en_us_normalization.production.classify.VerbatimFst
- class en_us_normalization.production.classify.VerbatimFst[source]
Finite state transducer for classifying verbatims - anything that has extra symbols and doesn’t match available semiotic classes. Verbatim takes any characters, ommitting spaces (boudnary between tokens) and trailing punctuation marks.
Example of input/output string:
jo234 -> verbatim { name: “jo234” }