en_us_normalization.production.classify.VerbatimFst

class en_us_normalization.production.classify.VerbatimFst[source]

Finite state transducer for classifying verbatims - anything that has extra symbols and doesn’t match available semiotic classes. Verbatim takes any characters, ommitting spaces (boudnary between tokens) and trailing punctuation marks.

Example of input/output string:

  • jo234 -> verbatim { name: “jo234” }

__init__()[source]