lexnlp.nlp.en.transforms.characters: Transforming text into character-oriented features

The lexnlp.nlp.en.transforms.characters module contains methods that transform text into character distributions or related feature vectors.

Attention

The sections below are a work in progress. Thank you for your patience while we continue to expand and improve our documentation coverage.

If you have any questions in the meantime, please feel free to log issues on GitHub at the URL below or contact us at the email below:

lexnlp.nlp.en.transforms.characters Module

Transforms related to characters for English

Functions

get_character_distribution(text[, …]) Get character distribution of text, potentially lowercasing and stopwording first.
get_character_ngram_distribution(text, n[, …]) Get character distribution of text, potentially lowercasing and stopwording first.
get_token_list((text: str, …) Get token list from text.
get_tokens((text[, lowercase, stopword]) Get token generator from text.

Variables

MODULE_PATH str(object=’‘) -> str