lexnlp.nlp.en package

Submodules

lexnlp.nlp.en.stanford module

Stanford parsing methods for English.

This module implements an interface to Stanford parsing methods for English, including token and parts of speech.

Todo:

lexnlp.nlp.en.stanford.check_stanford()
lexnlp.nlp.en.stanford.get_nouns(text, lowercase=False, lemmatize=False)

Get only nouns from text using Stanford libraries.

Parameters:
  • text
  • lowercase
  • lemmatize
Returns:

lexnlp.nlp.en.stanford.get_tokens(text, lowercase=False, stopword=False) → Generator

Get token generator from text using Stanford libraries. :param text: :param lowercase: :param stopword: :return:

lexnlp.nlp.en.stanford.get_tokens_list(text, lowercase=False, stopword=False) → List

Get token list form text using Stanford libraries. :param text: :param lowercase: :param stopword: :return:

lexnlp.nlp.en.stanford.get_verbs(text, lowercase=False, lemmatize=False) → Generator

Get only verbs from text using Stanford libraries.

Parameters:
  • text
  • lowercase
  • lemmatize
Returns:

lexnlp.nlp.en.tokens module

Token parsing for English.

This module implements token parsing, such as tokens, stems, and lemma tokenization functionality in English.

Todo:

lexnlp.nlp.en.tokens.get_adjectives(text, lowercase=False, lemmatize=False) → Generator

Get only adjectives from text.

lexnlp.nlp.en.tokens.get_adverbs(text, lowercase=False, lemmatize=False) → Generator

Get only adverbs from text.

lexnlp.nlp.en.tokens.get_lemma_list(text, lowercase=False, stopword=False, lemmatizer=<WordNetLemmatizer>) → List

Get lemmas materialized from text.

lexnlp.nlp.en.tokens.get_lemmas(text, lowercase=False, stopword=False, lemmatizer=<WordNetLemmatizer>) → Generator

Get lemmas from text. :param text: :param lowercase: :param stopword: :param lemmatizer: :return:

lexnlp.nlp.en.tokens.get_nouns(text, lowercase=False, lemmatize=False) → Generator

Get only nouns from text.

lexnlp.nlp.en.tokens.get_stem_list(text, lowercase=False, stopword=False, stemmer=<EnglishStemmer>) → List

Get stems materialized from text. N.B.: when stemmer is SnowballStemmer, lowercase is always returned no matter the parameter.

Parameters:
  • text
  • lowercase
  • stopword
  • stemmer
Returns:

lexnlp.nlp.en.tokens.get_stems(text, lowercase=False, stopword=False, stemmer=<EnglishStemmer>) → Generator

Get stems from text. N.B.: when stemmer is SnowballStemmer, lowercase is always returned no matter the parameter. :param text: :param lowercase: :param stopword: :param stemmer: :return:

lexnlp.nlp.en.tokens.get_token_list(text: str, lowercase: bool = False, stopword: bool = False, preserve_line: bool = True) → List

Get token list from text. :param text: :param lowercase: :param stopword: :param preserve_line: keep the preserve the sentence and not sentence tokenize it. :return:

lexnlp.nlp.en.tokens.get_tokens(text, lowercase=False, stopword=False, preserve_line=True) → Generator

Get token generator from text. :param text: :param lowercase: :param stopword: :param preserve_line: keep the preserve the sentence and not sentence tokenize it. :return:

lexnlp.nlp.en.tokens.get_verbs(text, lowercase=False, lemmatize=False) → Generator

Get only verbs from text.

lexnlp.nlp.en.tokens.get_wordnet_pos(treebank_tag)

Return wordnet POS object from Treebank POS tag. :param treebank_tag: :return:

Module contents