lexnlp.nlp.en package¶
Subpackages¶
- lexnlp.nlp.en.segments package
- lexnlp.nlp.en.tests package
- Submodules
- lexnlp.nlp.en.tests.test_pages module
- lexnlp.nlp.en.tests.test_paragraphs module
- lexnlp.nlp.en.tests.test_sections module
- lexnlp.nlp.en.tests.test_sentences module
- lexnlp.nlp.en.tests.test_stanford module
- lexnlp.nlp.en.tests.test_stanford_missing module
- lexnlp.nlp.en.tests.test_titles module
- lexnlp.nlp.en.tests.test_tokens module
- lexnlp.nlp.en.tests.test_transforms module
- Module contents
- lexnlp.nlp.en.transforms package
Submodules¶
lexnlp.nlp.en.stanford module¶
Stanford parsing methods for English.
This module implements an interface to Stanford parsing methods for English, including token and parts of speech.
Todo:
-
lexnlp.nlp.en.stanford.
check_stanford
()¶
-
lexnlp.nlp.en.stanford.
get_nouns
(text, lowercase=False, lemmatize=False)¶ Get only nouns from text using Stanford libraries.
Parameters: - text –
- lowercase –
- lemmatize –
Returns:
-
lexnlp.nlp.en.stanford.
get_tokens
(text, lowercase=False, stopword=False) → Generator¶ Get token generator from text using Stanford libraries. :param text: :param lowercase: :param stopword: :return:
-
lexnlp.nlp.en.stanford.
get_tokens_list
(text, lowercase=False, stopword=False) → List¶ Get token list form text using Stanford libraries. :param text: :param lowercase: :param stopword: :return:
-
lexnlp.nlp.en.stanford.
get_verbs
(text, lowercase=False, lemmatize=False) → Generator¶ Get only verbs from text using Stanford libraries.
Parameters: - text –
- lowercase –
- lemmatize –
Returns:
lexnlp.nlp.en.tokens module¶
Token parsing for English.
This module implements token parsing, such as tokens, stems, and lemma tokenization functionality in English.
Todo:
-
lexnlp.nlp.en.tokens.
get_adjectives
(text, lowercase=False, lemmatize=False) → Generator¶ Get only adjectives from text.
-
lexnlp.nlp.en.tokens.
get_adverbs
(text, lowercase=False, lemmatize=False) → Generator¶ Get only adverbs from text.
-
lexnlp.nlp.en.tokens.
get_lemma_list
(text, lowercase=False, stopword=False, lemmatizer=<WordNetLemmatizer>) → List¶ Get lemmas materialized from text.
-
lexnlp.nlp.en.tokens.
get_lemmas
(text, lowercase=False, stopword=False, lemmatizer=<WordNetLemmatizer>) → Generator¶ Get lemmas from text. :param text: :param lowercase: :param stopword: :param lemmatizer: :return:
-
lexnlp.nlp.en.tokens.
get_nouns
(text, lowercase=False, lemmatize=False) → Generator¶ Get only nouns from text.
-
lexnlp.nlp.en.tokens.
get_stem_list
(text, lowercase=False, stopword=False, stemmer=<EnglishStemmer>) → List¶ Get stems materialized from text. N.B.: when stemmer is SnowballStemmer, lowercase is always returned no matter the parameter.
Parameters: - text –
- lowercase –
- stopword –
- stemmer –
Returns:
-
lexnlp.nlp.en.tokens.
get_stems
(text, lowercase=False, stopword=False, stemmer=<EnglishStemmer>) → Generator¶ Get stems from text. N.B.: when stemmer is SnowballStemmer, lowercase is always returned no matter the parameter. :param text: :param lowercase: :param stopword: :param stemmer: :return:
-
lexnlp.nlp.en.tokens.
get_token_list
(text: str, lowercase: bool = False, stopword: bool = False, preserve_line: bool = True) → List¶ Get token list from text. :param text: :param lowercase: :param stopword: :param preserve_line: keep the preserve the sentence and not sentence tokenize it. :return:
-
lexnlp.nlp.en.tokens.
get_tokens
(text, lowercase=False, stopword=False, preserve_line=True) → Generator¶ Get token generator from text. :param text: :param lowercase: :param stopword: :param preserve_line: keep the preserve the sentence and not sentence tokenize it. :return:
-
lexnlp.nlp.en.tokens.
get_verbs
(text, lowercase=False, lemmatize=False) → Generator¶ Get only verbs from text.
-
lexnlp.nlp.en.tokens.
get_wordnet_pos
(treebank_tag)¶ Return wordnet POS object from Treebank POS tag. :param treebank_tag: :return: