lexnlp.nlp.en.segments.utils: Utilities for segmenting

The lexnlp.nlp.en.segments.utils module contains utility methods for segmenting text.

Attention

The sections below are a work in progress. Thank you for your patience while we continue to expand and improve our documentation coverage.

If you have any questions in the meantime, please feel free to log issues on GitHub at the URL below or contact us at the email below:

lexnlp.nlp.en.segments.utils Module

Utility methods for segmentation classifiers

This module implements utility methods for segmentation, such as shared methods to generate document character distributions.

Todo:

Functions

build_document_distribution(text[, …]) Build document character distribution based on fixed character, optionally norming.
build_document_line_distribution(text[, …]) Build document and line character distribution for section segmenting based on fixed character, optionally normalizing vector.