build_document_line_distribution¶
-
lexnlp.nlp.en.segments.utils.
build_document_line_distribution
(text, characters='0123456789abcdefghijklmnopqrstuvwxyzABCDEFGHIJKLMNOPQRSTUVWXYZ!"#$%&\'()*+, -./:;<=>?@[\\]^_`{|}~ \t\n\r\x0b\x0c', norm=True)¶ Build document and line character distribution for section segmenting based on fixed character, optionally normalizing vector.