pre_process_document

lexnlp.nlp.en.segments.sentences.pre_process_document(text: str) → str

Pre-process text of the specified document before splitting it to the sentences. Removes obsolete formatting, page-splitting markers, page numbers e.t.c. :param text: :return: