.. _changes: ============ Changelog ============ 2.3.0 - November 30, 2022 ---------------- * Updated Python version and upgraded all dependencies. * Started using pipenv 2.2.1.0 - August 10, 2022 ---------------- * Improved LexNLP handling for companies for the "EN" locale. * Unified API pattern for sentence and paragraph segmentation. 2.2.0 - July 7, 2022 ---------------- * Improved LexNLP handling for dates, durations and persons for the all locales. * Added parameterizable contract classifiers. * Improved LexNLP handling for ML models. * Updated python requirements and tests, retrained ML models to use gensim-4. 2.1.0 - September 16, 2021 ---------------- * Improved LexNLP handling for companies for the "EN" locale. * Improved LexNLP handling for dates for all locales and dates parser accuracy for the "DE" locale. 2.0.0 - May 10, 2021 ---------------- * Tune extracting facts from text for different locales. * Updated regex patterns and tests for "DE" amount parser. * Added support for delimiter inference in "EN" and "DE" amount parsers. 1.8.0 - December 2, 2020 ---------------- * Improved LexNLP handling for definitions for the "EN" locale. * Implemented rating OCR quality in texts. * Migrated numeric data in parsers results to decimal format to avoid losing fraction digits. 1.7.0 - August 27, 2020 ---------------- * Improved LexNLP handling for dates for the "EN" locale. * Implemented lists of exceptions for entity extractors. * Implemented strongly typed response for entity extractors. * Updated third-party python requirements. 1.6.0 - May 27, 2020 ---------------- * Update psutil package version from 5.4.0 to 5.6.6. 1.4.0 - December 20, 2019 ---------------- * Improved accuracy of locating and converting date phrases into typed format. * Introduced new text vectorizing and classifying models. * Implemented ML-based definitions locator. 1.3.0 - November 1, 2019 ---------------- * Made massive improvements to EN definitions and companies parsers. * Updated EN dates parser to catch more date formats. * Made company parsing strongly typed 0.2.7 - August 1, 2019 ---------------- * Standardized LexNLP methods response to return a generator of Annotation objects or a generator of dictionaries (tuples) * Improved LexNLP handling for definitions for the "EN" locale. * Improved LexNLP handling for companies for the "EN" locale. * Improved sentence splitting logic. * Improved LexNLP unit test coverage. * Updated python requirements in python-requirements*.txt. * Dropped support for python 3.4 and 3.5. 0.2.6 - Jun 12, 2019 ---------------- * Improved LexNLP handling for dates for all locales. * Improved LexNLP handling for currencies for "EN" locale. * Updated documentation for ReadTheDocs. * Improved LexNLP unit test coverage. 0.2.5 - Mar 1, 2019 ---------------- * Improved LexNLP handling for courts for "DE" and "ES" locales. * Improved LexNLP handling for dates for "ES" locale. * Improved LexNLP handling for amounts, acts, regulations and definitions for "EN" locale. * Added CUSIP parser for "EN" locale. * Improved LexNLP unit test coverage. 0.2.4 - Feb 1, 2019 ---------------- * Added universal courts parser, configured LexNLP handling for courts for "DE" locale. * Added universal dates parser, configured LexNLP handling for dates for "DE" and "ES" locales. * Added definitions, citations and dates parsers for "DE" locale. * Added amounts, percents and durations parsers for "DE" locale. * Added geo entities parser for "DE" locale. * Added courts and definitions parsers for "ES" locale. * Added acts parser for "EN" locale. * Improved LexNLP unit test coverage. 0.2.3 - Jan 10, 2019 ---------------- * Updated python requirements. * Improved LexNLP handling for definitions and paragraphs. * Improved LexNLP unit test coverage. 0.2.2 - Sep 30, 2018 ---------------- * Improved LexNLP handling for different date formats. * Improved LexNLP handling for titles. * Improved LexNLP unit test coverage. 0.2.1 - Aug 24, 2018 ---------------- * Updated python requirements. * Improved LexNLP handling for amounts. * Optimized processing of sentences and titles. * Improved LexNLP unit test coverage. 0.2.0 - Aug 1, 2018 ---------------- * Improved LexNLP handling for addresses and sentences. * Improved LexNLP unit test coverage. 0.1.9 - Jul 1, 2018 ---------------- * Improved handling of TOC during sentence processing. * Added contracts locator to LexNLP. * Improved LexNLP handling for citations, titles and definitions. * Improved LexNLP unit test coverage. 0.1.8 - May 1, 2018 ---------------- * Improved LexNLP handling for addresses and currencies. * Improved LexNLP unit test coverage. 0.1.7 - Apr 1, 2018 ---------------- * Improved LexNLP handling for companies, organizations and dates. * Implemented generating train/test dataset for addresses. * Exclude common false positives for persons parser. 0.1.6 - Mar 1, 2018 ---------------- * Improved LexNLP unit test coverage. 0.1.5 - Feb 1, 2018 ---------------- * Improved LexNLP unit test coverage. 0.1.4 - Jan 1, 2018 ---------------- * Improved LexNLP unit test coverage. * Implemented method to get sentence ranges in addition to sentence texts. 0.1.3 - Dec 1, 2017 ---------------- * Improved LexNLP unit test coverage. 0.1.2 - Nov 1, 2017 ---------------- * Implemented LexNLP title locator. * Implemented additional LexNLP transforms for skipgrams and n-grams. * Improved LexNLP handling for parties with abbreviations and other cases. * Improved LexNLP handling for amounts with mixed alpha and numeric characters. * Improved LexNLP unit test coverage. 0.1.1 - Oct 1, 2017 ---------------- * Improve unit test framework handling for language and locales. * Implemented method and input-level CPU and memory benchmarking for unit tests. * Migrated all unit tests to 60 separate CSV files. * Added over 1,000 new unit tests for most LexNLP methods. * Reduced memory usage for paragraph and section segmenters. * Improved handling of brackets and parentheses within noun phrases. * Added URL locator to LexNLP. * Added trademark locator to LexNLP. * Added copyright locator to LexNLP. * Improved default Punkt sentence boundary detection. * Added custom sentence boundary training methods. * Improved handling of multilingual text, especially around geopolitical entities. * Improved default handling of party names with non-standard characters. * Enhanced metadata related to party type in LexNLP. * Improved continuous integration for public repositories. 0.1.0 - Sep 1, 2017 ---------------- * Refactored and integrate core extraction into separate LexNLP package. * Released nearly 200 unit tests with over 500 real-world test cases in LexNLP. * Improved definition, date, and financial amount locators for corner cases. * Integrated PII locator for phone numbers, SSNs, and names from LexNLP. * Integrated ratio locator from LexNLP. * Integrated percent locator from LexNLP. * Integrated regulatory locator from LexNLP. * Integrated distance locator from LexNLP. * Integrated case citation locator from LexNLP. * Improved geopolitical locator to allow non-master-data entity location. * Improved party locator to allow configuration and better handle corner cases