Changelog for the Extraction Performance base processor in Extend.
Version | Date | Changelog | |
---|---|---|---|
4.0.0 | 2025-02-10 | - New foundation model in the core extraction pipeline that leads to a big jump in performance, especially on complex array extraction. | |
3.11.0 | 2025-04-02 | More robust figure parsing with better captions and improved handling of signature fields to reduce false positive rate in most key use cases. | |
3.10.1 | 2025-03-21 | Small patch to improve handling of some deeply nested extraction fields. | |
3.10.0 | 2025-02-09 | Significantly improved figure parsing in our pre-processing to better segment out and transform sub-classes of figures (e.g. charts, logos, etc.) in markdown | |
3.9.0 | 2024-11-15 | Add support for nested arrays and objects. See here for an example schema. | |
3.8.0 | 2024-11-04 | Add support for nested enums in array and object fields. Make advanced multimodal enabled by default. | |
3.7.0 | 2024-10-10 | Add support for an new, optimized enum field type. Updates to document pre-processing for handwritten text and dense/large tables. | |
3.6.0 | 2024-09-16 | Add base support for advanced multimodal features, which can be enabled in the Processor Settings in Studio. | |
3.5.0 | 2024-08-27 | Several changes to improve extraction performance, including minor model version upgrade, a new bounding box system, and model insights. | |
3.4.0 | 2024-08-16 | Updates to our document pre-processing to better handle more complex document/table layouts. | |
3.3.0 | 2024-07-30 | Improvements to signature extraction accuracy. | |
3.2.0 | 2024-07-14 | Updates to our document pre-processing to better handle checkboxes. | |
3.1.0 | 2024-06-01 | Updates to our document pre-processing to better handle more complex document/table layouts. | |
3.0.0 | 2024-05-14 | Promoting a new foundation model to default - it’s faster and more accurate across all of our internal benchmarks for extraction. | |
2.0.0 | 2024-04-12 | Promoting a new foundation model to default. Very minor increases in accuracy and speed. | |
1.0.0 | 2023-08-01 | Initial (and now legacy) extraction model. |