2024
February 27
4min
Our Re-Extraction feature is a powerful tool designed to ensure adjustments to your document extraction experience are seamless and quick. This feature enables users to re-extract values from a document against Patterns of their choice in such a way that any adjustments to the OCR-obtained tables and text structure can be considered.
Pages that are run as part of re-extract are not counted toward your organization's pages processed count, saving your organization from the costs typically associated with re-running a document when adjustments are needed.
There are three cases that make the most of our re-extract feature:
- Sometimes a modification to your extraction automation is necessary for a more accurate and up-to-date extraction of values, which can be accomplished via a machine learning model update or a Pattern change
- Re-extract can be used to extract updated values with the updated Pattern automation
- On rare occassions with misaligned or small text, the OCR model may encounter a minor hiccup, such as treating two lines of text as separate boxes or missing a final, less populated row in a table
- After making the adjustment, you can use Re-extract to use your Pattern to extract updated values based on newly redrawn boxes, post modification
- If you would like to associate a new Pattern with your document, you can designate additional active Patterns for re-extract to run
- These Patterns will extract values from the document and can be viewed upon completion