NuExtract
NuExtract is a large language model specialized in extracting structured information from documents of any format, including raw text, scanned images, PDFs, PowerPoints, spreadsheets, and more, supporting over a dozen languages and mixed‑language inputs. It delivers JSON‑formatted output that faithfully follows user‑defined templates, with built‑in verification and null‑value handling to minimize hallucinations. Users define extraction tasks by creating a template, either by describing the desired fields or importing existing schemas—and can improve accuracy by adding document, output examples in the example set. The NuExtract Platform provides an intuitive workspace for designing templates, testing extractions in a playground, managing teaching examples, and fine‑tuning settings such as model temperature and document rasterization DPI. Once validated, projects can be deployed via a RESTful API endpoint that processes documents in real time.
Learn more
Parseflow
Stop manual data entry; extract structured data & integrate it with everything. Parseflow offers a wide range of options for importing your documents for parsing. Forward your emails and attachments to Parseflow's inbox. Import your documents from your favorite apps. Specify your fields and watch Parseflow automate. Accelerate your workflow, intelligent extraction suggestions speed up your process. Powering accurate and fast data extraction. Parseflow automates data extraction from emails and files. Export to Zoho, Xero, Tally, and thousands of other apps. Export parsed data to your favorite apps and platforms. Fast data extraction with our OCR & AI engine. Set up takes just a few minutes. No coding is required, no classification, and no custom model training is necessary. Extract data even from documents you've never seen before. With instructions and support, just describe the data you need in plain language.
Learn more
Extend
Extend is a complete document processing platform that turns complex, unstructured files into clean, accurate data in minutes. Its advanced multimodal vision models are designed to handle messy handwriting, massive tables, tricky checkboxes, and irregular layouts with precision. Extend’s AI agents learn from your documents, run autonomous experiments, and optimize your extraction schemas for maximum accuracy. With flexible APIs for parsing, classification, extraction, and splitting, you can embed fast, polished document workflows directly into your product. Confidence scoring, human-in-the-loop review, and built-in validations ensure accuracy at scale for mission-critical operations. Extend helps technical teams ship production-ready pipelines in days—not months.
Learn more
reciTAL
reciTAL is an Artificial Intelligence software editor. First Intelligent Document Processing player with a Deep Tech label, reciTAL automates your extraction, classification and search processes, for all types of document and email flows. At any time, you can re-train a model taking into consideration user feedback. The reciTAL team guides you through deployment in your internal Kubernetes or via Docker Compose. Basic business rules are then implemented in a few minutes to configure your data points. Depending on the level of confidence reached, the extracted data are validated or not by an operator. The configuration of a new type of document is done with unparalleled simplicity and speed. Validated data is used for continuous performance improvement.
Learn more