Can untxt. split multiple invoices from one PDF?
Yes. untxt. auto-detects document boundaries, auto-groups related pages, auto-classifies each document, and extracts each record separately.
What it means
Splitting a mixed PDF is a bookkeeping problem, not just a PDF problem. The system has to detect where one document ends, where the next starts, whether pages belong together, and what accounting data should be extracted from each detected document.
02 · Example
A client uploads one scanner PDF containing invoices, receipts, statements, and credit notes. untxt. separates the batch into individual documents and sends uncertain page groupings to review.
Where review matters
Very poor scans, missing pages, repeated blank pages, or overlapping documents can still need human judgment.
Who this helps
This helps teams that receive scanner bundles, email exports, or client uploads where many receipts, invoices, statements, and credit notes arrive inside one file.
What untxt. does
untxt. automatically looks for page boundaries, document type changes, repeated headers, totals, dates, page numbers, vendor shifts, and layout cues. Then it auto-groups pages before extraction so each detected document becomes its own reviewable record.
What it does not pretend
It does not silently guess when the split is unclear. If pages are missing, duplicated, out of order, or visually ambiguous, those sections should be flagged for review.