Session

6 Million Pages: Agentic Solution to Scan, Classify, Extract, and Find What Matters

Every business drowns in documents - contracts, statements, invoices, loan agreements, you name it. Collecting them is easy – “just send it as an attachment”. The real money pit starts after: scanning, classifying, extracting key info, and turning a chaotic pile into organised and searchable storage. That’s where budgets vanish and people quit.

The reality: 6,000,000 pages in mixed-quality PDFs with upside-down scans, handwriting, stitched bundles, and unpredictable layouts.

The challenge: accurate OCR, reliable classification, validated extraction, and audit-ready search.

In this session, we’ll show how an agentic solution - built on Foundry Agent Service - and the team of helpers: Azure Functions, Logic Apps, Document Intelligence, and Azure AI Search - transforms messy inbound files into structured, searchable, and actionable knowledge that drives faster decisions and better compliance.

Agent lineup:

- Intake & Triage Agent: performs OCR, detects doc sets, splits bundles, fixes rotation, tags, and routes using Document Intelligence / Foundry tools.
- Extraction Agent: pulls entities and key fields (people, orgs, dates, terms) and normalises to your schema.
- Validation & Rules Agent: checks completeness, flags missing artefacts, applies policy, and kicks off follow-ups.
- Search & Knowledge Agent: indexes content for semantic and agentic retrieval in Azure AI Search.

Low-confidence pages don’t block the pipeline. The agents route exceptions to a review queue (human-in-the-loop), capture corrections, and re-run extraction with the updated signal so the system gets cleaner over time.

Result: documents classified, organised, verified, and available in seconds, not hours. Bring Copilot family to the party, if you wish.

George Doubinski

Microsoft MVP | Power Platform Enabler

Sydney, Australia

Actions

Please note that Sessionize is not responsible for the accuracy or validity of the data provided by speakers. If you suspect this profile to be fake or spam, please let us know.

Jump to top