Steps in document processing

1

Document Ingestion

  • To receive documents, you can use web upload, email, SFTP or REST API
  • Support of unstructured data from PDFs or scanned documents
  • Support of structured documents like CSV files or Excel
2

Image to Text

For images and scanned PDF documents, optical character recognition (OCR) is used to extract text directly from Excel tables. 

3

Classify and Prepare Documents

Determining the type of document ensures that the right data elements are extracted. The following are supported:

  • Invoice
  • Packing List
  • Certificate of Origin
  • and more

The documents are pre-classified by the user or via the upload API.

4

AI-Based Structuring

Extraction is performed using our Vision API including LLM models. The Vision API provides a self-assessment of the LLM’s performance regarding the quality of the extraction. Basic validations and data enhancements are directly implemented in MIC DOCFLOW.

5

Human Review

Like all AI models, the Vision API can make mistakes. Therefore, the results should be verified by a human. MIC DOCFLOW enables users to compare the original document with the extracted data elements and make any necessary additions or corrections via a document editor.

6

Integrate with MIC Modules

In the final step, the data is made available in JSON format for further use in a subsequent module:

Person that gives IT support

Get in touch

Have questions about our company or products? Reach out to us, and our sales team will be happy to assist you. Or take advantage of the opportunity to explore how MIC solutions can elevate your business, with expert guidance from one of our specialists.