Mistral AI Document OCR is an advanced optical character recognition service that transforms PDFs, Office documents (DOCX, PPTX), and images into structured text. It uses Mistral AI's latest OCR model to extract text with high accuracy, supporting both Markdown and structured JSON output formats.
| Property | Value |
|---|---|
| Service Name | Mistral AI document OCR |
| Status | Enabled |
| Compatible Nodes | Transform Content |
Mistral AI OCR is ideal for:
This service requires:
| Parameter | Type | Default | Options | Description |
|---|---|---|---|---|
| Result format | Choice | Markdown | Markdown, JSON Structured | Output format for the OCR results |
| Select pages | Text | all | (comma-separated numbers or 'all') | Specify which pages to process. Use comma-separated page numbers (e.g., "1,3,5") or "all" for all pages |
When Result format is set to "JSON Structured":