Release Date: January 2025
Feature Status: Production Ready
Breaking Changes: None
Feature Summary
HyperFlow introduces the LLM PDF Transform service, an intelligent document processing capability that uses multimodal large language models to perform advanced PDF-to-Markdown conversion with exceptional accuracy. The service excels at standard document conversion with built-in instructions while also supporting custom transformations for specialized document processing needs. This combination of ease-of-use and flexibility makes PDFs fully accessible for RAG applications and specialized data extraction workflows.
What's New
HyperFlow PDF Transform Service
- Advanced PDF-to-Markdown conversion: Built-in instructions optimized for most document types with superior accuracy
- Custom transformation capabilities: Per-page instructions enable specialized data extraction and format conversion
- Smart image reference system: Automatic image extraction with intelligent placeholder replacement in Markdown output
- Cross-page continuity: Maintains document context between pages for seamless multi-page analysis
- LLM-recognized page numbers: Intelligent page number detection and insertion by multimodal models
- Mathematical content handling: Automatic LaTeX/Markdown formatting for equations and formulas
Key Capabilities
- Dual transformation modes: Standard Markdown conversion or custom instruction-driven processing
- Multiple LLM service support: Works with both specialized OCR models and general multimodal LLMs
- Advanced image handling: Extraction, storage, and smart reference replacement in output text
- Flexible output organization: Per-page or per-file content structuring with comprehensive metadata
- Intelligent text extraction: PDF text hints provided to LLM for improved character recognition accuracy
Output Options