A streamlined document processing library that leverages AI vision models for OCR and data extraction. Designed to handle complex document layouts including tables, charts, and irregular formatting.
Zerox is available as both a Node.js and Python package.
Try out the hosted version here: https://getomni.ai/ocr-demo.
Here’s the SDK feature support table in markdown format:
Feature | Node.js | Python |
---|---|---|
PDF Processing | ✓ (requires graphicsmagick) | ✓ (requires poppler) |
Image Processing | ✓ | ✓ |
OpenAI Support | ✓ | ✓ |
Azure OpenAI Support | ✓ | ✓ |
AWS Bedrock Support | ✓ | ✓ |
Google Gemini Support | ✓ | ✓ |
Vertex AI Support | ✗ | ✓ |
Data Extraction | ✓ ( schema ) | ✗ |
Per-page Extraction | ✓ ( extractPerPage ) | ✗ |
Custom System Prompts | ✗ | ✓ ( custom_system_prompt ) |
Maintain Format Option | ✓ ( maintainFormat ) | ✓ ( maintain_format ) |
Async API | ✓ | ✓ |
Error Handling Modes | ✓ ( errorMode ) | ✗ |
Concurrent Processing | ✓ ( concurrency ) | ✓ ( concurrency ) |
Temp Directory Management | ✓ ( tempDir ) | ✓ ( temp_dir ) |
Page Selection | ✓ ( pagesToConvertAsImages ) | ✓ ( select_pages ) |
Orientation Correction | ✓ ( correctOrientation ) | ✗ |
Edge Trimming | ✓ ( trimEdges ) | ✗ |
Direct Image Extraction | ✓ ( directImageExtraction ) | ✗ |
A streamlined document processing library that leverages AI vision models for OCR and data extraction. Designed to handle complex document layouts including tables, charts, and irregular formatting.
Zerox is available as both a Node.js and Python package.
Try out the hosted version here: https://getomni.ai/ocr-demo.
Here’s the SDK feature support table in markdown format:
Feature | Node.js | Python |
---|---|---|
PDF Processing | ✓ (requires graphicsmagick) | ✓ (requires poppler) |
Image Processing | ✓ | ✓ |
OpenAI Support | ✓ | ✓ |
Azure OpenAI Support | ✓ | ✓ |
AWS Bedrock Support | ✓ | ✓ |
Google Gemini Support | ✓ | ✓ |
Vertex AI Support | ✗ | ✓ |
Data Extraction | ✓ ( schema ) | ✗ |
Per-page Extraction | ✓ ( extractPerPage ) | ✗ |
Custom System Prompts | ✗ | ✓ ( custom_system_prompt ) |
Maintain Format Option | ✓ ( maintainFormat ) | ✓ ( maintain_format ) |
Async API | ✓ | ✓ |
Error Handling Modes | ✓ ( errorMode ) | ✗ |
Concurrent Processing | ✓ ( concurrency ) | ✓ ( concurrency ) |
Temp Directory Management | ✓ ( tempDir ) | ✓ ( temp_dir ) |
Page Selection | ✓ ( pagesToConvertAsImages ) | ✓ ( select_pages ) |
Orientation Correction | ✓ ( correctOrientation ) | ✗ |
Edge Trimming | ✓ ( trimEdges ) | ✗ |
Direct Image Extraction | ✓ ( directImageExtraction ) | ✗ |