Document Extraction
Accepted File Types
Learn about the different file types that OmniAI can process
File Types
The following document types are accepted by the extract endpoints as either a file buffer or URL:
.pdf
- Portable Document Format.doc
- Microsoft Word 97-2003.docx
- Microsoft Word 2007-2019.odt
- OpenDocument Text.ott
- OpenDocument Text Template.rtf
- Rich Text Format.txt
- Plain Text.html
- HTML Document.htm
- HTML Document (alternative extension).xml
- XML Document.wps
- Microsoft Works Word Processor.wpd
- WordPerfect Document.ods
- OpenDocument Spreadsheet.ots
- OpenDocument Spreadsheet Template.ppt
- Microsoft PowerPoint 97-2003.pptx
- Microsoft PowerPoint 2007-2019.odp
- OpenDocument Presentation.otp
- OpenDocument Presentation Template