Extract
Extract structured data from a document asynchronously
This is an asynchronous API endpoint. The initial request returns a jobId and status. You can use the jobId to check the processing status and fetch results.
Body Parameters
Either file or URL is required but not both. See Accepted File Types.
URL of the document to extract data from
The file to extract data from
JSON schema to define the structure of extracted data. Recommended to use our UI to define the schema. See JSON schema examples.
Whether to exclude OCR result from the response. Defaults to false.
Whether to maintain format from the previous page. Defaults to false.
Array of page numbers to process. Defaults to all pages.
Array of schema properties to extract per page. Defaults to empty array.
Unique identifier for the webhook callback
Custom JSON data to be included in the response
Example JSON Schema
This is a JSON Schema, which defines the structure and validation rules for the JSON. For more examples and details, see JSON Schema Examples.
Response
Unique identifier for the extraction request
Status of the extraction (success, processing, or error)
URL for polling the extraction result