Documents Extract
This endpoint allows you to parse text from documents using OCR.
Headers
Authorization
Bearer authentication of the form Bearer <token>, where token is your auth token.
Toxicity
Whether to check the output for toxicity.
Pii
Whether to check the output for Pii, and what to do if found. Supported values are 'replace' and 'block'.
Replace-Method
Method to replace any found PII. Supported values are 'category', 'fake', 'mask', and 'random'.
Injection
Whether to check the output for a prompt injection.
Request
This endpoint expects a multipart form containing a file.
file
The document file to upload.
embedImages
Whether to embed images from the document.
outputFormat
The output format for the content of the document.
chunkDocument
Whether to separate the document into chunks.
chunkSize
The size of chunks for the documents.
enableOCR
Whether to enable OCR for document parsing.
Response
Successful response.
title
The parsed document title.
contents
The parsed document contents.
count
The word count for the document.