- 27 Feb 2025
- 2 Minutes to read
- Print
- DarkLight
- PDF
Document Extraction
- Updated on 27 Feb 2025
- 2 Minutes to read
- Print
- DarkLight
- PDF
The Agentic Document Extraction API is a tool that can extract structured information out of documents with different layouts. It returns the extracted data in a structured hierarchical format containing text, tables, pictures, charts, and other information.
You can send documents to the Agentic Document Extraction API through our web-based app here. You can also programmatically send documents using the Python, JavaScript, and cURL commands available in the app.
The Agentic Document Extraction API is built on VisionAgent, the agentic framework from LandingAI.
data:image/s3,"s3://crabby-images/fd335/fd335dcec2c72d47a53ee920a9b289b4e126ebf7" alt=""
File Support
The Agentic Document Extraction tool supports these files:
- PNG, JPEG, PDF (up to 5 pages)
- Max size: 50MB per file
Rate Limits
The free plan includes a rate limit. For higher usage, additional features, and custom solutions, please schedule a call to discuss our enterprise plans.
If you exceed the rate limit, you will get a 429
(Too Many Requests) error response.
Send Files in the Document Extraction App
To send files in the web-based Agentic Document Extraction app:
- Go to the Agentic Document Extraction app.
- Upload a file or click one of the Examples.
Upload or Select a File
- The Agentic Document Extraction app processes the file. This might take a few moments.
- After the app processes the file, it displays the results:
- A bounding box displays around each element on the document preview.
- The API response displays in the right-hand panel. You can toggle between the JSON and Markdown output.
Extracted Data
- Use the Chat with Document tool in the right-hand panel to interact with the document.
- Happy with the output? Try sending the files programmatically or build an app on top of the API! Share your projects in our VisionAgent Discord.
Chat with Document
After the Agentic Document Extraction app processes the file, you can use the Chat with Document tool in the right-hand panel to interact with the document.
The Chat with Document tool is an LLM layered on top of the Agentic Document Extraction API. The chat tool showcases how the API accurately extracts and understands document data, including element locations. Use the chat tool to get inspired for how you can build custom solutions on top of the API.
The Chat with Document tool suggests a few prompts based on your document. You can also enter your own prompts.
data:image/s3,"s3://crabby-images/064a5/064a5a4858faf472a10570659795d03749c06685" alt=""
Send Files Programmatically
You can programmatically send documents to the Agentic Document Extraction API using Python, JavaScript, and cURL commands in the app.
All scripts require a VisionAgent API key. For more information about getting your VisionAgent API key, go to VisionAgent API Key.
You can copy the scripts in the following sections, or click the Extraction API button in the Agentic Document Extraction app.
data:image/s3,"s3://crabby-images/29d77/29d779f088a93100e87db2a175a3bb4fa5074f41" alt=""
Troubleshooting
If you receive any of the following errors when you try to upload a file, there might be latency or availability issues. Pleaes wait a few minutes and try uploading the file again.
LLM provider error
Timeout error
Availability error