Document Extraction
  • 27 Feb 2025
  • 2 Minutes to read
  • Dark
    Light
  • PDF

Document Extraction

  • Dark
    Light
  • PDF

Article summary

The Agentic Document Extraction API is a tool that can extract structured information out of documents with different layouts. It returns the extracted data in a structured hierarchical format containing text, tables, pictures, charts, and other information.

You can send documents to the Agentic Document Extraction API through our web-based app here. You can also programmatically send documents using the Python, JavaScript, and cURL commands available in the app.

The Agentic Document Extraction API is built on VisionAgent, the agentic framework from LandingAI

Document Extraction

File Support

The Agentic Document Extraction tool supports these files:

  • PNG, JPEG, PDF (up to 5 pages)
  • Max size: 50MB per file 

Rate Limits

The free plan includes a rate limit. For higher usage, additional features, and custom solutions, please schedule a call to discuss our enterprise plans. 

If you exceed the rate limit, you will get a 429 (Too Many Requests) error response.

Send Files in the Document Extraction App

To send files in the web-based Agentic Document Extraction app:

  1. Go to the Agentic Document Extraction app.
  2. Upload a file or click one of the Examples.
    Upload or Select a File
  3. The Agentic Document Extraction app processes the file. This might take a few moments.
  4. After the app processes the file, it displays the results:
    • A bounding box displays around each element on the document preview.
    • The API response displays in the right-hand panel. You can toggle between the JSON and Markdown output.
      Extracted Data
  5. Use the Chat with Document tool in the right-hand panel to interact with the document.
  6. Happy with the output? Try sending the files programmatically or build an app on top of the API! Share your projects in our VisionAgent Discord.

Chat with Document

After the Agentic Document Extraction app processes the file, you can use the Chat with Document tool in the right-hand panel to interact with the document.

The Chat with Document tool is an LLM layered on top of the Agentic Document Extraction API. The chat tool showcases how the API accurately extracts and understands document data, including element locations. Use the chat tool to get inspired for how you can build custom solutions on top of the API.

The Chat with Document tool suggests a few prompts based on your document. You can also enter your own prompts.

Chat with Document

Send Files Programmatically

You can programmatically send documents to the Agentic Document Extraction API using Python, JavaScript, and cURL commands in the app. 

All scripts require a VisionAgent API key. For more information about getting your VisionAgent API key, go to VisionAgent API Key.

You can copy the scripts in the following sections, or click the Extraction API button in the Agentic Document Extraction app.

Get the API for Document Extraction

Troubleshooting

If you receive any of the following errors when you try to upload a file, there might be latency or availability issues. Pleaes wait a few minutes and try uploading the file again.

  • LLM provider error
  • Timeout error
  • Availability error

Was this article helpful?