This Utility API focuses on AI-based document data extraction for large text-heavy documents in multiple file formats and global languages.
This API can extract print and handwritten text from PDF documents, scanned images, and various document formats, including Microsoft Word, Excel, PowerPoint, and HTML. This API includes features like higher-resolution scanning of document images for better handling of smaller and dense text; paragraph detection; and fillable form management, making it a valuable asset for document management and data extraction applications.
The atoms cost is subjected to change depending on the size of the input file and the provider selected. The list of providers and the atoms cost for each provider is given below:
Provider (requested_service) | Atoms |
---|
Azure | 500 |
ApyHub | 2000 |
Note: In order to test the API on API Playground, just click on "Show optional inputs" and enter the Authentication token for the provider before clicking on Send request. The output response structure and the result of the AI utility APIs depend on the service provider and it may vary depending on which service provider is selected.