How to Convert PDF to Database Records (MySQL, PostGres, MongoDB, …)

You have business documents you get in pdf format: invoices, work orders, purchase orders and others. Sometimes data is in the pdf as a table or documents were scanned into a pdf. They hold data you need to process in your ERP or other database-driven information system. Unfortunately, PDF documents do not come with an easy ‘PDF to database’ function which can be used to get hold of your data. Continue reading “How to Convert PDF to Database Records (MySQL, PostGres, MongoDB, …)”

Convert PDF to CSV – Extract Tables and Text Data From PDF

Whether you have 100 or 10,000 PDF files you want to extract text from, Docparser’s smart filters, and fixed & variable location extraction options will get your there. PDF’s are a very popular file format, BUT, that doesn’t mean that converting PDF to CSV and extracting text data from PDF files has always been a clear and easy mission. Until now. Continue reading “Convert PDF to CSV – Extract Tables and Text Data From PDF”

Build Your Own Automated Purchase Order System With Docparser

Does your business receive tons of purchase orders and sales orders by e-mail, fax, telephone or snail mail? No matter if you are dealing with just a handful of orders per week or hundreds each months. Handling those orders manually is time consuming and error prone to say the least. Even worse, purchase orders contain crucial data for your business which you just can’t miss. Continue reading “Build Your Own Automated Purchase Order System With Docparser”

Email PDF’s to our Parsing Engine

Are you receiving PDF Documents containing important data by email? Good news! With Docparser, it’s easy to extract data from PDF email attachments. If you have recurring PDFs, with the same physical layout, you can simply email them to the Docparser app and get structured data back in return.

Once you have created, and tested your PDF layout parser, you can upload additional PDFs with our email option. Simply select the layout parser you would like to send attachments to, select “Settings” from the navigation and you will see your layout parser “inbox”. Continue reading “Email PDF’s to our Parsing Engine”

OCR PDF Scanner

Optical Character Recognition (OCR) is a technology that allows you to extract data from scanned documents. Text which you can then edit, update, or aggregate with other tools for data analysis and a range of other uses.

Optical Character Recognition (OCR), is essentially the conversion of scanned images with text, be it typed, in print, or written by hand, into … well … text. Typically you see OCR used in extracting text information from photos, passports, and scanned documents. OCR is often used for “digitizing” recognized text, so it can be utilized later, edited, searched, aggregated for analysis, etc. Continue reading “OCR PDF Scanner”

Convert PDF to JSON – Turn PDF Documents Into Structured JSON Data Objects

Without a doubt, PDF became the de-facto exchange format for business documents. But PDF is “only” a replacement for paper and businesses around the globe have a hard time accessing important data which is trapped inside their PDF documents. On the other hand, JSON became probably the most popular data exchange format when it comes to syncing data between two web applications.

That being said, wouldn’t it be great to be able to automatically convert PDF documents into JSON data objects? What if it would actually be possible to leverage data which is trapped inside PDF documents to automate business processes?

This post will show you how you can do exactly that with Docparser. Docparser allows you to convert PDF to JSON data which can then be used to automate your document based workflows. Continue reading “Convert PDF to JSON – Turn PDF Documents Into Structured JSON Data Objects”