How To Extract Data From PDF: Converting Unstructured PDFs to Structured Data
The PDF is here to stay. In today’s work environment, the PDF became ubiquitous as a digital replacement for paper and holds a variety of important business data. But what are the options if you want to extract data from PDF documents? Manually rekeying PDF data is often the first reflex, but fails most of […]
What Is a PDF Parser? How to Extract Data From PDFs
A PDF Parser (also sometimes called PDF scraper) is a software that can be used to extract data from PDF documents. PDF Parsers can come in form of libraries for developers or as standalone software products for end-users. PDF Parsers are used mainly to extract data from a batch of PDF files. Manual data entry […]
An Introduction to UBL: Automate Procurement with Standardized Data Exchange
In this article we explain what UBL (Universal Business Language) stands for, where it comes from and how it helps businesses around the globe to automate procurement.
Automatically Tag Files and Folders in Box
Tagging files and folders in Box is a great way of getting organized. But tagging documents in Box manually can be cumbersome and time-consuming. So what if there was a way to automatically tag documents in Box? In this article we will shed some light on existing auto-tagging options and talk about what we are […]
Invoice Scanning Software – Is Automated Invoice Scanning a Viable Solution?
In this article, we discuss how and when invoice capture software is a viable solution and can be used to eliminate manual data entry. We discuss in detail how invoice scanning software works in general and what methods lead to accurate data.
Post File From Salesforce Apex to External HTTP Webservices
This article will show you how to send files from Salesforce to an external webservice using the Apex HttpRequest Class. You’ll learn how make a ‘multipart/form-data’ HTTP request which includes your file as an attachment.
Offering ERP Clients EDI Alternatives
Docparser offers entities looking to minimize paper transactions, a seamless PDF data extraction and interchange solution. In as little as 15 minutes you can be up, running and already watching your layout parser extracting data from PDF documents. Clients often have a combination of Electronic Data Interchange (EDI) and PDF data transactions, and are looking […]
What is EDI?
Electronic Data Interchange (EDI) is a collection of standards which describe how two computers can communicate in a “strictly formatted”, standardized way. This form of communication has been utilized for over 30 years and is a common method for governments and businesses to move information from one point to another.
PDF Metadata – Overview
PDF metadata, or “data about data” provides additional information about a PDF file. Potential metadata could be author, the date of creation, the application that was used to create the file, and more. This information is added to the file when it is created, or can be added along the way, additionally the metadata can […]
PDF to XML: How to Convert PDF to XML for Free
This post covers how to use Docparser for PDF to XML conversion. You’ll learn why converting PDF to XML is usually a challenging task and how easy it is to convert PDF to XML with Docparser. If you’re in business, there’s a good chance you deal with PDFs regularly. But what if you need to convert PDF […]