3.2.1. Extracting Data from PDF Files to CSV/XLS
In this sample video we will show you how to configure ChronoScan to dismount PDF files to a compressible data file (CSV/XLS).
This is our sample document. A PDF Text file with multiple pages. We want to extract some fields and all the detail lines.
For every processed document we want to get a .csv file named with the Inv.No:
106673N.csv
And with the detail lines:
"Invoice Date","Line No","Location","Part Number","Description","Description_PartNumber","Ordered","B.O.","Supplied","Unit List","Unit Net","GST Code","Total"
"23/10/2013","1","SS03EA","CP-TLB-35400G","SNORKEL CAP 1HZ L/C DSL 1/92-1/ 07 PLAST","1773066010","1","","1","191.81","67.13","10.00%","67.13"
"23/10/2013","2","RD09AA","CP-TLG-80010RHG","RH F/D VDJ79 L/C UTE 1/07- ST D","67001-60650","1","","1","1859.11","743.64","10.00%","743.64"
"23/10/2013","3","RD01AA","CP-TLG-80010LHG","LH F/D VDJ79 L/C UTE 1/07- ST D","67002-60650","1","","1","1859.11","929.55","10.00%","929.55"
"23/10/2013","4","RB05AA","CP-TLG-80011LHG","LH F/D VDJ79 L/C UTE 1/07- GX L","6700260660","1","","1","1859.11","743.64","10.00%","743.64"
"23/10/2013","5","RC12AA","CP-TLG-80011RHG","RH F/D VDJ79 L/C UTE 1/07- GX L","6700160660","1","","1","1859.11","929.55","10.00%","929.55"
........
Inv No
We are going to use Inv No. to generate the output .csv file name.
|
|
Date
We want the invoice date on all the extracted lines.
|
|
Grid Module
You will learn how to configure the grid module for details line capture on multiple pages.
|
|
Custom column processing
We will create a custom column to extract data from a multi-line column.
|
|
Split page rows
You will see how split rows are managed on ChronoScan.
|
|
Video Tutorial:
------------------------