×
Menu
Index

3.4.11.1. Advanced OCR Zone Reading

3.4.11.1. Advanced OCR Zone Reading
 
When using OCR Zones the user can set a series of advanced filters and capture options to enhance the OCR Engine capture. It is possible to capture intelli-tags, capture specific lines, run VBScripts, apply quick and advanced image filters and much more.
 
1

Local Charset

1. Local Charset
 
Using the Local charset option the user can restrict the characters used by the OCR Engine to perform the capture. If only numbers are used only numbers will be searched for by the OCR Engine. Very useful when common OCR problems such as "O" being detected as "0" and "1" as "I" or the other way around happen. The user can choose from the dropdown menu or manually type in all the desired characters.
 
Local charset options for the dropdown menu. The user can also manually type in the desired characters.
 
2

Intelli-Search

2. Intelli-Search
 
The intelli-search function will allow the user to tweak the OCR Engine algorithm to look for a specific type of data that fits one of the predefined filters. Using the intelli-search function when the data captured fits one of the predefined filters can help reduce OCR errors.
 
Available intelli-search filters. The default one is "Any text".
 
3

Search Mask

3. Search Mask
 
Search masks will create a template for the data being captured. Using special characters such as forward slash in combination with "A" or "D" (refer to the table below) will create a template over which the data being captured will be applied. Characters like "/" and "ยบ" will be added to the resulting value when used on the mask. The search mask function is very useful when the data being captured must be modified by addition of special characters like "/", "\" or "-" for dates or "." and "," for amounts.
The user can create custom search masks manually or use the ones available from the drop down menu. 
 
A
Alphanumeric character
D
Digit character
N
Number character
 
4

Regular Expression Search

4. Regular Expression Search
 
Using the regular expression tool it is possible to search within the captured data the desired value. This function will work great when the OCR reading is good and the user is looking for a very specific bit of data that is always formatted in a specific way.
The user can create custom regular expression or use the ones available from the drop down menu.
 
5

Lines to Capture

5. Lines to Capture
 
On this drop down menu the user can select the line or lines desired for the data capture process. To take advantage from this option Allow Carriage Return must be selected.
 
6

Strings to Remove

6. Strings to Remove
 
Strings can be removed from the captured data. Several strings can be added using "|" (pipe character) as separator. Using the "^" character will perform the Remove Strings function before the Regular Expression Search function if the latter is in use.
 
7

Input Date Format

7. Input Date Format
 
Having several different date formats it is possible for the user to override the default date type used by ChronoScan. By default ChronoScan will use the exact same date date format as the system date format when looking for dates. Whenever the date format on the document does not match the system format the user should change the Date Format setting to the one that fits the date format on the document. After the date capture was performed the result on the data field will be converted to the current system date format unless the user specifies a different setting.
 
8

Negative Numbers Format

8. Negative Numbers Format
 
By default ChronoScan will set numbers with a "-" sign in front of them as negative numbers. Using the Negative Numbers Format setting the user can choose the right format for negative numbers for the current document type and OCR Zone. Very useful when non-standard negative number format is used.
 
Currently available negative numbers format options for ChronoScan.
 
9

Allow Spaces/Carriage Return

9. Allow Spaces/Carriage Return
 
By default ChronoScan will identify spaces between characters and carriage return (paragraph) information. Should any of those bits of information not be necessary those options can be disabled for a more accurate and easier to process OCR result.
 
10

OCR Engine Selection

10. OCR Engine Selection
 
On the OCR Engine Selection box the user can quickly switch between OCR Engines for that specific OCR Zone while checking the results for each one. On the OCR Read column it is possible to read what the currently selected OCR Engine is reading from the documents and the Value column will display the final value that will be assigned to the field after all filters, validation rules and search masks have been applied.
 
11

Auto Adjust

11. Auto Adjust
 
The auto adjust filter will automatically crop the OCR Zone area trying to fit it to existing data. It can help fix problems when there is the need for a big OCR Zone for data that will vary in length.
 
12

Background Filter

12. Background Filter
 
The Background filter will try to remove background images and colors. Very useful when trying to handle data with stamps or colors in the background. The user can choose from two different algorithms, GAUSSIAN and MEAN.
 
13

Noise Filter

13. Noise Filter
 
This option will apply a quick noise filter to the image. It can improve results with images with a lot of background noise.
 
14

Erode Filter

14. Erode Filter
 
The erode filter will detect lines on the image and erode them, making the lines thinner.
 
15

Dilate Filter

15. Dilate Filter
 
The dilate filter will detect lines on the image and dilate them, making the lines thicker.
 
16

Advanced Filters

16. Advanced Filters
 
When using OCR Zones the user can enable Advanced Filters that can solve OCR errors to some degree. Learn more about advanced filters here. The Advanced Filters on the OCR Zones will be applied on a OCR Zone level. To apply advanced filter on a global level use the Image Processing Options on the Document Toolbar on the Scan/Input Tab.
 
17

Advanced Data Extraction

17. Advanced Data Extraction
 
Advanced data extraction can be applied to the data captured by OCR Zones. The advanced extraction options can be very useful when there is a big block of data and only a few lines or specific pieces of data should be captured. A single big OCR Zone can be created instead of several small ones.
 
18

Preprocessed Image

18. Preprocessed Image
 
OCR Zones can be setup to bypass the global image processing set on the Image Processing Window. By default the OCR Zone will use the image processed at global level. To change that behavior disable the "Use preprocessed image" checkbox.
 
19

Refresh Image/Reading

19. Refresh Image/Reading
 
When making changes to the processing options use the Refresh button to see the result of the new processing settings.