Poor OCR Results in HOCR.xml

Issue

OCR results may be incorrect or missing data for non-EText operations.

Root Cause

This issue may occur because of an incorrect entry in the associated RSP file for the batch class. The default occurrence of this issue is in the FPR.rsp file, but this file name may vary depending on your configuration.

You may find FindTextBlocks=”true” in the associated RSP file, which may be the cause of the loss of quality. Try configuring FindTextBlocks=”false” to understand if that is the root cause of the issue. 

Solution

Test if manually setting FindTextBlocks=“false” in the associated RSP file for the batch class resolves the issue.

  1. Open the FPR.rsp file, located at [Ephesoft_Directory]\SharedFolders\<your batch class>\fixed-form-extraction.
  2. Locate the following XML tag:
<LayoutOperator FindTextBlocks=”true” Name=”LayoutOperator”/>
  1. Set “FindTextBlocks” as “false”.
<LayoutOperator FindTextBlocks=”false” Name=”LayoutOperator”/>
  1. Save and close the file.