KB00010300: EText Limitations

Issue

Some PDFs cannot be extracted using EText mode. When EText is enabled in the batch class, Ephesoft Transact uses Recostar’s EText .rsp file to OCR any documents for extraction. EText mode will fail if the headers and labels inside the PDF are not editable or cannot be copied to notepad.

Solution

Note: These changes are made at the batch class level. You will need to repeat the following steps for each batch class with which you want to use EText mode.

  1. Go to [Ephesoft_Directory]\SharedFolders\[your_batch_class]\fixed-form-extraction.
  2. Open the FPR_EText.rsp file.
  3. Find the XML tag that ends with </ImageSequence2Operator>.
  4. Add the following after the XML tag:
<LayoutOperator FindTextBlocks=”true” Name=”LayoutOperator”/>
  1. Change the ProcessingMode option from ExtractfromEText to VoteOCRAndEText.

Figure 1. Processing Mode

  1. Save and close the file.