Overview

Earlier to 3030 release HTML TO XML generation plugin creates HOCR xml file using HTML file created by RECOSTAR_HOCR/TESESERACT plugin. HOCR files are generated by thread pool executor but in 3030 RECOSATR_HOCR/TESSERACT plugins directly generate HOCR xml file corresponding to image file. So now this plugin is obsolete.

Other plugins use this HOCR xml file to read the image data.

Configuration

  • Property File:{Ephesoft-install-dir}/WEB-INF/classes/META-INF/dcma-core/dcma-core.properties/*
  • Property:thread.pool_size=5

 

[table caption=”” width=”800″ colwidth=”100|100|200|200″ colalign=”center|center|center|left”]
Configurable property,Type of value,Value options,Description
thread.pool_size,String,Positive integer value,This field stores a string value for thread.pool_size field. This property will govern how many files will be processed simultaneously.

[/table]

 

Dependencies

One of the below two specified plugins must be ON to generate HOCR Xml files:

  • RECOSTAR_HOCR
  • TESSEARCT_HOCR