{"id":31870,"date":"2018-03-22T08:32:08","date_gmt":"2018-03-22T16:32:08","guid":{"rendered":"https:\/\/ephesoft.com\/docs\/2019-1-2\/moduleplugin-configuration\/page-process-module\/recostar-hocr-plugin\/ocr-languages-selection-from-the-ui\/"},"modified":"2022-03-16T08:20:02","modified_gmt":"2022-03-16T15:20:02","slug":"ocr-languages-selection-from-the-ui","status":"publish","type":"docs","link":"https:\/\/ephesoft.com\/docs\/products\/transact\/features-and-functions\/administrator\/moduleplugin-configuration\/page-process-module\/recostar-hocr-plugin\/ocr-languages-selection-from-the-ui\/","title":{"rendered":"OCR Languages Selection from the UI"},"content":{"rendered":"

Ephesoft Transact supports the OCR engines Recostar (for Windows systems) and Nuance (for Linux systems). Transact also offers the option for Tesseract. The user can select any one of them depending on preference and system requirements.<\/p>\n

In Transact versions prior to Release 4.5.x.x or 2019.1, to define OCR languages for Recostar\/Nuance, the user had to find the required backend folders on the server and edit the OCR input file manually. Tessaract OCR language could be specified from the UI \u2013 for that the language name had to be manually typed in the corresponding field.<\/p>\n

Starting from Ephesoft Transact 4.5.0.0 and continuing with 2019.x releases, a new multi-select-suggestion widget<\/strong> has been added to the Plugin Configuration<\/strong> screen for all three OCR engines under the Page Process<\/strong> module. Using this widget, the user can select the language(s) and update the OCR engine input file automatically from the UI rather than having to make this change manually.<\/p>\n

The name of the new Plugin Configuration field for Nuance and Tesseract OCR engines is OCR Language<\/strong>. The Recostar OCR engine, on the other hand, takes only the country name as the language input; therefore, to make it compatible with other definitions, the same field in the Recostar HOCR plugin is called OCR Country\/Language<\/strong>.<\/p>\n

NUANCE_HOCR Plugin Configuration screen<\/strong><\/p>\n

\"C:UsersEphesoftAppDataLocalMicrosoftWindowsINetCacheContent.Word2.png\"<\/p>\n

RECOSTAR_HOCR Plugin Configuration screen<\/strong><\/p>\n

\"C:UsersEphesoftAppDataLocalMicrosoftWindowsINetCacheContent.Word3.png\"<\/p>\n

TESSERACT_HOCR Plugin Configuration screen<\/strong><\/p>\n

\"C:UsersEphesoftAppDataLocalMicrosoftWindowsINetCacheContent.Word1.png\"<\/p>\n

When you select or type the language name, the widget will help you by giving suggestions. The complete suggestion list will be opened by the suggestion token, which is a semi-colon (;) or by clicking in the field with predictive typing if no language is selected. The suggestion token will automatically list languages based on the user\u2019s input.<\/p>\n

\"C:UsersEphesoftAppDataLocalMicrosoftWindowsINetCacheContent.Word4-1.png\"<\/p>\n

When you start typing the first letters of the required language name, the widget will suggest languages according to the letters already entered.<\/p>\n

\"C:UsersEphesoftAppDataLocalMicrosoftWindowsINetCacheContent.Word5.png\"<\/p>\n

The multi-select-suggestion widget<\/strong> has several icons associated with it:<\/p>\n

– Help icon is used to provide suggestions (for example, it will remind you to use suggestion token to view the language suggestions list (;).<\/p>\n

– Error icon is used to indicate that you have provided\/selected wrong input (for example, if you leave the field empty or enter invalid value).<\/p>\n

Note<\/em><\/strong>: The error icon will also be shown if you select\/use a non-licensed language for Nuance (Arabic and Asian (Chinese_Simplified, Chinese_Traditional, Japanese, Korean) languages) or Recostar (Chinese, Japanese, Korean, Thai).<\/em><\/p>\n

– Warning icon is used to warn and provide information (for example, it will remind that Tessaract Test-Data folder should contain Test Data for selected Tesseract languages).<\/p>\n

Notes<\/em><\/strong>: <\/em><\/p>\n