{"id":47334,"date":"2020-11-03T16:20:27","date_gmt":"2020-11-03T23:20:27","guid":{"rendered":"https:\/\/ephesoft.com\/docs\/?p=47334"},"modified":"2021-01-25T12:17:18","modified_gmt":"2021-01-25T19:17:18","slug":"poor-ocr-results-in-hocr-xml","status":"publish","type":"post","link":"https:\/\/ephesoft.com\/docs\/poor-ocr-results-in-hocr-xml\/","title":{"rendered":"Poor OCR Results in HOCR.xml"},"content":{"rendered":"

Applies to:\u00a0<\/strong>Ephesoft Transact 2020.1 to 2020.1.03<\/span>
\nPlanned Resolution:\u00a0<\/strong>2020.1.04<\/span><\/p>\n

Issue<\/b><\/span><\/h2>\n

OCR results may be incorrect or missing data for non-EText operations.<\/span><\/p>\n

Root Cause<\/b><\/span><\/h2>\n

This issue may occur because of an incorrect entry in the associated RSP<\/span> file for the batch class. The default occurrence of this issue is in the <\/span>FPR.rsp <\/b>file, but this file name may vary depending on your configuration.<\/span><\/span><\/p>\n

You may find <\/span>FindTextBlocks=\u201dtrue\u201d<\/code><\/span> in the associated RSP file, which may be the cause of the loss of quality. Try configuring <\/span>FindTextBlocks=\u201dfalse\u201d<\/code><\/span> to understand if that is the root cause of the issue.\u00a0<\/span><\/span><\/p>\n

Solution<\/b><\/span><\/h2>\n

Test if manually setting <\/span>FindTextBlocks=\u201cfalse\u201d<\/span> in the associated RSP<\/span> file for the batch class resolves the issue.<\/span><\/span><\/p>\n

    \n
  1. Open the <\/span>FPR.rsp <\/b>file, located at <\/span>[Ephesoft_Directory]<\/span><\/i>\\SharedFolders\\<your batch class><\/strong>\\fixed-form-extraction.<\/span><\/span><\/li>\n
  2. Locate the following XML tag:<\/span><\/li>\n<\/ol>\n
    <LayoutOperator FindTextBlocks=\u201dtrue\u201d Name=\u201dLayoutOperator\u201d\/><\/span><\/pre>\n
      \n
    1. Set \u201cFindTextBlocks\u201d as \u201cfalse\u201d.<\/span><\/li>\n<\/ol>\n
      <LayoutOperator FindTextBlocks=\u201d<\/span>false<\/b>\u201d Name=\u201dLayoutOperator\u201d\/><\/span><\/span><\/pre>\n
        \n
      1. Save and close the file.<\/span><\/li>\n<\/ol>\n","protected":false},"excerpt":{"rendered":"

        Applies to:\u00a0Ephesoft Transact 2020.1 to 2020.1.03 Planned Resolution:\u00a02020.1.04 Issue OCR results may be incorrect or missing data for non-EText operations. […]<\/p>\n","protected":false},"author":62,"featured_media":0,"comment_status":"closed","ping_status":"closed","sticky":false,"template":"","format":"standard","meta":{"footnotes":""},"categories":[12361],"tags":[1430],"yoast_head":"\nPoor OCR Results in HOCR.xml | Ephesoft Docs<\/title>\n<meta name=\"robots\" content=\"noindex, follow\" \/>\n<meta property=\"og:locale\" content=\"en_US\" \/>\n<meta property=\"og:type\" content=\"article\" \/>\n<meta property=\"og:title\" content=\"Poor OCR Results in HOCR.xml\" \/>\n<meta property=\"og:description\" content=\"Applies to:\u00a0Ephesoft Transact 2020.1 to 2020.1.03 Planned Resolution:\u00a02020.1.04 Issue OCR results may be incorrect or missing data for non-EText operations. […]\" \/>\n<meta property=\"og:url\" content=\"https:\/\/ephesoft.com\/docs\/poor-ocr-results-in-hocr-xml\/\" \/>\n<meta property=\"og:site_name\" content=\"Ephesoft Docs\" \/>\n<meta property=\"article:published_time\" content=\"2020-11-03T23:20:27+00:00\" \/>\n<meta property=\"article:modified_time\" content=\"2021-01-25T19:17:18+00:00\" \/>\n<meta name=\"author\" content=\"Breanna Fitzgerald\" \/>\n<meta name=\"twitter:card\" content=\"summary_large_image\" \/>\n<meta name=\"twitter:label1\" content=\"Written by\" \/>\n\t<meta name=\"twitter:data1\" content=\"Breanna Fitzgerald\" \/>\n\t<meta name=\"twitter:label2\" content=\"Est. reading time\" \/>\n\t<meta name=\"twitter:data2\" content=\"1 minute\" \/>\n<script type=\"application\/ld+json\" class=\"yoast-schema-graph\">{\"@context\":\"https:\/\/schema.org\",\"@graph\":[{\"@type\":\"WebPage\",\"@id\":\"https:\/\/ephesoft.com\/docs\/poor-ocr-results-in-hocr-xml\/\",\"url\":\"https:\/\/ephesoft.com\/docs\/poor-ocr-results-in-hocr-xml\/\",\"name\":\"Poor OCR Results in HOCR.xml | Ephesoft Docs\",\"isPartOf\":{\"@id\":\"https:\/\/ephesoft.com\/docs\/#website\"},\"datePublished\":\"2020-11-03T23:20:27+00:00\",\"dateModified\":\"2021-01-25T19:17:18+00:00\",\"author\":{\"@id\":\"https:\/\/ephesoft.com\/docs\/#\/schema\/person\/d74c698404588430489bf05dfdf4bedd\"},\"breadcrumb\":{\"@id\":\"https:\/\/ephesoft.com\/docs\/poor-ocr-results-in-hocr-xml\/#breadcrumb\"},\"inLanguage\":\"en-US\",\"potentialAction\":[{\"@type\":\"ReadAction\",\"target\":[\"https:\/\/ephesoft.com\/docs\/poor-ocr-results-in-hocr-xml\/\"]}]},{\"@type\":\"BreadcrumbList\",\"@id\":\"https:\/\/ephesoft.com\/docs\/poor-ocr-results-in-hocr-xml\/#breadcrumb\",\"itemListElement\":[{\"@type\":\"ListItem\",\"position\":1,\"name\":\"Home\",\"item\":\"https:\/\/ephesoft.com\/docs\/\"},{\"@type\":\"ListItem\",\"position\":2,\"name\":\"Poor OCR Results in HOCR.xml\"}]},{\"@type\":\"WebSite\",\"@id\":\"https:\/\/ephesoft.com\/docs\/#website\",\"url\":\"https:\/\/ephesoft.com\/docs\/\",\"name\":\"Ephesoft Docs\",\"description\":\"Intelligent Document Processing Made Easy\",\"potentialAction\":[{\"@type\":\"SearchAction\",\"target\":{\"@type\":\"EntryPoint\",\"urlTemplate\":\"https:\/\/ephesoft.com\/docs\/?s={search_term_string}\"},\"query-input\":\"required name=search_term_string\"}],\"inLanguage\":\"en-US\"},{\"@type\":\"Person\",\"@id\":\"https:\/\/ephesoft.com\/docs\/#\/schema\/person\/d74c698404588430489bf05dfdf4bedd\",\"name\":\"Breanna Fitzgerald\",\"image\":{\"@type\":\"ImageObject\",\"inLanguage\":\"en-US\",\"@id\":\"https:\/\/ephesoft.com\/docs\/#\/schema\/person\/image\/\",\"url\":\"https:\/\/secure.gravatar.com\/avatar\/e0624b0af4f5f3caa370053f6eef54c8?s=96&r=g\",\"contentUrl\":\"https:\/\/secure.gravatar.com\/avatar\/e0624b0af4f5f3caa370053f6eef54c8?s=96&r=g\",\"caption\":\"Breanna Fitzgerald\"}}]}<\/script>\n<!-- \/ Yoast SEO Premium plugin. -->","yoast_head_json":{"title":"Poor OCR Results in HOCR.xml | Ephesoft Docs","robots":{"index":"noindex","follow":"follow"},"og_locale":"en_US","og_type":"article","og_title":"Poor OCR Results in HOCR.xml","og_description":"Applies to:\u00a0Ephesoft Transact 2020.1 to 2020.1.03 Planned Resolution:\u00a02020.1.04 Issue OCR results may be incorrect or missing data for non-EText operations. […]","og_url":"https:\/\/ephesoft.com\/docs\/poor-ocr-results-in-hocr-xml\/","og_site_name":"Ephesoft Docs","article_published_time":"2020-11-03T23:20:27+00:00","article_modified_time":"2021-01-25T19:17:18+00:00","author":"Breanna Fitzgerald","twitter_card":"summary_large_image","twitter_misc":{"Written by":"Breanna Fitzgerald","Est. reading time":"1 minute"},"schema":{"@context":"https:\/\/schema.org","@graph":[{"@type":"WebPage","@id":"https:\/\/ephesoft.com\/docs\/poor-ocr-results-in-hocr-xml\/","url":"https:\/\/ephesoft.com\/docs\/poor-ocr-results-in-hocr-xml\/","name":"Poor OCR Results in HOCR.xml | Ephesoft Docs","isPartOf":{"@id":"https:\/\/ephesoft.com\/docs\/#website"},"datePublished":"2020-11-03T23:20:27+00:00","dateModified":"2021-01-25T19:17:18+00:00","author":{"@id":"https:\/\/ephesoft.com\/docs\/#\/schema\/person\/d74c698404588430489bf05dfdf4bedd"},"breadcrumb":{"@id":"https:\/\/ephesoft.com\/docs\/poor-ocr-results-in-hocr-xml\/#breadcrumb"},"inLanguage":"en-US","potentialAction":[{"@type":"ReadAction","target":["https:\/\/ephesoft.com\/docs\/poor-ocr-results-in-hocr-xml\/"]}]},{"@type":"BreadcrumbList","@id":"https:\/\/ephesoft.com\/docs\/poor-ocr-results-in-hocr-xml\/#breadcrumb","itemListElement":[{"@type":"ListItem","position":1,"name":"Home","item":"https:\/\/ephesoft.com\/docs\/"},{"@type":"ListItem","position":2,"name":"Poor OCR Results in HOCR.xml"}]},{"@type":"WebSite","@id":"https:\/\/ephesoft.com\/docs\/#website","url":"https:\/\/ephesoft.com\/docs\/","name":"Ephesoft Docs","description":"Intelligent Document Processing Made Easy","potentialAction":[{"@type":"SearchAction","target":{"@type":"EntryPoint","urlTemplate":"https:\/\/ephesoft.com\/docs\/?s={search_term_string}"},"query-input":"required name=search_term_string"}],"inLanguage":"en-US"},{"@type":"Person","@id":"https:\/\/ephesoft.com\/docs\/#\/schema\/person\/d74c698404588430489bf05dfdf4bedd","name":"Breanna Fitzgerald","image":{"@type":"ImageObject","inLanguage":"en-US","@id":"https:\/\/ephesoft.com\/docs\/#\/schema\/person\/image\/","url":"https:\/\/secure.gravatar.com\/avatar\/e0624b0af4f5f3caa370053f6eef54c8?s=96&r=g","contentUrl":"https:\/\/secure.gravatar.com\/avatar\/e0624b0af4f5f3caa370053f6eef54c8?s=96&r=g","caption":"Breanna Fitzgerald"}}]}},"_links":{"self":[{"href":"https:\/\/ephesoft.com\/docs\/wp-json\/wp\/v2\/posts\/47334"}],"collection":[{"href":"https:\/\/ephesoft.com\/docs\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/ephesoft.com\/docs\/wp-json\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/ephesoft.com\/docs\/wp-json\/wp\/v2\/users\/62"}],"replies":[{"embeddable":true,"href":"https:\/\/ephesoft.com\/docs\/wp-json\/wp\/v2\/comments?post=47334"}],"version-history":[{"count":0,"href":"https:\/\/ephesoft.com\/docs\/wp-json\/wp\/v2\/posts\/47334\/revisions"}],"wp:attachment":[{"href":"https:\/\/ephesoft.com\/docs\/wp-json\/wp\/v2\/media?parent=47334"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/ephesoft.com\/docs\/wp-json\/wp\/v2\/categories?post=47334"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/ephesoft.com\/docs\/wp-json\/wp\/v2\/tags?post=47334"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}