{"id":2771,"date":"2015-01-22T01:04:10","date_gmt":"2015-01-22T08:04:10","guid":{"rendered":"https:\/\/ephesoft.com\/docs\/?p=2771"},"modified":"2020-08-26T15:59:56","modified_gmt":"2020-08-26T22:59:56","slug":"kb00007775-having-trouble-extracting-data-from-a-table-using-the-column-header-and-column-coordinates","status":"publish","type":"post","link":"https:\/\/ephesoft.com\/docs\/kb00007775-having-trouble-extracting-data-from-a-table-using-the-column-header-and-column-coordinates\/","title":{"rendered":"KB00007775: Issue with Table Extraction Using the Column Header and Column Coordinates"},"content":{"rendered":"

Issue<\/strong><\/span><\/h2>\n

Trouble extracting data from a table using the Column Header<\/strong> and Column Coordinates<\/strong>.<\/span><\/p>\n

Root Cause<\/strong><\/span><\/h2>\n

Column Headers<\/strong> are highly dependent on the recognition of the Column Header Pattern in the OCR. Variations in the OCR can cause the table or column not to be extracted properly or complete rows to be skipped. For example, If you configured the extraction rule to look for a column named \u201cPart Number\u201d but the OCR value is \u201cPert Number\u201d the column or table will not extract properly.<\/span><\/p>\n

Column Coordinates<\/strong> will take the values identified based on a zonal pattern and extract the contents within. If your zone coordinates are not defined properly you may get values pertaining to the column next to it as well.<\/span><\/p>\n

Solution<\/strong><\/span><\/h2>\n

To resolve the issues regarding the recognition of Column Headers<\/strong>, you may need to account for variances in the OCR.<\/span><\/p>\n

Here are some potential solutions:<\/span><\/p>\n

    \n
  1. Try using a different image compression in your import settings (Group4 vs. LZW).<\/span><\/li>\n
  2. Try a higher DPI for quality retention of the image during batch processing (300 \u2013 600 DPI).<\/span><\/li>\n
  3. Account for variances in the OCR by using regular expressions.<\/span><\/li>\n<\/ol>\n

    For example, for a specific Column Header Pattern<\/strong> like \u201cPart Number\u201d, use a more generic Regex such as \u201cP[A-z0-9\\s]{7}ber\u201d. This finds any variation of alphanumeric values that start with a \u201cP\u201d and end with \u201cber\u201d.<\/span><\/p>\n

    To resolve any issues with the Column Coordinates<\/strong>, you may need to simply adjust your zonal areas so they fit and account for variations in the images. Variations include:<\/span><\/p>\n

      \n
    • Skewed coordinates.<\/span><\/li>\n
    • Changes in resolutions and overall image size.<\/span><\/li>\n<\/ul>\n

      To ensure the best results you should try to standardize your input images and have a minimum quality requirement. For example, Resolution: 2550\u00d73300, DPI: 300.<\/span><\/p>\n","protected":false},"excerpt":{"rendered":"

      Issue Trouble extracting data from a table using the Column Header and Column Coordinates. Root Cause Column Headers are highly […]<\/p>\n","protected":false},"author":62,"featured_media":0,"comment_status":"closed","ping_status":"open","sticky":false,"template":"","format":"standard","meta":{"footnotes":""},"categories":[12353],"tags":[722,1430,437,743],"yoast_head":"\nKB00007775: Issue with Table Extraction Using the Column Header and Column Coordinates | Ephesoft Docs<\/title>\n<meta name=\"robots\" content=\"noindex, follow\" \/>\n<meta property=\"og:locale\" content=\"en_US\" \/>\n<meta property=\"og:type\" content=\"article\" \/>\n<meta property=\"og:title\" content=\"KB00007775: Issue with Table Extraction Using the Column Header and Column Coordinates\" \/>\n<meta property=\"og:description\" content=\"Issue Trouble extracting data from a table using the Column Header and Column Coordinates. Root Cause Column Headers are highly […]\" \/>\n<meta property=\"og:url\" content=\"https:\/\/ephesoft.com\/docs\/kb00007775-having-trouble-extracting-data-from-a-table-using-the-column-header-and-column-coordinates\/\" \/>\n<meta property=\"og:site_name\" content=\"Ephesoft Docs\" \/>\n<meta property=\"article:published_time\" content=\"2015-01-22T08:04:10+00:00\" \/>\n<meta property=\"article:modified_time\" content=\"2020-08-26T22:59:56+00:00\" \/>\n<meta name=\"author\" content=\"Breanna Fitzgerald\" \/>\n<meta name=\"twitter:card\" content=\"summary_large_image\" \/>\n<meta name=\"twitter:label1\" content=\"Written by\" \/>\n\t<meta name=\"twitter:data1\" content=\"Breanna Fitzgerald\" \/>\n\t<meta name=\"twitter:label2\" content=\"Est. reading time\" \/>\n\t<meta name=\"twitter:data2\" content=\"1 minute\" \/>\n<script type=\"application\/ld+json\" class=\"yoast-schema-graph\">{\"@context\":\"https:\/\/schema.org\",\"@graph\":[{\"@type\":\"WebPage\",\"@id\":\"https:\/\/ephesoft.com\/docs\/kb00007775-having-trouble-extracting-data-from-a-table-using-the-column-header-and-column-coordinates\/\",\"url\":\"https:\/\/ephesoft.com\/docs\/kb00007775-having-trouble-extracting-data-from-a-table-using-the-column-header-and-column-coordinates\/\",\"name\":\"KB00007775: Issue with Table Extraction Using the Column Header and Column Coordinates | Ephesoft Docs\",\"isPartOf\":{\"@id\":\"https:\/\/ephesoft.com\/docs\/#website\"},\"datePublished\":\"2015-01-22T08:04:10+00:00\",\"dateModified\":\"2020-08-26T22:59:56+00:00\",\"author\":{\"@id\":\"https:\/\/ephesoft.com\/docs\/#\/schema\/person\/d74c698404588430489bf05dfdf4bedd\"},\"breadcrumb\":{\"@id\":\"https:\/\/ephesoft.com\/docs\/kb00007775-having-trouble-extracting-data-from-a-table-using-the-column-header-and-column-coordinates\/#breadcrumb\"},\"inLanguage\":\"en-US\",\"potentialAction\":[{\"@type\":\"ReadAction\",\"target\":[\"https:\/\/ephesoft.com\/docs\/kb00007775-having-trouble-extracting-data-from-a-table-using-the-column-header-and-column-coordinates\/\"]}]},{\"@type\":\"BreadcrumbList\",\"@id\":\"https:\/\/ephesoft.com\/docs\/kb00007775-having-trouble-extracting-data-from-a-table-using-the-column-header-and-column-coordinates\/#breadcrumb\",\"itemListElement\":[{\"@type\":\"ListItem\",\"position\":1,\"name\":\"Home\",\"item\":\"https:\/\/ephesoft.com\/docs\/\"},{\"@type\":\"ListItem\",\"position\":2,\"name\":\"KB00007775: Issue with Table Extraction Using the Column Header and Column Coordinates\"}]},{\"@type\":\"WebSite\",\"@id\":\"https:\/\/ephesoft.com\/docs\/#website\",\"url\":\"https:\/\/ephesoft.com\/docs\/\",\"name\":\"Ephesoft Docs\",\"description\":\"Intelligent Document Processing Made Easy\",\"potentialAction\":[{\"@type\":\"SearchAction\",\"target\":{\"@type\":\"EntryPoint\",\"urlTemplate\":\"https:\/\/ephesoft.com\/docs\/?s={search_term_string}\"},\"query-input\":\"required name=search_term_string\"}],\"inLanguage\":\"en-US\"},{\"@type\":\"Person\",\"@id\":\"https:\/\/ephesoft.com\/docs\/#\/schema\/person\/d74c698404588430489bf05dfdf4bedd\",\"name\":\"Breanna Fitzgerald\",\"image\":{\"@type\":\"ImageObject\",\"inLanguage\":\"en-US\",\"@id\":\"https:\/\/ephesoft.com\/docs\/#\/schema\/person\/image\/\",\"url\":\"https:\/\/secure.gravatar.com\/avatar\/e0624b0af4f5f3caa370053f6eef54c8?s=96&r=g\",\"contentUrl\":\"https:\/\/secure.gravatar.com\/avatar\/e0624b0af4f5f3caa370053f6eef54c8?s=96&r=g\",\"caption\":\"Breanna Fitzgerald\"}}]}<\/script>\n<!-- \/ Yoast SEO Premium plugin. -->","yoast_head_json":{"title":"KB00007775: Issue with Table Extraction Using the Column Header and Column Coordinates | Ephesoft Docs","robots":{"index":"noindex","follow":"follow"},"og_locale":"en_US","og_type":"article","og_title":"KB00007775: Issue with Table Extraction Using the Column Header and Column Coordinates","og_description":"Issue Trouble extracting data from a table using the Column Header and Column Coordinates. Root Cause Column Headers are highly […]","og_url":"https:\/\/ephesoft.com\/docs\/kb00007775-having-trouble-extracting-data-from-a-table-using-the-column-header-and-column-coordinates\/","og_site_name":"Ephesoft Docs","article_published_time":"2015-01-22T08:04:10+00:00","article_modified_time":"2020-08-26T22:59:56+00:00","author":"Breanna Fitzgerald","twitter_card":"summary_large_image","twitter_misc":{"Written by":"Breanna Fitzgerald","Est. reading time":"1 minute"},"schema":{"@context":"https:\/\/schema.org","@graph":[{"@type":"WebPage","@id":"https:\/\/ephesoft.com\/docs\/kb00007775-having-trouble-extracting-data-from-a-table-using-the-column-header-and-column-coordinates\/","url":"https:\/\/ephesoft.com\/docs\/kb00007775-having-trouble-extracting-data-from-a-table-using-the-column-header-and-column-coordinates\/","name":"KB00007775: Issue with Table Extraction Using the Column Header and Column Coordinates | Ephesoft Docs","isPartOf":{"@id":"https:\/\/ephesoft.com\/docs\/#website"},"datePublished":"2015-01-22T08:04:10+00:00","dateModified":"2020-08-26T22:59:56+00:00","author":{"@id":"https:\/\/ephesoft.com\/docs\/#\/schema\/person\/d74c698404588430489bf05dfdf4bedd"},"breadcrumb":{"@id":"https:\/\/ephesoft.com\/docs\/kb00007775-having-trouble-extracting-data-from-a-table-using-the-column-header-and-column-coordinates\/#breadcrumb"},"inLanguage":"en-US","potentialAction":[{"@type":"ReadAction","target":["https:\/\/ephesoft.com\/docs\/kb00007775-having-trouble-extracting-data-from-a-table-using-the-column-header-and-column-coordinates\/"]}]},{"@type":"BreadcrumbList","@id":"https:\/\/ephesoft.com\/docs\/kb00007775-having-trouble-extracting-data-from-a-table-using-the-column-header-and-column-coordinates\/#breadcrumb","itemListElement":[{"@type":"ListItem","position":1,"name":"Home","item":"https:\/\/ephesoft.com\/docs\/"},{"@type":"ListItem","position":2,"name":"KB00007775: Issue with Table Extraction Using the Column Header and Column Coordinates"}]},{"@type":"WebSite","@id":"https:\/\/ephesoft.com\/docs\/#website","url":"https:\/\/ephesoft.com\/docs\/","name":"Ephesoft Docs","description":"Intelligent Document Processing Made Easy","potentialAction":[{"@type":"SearchAction","target":{"@type":"EntryPoint","urlTemplate":"https:\/\/ephesoft.com\/docs\/?s={search_term_string}"},"query-input":"required name=search_term_string"}],"inLanguage":"en-US"},{"@type":"Person","@id":"https:\/\/ephesoft.com\/docs\/#\/schema\/person\/d74c698404588430489bf05dfdf4bedd","name":"Breanna Fitzgerald","image":{"@type":"ImageObject","inLanguage":"en-US","@id":"https:\/\/ephesoft.com\/docs\/#\/schema\/person\/image\/","url":"https:\/\/secure.gravatar.com\/avatar\/e0624b0af4f5f3caa370053f6eef54c8?s=96&r=g","contentUrl":"https:\/\/secure.gravatar.com\/avatar\/e0624b0af4f5f3caa370053f6eef54c8?s=96&r=g","caption":"Breanna Fitzgerald"}}]}},"_links":{"self":[{"href":"https:\/\/ephesoft.com\/docs\/wp-json\/wp\/v2\/posts\/2771"}],"collection":[{"href":"https:\/\/ephesoft.com\/docs\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/ephesoft.com\/docs\/wp-json\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/ephesoft.com\/docs\/wp-json\/wp\/v2\/users\/62"}],"replies":[{"embeddable":true,"href":"https:\/\/ephesoft.com\/docs\/wp-json\/wp\/v2\/comments?post=2771"}],"version-history":[{"count":0,"href":"https:\/\/ephesoft.com\/docs\/wp-json\/wp\/v2\/posts\/2771\/revisions"}],"wp:attachment":[{"href":"https:\/\/ephesoft.com\/docs\/wp-json\/wp\/v2\/media?parent=2771"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/ephesoft.com\/docs\/wp-json\/wp\/v2\/categories?post=2771"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/ephesoft.com\/docs\/wp-json\/wp\/v2\/tags?post=2771"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}