{"id":31867,"date":"2015-03-09T12:51:47","date_gmt":"2015-03-09T20:51:47","guid":{"rendered":"https:\/\/ephesoft.com\/docs\/2019-1-2\/moduleplugin-configuration\/page-process-module\/search-classification-plugin-2\/"},"modified":"2022-03-09T11:55:10","modified_gmt":"2022-03-09T18:55:10","slug":"search-classification-plugin-2","status":"publish","type":"docs","link":"https:\/\/ephesoft.com\/docs\/products\/transact\/features-and-functions\/administrator\/moduleplugin-configuration\/page-process-module\/search-classification-plugin-2\/","title":{"rendered":"Search Classification Plugin"},"content":{"rendered":"<p><strong>Available<\/strong>: on-premises, cloud<\/p>\n<h2>Introduction<\/h2>\n<p>This document describes how to configure and use the Search Classification plugin. The plugin classifies documents in the <strong>Page Process<\/strong> module of the workflow using Lucene-based indexing. Classification is how Ephesoft Transact chooses or associates the document to the Document Type. This document applies to Ephesoft Transact 2019.1 and above.<\/p>\n<h2>Configuring the Search Classification Plugin<\/h2>\n<p>Perform the following steps to configure the SEARCH_CLASSIFICATION plugin in the <strong>Page Process<\/strong> module. You must have administrator rights to complete these steps.<\/p>\n<ol>\n<li>Launch Ephesoft Transact and navigate to <strong>Administrator <\/strong>&gt;<strong> Batch Class Management<\/strong>. Enter login credentials when prompted.<\/li>\n<li>Select an existing batch class and click <strong>Open<\/strong> or create a new batch class. You can also copy or import an existing batch class, then modify it to create a new batch class.<br \/>\nThe following figure illustrates the SEARCH_CLASSIFICATION plugin in a typical batch class configuration.<\/li>\n<\/ol>\n<p><img decoding=\"async\" class=\"wp-image-35604 aligncenter\" src=\"https:\/\/ephesoft.com\/docs\/wp-content\/uploads\/2019\/11\/word-image-38.png\" width=\"246\" height=\"462\" srcset=\"https:\/\/ephesoft.com\/docs\/wp-content\/uploads\/2019\/11\/word-image-38.png 337w, https:\/\/ephesoft.com\/docs\/wp-content\/uploads\/2019\/11\/word-image-38-160x300.png 160w\" sizes=\"(max-width: 246px) 100vw, 246px\" \/><\/p>\n<p style=\"text-align: center\"><span style=\"color: #999999\"><em>Navigation to SEARCH_CLASSIFICATION Plugin<\/em><\/span><\/p>\n<p style=\"padding-left: 40px\">The SEARCH_CLASSIFICATION plugin works independently of the <strong>MULTIDIMENSIONAL_CLASSIFICATION_PLUGIN<\/strong> in the <strong>Page Process<\/strong> module. Both plugins can be present in the module.<\/p>\n<p>\u00a0 \u00a0 \u00a0 3. Select the SEARCH_CLASSIFICATION plugin to set up the configuration. The <strong>Plugin Configuration <\/strong>screen for the <strong>SEARCH_CLASSIFICATION<\/strong> plugin displays.<\/p>\n<p><img decoding=\"async\" class=\"wp-image-35605 aligncenter\" src=\"https:\/\/ephesoft.com\/docs\/wp-content\/uploads\/2019\/11\/word-image-39.png\" width=\"696\" height=\"418\" srcset=\"https:\/\/ephesoft.com\/docs\/wp-content\/uploads\/2019\/11\/word-image-39.png 832w, https:\/\/ephesoft.com\/docs\/wp-content\/uploads\/2019\/11\/word-image-39-300x180.png 300w, https:\/\/ephesoft.com\/docs\/wp-content\/uploads\/2019\/11\/word-image-39-768x462.png 768w\" sizes=\"(max-width: 696px) 100vw, 696px\" \/><\/p>\n<p style=\"text-align: center\"><em>SEARCH_CLASSIFICATION Plugin Configuration Screen<\/em><\/p>\n<h2>Configurable Properties<\/h2>\n<p>The following table lists and defines the configurable properties for the Search Classification plugin:<\/p>\n<table>\n<thead>\n<tr>\n<th>Configurable Property<\/th>\n<th>Type of Value<\/th>\n<th>Value Options<\/th>\n<th>Description<\/th>\n<\/tr>\n<\/thead>\n<tbody>\n<tr>\n<td>Lucene Valid Extensions<\/td>\n<td>List of Values<\/td>\n<td>xml<\/p>\n<p>html<\/td>\n<td>This field defines the valid extension of the input file and is applied when classifying document types for the specified file format.<\/td>\n<\/tr>\n<tr>\n<td>Lucene Min Term Frequency<\/td>\n<td>Integer<\/td>\n<td>NA<\/td>\n<td>This field sets the frequency below which terms will be ignored in the source document.<\/td>\n<\/tr>\n<tr>\n<td>Lucene Min Document Frequency<\/td>\n<td>Integer<\/td>\n<td>NA<\/td>\n<td>This field sets the frequency at which words are ignored. When a word does not occur in at least x amount of documents indicated in this field, it gets ignored.<\/td>\n<\/tr>\n<tr>\n<td>Lucene Min Word Length<\/td>\n<td>Integer<\/td>\n<td>NA<\/td>\n<td>This field sets the minimum word length. Words smaller than this setting are ignored from the HOCR content.<\/td>\n<\/tr>\n<tr>\n<td>Lucene Min Query Terms<\/td>\n<td>Integer<\/td>\n<td>NA<\/td>\n<td>This field sets the minimum number of query terms that will be included in any generated query.<\/td>\n<\/tr>\n<tr>\n<td>Lucene Top Level Field<\/td>\n<td>String<\/td>\n<td>NA<\/td>\n<td>This property is used to configure the default field for query terms.<\/td>\n<\/tr>\n<tr>\n<td>Lucene No Of Pages<\/td>\n<td>Integer<\/td>\n<td>NA<\/td>\n<td>This property specifies the number of documents to be returned in a query search.<\/td>\n<\/tr>\n<tr>\n<td>Lucene Index Fields<\/td>\n<td>List of Values<\/td>\n<td>title<\/p>\n<p>summary<\/td>\n<td>This property is used as an index field for searching the document type using Lucene.<\/td>\n<\/tr>\n<tr>\n<td>Lucene Stop Words<\/td>\n<td>List of Values<\/td>\n<td>title<\/p>\n<p>name<\/td>\n<td>This property sets the words to be ignored when classifying a document.<\/td>\n<\/tr>\n<tr>\n<td>Search Classification Switch<\/td>\n<td>List of Values<\/td>\n<td>ON<\/p>\n<p>OFF<\/td>\n<td>This property enables or disables the SEARCH_CLASSIFICATION plugin for the batch class.<\/td>\n<\/tr>\n<tr>\n<td>Search Classification Max Results<\/td>\n<td>Integer<\/td>\n<td>NA<\/td>\n<td>This field defines the maximum number of alternate value results that will be generated in the batch.xml.<\/p>\n<p>The default value for this field is 5 in Ephesoft Transact to control the overall size of the batch.xml file.<\/td>\n<\/tr>\n<tr>\n<td>First Page Confidence Score Value<\/td>\n<td>Integer<\/td>\n<td>NA<\/td>\n<td>This property is used to update the confidence score based on the first page type.<\/td>\n<\/tr>\n<tr>\n<td>Middle Page Confidence Score Value<\/td>\n<td>Integer<\/td>\n<td>NA<\/td>\n<td>This property is used to update the confidence score based on the middle page type.<\/td>\n<\/tr>\n<tr>\n<td>Last Page Confidence Score Value<\/td>\n<td>Integer<\/td>\n<td>NA<\/td>\n<td>This property is used to update the confidence score based on the last page type.<\/td>\n<\/tr>\n<\/tbody>\n<\/table>\n<p>4. Define the settings, then click <strong>Deploy<\/strong> to save and enable the changes.<\/p>\n<h2>Search Classification Execution Process<\/h2>\n<p>This plugin operates in the <strong>Page Process<\/strong> module after all batch-level import processes are complete.<\/p>\n<p>Ephesoft recommends that document learning is completed for the batch class prior to using this plugin. This plugin classifies incoming document images using Lucene-based indexing. This plugin functions in two stages when classifying documents:<\/p>\n<ul>\n<li><strong>Learning \u2014 The <\/strong>learning process occurs when generating indexes for documents. This plugin uses the generated indexes to classify each document. This plugin uses the learned files that were created earlier in the workflow.<\/li>\n<li><strong>Classification \u2014 <\/strong>When this plugin classifies a document, the data it learns provides a reference for document classification. When this plugin classifies a document type, it uses the extracted HOCR content from the image and verifies the HOCR content, based on the data it learned in the previous learning process.<\/li>\n<\/ul>\n<p>The plugin generates HOCR content similar to the RecoStar HOCR and Tesseract HOCR plugins.<\/p>\n<ul>\n<li>After all images and documents in the batch instance have been classified, this plugin writes the data to the batch.xml file for the document type that is being classified.<\/li>\n<\/ul>\n<h2>Troubleshooting<\/h2>\n<p>The following table lists the possible error messages that may occur with this plugin along with a description of each possible root cause.<\/p>\n<table>\n<tbody>\n<tr>\n<td>Error message<\/td>\n<td>Possible root cause<\/td>\n<\/tr>\n<tr>\n<td>No index files exist inside folder<\/td>\n<td>The document learning is not complete for the batch class.<\/td>\n<\/tr>\n<tr>\n<td>Page Types not configured in Database.<\/td>\n<td>The index data contains invalid indexes for the batch class.<\/td>\n<\/tr>\n<tr>\n<td>CorruptIndexException while reading Index.<\/td>\n<td>The index data is corrupt in the index folder for the batch class.<\/td>\n<\/tr>\n<tr>\n<td>IOException while reading Index<\/td>\n<td>The plugin is unable to open the index data due to corruption in the get index file process, or there is a lock on the index file.<\/td>\n<\/tr>\n<tr>\n<td>No valid extensions are specified in resources<\/td>\n<td>The page contains an invalid HOCR file for processing.<\/td>\n<\/tr>\n<tr>\n<td>No pages found in batch XML.<\/td>\n<td>The pages tag was not found in the incoming batch.xml file.<\/td>\n<\/tr>\n<\/tbody>\n<\/table>\n<h2>Conclusion<\/h2>\n<p>This concludes instructions to configure and troubleshoot the Search Classification plugin for a batch class.<\/p>\n<p>For additional information about configuring or using classification in Ephesoft Transact, refer to the following documents:<\/p>\n<ul>\n<li><a href=\"https:\/\/ephesoft.com\/docs\/2019-1\/moduleplugin-configuration\/page-process-module\/\">Page Process Module configuration articles<\/a><\/li>\n<li><a href=\"https:\/\/ephesoft.com\/docs\/how-to-improve-classification\/\">How to Improve Classification<\/a><\/li>\n<\/ul>\n<p>For additional information about batch class creation, setup and configuration, refer to the following documents:<\/p>\n<ul>\n<li><a href=\"https:\/\/ephesoft.com\/docs\/how-to-createcopy-a-new-batch-class-2\/\">How to Create\/Copy a New Batch Class<\/a><\/li>\n<li><a href=\"https:\/\/ephesoft.com\/docs\/4-1-0-0\/global-batch-class-management\/\">Global Batch Class Management<\/a><\/li>\n<\/ul>\n","protected":false},"featured_media":0,"parent":31858,"menu_order":0,"comment_status":"closed","ping_status":"open","template":"","doc_tag":[],"yoast_head":"<!-- This site is optimized with the Yoast SEO Premium plugin v19.0 (Yoast SEO v22.1) - https:\/\/yoast.com\/wordpress\/plugins\/seo\/ -->\n<title>Search Classification Plugin | Ephesoft Docs<\/title>\n<meta name=\"robots\" content=\"noindex, follow\" \/>\n<meta property=\"og:locale\" content=\"en_US\" \/>\n<meta property=\"og:type\" content=\"article\" \/>\n<meta property=\"og:title\" content=\"Search Classification Plugin\" \/>\n<meta property=\"og:description\" content=\"Available: on-premises, cloud Introduction This document describes how to configure and use the Search Classification plugin. The plugin classifies documents [&hellip;]\" \/>\n<meta property=\"og:url\" content=\"https:\/\/ephesoft.com\/docs\/products\/transact\/features-and-functions\/administrator\/moduleplugin-configuration\/page-process-module\/search-classification-plugin-2\/\" \/>\n<meta property=\"og:site_name\" content=\"Ephesoft Docs\" \/>\n<meta property=\"article:modified_time\" content=\"2022-03-09T18:55:10+00:00\" \/>\n<meta property=\"og:image\" content=\"https:\/\/ephesoft.com\/docs\/wp-content\/uploads\/2019\/11\/word-image-38.png\" \/>\n<meta name=\"twitter:card\" content=\"summary_large_image\" \/>\n<meta name=\"twitter:label1\" content=\"Est. reading time\" \/>\n\t<meta name=\"twitter:data1\" content=\"5 minutes\" \/>\n<script type=\"application\/ld+json\" class=\"yoast-schema-graph\">{\"@context\":\"https:\/\/schema.org\",\"@graph\":[{\"@type\":\"WebPage\",\"@id\":\"https:\/\/ephesoft.com\/docs\/products\/transact\/features-and-functions\/administrator\/moduleplugin-configuration\/page-process-module\/search-classification-plugin-2\/\",\"url\":\"https:\/\/ephesoft.com\/docs\/products\/transact\/features-and-functions\/administrator\/moduleplugin-configuration\/page-process-module\/search-classification-plugin-2\/\",\"name\":\"Search Classification Plugin | Ephesoft Docs\",\"isPartOf\":{\"@id\":\"https:\/\/ephesoft.com\/docs\/#website\"},\"datePublished\":\"2015-03-09T20:51:47+00:00\",\"dateModified\":\"2022-03-09T18:55:10+00:00\",\"breadcrumb\":{\"@id\":\"https:\/\/ephesoft.com\/docs\/products\/transact\/features-and-functions\/administrator\/moduleplugin-configuration\/page-process-module\/search-classification-plugin-2\/#breadcrumb\"},\"inLanguage\":\"en-US\",\"potentialAction\":[{\"@type\":\"ReadAction\",\"target\":[\"https:\/\/ephesoft.com\/docs\/products\/transact\/features-and-functions\/administrator\/moduleplugin-configuration\/page-process-module\/search-classification-plugin-2\/\"]}]},{\"@type\":\"BreadcrumbList\",\"@id\":\"https:\/\/ephesoft.com\/docs\/products\/transact\/features-and-functions\/administrator\/moduleplugin-configuration\/page-process-module\/search-classification-plugin-2\/#breadcrumb\",\"itemListElement\":[{\"@type\":\"ListItem\",\"position\":1,\"name\":\"Home\",\"item\":\"https:\/\/ephesoft.com\/docs\/\"},{\"@type\":\"ListItem\",\"position\":2,\"name\":\"Transact\",\"item\":\"https:\/\/ephesoft.com\/docs\/products\/transact\/\"},{\"@type\":\"ListItem\",\"position\":3,\"name\":\"Features and Functions\",\"item\":\"https:\/\/ephesoft.com\/docs\/products\/transact\/features-and-functions\/\"},{\"@type\":\"ListItem\",\"position\":4,\"name\":\"Administrator Role and Features\",\"item\":\"https:\/\/ephesoft.com\/docs\/products\/transact\/features-and-functions\/administrator\/\"},{\"@type\":\"ListItem\",\"position\":5,\"name\":\"Modules and Plugins\",\"item\":\"https:\/\/ephesoft.com\/docs\/products\/transact\/features-and-functions\/administrator\/moduleplugin-configuration\/\"},{\"@type\":\"ListItem\",\"position\":6,\"name\":\"Page Process Module\",\"item\":\"https:\/\/ephesoft.com\/docs\/products\/transact\/features-and-functions\/administrator\/moduleplugin-configuration\/page-process-module\/\"},{\"@type\":\"ListItem\",\"position\":7,\"name\":\"Search Classification Plugin\"}]},{\"@type\":\"WebSite\",\"@id\":\"https:\/\/ephesoft.com\/docs\/#website\",\"url\":\"https:\/\/ephesoft.com\/docs\/\",\"name\":\"Ephesoft Docs\",\"description\":\"Intelligent Document Processing Made Easy\",\"potentialAction\":[{\"@type\":\"SearchAction\",\"target\":{\"@type\":\"EntryPoint\",\"urlTemplate\":\"https:\/\/ephesoft.com\/docs\/?s={search_term_string}\"},\"query-input\":\"required name=search_term_string\"}],\"inLanguage\":\"en-US\"}]}<\/script>\n<!-- \/ Yoast SEO Premium plugin. -->","yoast_head_json":{"title":"Search Classification Plugin | Ephesoft Docs","robots":{"index":"noindex","follow":"follow"},"og_locale":"en_US","og_type":"article","og_title":"Search Classification Plugin","og_description":"Available: on-premises, cloud Introduction This document describes how to configure and use the Search Classification plugin. The plugin classifies documents [&hellip;]","og_url":"https:\/\/ephesoft.com\/docs\/products\/transact\/features-and-functions\/administrator\/moduleplugin-configuration\/page-process-module\/search-classification-plugin-2\/","og_site_name":"Ephesoft Docs","article_modified_time":"2022-03-09T18:55:10+00:00","og_image":[{"url":"https:\/\/ephesoft.com\/docs\/wp-content\/uploads\/2019\/11\/word-image-38.png"}],"twitter_card":"summary_large_image","twitter_misc":{"Est. reading time":"5 minutes"},"schema":{"@context":"https:\/\/schema.org","@graph":[{"@type":"WebPage","@id":"https:\/\/ephesoft.com\/docs\/products\/transact\/features-and-functions\/administrator\/moduleplugin-configuration\/page-process-module\/search-classification-plugin-2\/","url":"https:\/\/ephesoft.com\/docs\/products\/transact\/features-and-functions\/administrator\/moduleplugin-configuration\/page-process-module\/search-classification-plugin-2\/","name":"Search Classification Plugin | Ephesoft Docs","isPartOf":{"@id":"https:\/\/ephesoft.com\/docs\/#website"},"datePublished":"2015-03-09T20:51:47+00:00","dateModified":"2022-03-09T18:55:10+00:00","breadcrumb":{"@id":"https:\/\/ephesoft.com\/docs\/products\/transact\/features-and-functions\/administrator\/moduleplugin-configuration\/page-process-module\/search-classification-plugin-2\/#breadcrumb"},"inLanguage":"en-US","potentialAction":[{"@type":"ReadAction","target":["https:\/\/ephesoft.com\/docs\/products\/transact\/features-and-functions\/administrator\/moduleplugin-configuration\/page-process-module\/search-classification-plugin-2\/"]}]},{"@type":"BreadcrumbList","@id":"https:\/\/ephesoft.com\/docs\/products\/transact\/features-and-functions\/administrator\/moduleplugin-configuration\/page-process-module\/search-classification-plugin-2\/#breadcrumb","itemListElement":[{"@type":"ListItem","position":1,"name":"Home","item":"https:\/\/ephesoft.com\/docs\/"},{"@type":"ListItem","position":2,"name":"Transact","item":"https:\/\/ephesoft.com\/docs\/products\/transact\/"},{"@type":"ListItem","position":3,"name":"Features and Functions","item":"https:\/\/ephesoft.com\/docs\/products\/transact\/features-and-functions\/"},{"@type":"ListItem","position":4,"name":"Administrator Role and Features","item":"https:\/\/ephesoft.com\/docs\/products\/transact\/features-and-functions\/administrator\/"},{"@type":"ListItem","position":5,"name":"Modules and Plugins","item":"https:\/\/ephesoft.com\/docs\/products\/transact\/features-and-functions\/administrator\/moduleplugin-configuration\/"},{"@type":"ListItem","position":6,"name":"Page Process Module","item":"https:\/\/ephesoft.com\/docs\/products\/transact\/features-and-functions\/administrator\/moduleplugin-configuration\/page-process-module\/"},{"@type":"ListItem","position":7,"name":"Search Classification Plugin"}]},{"@type":"WebSite","@id":"https:\/\/ephesoft.com\/docs\/#website","url":"https:\/\/ephesoft.com\/docs\/","name":"Ephesoft Docs","description":"Intelligent Document Processing Made Easy","potentialAction":[{"@type":"SearchAction","target":{"@type":"EntryPoint","urlTemplate":"https:\/\/ephesoft.com\/docs\/?s={search_term_string}"},"query-input":"required name=search_term_string"}],"inLanguage":"en-US"}]}},"comment_count":0,"_links":{"self":[{"href":"https:\/\/ephesoft.com\/docs\/wp-json\/wp\/v2\/docs\/31867"}],"collection":[{"href":"https:\/\/ephesoft.com\/docs\/wp-json\/wp\/v2\/docs"}],"about":[{"href":"https:\/\/ephesoft.com\/docs\/wp-json\/wp\/v2\/types\/docs"}],"replies":[{"embeddable":true,"href":"https:\/\/ephesoft.com\/docs\/wp-json\/wp\/v2\/comments?post=31867"}],"version-history":[{"count":2,"href":"https:\/\/ephesoft.com\/docs\/wp-json\/wp\/v2\/docs\/31867\/revisions"}],"predecessor-version":[{"id":49486,"href":"https:\/\/ephesoft.com\/docs\/wp-json\/wp\/v2\/docs\/31867\/revisions\/49486"}],"up":[{"embeddable":true,"href":"https:\/\/ephesoft.com\/docs\/wp-json\/wp\/v2\/docs\/31858"}],"next":[{"title":"Create OCR Input Plugin","link":"https:\/\/ephesoft.com\/docs\/products\/transact\/features-and-functions\/administrator\/moduleplugin-configuration\/page-process-module\/create-ocr-input-plugin-4040\/","href":"https:\/\/ephesoft.com\/docs\/wp-json\/wp\/v2\/docs\/31860"}],"wp:attachment":[{"href":"https:\/\/ephesoft.com\/docs\/wp-json\/wp\/v2\/media?parent=31867"}],"wp:term":[{"taxonomy":"doc_tag","embeddable":true,"href":"https:\/\/ephesoft.com\/docs\/wp-json\/wp\/v2\/doc_tag?post=31867"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}