{"id":31817,"date":"2017-04-25T09:33:45","date_gmt":"2017-04-25T17:33:45","guid":{"rendered":"https:\/\/ephesoft.com\/docs\/2019-1-2\/moduleplugin-configuration\/extraction-module\/regular-regex-extraction-plugin-3\/"},"modified":"2020-11-24T09:46:54","modified_gmt":"2020-11-24T16:46:54","slug":"regular-regex-extraction-plugin-3","status":"publish","type":"docs","link":"https:\/\/ephesoft.com\/docs\/products\/transact\/features-and-functions\/administrator\/moduleplugin-configuration\/extraction-module\/regular-regex-extraction-plugin-3\/","title":{"rendered":"Regular Regex Extraction Plugin"},"content":{"rendered":"
Available<\/b>: on-premises, cloud<\/p>\n This plugin extracts index field values based on the pattern defined for that field.\u00a0 A semicolon-separated collection of one or more words followed by a regular expression can be defined for the pattern.\u00a0 The system will search each page for the regular expression.\u00a0 If a match is found, the system will look to the left of the match and see if all of the preceding words in the pattern can be found.\u00a0 If all of the words are found (in order), the value will be extracted.\u00a0 If only a subset of the words are found, or if none of the words are found, the value will not be extracted.<\/p>\n Consider the following text defined for the pattern field of the InvoiceDate index field:\u00a0 Invoice;Date;d{1,2}[\/]d{1,2}[\/]d{2,4}<\/p>\n Example 1<\/strong><\/p>\n Text string in document:\u00a0 Invoice Date 21\/03\/2012<\/p>\n Result: “21\/03\/2012” will be extracted for the InvoiceDate index field.\u00a0 This happens because “21\/03\/2012” matches the regular expression pattern, with “Date” found to its left, and “Invoice” found to its left.<\/p>\n Example 2<\/strong><\/p>\n Text string in document:\u00a0 Date 21\/03\/2012<\/p>\n Result:\u00a0 Nothing will be extracted for this index field.\u00a0 Even though “21\/03\/2012” matches the regular expression, and “Date” is found to its left, the word “Invoice” is not found to the left of “Date.”<\/p>\n The REGULAR_REGEX_EXTRACTION plugin can be configured in the following UI:<\/p>\nOverview<\/span><\/h1>\n
Examples<\/span><\/span><\/h3>\n
Plugin Configuration<\/span><\/span><\/h3>\n