Transact

  1. Home
  2. Transact
  3. Features and Functions
  4. Semantik AI Engine
  5. Document Design Accelerator

Document Design Accelerator

Introduction

Applies to: Transact Version 2022.1.00 or newer.

Availability: Windows and Linux on-premise or cloud.

This article describes how to use the Document Design Accelerator in Ephesoft Transact. The Document Design Accelerator feature is a batch class design tool that uses the Semantik Artificial Intelligence (AI) Engine to process new document types and simplify key-value extraction rule creation. This feature automatically creates a new document type, pre-populated with index fields and extraction rules, for the batch class designer.

This feature supports all languages that are supported by Transact. For a list, see Supported Languages.

Use Cases

The Document Design Accelerator is useful when you need to quickly configure index fields and extraction rules for a new document type.

Licensing

Document Design Accelerator is included as part of the standard Transact license. There are no additional fees or licensing requirements.

Functional Limitations

  • Ephesoft recommends limiting the number of document types per batch class to fifty or fewer. If a larger number of document variants is expected, the best practice is to merge the rules for the variants into fewer document types.
  • Document Design Accelerator does not support Table Extraction in Transact 2022.1.00.
  • Batch classes cannot be encrypted or decrypted.

Prerequisites

To use the Document Design Accelerator:

  • You will need to install Transact version 2022.1.00 or newer.
  • You will need to download and install the Semantik AI Engine.
    • Note: Refer to Semantik AI Engine for steps. You will need to be a Transact system administrator.

Use the Document Design Accelerator

This guide assumes you have already created a batch class and that you have set up a training document.

Here’s an overview of how to use the Document Design Accelerator:

  1. Create a new document type in your batch class.
  2. Train the learn file for classification.
  3. Test the extraction.
  4. If needed, adjust the extraction rules.

Note on the Training Documents

  • In Step 1, your sample document must be a populated form.
  • In Step 2, your learn file must be a blank (unpopulated) form.

You can use the same training document for the sample document and the learn file, but they must be configured differently for the document type and classification steps.

Step 1: Create a Document Type

  1. From the Batch Class Management screen, select your batch class and click Open.
  2. Drag and drop the learn file into the Create Document Type from Sample panel.

Warning: The sample document must be a populated form. The Document Design Accelerator will not create rules effectively if the user-entry/dynamic fields on the sample document have not been populated. Do not use the Document Design Accelerator with blank forms.

Figure 1: Select and upload files in the Create Document Type From Sample panel.

  1. Create a Name and Description for the new document type.

Important: The document type name cannot be changed once it has been created.

Figure 2: Create Document Type.

  1. Click OK.

Figure 3: New document type with automatic index fields and extraction rules.

Using the Semantik AI Engine, Transact will automatically create a new document type with index fields and extraction rules based on the data that was supplied in the sample.

For each Key-Value pair identified by the Semantik AI Engine, a new index field will be created, named after the key that was detected. If several identically named keys are detected, a unique index field name will be created for each one by appending a 1, 2, 3, etc. to the detected key name.

Each index field’s data type, field type, and regular expression pattern will be populated based on the values detected by the Semantik AI Engine in the sample document.

Step 2: Train the Document for Classification

  1. Select the document type you created in Step 1.
  2. Train the document for classification using the method appropriate for your document content and use case. For more information, see Test Classification. In this tutorial, we use the Search Classification method.

Important: The classification learn file should be a blank (unpopulated) form, unlike the populated form used in the previous step.

Step 3: Test Extraction

  1. From the Batch Class Management screen, drag and drop your document into the Upload Test Extraction File(s) panel.

Note: We recommend using a document different from the training document but from the same company or document issuer.

  1. Click Test Extraction.

Figure 4. Test Extraction.

  1. Click Extract.

Figure 5: Extract.

  1. Verify the extraction results from the index fields and extraction rules created by the Semantik AI Engine.

Figure 6. Verify the extraction results.

  1. Click Close.

Step 4 (Optional): Adjust Extraction Rules if Needed

While the Document Design Accelerator produces accurate results automatically, you may need to make adjustments. To fine-tune extraction rules, use the standard Transact method. For help, see Create Extraction Rule.

How to Hide or Display the “Create Document Type from Sample” Panel

This section describes how to hide or display the Create Document Type from Sample panel under Upload File(s). Note: You must be a Transact system administrator.

Hide the Panel

Figure 7: Create Document Type from Sample upload panel.

  1. Navigate to the [Ephesoft_Directory]\Application\WEB-INF\classes\META-INF folder.
  2. Open the application.properties file in your preferred text editor.
  3. Locate the following property:

display_key_value_accelerator_sample_panel = yes

  1. Change the attribute from yes to no.
  2. Save the file.
  3. Restart Transact.


Re-display the Panel

To display the panel again:

  1. Complete steps 1-3 in Hide the Panel.
  2. Change the property attribute from no to yes.
  3. Save the file.
  4. Restart Transact.

Troubleshooting

Refer to the issues below for assistance in troubleshooting the Document Design Accelerator.

Issue Possible Root Cause Solution
An error occurs when dragging a file into the Create Document Type from Sample panel. Semantik AI Engine isn’t installed and running. Contact your Ephsoft administrator to install and/or start the Semantik AI Engine service.
The Create Document from Sample panel is not visible. Your Transact instance is running a version prior to 2022.1.00.

or

Transact 2022.1.00 or later is running, however the display_key_value_accelerator_sample_panel property has not been enabled.

Contact your Ephesoft administrator and ask them to follow Hide or Display the “Create Document Type from Sample” Panel.
Document Design Accelerator returns few or no index fields. An unpopulated form was dragged into the Create New Document Type From Sample panel.

Confirm that the sample used in Create a Document Type from Sample is a populated form. The Document Design Accelerator will not create index fields or extraction rules unless it detects both a key and a value pair.

Blank forms are likely to contain only keys and will not result in the automatic creation of index fields and extraction rules.

Documents of the document type created using the Document Design Accelerator are not being classified correctly. The document type was not trained for classification. Train the document for classification.

 

Was this article helpful to you? Yes No