Transact

⌘K
  1. Home
  2. Transact
  3. Features and Functions
  4. Administrator Role and Fe...
  5. Modules and Plugins
  6. Extraction Module
  7. Machine Learning | Support for Multiple JSON Files

Machine Learning | Support for Multiple JSON Files

Applies to: Ephesoft Transact 4.5.0.0 and above

Whenever you perform machine learning for a document, a new machine-learning-extraction subfolder is created in the batch class folder on the Ephesoft Transact server. This subfolder contains JSON files with machine learning data for each document type and index field.

C:UsersEphesoftAppDataLocalMicrosoftWindowsINetCacheContent.Word18.png

Figure 1. Folder Structure (part one)

If, for example, another user learns the same index field under the same document type and batch class using a Web Service, the application will save the JSON file for that learning as well.

C:UsersEphesoftAppDataLocalMicrosoftWindowsINetCacheContent.Word19.png

Figure 2. Folder Structure (part two)

Now, the next time you extract data from the document, the system will compare the anchors around the extracted value with anchors saved in all existing JSON files. The value with the highest confidence will be shown as the extraction result. The JSON files are then merged into one file containing the latest learning information.

Note: Anchors are words surrounding a specific value. During extraction, anchors help to determine if expected keywords are found and if the value’s neighbors match any of the neighbors found during training.

This feature helps ensure that all machine learning results are saved and used during further data extraction. Ephesoft does not recommend manually editing JSON files.