How to Configure Chinese language Support at the Database Level
Applicable Ephesoft Transact Versions: 4.x or above
This article provides steps on how to configure the key-value pair for KV extraction in the Chinese language at the database level.
- For MySQL/MariaDB:
- To insert Chinese characters in the key_pattern and value_pattern columns of the kv_extraction table, change the collation of these columns from latin1_swedish_ci to utf8_bin by executing the following MySQL query:
ALTER TABLE `kv_extraction` CHANGE COLUMN `key_pattern` `key_pattern` VARCHAR(700) NULL DEFAULT NULL COLLATE ‘utf8_bin’ AFTER `key_fuzziness`; ALTER TABLE `kv_extraction` CHANGE COLUMN `value_pattern` `value_pattern` VARCHAR(700) NULL DEFAULT NULL COLLATE ‘utf8_bin’ AFTER `use_existing_key`;
- For MS SQL server:
- To insert Chinese characters in the key_pattern and value_pattern column of the kv_extraction table, change the data type of these columns from varchar to nvarchar by executing the following MSSQL queries:
alter table [ephesoft].[dbo].[kv_extraction] alter column key_pattern nvarchar(700); alter table [ephesoft].[dbo].[kv_extraction] alter column value_pattern nvarchar(700);