In this section: |
Search method that uses single words from the input text to find known values.
Name |
Type |
Required |
Description |
---|---|---|---|
Search Single Words |
Boolean |
Yes |
Specifies searching for single words directly in the dictionaries Default value: False. |
Max Number Errors |
Integer |
Yes |
Maximum number of errors to accept the value when compared with the appropriate part of the input text. Default value: 0. |
Min Word Length |
Integer |
Yes |
Minimum length of the word to be searched in the dictionaries. This prevents searching words that are abbreviations such as: rd (road) or ul (ulice). Default value: 2. |
Excluded Words File Name |
String |
No |
Name of the dictionary containing values that are not searched in the dictionaries. This lookup is optional and, if not specified, all "searchable" words are searched. |
Tokenizer Config |
Configurable Tokenizer Config |
Yes |
Definition of tokenization input text to words. |
The methods tokenize input text into words and these words are used as keys to find known values for each component that has its property input set to true. The dictionaries are specified by the components themselves and are not specified here. The found values are then compared to the input text to verify positive matches up to a maximum error. The word is used as a key to the dictionaries only when its length is at least the value specified (property minWordLength) and is not found in an optional excluded words dictionary.
iWay Software |