Element Word Aware Method

In this section:

Search method that uses single words from the input text to find known values.

Name

Type

Required

Description

Search Single Words

Boolean

Yes

Specifies searching for single words directly in the dictionaries Default value: False.

Max Number Errors

Integer

Yes

Maximum number of errors to accept the value when compared with the appropriate part of the input text. Default value: 0.

Min Word Length

Integer

Yes

Minimum length of the word to be searched in the dictionaries. This prevents searching words that are abbreviations such as: rd (road) or ul (ulice). Default value: 2.

Excluded Words File Name

String

No

Name of the dictionary containing values that are not searched in the dictionaries. This lookup is optional and, if not specified, all "searchable" words are searched.

Tokenizer Config

Configurable Tokenizer Config

Yes

Definition of tokenization input text to words.



x
Detailed Description of Element Word Aware Method

The methods tokenize input text into words and these words are used as keys to find known values for each component that has its property input set to true. The dictionaries are specified by the components themselves and are not specified here. The found values are then compared to the input text to verify positive matches up to a maximum error. The word is used as a key to the dictionaries only when its length is at least the value specified (property minWordLength) and is not found in an optional excluded words dictionary.


iWay Software