Separates numbers meeting the following conditions from the rest of the input text separated by whitespace. This step is based on division of the input text into "words", where a word is either a sequence of letters, a sequence of numbers or a sequence of special characters (other than letters and numbers). The number separation is applied only if a word consisting of letters with length of at least 2 characters precedes the numbers. The number must be either the last input word or only a word consisting of special characters as follows. Note: If the number is separated from the preceding word and is followed by a word composed of special characters, only then will these two words (number and special word) not be separated.
<step id='alg' className='cz.adastra.cif.tasks.clean.SplitOutTrailingNumbers'> <properties> <column>city</column> <columnOut>std_city</columnOut> <scorer explanationColumn='expl'> <scoringEntries> <scoringEntry key='STN_NULL' score='100' explain='true' /> <scoringEntry key='STN_CHANGED' score='100' explain='true' /> </scoringEntries> </scorer> </properties> </step>
iWay Software |