Detailed Description of Split Out Trailing Numbers

Separates numbers meeting the following conditions from the rest of the input text separated by whitespace. This step is based on division of the input text into "words", where a word is either a sequence of letters, a sequence of numbers or a sequence of special characters (other than letters and numbers). The number separation is applied only if a word consisting of letters with length of at least 2 characters precedes the numbers. The number must be either the last input word or only a word consisting of special characters as follows. Note: If the number is separated from the preceding word and is followed by a word composed of special characters, only then will these two words (number and special word) not be separated.


Top of page

Example: Example
<step id='alg' className='cz.adastra.cif.tasks.clean.SplitOutTrailingNumbers'>
        <properties>
                <column>city</column>
                <columnOut>std_city</columnOut>
                <scorer explanationColumn='expl'>
                        <scoringEntries>
                                <scoringEntry key='STN_NULL'    score='100' explain='true' />
                                <scoringEntry key='STN_CHANGED' score='100' explain='true' />
                        </scoringEntries>
                </scorer>
        </properties>
</step>

iWay Software