|
MergeSpellingData
merges the contents of multiple spelling map files into a
single spelling map file.
A spelling map file is a utf-8 file containing two fields
separated by a tab character. The first field is a variant
spelling. The second field is the standardized spelling
for the variant.
Usage:
mergespellingdata output.tab input.tab input2.tab ...
where
- output.txt -- output merged spelling map file.
- input*.txt -- input text files containing spelling maps to be merged.
Each input spelling map is a utf-8 file contain two fields
separated by a tab character. The first field is a variant
spelling. The second field is the standardized spelling
for the variant.
The output file is a utf-8 text file containing the merged spelling
maps from the input files. When a given variant appears more
than once with different standardized spellings in the input
files, the last mapping encountered is the one written to the
output file.
|