SubString package is an open-source set of Unix Shell scripts used for
substring reduction and frequency consolidation of word n-grams of
length. In the
process, the frequencies of substrings are reduced by the
of their superstrings and a consolidated list with n-grams of
lengths is produced without an inflation of the overall word count.
The functions performed by SubString will primarily be of interest to
linguists working on formulaic language, multi-word sequences and
SubString is cross-platform and currently available as beta software.
tested under MacOS X and Xubuntu Linux, but should work well on any
form that can run the bash shell.
What is frequency
| project page on github
| about the author
| download previous versions