Trigrams
& Bigrams
Trigrams
Ted, for
example, recently
sought through all the files in the Compendium to find the frequencies
of the 17,576 possible trigrams, sets of three letters from the 26 in
the
alphabet. A quite small program (60 lines of code) quickly produced all
the frequencies and a complete analysis of them, as shown by the
following
tables, first published in WordsWorth magazine for October 2000.
Note that 44%
possible Trigrams
do not occur at all. The counts are the number of different trigrams
having
the same frequency, 0; 1; 2; 3; and ranges 4 - 100; etc. A check will
reveal
that 55% of all trigrams have frequencies less than four.
Bigrams
Another
search, this time
for bigrams within a word, revealed the plural SOMATOSTATINOMAS, of a
type
of tumour. Its 16 letters yield 15 consecutive bigrams, each of which
is
a valid word (see the April 1998 WordsWorth), all of which are to be
found
in the Chambers Official Scrabble Words 1995. In its singular form,
this
word could form part of normal Scrabble© word play, leading to
a
score
for a 15 -letter word!
It would not
be an easy
task to discover a word with such properties from the database
directly;
a simple program, which would read its words one bigram at a time,
checking
whether they existed in the 2-letter files as words, is a doddle!
More
references to Word Puzzles
can be found in Ted Clarke's new book "HOW THICK IS A SHADOW?"
.....................
|