English Letter Frequency Counts per Google Books Analysis

Peter Norvig hat Mark Mayzner 1960er Studie über die Häufigkeit von Buchstabenkombinationen in der englischen Sprache per Google Books aktualisiert.

Here are the 24 words with length of 20 or more (that are mentioned at least 100,000 times each in the book corpus):

electroencephalographic
polytetrafluoroethylene
forschungsgemeinschaft
deinstitutionalization
counterrevolutionaries
dehydroepiandrosterone
electroencephalography
immunoelectrophoresis
institutionalisation
acetylcholinesterase
internationalization
institutionalization
radiopharmaceuticals
electroencephalogram
keratoconjunctivitis
counterrevolutionary
immunohistochemistry
internationalisation
hypercholesterolemia
phosphatidylinositol
compartmentalization
electrophysiological
electrocardiographic
uncharacteristically

English Letter Frequency Counts: Mayzner Revisited or ETAOIN SRHLDCU (via Adafruit)