Patterns in the word length distribution of synonymous words in the English language
Abstract
Utterance lengths in both fictional and online conversations are gradually shortening through time. One possible cause of shortening is the replacement of words by shorter synonyms. In this work, we analyze patterns in frequencies of words found in WordNet by getting their occurrences from the Google Books N-gram corpus. For sixty-nine percent (69%) of the unambiguous words in the WordNet dataset, the usage of words with shorter lengths rapidly increase after coinage and eventually becomes the most preferred word over its longer synonyms.