Word Mining with Google's Ngram Viewer
Posted by Greg at 05:29, 21 Dec 2010Google has taken on plenty of ambitious projects over the past decade, perhaps none more so than Google Books: the internet giant has so far managed to scan more than 5 million books into digital format. This has - for obvious reasons - generated much controversy regarding copyright and other aspects. Though it has bewildered me for a number of years how most people haven't understood the positive side of this incredible undertaking - basically making the corpus of all printed literature keyword searchable! Forget the old days of searching your local library for a title using the Dewey Decimal system - you can now search a mega-libary instantaneously for a particular keyword anywhere within the book...and then buy or download the book immediately.
Anyhow, recently Google have made a new tool available to search the Google Books database - the Ngram Viewer - which allows tracking of the use of particular words throughout history in published material. It's quite a bit of fun to use, and rather illuminating as well in various ways. As a simple example, and one that I'm sure nobody would doubt too much - usage of the word "vampire" over the last few hundred years, leading up to the modern fixation with this horror archetype:

But it's interesting to find out that other terms were, surprisingly, not written about much until modern times (or perhaps, were discussed via different terminology). For example, "afterlife":

Another interesting exercise is to compare terms. Say for example that we wanted to see the most common spelling of the word describing the field of UFO studies, which tends to always end in disputes - ufology, UFOlogy, or Ufology. Google's Ngram Viewer shows that though it started out as "UFOlogy", since the 1970s "ufology" has become the most used term - though in recent years its usage has fallen off with the variant spellings closing the gap:

Lots of other fun to be had, so head on over and get to it - and report back on any intriguing results in the comments! Science vs religion is particularly illuminating - try plugging in other terms like "Great Pyramid", "Illuminati", and "LSD" to find intriguing spikes during certain decades which point to historical moments and discoveries. Remember to vary your time frames as needed, and be careful how you enter words as they are case sensitive.


Comments
21 December 2010
1 year 21 weeks
Don't you have to statistically adjust any graph
to compensate for the exponential increase in the
number of books published in the last 50 years or
so?
Nearly every searched word would show a dramatic
increase simply because so many more books have
been published recently.
-=R
21 December 2010
1 year 21 weeks
OK. It's explained onsite.
Interesting!
-=R
21 December 2010
1 year 11 weeks
Just performed a search comparing the rise of hypnotism versus mesmerism. Peaks occur in Victorian/Edwardian society for hypnotism, but much earlier for mesmerism from 1840-60. We in the 21st century aren't that much interested in either anymore, except for a resurgence in hypnotism coincident with World War 2 and lasting to circa 1960.
21 December 2010
1 year 21 weeks
Just made an account specifically to respond to this. While hypnotism trails off in the 1960's, mind control picks up almost exactly where hypnotism leaves off and steadily increases to this day. While we did seem to lose interest in older forms of mind control, we remain quite interested in the subject to this very day. Other forms remain in usage as well. Propaganda, advertising, and particularly social engineering all remain in usage and have to varying degrees. My basic point is that we just got a tad more scientific about it. If you want what I find to be a particularly interesting data point; try 'industrial psychology.' Overall a fascinating tool Google has come up with.
6 May 2004
30 weeks 3 hours
I found this tren interesting
http://ngrams.googlelabs.com/graph?conte...
22 November 2004
4 weeks 6 days
The usage or "war" and "peace" seems aligned with ongoing or historically near big wars. Hardly surprising.
What is peculiar though - after Vietnam, there is no big spike in usage or "war", and there is a general down trend on "war". However there is an up trend of "War", since 1850, during the times when "war" usage is declining.
Hmm.
----
We are the cat.
12 April 2007
1 min 19 sec
Always on the rise :)
It's not the depth of the rabbit hole that bugs me...
It's all the rabbit SH*T you stumble over on your way down!!!
Red Pill Junkie
22 November 2004
4 weeks 6 days
A jump in the use of "lose" and "sink" just before 1800, around the time of some big revolutions, why would that be?
http://ngrams.googlelabs.com/graph?conte...
----
We are the cat.