Word Mining with Google's Ngram Viewer

Google has taken on plenty of ambitious projects over the past decade, perhaps none more so than Google Books: the internet giant has so far managed to scan more than 5 million books into digital format. This has - for obvious reasons - generated much controversy regarding copyright and other aspects. Though it has bewildered me for a number of years how most people haven't understood the positive side of this incredible undertaking - basically making the corpus of all printed literature keyword searchable! Forget the old days of searching your local library for a title using the Dewey Decimal system - you can now search a mega-libary instantaneously for a particular keyword anywhere within the book...and then buy or download the book immediately.

Anyhow, recently Google have made a new tool available to search the Google Books database - the Ngram Viewer - which allows tracking of the use of particular words throughout history in published material. It's quite a bit of fun to use, and rather illuminating as well in various ways. As a simple example, and one that I'm sure nobody would doubt too much - usage of the word "vampire" over the last few hundred years, leading up to the modern fixation with this horror archetype:

Vampire Ngram

But it's interesting to find out that other terms were, surprisingly, not written about much until modern times (or perhaps, were discussed via different terminology). For example, "afterlife":

Afterlife Ngram

Another interesting exercise is to compare terms. Say for example that we wanted to see the most common spelling of the word describing the field of UFO studies, which tends to always end in disputes - ufology, UFOlogy, or Ufology. Google's Ngram Viewer shows that though it started out as "UFOlogy", since the 1970s "ufology" has become the most used term - though in recent years its usage has fallen off with the variant spellings closing the gap:

Ufology Ngram

Lots of other fun to be had, so head on over and get to it - and report back on any intriguing results in the comments! Science vs religion is particularly illuminating - try plugging in other terms like "Great Pyramid", "Illuminati", and "LSD" to find intriguing spikes during certain decades which point to historical moments and discoveries. Remember to vary your time frames as needed, and be careful how you enter words as they are case sensitive.

Comments

Comment viewing options

Select your preferred way to display the comments and click "Save settings" to activate your changes.
Robinamw's picture
Member since:
21 December 2010
Last activity:
1 year 21 weeks

Don't you have to statistically adjust any graph
to compensate for the exponential increase in the
number of books published in the last 50 years or
so?

Nearly every searched word would show a dramatic
increase simply because so many more books have
been published recently.

-=R

Robinamw's picture
Member since:
21 December 2010
Last activity:
1 year 21 weeks

OK. It's explained onsite.
Interesting!
-=R

werewolf's picture
Member since:
21 December 2010
Last activity:
1 year 11 weeks

Just performed a search comparing the rise of hypnotism versus mesmerism. Peaks occur in Victorian/Edwardian society for hypnotism, but much earlier for mesmerism from 1840-60. We in the 21st century aren't that much interested in either anymore, except for a resurgence in hypnotism coincident with World War 2 and lasting to circa 1960.

De.Carabas's picture
Member since:
21 December 2010
Last activity:
1 year 21 weeks

Just made an account specifically to respond to this. While hypnotism trails off in the 1960's, mind control picks up almost exactly where hypnotism leaves off and steadily increases to this day. While we did seem to lose interest in older forms of mind control, we remain quite interested in the subject to this very day. Other forms remain in usage as well. Propaganda, advertising, and particularly social engineering all remain in usage and have to varying degrees. My basic point is that we just got a tad more scientific about it. If you want what I find to be a particularly interesting data point; try 'industrial psychology.' Overall a fascinating tool Google has come up with.

ice9melting's picture
Member since:
6 May 2004
Last activity:
30 weeks 3 hours

I found this tren interesting
http://ngrams.googlelabs.com/graph?conte...

earthling's picture
Member since:
22 November 2004
Last activity:
4 weeks 6 days

The usage or "war" and "peace" seems aligned with ongoing or historically near big wars. Hardly surprising.

What is peculiar though - after Vietnam, there is no big spike in usage or "war", and there is a general down trend on "war". However there is an up trend of "War", since 1850, during the times when "war" usage is declining.

Hmm.

----
We are the cat.

red pill junkie's picture
Member since:
12 April 2007
Last activity:
1 min 19 sec

Always on the rise :)

It's not the depth of the rabbit hole that bugs me...
It's all the rabbit SH*T you stumble over on your way down!!!

Red Pill Junkie

earthling's picture
Member since:
22 November 2004
Last activity:
4 weeks 6 days

A jump in the use of "lose" and "sink" just before 1800, around the time of some big revolutions, why would that be?
http://ngrams.googlelabs.com/graph?conte...

----
We are the cat.