Interdisciplinarity

Analyzing Culture with Google Books: Is It Social Science?

January 4, 2012 1898

In a recent opinion piece in Miller-McCune Magazine, argues that discovering fun facts by graphing terms found among the 5 million volumes of the Google Books project sure is amusing — but this pursuit dubbed ‘culturomics’ is not the same as being an historian.

Earlier this year, a group of scientists — mostly in mathematics and evolutionary psychology — published an article in Science titled“Quantitative Analysis of Culture Using Millions of Digitized Books.”The authors’ technique, called “culturomics,” would, they said, “extend the boundaries of rigorous quantitative inquiry to a wide array of new phenomena spanning the social sciences and the humanities.” The authors employed a “corpus” of more than 5 million books — 500 billion words — that have been scanned by Google as part of the Google Books project. These books, the authors assert, represent about 4 percent of all the books ever published, and will allow the kind of statistically significant analysis common to many sciences.

This sounds impressive. The authors point out that 500 billion words are more than any human could reasonably read in a lifetime. Their main method of analysis is to count the number of times a particular word or phrase (referred to as an n-gram) occurs over time in this corpus. (Try your own hand at n-grams here.) Their full data set includes over 2 billion such “culturomic trajectories.” One of the examples the authors give is to trace the usage of the year “1951.” They note that “1951” was not discussed much before the actual year 1951, that it appeared a lot in 1951, and that its usage dropped off after 1951. They call this evidence of collective memory.

I initially reacted to this article with skepticism. As I read more — including a recent piece (one might call it a puff piece) in Nature on one of the co-authors, Erez Lieberman Aiden, in which he was dubbed “the prophet of digital humanities” — my skepticism became stronger. I think culturomics is a nifty tool, but we need to be cautious and critical about this kind of digital data and about claims that culturomics could make “much of what [historians] do trivially easy.” Historians do much more than follow trajectories, so I am not so sure that culturomics will lead to a new way of doing historical work. It’s not the game-changer it’s been claimed to be.

I would not call myself a Luddite — I use digital resources all the time, in my research and my teaching. I have hundreds of PDFs of books I have downloaded from a variety of online sources — Early English Books Online,Eighteenth Century Collections OnlineGallica (the digital service of the French National Library), and yes, Google Books — that I use in my research.

But when I read the Science article, I was immediately struck by what seems to me to be a fundamental flaw in its methodology: its reliance on Google Books for its sample….

Read the rest Here

One of Library Journal’s Best Magazines of 2008, Miller-McCune not only identifies policy issues of global important but provides evidence-based solutions offered by academic research and real-world models. Through excellent but understandable writing and proven judgment in what to cover, the nonprofit Miller-McCune has received a surprising amount of acclaim and, more importantly, a large and growing audience interested in the social and natural sciences.

View all posts by Pacific-Standard Magazine

Related Articles

Less Academic Freedom Will Mean Fewer Collaborative Breakthroughs
News
November 20, 2025

Less Academic Freedom Will Mean Fewer Collaborative Breakthroughs

Read Now
Vaccination: A Child’s Right?
Public Policy
November 17, 2025

Vaccination: A Child’s Right?

Read Now
New Guide Recognizes the Value of Good Curation
Bookshelf
October 29, 2025

New Guide Recognizes the Value of Good Curation

Read Now
The Musée des Confluences: Celebrating Secularism and the Sciences
Public Engagement
October 13, 2025

The Musée des Confluences: Celebrating Secularism and the Sciences

Read Now
Public Health and American Exceptionalism: Part II Raw Milk

Public Health and American Exceptionalism: Part II Raw Milk

‘Blessed are the cheesemakers’ – but not, it seems, in the US. Some years ago, I was at a conference in Madison, […]

Read Now
Public Health and American Exceptionalism: Part I – Vaccine Mandates

Public Health and American Exceptionalism: Part I – Vaccine Mandates

The hullabaloo over COVID-19 vaccine recommendations in the U.S. raises some interesting questions about other areas where public health elites have been […]

Read Now
CDC – Meltdown or Hissy Fit?

CDC – Meltdown or Hissy Fit?

At the time of writing, there is a new stand-off between the Centers for Disease Control and Prevention and the Trump administration […]

Read Now
0 0 votes
Article Rating
Subscribe
Notify of
guest

This site uses Akismet to reduce spam. Learn how your comment data is processed.

0 Comments
Newest
Oldest Most Voted
Inline Feedbacks
View all comments