The Ngram tool

Michel et al. (2011) produced the online tool Ngram.

Ngram produces a graph in which the y-axis shows how many times a phrase occurs in a corpus of books (making up about 6% of all books ever printed; Lin et al. 2012) relative to all remaining phrases composed of the same number of words (i.e., relative frequency) during the same time (x-axis).

 

 

 

 

Michel et al. (2011) and Lin et al. (2012) provide a detailed account on Ngram. There is also an application guide available online.

Ngram has been successfully used in many fields (Michel et al. 2011, Stergiou 2017), e.g.:

  • social sciences
  • humanities
  • linguistics
  • literature
  • accounting
  • computer
  • environmental sciences
  • ethics
  • estimating university reputation rankings.

 

Fame

We can objectively quantify fame, or reputation (i.e. what is said or reported about a name), from the frequency with which the name of a person or object appears in various sources one of which is books (Michel et al. 2011, Stergiou 2017).

 

 

I stress that  books are only one source that can be used to assess fame and reputation.

Other sources that can also be used for assessing fame include, among others, newspapers, magazines, media, blogs and social networks (e.g., see here).

Bruce Willis, Arnold Schwarzenegger, Sylvester Stallone

Here I use Ngram to investigate who is more famous: Bruce Willis, Arnold Schwarzenegger, or Sylvester Stallone.

 

 

I do this by comparing the rate of appearance of their names in the corpus of digitized English books published between 1800 and 2010.

As a unit of fame  I use (defined in Stergiou 2017):

  • the famon (Greek fími, fame, from which the Latin fama, fame, is derived)
  • with 1 famon = 1/1 000 000 relative % Ngram frequency.
And more famous is …

Arnold and Sylvester appear in the books in the early 1970s whereas Bruce, who is almost 10 years younger than them, in the early 1980’s.

Up to the late 1980s Sylvester was somewhat more famous than both Bruce and Arnold.

 

 

 

 

 

Sylvester’s fame increases up to the 1995 – 2000 period as a result of all his great, classic movies hits that established him as an action hero, e.g.:

  • Rocky franchise
  • Rambo franchise
  • Tango and Cash
  • Cliffhanger
  • Demolition Man
  • The Specialist, etc.

Arnold’s fame starts to increase from the early 1970s, as a result of his unique body building career during the 1970-1980 period. His fame increases almost exponentially till 1995, because of his great movie hits that also established him as an action hero, e.g.:

  • Conan the Barbarian
  • the Terminator franchise,
  • Commando
  • Predator
  • Red Heat
  • Total Recall
  • True Lies
  • Eraser, etc.

After reaching a plateau in the late 1990s, Arnold’s fame reaches a level, 25 famons, which is about 2.5 times higher than those of Bruce and Sylvester,  who enjoy  similar fame levels after 2000, about 10 famons. This high fame level of Arnold after 2000 is related to his political career as a Governor of California during 2003-2011.

Bruce’s fame increases from about 1983 almost exponentially till 2000. Since then it remains stable. This is attributed to the great success of the:

  • Moonlighting series (which started in 1985)

as well as of the:

  •  Die Hard franchise movies,

which also established him as an action hero, and the success of his many other great movie hits:

  • The Last Boy Scout
  • Striking Distance
  • Pulp Fiction
  • 12 Monkeys
  • Mercury Rising
  • The Fifth Element
  • The Sixth Sense
  • Unbreakable, etc.

But how famous Arnold is when compared to other people or entities? For instance, Goldfish, which is the most famous fish, enjoys a fame in recent years of 125 famons, i.e., 5 times more than Arnold.

I will compare the fame of Arnold, Bruce and Sylvester to those of other actors, politicians, authors and scientists in later posts.

 


 

 


Kostas Stergiou

Kostas Stergiou is a Professor at the Aristotle University of Thessaloniki. He was the former Director of the Institute of Marine Biological Resources and Inland Waters of HCMR (2013-2021). He has research interests on fish and fisheries ecology, modeling and forecasting, ecosystem management, and bibliometrics. He has contributed more than 200 papers in peer-reviewed journals and several other publications (see https://scholar.google.com/citations?user=k8hb4pIAAAAJ). Since 2008 and 2015 he developed the home cinema and smart home hobbies and has installed different home cinema setups in two different houses which have lately been transformed to smart ones.

0 Comments

Leave a Reply

Avatar placeholder

Your email address will not be published. Required fields are marked *