Energy Twitter Dataset


link to file; to technical report

To preview the dataset you could use TwEx that provides an excellent visualization and analysis of the dataset ….:

AIL role desc

In collaboration w/

Dataset contains any of the 121 eco-linguistic terms being studied by the Stanford ARPAe project.  Overall there are 2.47 million tweets, containing 18,338 hastags (df_t>3).

The eco-linguistic terminology used to crawl and collect the tweets was developed at Stanford University by Drs. June Flora, Carrie Armel, and Martha Russell, under sponsorship from the US Advanced Research Projects Agency for Energy, and Media X at Stanford University. A preliminary analysis of this data was presented at BECC2010, November 19, 2010, by Martha Russell and Camilla Yu.  For information about Understanding the Role of Social Media in Changing Consumer’s Energy Behavior, contact



Leave a Reply

Your email address will not be published. Required fields are marked *