download PDF - Imaging Notes
download PDF - Imaging Notes
download PDF - Imaging Notes
Create successful ePaper yourself
Turn your PDF publications into a flip-book with our unique Google optimized e-Paper software.
…THE LONG TAIL OF SCIENCE DATA<br />
<br />
RICHARD HEIMANN<br />
Analytics Coordinator<br />
Data Tactics Corportation<br />
McLean, Va.<br />
www.data-tactics-corp.com<br />
THE 20TH CENTURY was seminal for the natural sciences, with discoveries such<br />
as penicillin (Fleming, 1945), the polio vaccine (Salk, 1952), the double helix structure<br />
of DNA (Watson & Crick, 1953), and the first complete DNA sequence of an<br />
organism (Sanger et al., 1977), all of which advanced human understanding and<br />
human welfare. The advent of the OpenWeb and a seemingly endless amount of<br />
new science data have the potential to do for the computational social sciences in<br />
the 21st century what other measurement tools did for the natural sciences of the<br />
20th century – advancing further human understanding and human welfare. Big<br />
Data will be central to that pursuit.<br />
“Data is the new oil” is a phrase coined by Clive Humby and embraced by the<br />
World Economic Forum in 2011 as it considered data as an economic asset like oil.<br />
Every day, we create 2.5 quintillion bytes of data – so much that 90% of the data<br />
in the world today has been created in the last two years alone. (IBM) These data<br />
come from everywhere: hard sensors used to gather information, the social web,<br />
transaction records, and cell phone GPS signals, to name a few. The amount of such<br />
data is big and by every account growing exponentially. These facts, however, give<br />
no hint to where the largest growth is, or comparatively speaking, where the greatest<br />
rewards lie for researchers. It is difficult to believe that data are increasing uniformly.<br />
34<br />
IMAGING NOTES // WINTER 2013 // WWW.IMAGINGNOTES.COM