In 2000 he received a U.S. government grant to begin exploring large datasets using visualization technology.
FORBES: Tableau Software's Pat Hanrahan on "What Is a Data Scientist?"
The sons saw search as an interesting problem in organizing very large datasets.
Many companies now collect large datasets on consumer behavior, be it online search patterns or user demographics.
Other applications might be more robust internal database search for e-discovery or for scientific analysis of large datasets.
FORBES: HP Might Use Autonomy to Build a Search Engine - For Everything
Most of his customers are not creative content or news publishers, but rather companies protecting large datasets or databases.
If we gave away large datasets that cost a lot of money to collect, the data would degenerate over time.
Facebook and Twitter to being a data transfer complex for large datasets, such as those used by the Large Hadron Collider.
FORBES: Netflix: "We Kill Piracy!"; BitTorrent: "Yeah, So, About That Piracy Thing?"
It might be nice to think that people are, for one reason or another, different and not subject to the same rules those that govern other large datasets.
FORBES: Nate Silver, Jonah Goldberg And Conservatism's Intellectual Decline
One of the pioneers in this field is Britain's National Institute for Health and Clinical Excellence, which uses large datasets to investigate the cost and benefit of new drugs and existing expensive treatments.
"It's not the first time something in large epidemiological datasets just didn't work out clinically, " Ferris says, noting that statin medications, anti-inflammatory drugs, and estrogen therapy have all failed to live up to their initial promise in preventing or treating Alzheimer's.
That said, as more academic researchers become interested in examining large-scale datasets (on the order of Twitter or Facebook), many of the technical skills of data science will have to be acquired by academics.
The 1000 Genomes Project in the Cloud offers a showcase for how technology can transform global research around large, heavily used datasets.
应用推荐