The Extract, Transformation and Load (ETL) process that cleans up the data and ensures data integrity, the data warehouse that acts as a scalable and reliable repository for data derived from many sources, and some form of search acceleration for large data volumes are all still needed, and an enlightened IT group is needed to install and maintain them.
FORBES: Tableau Software's Pat Hanrahan on "What Is a Data Scientist?"