In addition, Pentaho Data Integration (PDI) can access unstructured, raw data such as tweets, do pattern matching, find the structure, and perform sentiment analysis.
FORBES: Ideas for Solving the 'Data' Problem First, the 'Big' Problem Second: The Pentaho Way