The latest deduplication advances automate the process of hunting down multiple files at a very granular level and apply specialized compression algorithms to what remains to shrink your data even further.
FORBES: Turn Big Data into Little Data With De-Duplication