E. you read a simple input csv, split it into training and test set, run a simple algorithm with readily-tuneable or explorable hyperparameters, and a simple output of relevant statistics.
Because there are no comparable set of statistics for the Internet, there is no way of answering the simple question: Is using the Web safer today than it was a few years ago?