I think it's much more helpful to have an idea of what you want to get out of yo...

vonmoltke · on Feb 21, 2012

Random data or mashups of public datasets are good for learning the mechanics of specific processing frameworks, but you really need a clear objective guiding the analysis to understand the concepts behind processing big data.

Random data is god for the how and the with what (to an extent), but not for the when and the why.