Hacker Newsnew | past | comments | ask | show | jobs | submitlogin

Data collected from devices and it is large, but not big. Around 40-60TB and very repetitive data. Find some open set of data that interests you and just do something to get familiar with the tools.

I think most data sets could be handled via RDBMS and Big Data is just another choice. The more interesting thing to me is what you accomplish and if a new tech can get you there faster or cheaper, etc.



Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: