Hacker Newsnew | past | comments | ask | show | jobs | submitlogin

What is missed in the article and many of these comments is that Hadoop isn't always going the best tool for one job. It shines in its multitenancy- when many users are running many jobs-each developed in their favorite framework or language(bash/awk pipeline? No problem) running over datasets bigger than single machines can handle.

It also comes in handy when your dataset grows dramatically in size.



Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: