What is missed in the article and many of these comments is that Hadoop isn't always going the best tool for one job. It shines in its multitenancy- when many users are running many jobs-each developed in their favorite framework or language(bash/awk pipeline? No problem) running over datasets bigger than single machines can handle.
It also comes in handy when your dataset grows dramatically in size.
It also comes in handy when your dataset grows dramatically in size.