This is somewhat similar to a site I have been working on. http://www.newsdive.net I am doing filtering based on story content, but this seems to be based on tags and categories more.
At the moment it's pretty simple matching based on a few rules, but machine learning is in development. In the case of google, it appears they are using human editors.