My Bullshit Detector software is quite difficult to build. I'm searching through text and URLs to produce an index number (kind of like a reverse PageRank) - but it is difficult to detect context. For instance, I consider the words "Deepak Chopra" to be a good sign that "here be bullshit". But how to differentiate between a page that mindlessly repeats his spiritualist twaddle and a page that rips it to shreds? Basically, this is a good case for a lot of testing and iteration. 
Currently, I'm covering a wide variety of topics in bullshit detection - pseudoscience (things like creationism and "quantum vibrations"), religion (new age, dark age and revival tent style), alternative medicine (homeopathy and the suchlike), conspiracy theories (chemtrails, Holocaust denial) and the folks who tend to trade in bullshit (Ann Coulter, Prince Charles, Pat Robertson etc.). The other main problem with developing a bullshit detector is that you have to basically don the thickest damn waders you can and climb right in and hunt that bullshit. Currently, I have about 278 terms which I look for (often with variations - I'm using Perl-compatible regex) in order to detect whether the page is bullshit or not. Most of these are names of bullshiters, but also some of the many silly words, organisations and practices they use to foist their rubbish off on us under. 
I've built it, and am going to be demoing it tonight to a few friends. If they like it, I'll publish it tomorrow (and, of course, roll their suggestions in to the iterative development process. I'm using microformat-style formatting in order to make it in to a very basic API too. I might also make it so that you can choose to run it in "reporting" mode, which would basically add the results to a database - so I could then output a directory of websites based on their Bullshit Rating. 
There is an important task in this - which is to try and shake people in to building web applications that take a stand on things, rather than just trying to remain agnostic. Part of "users and developers party together" is the fact that developers are users too. 
Comments | TrackBack 