2007.04.29

History of the Bagel is a long, scholarly article about - yeah, you guessed it - bagels. I'm still trying to find a bagel shop in the West End that produces edible and cheap bagels. All I can find is the ruddy Bagel Factory. Smile and a wink 2007-04-29T17:57:30ZUntitled entry permalink

Jesus and Mo are concerned about bigotophobia. 2007-04-29T16:11:59ZUntitled entry permalink

New RSS and pinging now in PHP OPML blogger 2007-04-29T16:15:41ZTitled entry permalink

I have updated my PHP-based OPML blog renderer to support RSS better. I was having a bug whereby if the 'created' attribute wasn't set properly, a mangled link URI would be put in to the RSS. This has been fixed.

I have also had a number of times where blank entries have gone out. I have fixed this by rendering my RSS to a static file. This has been switched over automatically and you should be seeing no difference on your end.

In the process, a convenient place arrived for me to put pingers for Weblogs.com, Feedburner etc. This means that when I put 'build RSS' in my outliner or post from my mobile blogging service, pings should be sent out automatically to the relevant places. I would add more services - or just use Pingomatic, but there doesn't seem to be any documentation of their methods... I can presume they are modelled on the same behaviour as Weblogs.com, but it would be useful to at least have the method names, arguments and suchlike viewable...

Building your own blogging application is fun, and teaches you to do stuff like XML-RPC. The public version will be available soon - I just haven't got time to clean up the code at the moment.

Comments | TrackBack

Standards compliance is context sensitive 2007-04-29T16:47:06ZTitled entry permalink

Shelley Powers has weighed in on XHTML and strict parsing. At risk of getting flamed, I'll give some personal opinions. Remember: reasonable people can disagree, and I Am Not A Heretic.

Personally, I think that it's not a clear cut thing. I think that if an XHTML page is not well-formed, it should still render using tag soup, even if it is served as application/xhtml+xml. If I'm reading someone's crappy MySpace page, I don't care about validation.

Now, if I'm getting financial information that may have a huge significance on, say, whether I have a roof over my head next week - like a legal declaration or contract or something equally important, you bet your arse I want my computer to flash a big red light if the XML isn't well-formed. This is an important piece of context-sensitivity.

I don't care about the MIME type, I care about the fact that there are a lot of things which really ought to be well-formed to avoid ambiguity. If I get a file containing my student loan agreement over the web, you can bet I want it 'conservatively' parsed. One misplaced apostrophe in the legal world where English is the lingua franca can cost you thousands, and online, one misplaced semi-colon or unquoted ampersand sign can also cost you thousands. XML - whatever it's shortcomings - solves this. It can quite easily be the lingua franca of the Web.

Why don't we go further? If 'forgiveness' or 'liberal parsing' is acceptable in HTML, maybe even XML, why not JSON? Surely, not everyone gets their JSON right all the time, so shouldn't we be a little bit more tolerant about it? Let's have some smart algorithm that works out how the object should have been serialised and rearrange it for us. And as for C++, let's just let you type anything. Let's have fault-tolerant compilers. That'd make the software experience just dandy for everybody. Don't worry yourself about the fact that leaving that semi-colon off line 42 means that Carl's computer in Accounts now burps on alternate Thursdays. Error? What error? If I don't see an error, it doesn't exist!

The problem with the discourse about XHTML is that people look at IE, Safari, Firefox and Mozilla and think "that's all there is". There is a long tail of applications which parse the web, both for human consumption - screen readers and other tools to aid accessibility - but for 'web of data' consumption too.

A healthy platform is one where it's as easy to write a parser as it is to write a document. Parsing HTML is bloody hard work, and we have to use inelegant hacks like Tidy, Hpricot and BeautifulSoup to get data out of HTML, when we should be using XML-based methods like XPath, XQuery and XSLT which are quicker, sexier and lead to far less hair loss.

The argument that we shouldn't bother with XHTML because there's so much tag soup out there is, frankly, a bad argument. It's a bit like saying "well, we shouldn't bother to punish criminals because there's so much crime out there". The fact that something is either popular or unpopular doesn't mean it's worth or not worth doing.

Lowering the barrier on making tools to use content on the Web is a valuable thing, but the average web designer doesn't see it as one. Even if we don't get to a well-formed web, it's still worth striving to do so.

Human authors are only part of the picture. It's quite easy for tool makers to spend an extra few minutes and provide XHTML. The argument about browsers is a distraction from the key part of this. Well-formed XML is good in and of itself because it makes the process of making tools to consume data easier.

Perhaps there is another motive why the browser manufactuers (whose interests guide the W3C) are kicking their heels on XHTML adoption - because it would make competing with the currently established interests a bit easier. (Then again, as an 'XML guy', perhaps there are motives for me too...)

When I ask for XHTML, I'm not saying that having pages break if they have a misplaced ampersand is a good thing. I'm saying that the eventual replacement of the SGML-based web with an XML-based one would be a Good Thing because every webpage could become an API and not just a document. There are going to be engineering problems involved in that (and a boatload of clueless folks in suits getting het up about stuff), but not switching to XML is putting off the inevitable. We need some blue sky thinking in the standards area, some vision of where we are going - otherwise we get stuck in the trap of minutiae.

Comments | TrackBack

 

Login with your OpenID:
Le Web '08, Paris, December 9th and 10th, Register Now! 20% off.
No. 545
Tom Morris
Currently in: East Sussex, England
Usually in: East Sussex, United Kingdom
AIM: tommorris
YIM: tom.morris

I am a , an , like to code in and noodle about with and the . I also have a BA in philosophy from London, and am studying for an MA. My philosophical interests are in Victorian-era German philosophy, Kierkegaard, Robert Nozick, hermeneutics and current approaches to the demarcation problem in the philosophy of science. Musically, I like jazz fusion, soul and P-Funk. My musical nirvana would be a mixture of Beethoven, Miles Davis and George Clinton topped with a side-serving of Erykah, Jill and Angie.

I also write for the Citizendium, an online encyclopedia project. If you know about stuff, you should join in.

Elsewhere:

  • GPG Key
  • del.icio.us
  • Flickr
  • Twitter
  • digg
  • Jaiku
  • LinkedIn
  • ma.gnolia
  • blip.tv
  • upcoming.org
  • MetaFilter
  • LiveJournal
  • CiteULike
  • Technorati Profile

RSS Feed Subscribe:

RDF

« April 2007 »
SuMoTuWeThFrSa
1234567
891011121314
15161718192021
22232425262728
2930 

View in month context

On this day in: 2006 2008