Technorati cosmos
What's the point in XHTML?
So, why do people use XHTML?
Mark talks about the XHTML vs. RSS debate, in somewhat disparaging terms, stating that "Syndication is not Publication".
The entire success of RSS is predicated on the principle that "you can keep doing whatever messed up stuff you've always done on your web pages... oh, and do this other thing too. Look, it's simple, you can code it up in an hour with a few print statements and an escape function." By contrast, this latest XHTML-as-syndication movement seems to be based on the principle that "syndication is so incredibly important that you must immediately stop whatever you're doing with your web pages, upgrade to XHTML, validate your markup, restructure your home page to include all and only the content you're willing to syndicate, and by the way, would you please unlearn that ugly nasty presentational page layout language you've been using for years and learn this wonderful happy structured semantic markup language instead?"
It should be obvious to any rational observer that this will go nowhere fast. A syndication format that requires valid semantic XHTML markup? Spare me. 9 out of 10 bloggers can't even spell XHTML.
One of the issues here is that 9 out of 10 bloggers can't spell RSS either. In fact, a reasonable propertion of bloggers don't even know what RSS is, or that their weblogging tool generates it for them. That so many pages are invalid is not the fault of these 9 out of 10 bloggers, it's the fault of their tool of choice. Imagine if a weblogging tool checked your built pages for XML well-formedness. (Obviously this only works for baked systems, rather than fried.) Not validation against a DTD or anything, just well-formedness. It's not going to solve all problems overnight, I admit, but being able to use XHTML in pursuit of one of its major wins -- that it's XML as well, and therefore can be parsed by the vast suite of XML tools which insist on a minimum level of compliance (by which I mean well-formedness) and then derive power from that (or are there, say, XPath tools for HTML 4.01 that I don't know about? -- would make life a lot easier. If your intention, O XHTML user, is to have your pages marked up in XHTML in pursuit of this goal, then why do you need RSS or similar formats? If your aim is not to use XHTML for this purpose, what is your purpose? Mark's "It's like...semantic and stuff" remark isn't a flappy-handed meaningless contention on the part of XHTML pushers, it's the point of the whole exercise.
He does make a good point about bandwidth, I freely admit. That, however, reduces RSS from being the Right Thing to being an optimisation away from correctness in order to live in the real world. RSS makes your bandwidth bill cheaper; true. And that's quite possibly a major motivating factor for almost everyone, because real world issues come above principles to some extent. It doesn't make RSS right, though.
Moreover, the further points about corner cases such as Dorothea's Latin dates are entirely valid. That backs up the core contention that publication and syndication are not the same thing, assuming that you actually need an accurate date on a post for anything. Those of you who use RSS newsreaders, do you tend to look at "all the latest posts from all my feeds" (in which case a date is obviously vital) or "all feeds with new content" and then read that feed in isolation (in which case you don't need a date at all, although the newsreader does need to cache what it last saw to check for updates rather than caching the last time you read a feed to see whether there are posts since -- an MD5 hash of the feed contents is not much larger than a stored date). Quoting Dean Allen's use of hand-crafted descriptions in RSS misses the point; I fail to see how any hand-crafted description could be a better description of the post than the post itself is! I'm open to correction on this point; Mark quotes Shelley's desire to not have her whole feed in RSS to avoid republication -- I can't see how those who would want to republish her content couldn't just rip it off her website, so I'm unclear on the point of this.
I mean, I've set explicit excerpts on posts just for my RSS feed. But not because I think that this is a great way to summarise my message. Because I want to use entities like — in my HTML, and it doesn't work in RSS. I have to remember to use " instead of " in all my posts to avoid breaking my RSS feed. This is a hindrance, not the easy path to syndication.
This is in danger of descending into the age old argument between attempting to change the world to make the Right Thing the thing that works, and stepping away from the Right Thing so that your implementation works in the world. Pushing the former too far leads to zealotry. Pushing the latter too far leads to Internet Explorer v 4. The real truth is somewhere in the middle, and once again some people are trying to decide where that line which they won't step over actually lies.
More on XHTML vs. RSS
Visions of the future
Minor layout change
Entities and whatnot in RSS
EUCD victory
XHTML instead of RSS
All CSS hacks, all the time
Mozilla largeness
Little things for big minds
Code and HTML and code, oh my!
Catch-ups
Drone: Okay Sir, Could we have @DETAILS?-----
Me: @DETAILS
Links between other places
Universal search engine
No RAND in W3C standards
"The community now needs to be heard supporting this policy so that it is not undone during the public input and W3C Advisory Council phase. Address your comments to www-patentpolicy-comment@w3.org."This is absolutely true; if we show support then the W3C will have it affirmed that there are people out there who care about more than patent royalties. They do a good job; show them you appreciate it. -----
hr.rec.tv.buffy
I on postoji, iako bi većina obozavatelja vjerojatno radije htjela da ne postoji. Joss Whedon autor je scenarija, ali nije imao previse utjecaja na film, pa se konačna verzija razlikuje od njegovog viđenja filma. I glumi Luke Perry, sto je dovoljno zastrasujuće. Film je već nekoliko puta emitirala Hrvatska televizija.I'm sure I recognise one or two of those names :)
RISC OS revolution
RISC OS is very revolutionary. The Desktop is largely unchanged for 14 years because it was designed correctly in the first place!It is, too. A RISC OS desktop today looks pretty similar to a Windows desktop, or a Mac desktop, or a KDE desktop, or a Gnome desktop. The point is that it looked the same as that when I started using it a decade ago. Well done the RISC OS team. -----
Project Genesis
When good interfaces go crufty
Hiding CSS from NN4
Pay for all your email
Joel Spolsky suggests that, to counter spam, someone sets up a service where sending email costs a penny, and everyone allows all mail that comes from that server through their spam filters, because you know that spammers wouldn't (and couldn't) pay money for the vast volumes of mail they send. In principle, not a bad idea. Sadly, though, Joel then seems to put his corporate head back on and ruins the whole effect:
Eventually, if it caught on, you wouldn't need a spam filter: just put all the free email in a suspect folder, and check it once a week in case some old school holdouts insist on sending you email without paying.Yeah, that's right. Because then we'll be at a stage where we've managed to make people pay money to use a service that was formerly free, and now you can look down on those people who don't want to pay because they think that it should be free. This I consider to be blind corporate thinking -- it's the reason that Linux is not tried in more businesses. (Note: not the reason it's not used more; I have no problem at all with firms who try it and decide that it's not for them. But failing to investigate an alternative that might be better and might save you money just because you don't like the idea of it, because it's "free" and therefore worthless, is blinkered.)
Funky caching
The Day Alertbox Died
Remember, remember November
Instantaneous web services
A census of blogspace
- Do you have an educated guess?
- Not even remotely. No idea. I could pick a figure, but it could be out by two orders of magnitude.
- Do you know of any prior work in this area?
- Not that I'm aware of, I must admit, although it could well have passed me by.
- Can you think of a methodology or two to create useful measures of the number of bloggers and the number of weblogs?
- Google. Google is the best way for queries about all of the net, because it indexes all of the net. You could get a rough estimate of the number of webloggers by making a few simplifying assumptions: all webloggers either have their own domain or are using one of a few weblogging hosts (blogspot, Livejournal, etc, it's a fairly short list), getting user counts from each of the major hosts, and then searching Google for the word "permalink" and extracting the number of unique domains. That'll be a low estimate, because there are multiple weblogs on some domains, and because not all weblogs use the word permalink, but it'd be a figure to begin working with.
The other alternative is to assume that all weblogs are interconnected (see the next question), start at one place, and link-crawl yourself, counting as you go. You'd need rules of what constituted a weblog, which is something not well-defined for a person looking at one, never mind an automated process, but hey. - What related questions would you want answered?
- How many different "islands" are there in the interconnected map of weblogs? Can you navigate from any given blog to any other blog by merely travelling links between weblogs? What does the map look like? What's the most connected node? Which node is at the centre of the map? Lots of questions about the map of links, really.
- How might you use this information?
- Blimes, I dunno. It'd be interesting to look at :) I could do a "six steps to as days pass by" thing, or something.
- Pitfalls to avoid?
- No idea, guv. At this stage, where there's no data at all, any data is better than none, so make assumptions, guess figures, and so on. We can refine the data later.
- Would you join a BlogCensus.org to provide and share stats?
- Suppose so, but I always find that sort of thing fairly silly, because the audience is self-selecting. The Linux Counter is much the same principle, and it's pretty useless in terms of information.