On Tue, Mar 13, 2012 at 01:33:28AM -0700, Brian Dolbec wrote: > ... and just when I was beginning to think no one actually cared :) ... I specifically wanted to avoid any special regex to pull data out of the XML. Merging fields is acceptable, splitting them based on regex isn't.
> The proper form of an irc url is in my example > "irc://irc.gentoo.org/gentoo-guis" and I took it from gentoo's irc > channel page at http://www.gentoo.org/main/en/irc.xml . The '#' is debated in the URL scheme specs. The last RFC draft I saw for it was: http://tools.ietf.org/html/draft-butcher-irc-url-04 Earlier drafts did explicitly call for dropping the '#', but that lead to trouble distinguishing between a user with the same name as a channel. > That would mean limiting a single <irc></irc> field to just valid url's > just like the <homepage></homepage> field. We can allow 0 or more irc fields in the DTD... > Personally I would find it quite simple to use a reg expression to > extract a valid irc url from a mixture of written text and url. > <irc>#gentoo-guis on the freenode IRC network, > irc://irc.gentoo.org/gentoo-guis</irc> Don't use a regex on XML. Actually connect it properly. > Would it be better that I create 2 irc sub data types then? > > <irc> > <description>#gentoo-guis on the freenode IRC network</description> > <url>irc://irc.gentoo.org/gentoo-guis</url> > </irc> No, that's really bloated. > So far it seems many/most systems do not come setup to recognize and > take proper action for irc:// mime types like they do for http:// It's not a mime type. It's URL scheme. Docbook/GuideXML style: Option 1a) <irc link="irc://irc.gentoo.org/#gentoo-guis" /> Option 1b) <irc link="irc://irc.gentoo.org/#gentoo-guis"> For GUI issues in Gentoo </irc> HTML style: Option 2a) <a rel="irc" href="irc://irc.gentoo.org/#gentoo-guis" /> Option 2b) <a rel="irc" href="irc://irc.gentoo.org/#gentoo-guis"> For GUI issues in Gentoo </a> -- Robin Hugh Johnson Gentoo Linux: Developer, Trustee & Infrastructure Lead E-Mail : robb...@gentoo.org GnuPG FP : 11ACBA4F 4778E3F6 E4EDF38E B27B944E 34884E85