XML Isn’t Enough

A lot of this is in my XML Server presentation at the Innovative Users Group conference in a couple weeks…

Jenny Levine is an outspoken advocate for the use of RSS in libraries. One example she cites is posting lists of new acquisitions to library websites. She estimates that folks in the 77 libraries of her library system spend 924 hours per year on that one activity, time that could be used elsewhere if automated by RSS. So it’s easy to see why I wanted to mention her in my presentation.

The problem is that even though RSS is XML, XML isn’t always as useful as RSS. Quickly, what is RSS? RSS is an XML schema for content syndication that has broad client and server software support.

XML, however has no schema. It’s a standard, but so is ASCII. As a practical matter, ASCII data must delimited in some way to be useful as a form for information exchange between computers. XML without a schema is like ASCII without those pesky delimiters. Here it is from Wikipedia:

XML provides a surface syntax for structured documents, but imposes no semantic constraints on the meaning of these documents.

Data exchange requires all the standards of the networks the data travels on, plus standard ways of reading and understanding the information once received. XML provides a standard and flexible way of delimiting the data, but relies on schemas or DTDs layered on top of it to make the data it contains meaningful.

When done right, however, XML-formatted data are the basic building blocks for the Semantic Web, “a project that intends to create a universal medium for information exchange by giving meaning (semantics) in a manner understandable by machines, to the content of documents on the Web.”

XML schema standards are still new, and Wikipedia admits that the specification is “difficult to understand and implement,” but a few standard schemas are emerging. One useful standard for libraries is MARC XML (example). XML representations of Dublin Core are also standardizing.

Amazon deployed their XML Web Services some time ago and in doing so created one of the first standards for exchanging catalog information. Fortunately, their documentation is good and the schema is well designed. They’ve made it easy to build storefronts that push their content, and more than few people are doing just that.

Amazon is also pushing OpenSearch, an RSS-like XML schema that aims to create standards for metasearch. RSS has shown us how useful news aggregators can be, Amazon is making their A9 search engine into a sort of “search aggregator.” (There are too many terms for this: metasearch, federated search, broadcast searching; but now I’d like to add “aggregated search”).

Try it out now, you can search the Seattle Public Library, then click a button to bring up Wikipedia and the web right next to it.

So what do these other XML schemas offer that RSS doesn’t? RSS’s simplicity is also a barrier to more complex uses. It’s easy to embed book covers, titles, and descriptions of new books along with a link in an RSS feed, but on it’s own RSS is at a loss to express these bibliographic details in a way another computer can understand. (Yes, RDF solves some of these problems at the cost of the simplicity that made RSS so popular in the first place). This same list of new books in MARC XML would have richer detail, perhaps allowing the user to re-sort the list by author, publisher, call number, or any other field in the MARC XML data. Computers need to be told “this is the publisher, this is the call number,” and MARC XML allows that.

All of these technologies depend on XML and a schema. And they’re all changing the way we consume and interact with information.

[update:] it looks like Richard Wallis over at Talis was thinking the same thing I was thinking in response to Jenny Levine’s post on OPACs and XML.

No Comments Yet

No comments yet.

Comments RSS TrackBack Identifier URI

Leave a comment

 

User contributed tags for this post:

ascii devil horns (58) - ascii art devil horns (33) - devil horns ascii (30) - video de (14) - horns ascii (10) - ASCII HORNS (8) - opensearch XML schema (8) - devil horn ascii (7) - wikipedia xml schema (6) - xml semantic web isn t enough (4) - opensearch schema (3) - ASCII images devil horns (3) - devil horns ascii art (3) - devils horns ascii images (3) - devils horns ascii (3) - ASCII devils (2) - ascii devil sign (2) - The Semantic Web project is a project that intends to c (2) - wikipedia xml schema standards (2) - Mark Schutte seattle (2) - embed blogger (2) - blogger xml schema (2) - brewery xml schema (2) - Library Federated Search (2) - delimiting comments xml (2) - google earth XML (2) - new acquisitions with xml (2) - google earth schema (2) - opensearch (2) - XML Schema Wikipedia (2) - libraries (2) - how to make wikipedia xml search (2) - embed blogger xml and frontpage (2) - google earth xml schema (2) - wikipedia schema (2) - schema wikipedia (2) - delimiting in xml (2) - how to make devils horns in ascii (1) - ascii hornz (1) - devil horns ASCII images (1) - devil horn ascii art (1) - devil horns in ascii (1) - ASCII image devil horns (1) - ascii devil horns sign (1) - ascii devil's horn (1) - devil horns syntax (1) - devil's horns ascii (1) - ascii devil (1) - ASCII DEVILS HORNS (1) - MARCXML book (1) - devil sign ascii (1) - ascii devil horns symbol (1) - EVIL HORN ASCII (1) - ascii devil devil horns (1) - devil's horn ascii (1) - ascii art devil (1) - ascii devil horn (1) - ascii art devil's horns (1) - ASCII symbols devil horns (1) - ascii devil's horns (1) - sign horns ascii (1) - devil horns ascci (1) - devil ascii mark (1) - ascii art devil horn (1) - ascii pictures devil horns (1) - standards for metasearch (1) - embed blogger rss feed (1) - devils horn ascii (1) - wikipedia XMLSchema (1) - library websites semantic web (1) - 5955709157713 (1) - when RSS isn t enough (1) - KEYWORD (1) - Google Earth XML Search (1) - wikipedia XML (1) - amazon (1) - library (1) - wikipedia OR mediawiki schema comments OR comment OR fl (1) - xml isn't enough (1) - Tag Cloud RSS schema (1) - translate wikipedia xml data (1) - data exchange XML review wikipedia (1) - schemas wikipedia (1) - ascii pictures devils horns (1) - mark xml (1) - ascii devils horn (1) - wikipedia xml sample (1) - devil s horn ascii (1) - m devils horns ascii (1) - RSS feed XML schema (1) - s funnyfree net (1) - xml schema for opensearch (1) - Colorful XML my presentation (1) - library 2 0 xmlschema standards library (1) - wordpress (1) -