search

Web Search Re-Imagined: Searchme iPhone App

Re-imagined a bit, anyway. Why browse a vertical list of results when you can flip through them like pages in a book (or album covers in iTunes). Searchme on the iPhone and iPod touch does just that. As you type your search term, icons representing rough categories appear, allowing you to target your search and […] » about 300 words

DeWitt Clinton On The Birth of OpenSearch

OpenSearch is a common way of querying a database for content and returning the results. The idea is that it brings sanity to the proliferation of search APIs, but a realistic view would have to admit that we’ve been trying to do that since before the development of z39.50 in libraries decades ago, and the […] » about 900 words

Cataloging Errors

A bibliographic instruction quiz we used to use asked students how many of Dan Brown’s books could be found in our catalog. The idea was that attentive students would dutifully search by author for “brown, dan,” get redirected to “Brown, Dan 1964-,” and find three books. Indeed, the expected answer was “three.”

As it turns out, my library has all four of Dan Brown’s published books, including the missing Digital Fortress. The problem is that three books are cataloged under the more common Brown, Dan 1964-, but Fortress was cataloged under Brown, Danielle.

The problem is that cataloging is imperfect.

Yeah, it takes some marbles to say that, but the fact is that cataloging is a human endeavor. Humans make mistakes. The challenge we face is to build systems that tolerate error, and then make it easy to fix those errors when discovered.

OpenSearch In A Nutshell

OpenSearch is a standard way of querying a database for content and returning the results. The official docs note simply: “Any website that has a search feature can make their results available in OpenSearch format,” then adds: “Publishing your search results in OpenSearch™ format will draw more people to your content, by exposing it to […] » about 300 words

Context, Language, Systems

“Bagged products” is little better than “cookery.” I’m gonna bet that no customer has ever asked the sales people for “bagged products,” that nobody’s ever checked the yellow pages for “bagged products,” and without context, nobody would come close to answering a question on what the heck “bagged products” are all about. But we do […] » about 300 words

Talking ‘Bout Library 2.0

Users want a rich pool from which to search, simplicity, and satisfaction. One does not have to take a 50-minute instruction session to order from Amazon. Why should libraries continue to be so difficult for our users to master?

— from page 8 of the The University of California Libraries Bibliographic Services Task Force Final Report. I find a new gem every time I look at it.

Standards Cage Match

I prefaced my point about how the standards we choose in libraries isolate us from the larger stream of progress driving development outside libraries with the note that I was sure to get hanged for it. It’s true. I commented that there were over 140,00 registered Amazon API developers and 365 public OpenSearch targets (hey […] » about 1000 words

Ryan Eby’s Pursuit of Live-Search

Ryan Eby gets excited over LiveSearch. And who can blame him? I mention the preceding because it explains the following: two links leading to some good examples of livesearch in the wild.

Inquisitor is a livesearch plugin for OS X’s Safari web browser. It gives the top few hits, spelling suggestions where appropriate, and links to jump to other search engines.

Garrett Murray’s ManiacalRage is an interesting blog on its own, but he’s also doing some good AJAX on his search interfaces. Look first at the archive search. But also take some time to appreciate the new content search. Sure, you’ll have some complaints, but it’s his site and not yours and there are some ideas there that are pretty interesting and useful.

OpenSearch Spec Updated

I just received this email from the A9 OpenSearch team:

We have just released OpenSearch 1.1 Draft 2. We hope to declare it the final version shortly, and it is already supported by A9.com. Uprading from a previous version should only take a few minutes…

OpenSearch 1.1 allows you to specify search results in HTML, Atom, or any other format (or multiple formats) in addition to just RSS. In addition, OpenSearch 1.1 will be supported by Internet Explorer 7, among other software, so we strongly recommend that you upgrade. Also new is the ability to specify suggested searches, such as spelling suggestions and related queries. (link and emphasis addded)

Woot! I’ll be doing something with this soon.

What’s In A Web Search?

Sometimes the answer isn’t as interesting as the question. Consider this note from Yahoo Buzz:

On Sunday, the day before the nomination became official, [searches for] Alito sprang up a sudden 320%.

Did searches for Alito spike on tips White House staffers, or were White House Staffers vetting their nominee via the search engines?

Findability, The Google Economy, and Libraries

Peter Morville, author of Ambient Findability, stirred up the web4lib email list with a message about Authority and Findability. His message is about how services like Wikipedia and Google are changing our global information architecture and the meaning of “authority.” The reaction was quick, and largely critical, but good argument tests our thinking and weeds […] » about 400 words

Must Read: Ambient Findability

Peter Morville‘s Ambient Findability sold out at Amazon today on the first day of release. There’s a reason: it’s good. Morville’s work is the most appropriate follow-on to the usability concepts so well promoted by Steven Krug in his Don’t Make Me Think and Jakob Nielsen in Designing Web Usability. Findability, Morville argues, is a […] » about 300 words

Ambient Findability And The Google Economy

I’m only just getting into Peter Morville‘s Ambient Findability, but I’m eating it up. In trying to prep the reader to understand his thesis — summed up on the front cover as “what we find changes who we become” — Morville relates his difficulty in finding authoritative, non-marketing information about his daughter’s newly diagnosed peanut […] » about 500 words

Search, Findability, The Google Economy: How It Shapes Us

Just when I was beginning to feel a little on my own with my talk about the Google Economy here, I see two related new books are coming out. The first is Peter Morville’s Ambient Findability. The second is John Battelle’s The Search.

Findability appears to ask the big question that I’ve been pushing toward. From the description at Amazon:

Are we truly at a critical point in our evolution where the quality of our digital networks will dictate how we behave as a species? Is findability indeed the primary key to a successful global marketplace in the 21st century and beyond?

Here, as always when thinking about information, think about “marketplace” in broader terms than pure commercial, pure profit. This is the Google Economy.

Doing Relevance Ranked Full-Text Searches In MySQL

I’m going out on a limb to say MySQL’s full-text indexing and searching features are underused. They appeared in MySQL 3.23.23 (most people are using 4.x, and 5 is in development), but it’s been news to most of the people I know.

Here’s the deal, the MATCH() function can search a full-text index for a string of text (one or more words) and return relevance-ranked results. It’s at the core of the list of related links at the bottom of every post here.

For that query, I put all the tag names into a single variable that might look like this:

$keywords = “mysql database php select full-text search full-text searching docs documentation”

Then I do a select that looks something like this:

SELECT * FROM wp_posts WHERE MATCH(post_title,post_content) AGAINST(‘$keywords’);

The docs give a lot more detail, including how to do boolean searches.

The Google Economy Will Beat You With A Stick

Call it a law, or dictum, or just a big stick, but it goes like this:

The value and influence of an idea or piece of information is limited by the extent that the information provider has embraced the Google Economy; unavailable or unfindable information buried on the second or tenth page of search results might as well be hidden in a cave.

The Google Economy — The Wikipedia Entry

I’m rather passionate about the Google Economy, so it shouldn’t be too much of a surprise to learn that I just wrote about it in my first ever Wikipedia entry. Here it is: http://en.wikipedia.org/wiki/Google_economy “Google Economy” identifies the concept that the value of a resource can be determined by the way that resource is linked […] » about 600 words

Tags Tags Tags

David Weinberger at Many-to-Many pointed me to Tom Coates’ post about different schools of thought regarding tags. Coates has been thinking about tags as keywords, annotations. Thats how I’ve been using and thinking about tags too, but some people have different ideas.

…At the end of the argument I said to Joshua that it was almost like he was treating tags as folders. And he replied, exasperated, that this was exactly what they were.

Exasperation aside, Coates is pretty sure that Joshua’s view is loosing currency and the keywords view is growing.

Wienberger offers this explanation: we use tags as folders to organize things for ourselves, but we use tags as keywords as a way to contribute to the social understanding of things. That’s what Yahoo’s Social Search is trying to leverage.

Related: Google’s War On Hierarchy.

The Google Economy Vs. Libraries

Roger over at Electric Forest is making some arguments about the value of open access to information. Hopefully he’ll forgive me for my edit of his comment (though readers check the original to make sure I preserved the original meaning):

…keep the [information] under heavy protection and you will find that people ignore this sheltered content in favor of the sources that embrace the web and make everything accessible… [Open and accessible resources] will become the influential authorities, not because they are more trustworthy, or more authoritative, or better written, but because they are more accessible.

I’ve been calling this the “Google Economy,” where the value of information is directly proportional to its accessibility. This is a foreign land to libraries, where isolation and division of information is the norm (just count the number of unrelated search boxes linked on your library site), but it’s something I see a few people working to overcome. Kudos to Roger and others for a lot of great work.