Libraries & Networked Information

Tips To Publishers From Google News

It turns out that there are a lot of differences between Google’s regular web crawler and the Google News crawler. And though very few of us will find our content included in Google News, it still seems like a good idea to make our content conform to their technical requirements. Here are a few of them:

  • In order for our crawler to correctly gather your content, each article needs to link to a page dedicated solely to that article. We’re unable to index articles from news sections which consist of one long page rather than a series of links that lead to articles on individual pages.
  • If your articles are located in a drop down box, we won’t be able to crawl them. Google News is unable to crawl articles only accessible through a drop down menu.
  • Google News does not recognize or follow Flash, graphic/image or JavaScript links which link to articles. Our automated crawler is best able to crawl plain text HTML links.
  • Google News doesn’t crawl articles in PDF format, although this content is included on Google Web Search. Our automated crawler is currently best able to crawl plain text HTML sites.

Facebook’s Favorite Metadata

[Facebook’s guide to sharing][1] details some meta tags to make that sharing work better:

In order to make sure that the preview is always correctly populated, you should add the tags shown below to your html. An example news story could have the following:

> 
> As shown, title contains the preview title, description contains the preview summary and image_src contains the preview image. Please make sure that none of the content fields contain any html markup because it will be stripped out. For consistency’s sake, please use the 
> 
> <meta />
> tag to provide text data for the preview, and the 
> 
> <link />
> tag for any source urls.
> 
> The title and summary tags are the minimum requirements for any preview, so make sure to include these two.

 [1]: http://www.facebook.com/share_partners.php "Facebook | Share Partners"

WiFi Is Critical To Academia, The WiFi Alliance Says

study sponsored by the WiFi alliance reveals the following:

WiFi and college choice

  • 90% of college students say Wi-Fi access is as essential to education as classrooms and computers
  • 57% say they wouldn’t go to a college that doesn’t have free Wi-Fi
  • 79% say that without Wi-Fi access, college would be a lot harder
  • 60% agree that widely available Wi-Fi on campus is an indication that a school cares about its students

WiFi and where they use it

  • 55% have connected from coffee shops and restaurants
  • 47% from parks
  • 24% from in their cars

WiFi in the classroom

  • 55% have checked Facebook™ or MySpace™ and sent or received e-mail while using their laptop in class
  • 47% have sent instant messages to a friend during class
  • 44% used Wi-Fi to get a head start on an assignment before a class was finished

WiFi and linkbaiting statistics

  • If forced to choose, 48% would give up beer before giving up Wi-Fi

Survey methodology: “In conjunction with the Wi-Fi Alliance, Wakefield Research surveyed 501 U.S. college students in September 2008. The sampling variation in this survey is plus or minus 4.3 percentage points.”

Juice Your OPAC

Richard Wallace’s Juice project (Javascript User Interface Componentised Extensions) is a “simple componentised framework constructed in Javascript to enable the sharing of Ajax Stye extensions to a web interface.”

WordPress or Scriblio users might do well to think about it as a way to put widgets on systems that don’t support widgets, though as Richard points out, “the framework is applicable to any environment which, via identifiers contained within a html page, needs to link to or embed external resources.”

Way Cooler Than A Catalog

I got a little excited when Shirley Lincicum wrote to the NGC4Lib mail list: [O]ne of the most frustrating things for me about Next Generation Catalog systems as they currently exist is that they seem wholly focused on the user interface and can, in fact, actually hold libraries back from designing or implementing improved “back […] » about 500 words

Scriblio Theater

Flickr Video Flickr Video

I should have done screencasts like the above long ago. It’s not that they’re great, but they are a wonderful excuse to use the canned lounge music I’ve got. Those videos are now on the front page of the official Scriblio site, and I did five more to demo the installation and configuration. Big thanks go to Collingswood NJ Public Library Director Brett Bonfield who let me use his library like this.

Scriblio 2.7 Released

My slides for my presentation yesterday at code4lib are available both as a 2.7MB QuickTime and a 7.8 MB PDF, while the gist of talk went something like this: Scriblio is an open source WordPress plugin that adds the ability to search, browse, and create structured data to the the popular blog/content management platform. And […] » about 500 words

Is Internet Linking Legal?

You’d think the top search results on the matter would be newer than 1999, but that’s where you’ll find this NYT article and PubLaw item story, both from precambrian times. Worse, both of those articles suggest that my links to them may not be entirely kosher.

The problem is probably that US courts have not spoken clearly on such a case. A Danish court in 2006 did, but I think that no case in the US has gone far enough to actually set a precedent. Another chance at settling this issue was lost earlier this month when BlockShopper settled, rather than continue a costly defense of such a case. The EFF is confident BlockShopper could have won, but that means little when the legal bills come in.

Related at EFF: Kelly v. Arriba Soft and Unlawful Internet Gambling Enforcement Act.

Web Search Re-Imagined: Searchme iPhone App

Re-imagined a bit, anyway. Why browse a vertical list of results when you can flip through them like pages in a book (or album covers in iTunes). Searchme on the iPhone and iPod touch does just that. As you type your search term, icons representing rough categories appear, allowing you to target your search and […] » about 300 words

Declaration of Metadata Independance

Declaration of Metadata Independance: We hold these truths to be self-evident, that Metadata is essential to all Users, and that the Creation of Metadata endows certain inalienable Rights, that among these are the right to collect, the right to share and the pursuit of Happiness through the reuse of the Metadata… (read more) Via. » about 100 words

Do WordPress Pages Better With bSuite

WordPress‘ Pages feature makes the popular blogging platform a sophisticated CMS. bSuite adds a few features to make it even better. Write excerpts, tag, and categorize your pages WordPress excerpts are an underused but powerful feature that allow you to explain to your readers why they should read the page you wrote. Tagging and categorization […] » about 300 words

My DevCamp Lightning Talk

Hi, I’m Casey. I developed Scriblio, which is really just a faceted search and browse plugin for WordPress that allows you to use it as a library catalog or digital library system (or both). I’m not the only one to misuse WordPress that way. Viddler is a cool YouTube competitor built atop WordPress that allows […] » about 400 words