Jessamyn pointed out the dust up over the dissapearing of PaperOfRecord.com, a historical newspaper archive.
Tips To Publishers From Google News
It turns out that there are a lot of differences between Google’s regular web crawler and the Google News crawler. And though very few of us will find our content included in Google News, it still seems like a good idea to make our content conform to their technical requirements. Here are a few of them:
- In order for our crawler to correctly gather your content, each article needs to link to a page dedicated solely to that article. We’re unable to index articles from news sections which consist of one long page rather than a series of links that lead to articles on individual pages.
- If your articles are located in a drop down box, we won’t be able to crawl them. Google News is unable to crawl articles only accessible through a drop down menu.
- Google News does not recognize or follow Flash, graphic/image or JavaScript links which link to articles. Our automated crawler is best able to crawl plain text HTML links.
- Google News doesn’t crawl articles in PDF format, although this content is included on Google Web Search. Our automated crawler is currently best able to crawl plain text HTML sites.
Facebook’s Favorite Metadata
[Facebook’s guide to sharing][1] details some meta tags to make that sharing work better:
In order to make sure that the preview is always correctly populated, you should add the tags shown below to your html. An example news story could have the following:
>
> As shown, title contains the preview title, description contains the preview summary and image_src contains the preview image. Please make sure that none of the content fields contain any html markup because it will be stripped out. For consistency’s sake, please use the
>
> <meta />
> tag to provide text data for the preview, and the
>
> <link />
> tag for any source urls.
>
> The title and summary tags are the minimum requirements for any preview, so make sure to include these two.
[1]: http://www.facebook.com/share_partners.php "Facebook | Share Partners"
WiFi Is Critical To Academia, The WiFi Alliance Says
A study sponsored by the WiFi alliance reveals the following:
WiFi and college choice
- 90% of college students say Wi-Fi access is as essential to education as classrooms and computers
- 57% say they wouldn’t go to a college that doesn’t have free Wi-Fi
- 79% say that without Wi-Fi access, college would be a lot harder
- 60% agree that widely available Wi-Fi on campus is an indication that a school cares about its students
WiFi and where they use it
- 55% have connected from coffee shops and restaurants
- 47% from parks
- 24% from in their cars
WiFi in the classroom
- 55% have checked Facebook™ or MySpace™ and sent or received e-mail while using their laptop in class
- 47% have sent instant messages to a friend during class
- 44% used Wi-Fi to get a head start on an assignment before a class was finished
WiFi and linkbaiting statistics
- If forced to choose, 48% would give up beer before giving up Wi-Fi
Survey methodology: “In conjunction with the Wi-Fi Alliance, Wakefield Research surveyed 501 U.S. college students in September 2008. The sampling variation in this survey is plus or minus 4.3 percentage points.”
Juice Your OPAC
Richard Wallace’s Juice project (Javascript User Interface Componentised Extensions) is a “simple componentised framework constructed in Javascript to enable the sharing of Ajax Stye extensions to a web interface.”
WordPress or Scriblio users might do well to think about it as a way to put widgets on systems that don’t support widgets, though as Richard points out, “the framework is applicable to any environment which, via identifiers contained within a html page, needs to link to or embed external resources.”
Way Cooler Than A Catalog
I got a little excited when Shirley Lincicum wrote to the NGC4Lib mail list: [O]ne of the most frustrating things for me about Next Generation Catalog systems as they currently exist is that they seem wholly focused on the user interface and can, in fact, actually hold libraries back from designing or implementing improved “back […] » about 500 words
Scriblio Theater
I should have done screencasts like the above long ago. It’s not that they’re great, but they are a wonderful excuse to use the canned lounge music I’ve got. Those videos are now on the front page of the official Scriblio site, and I did five more to demo the installation and configuration. Big thanks go to Collingswood NJ Public Library Director Brett Bonfield who let me use his library like this.
Scriblio 2.7 Released
My slides for my presentation yesterday at code4lib are available both as a 2.7MB QuickTime and a 7.8 MB PDF, while the gist of talk went something like this: Scriblio is an open source WordPress plugin that adds the ability to search, browse, and create structured data to the the popular blog/content management platform. And […] » about 500 words
Is Internet Linking Legal?
You’d think the top search results on the matter would be newer than 1999, but that’s where you’ll find this NYT article and PubLaw item story, both from precambrian times. Worse, both of those articles suggest that my links to them may not be entirely kosher.
The problem is probably that US courts have not spoken clearly on such a case. A Danish court in 2006 did, but I think that no case in the US has gone far enough to actually set a precedent. Another chance at settling this issue was lost earlier this month when BlockShopper settled, rather than continue a costly defense of such a case. The EFF is confident BlockShopper could have won, but that means little when the legal bills come in.
Related at EFF: Kelly v. Arriba Soft and Unlawful Internet Gambling Enforcement Act.
Firefox Improved RDF Browsing
lbjay uses both the Tabulator and Semantic Radar Firefox plugins to do magic with RDF in his browser.
LCSH Linked Data
lcsh.info is gone, but there’s a lot to learn from this paper. I wish I’d seen that earlier.
Web Search Re-Imagined: Searchme iPhone App
Re-imagined a bit, anyway. Why browse a vertical list of results when you can flip through them like pages in a book (or album covers in iTunes). Searchme on the iPhone and iPod touch does just that. As you type your search term, icons representing rough categories appear, allowing you to target your search and […] » about 300 words
WordPress Education Mail List
wp-edu, the WordPress for education mail list has launched. Join up, catch up on the archives, and set it up at your school.
Declaration of Metadata Independance
Declaration of Metadata Independance: We hold these truths to be self-evident, that Metadata is essential to all Users, and that the Creation of Metadata endows certain inalienable Rights, that among these are the right to collect, the right to share and the pursuit of Happiness through the reuse of the Metadata… (read more) Via. » about 100 words
Wikipedia API?
I’ve wanted a Wikipedia API for a while. Now I might’ve stumbled into one: commons.wikimedia.org/w/api.php. It doesn’t do exactly what I want, but it might yet be useful.
xFruits: “Compose Your Information System”
Is xFruits a worthy replacement for Yahoo! Pipes?
Libraries vs. IT Departments
The Chronicle‘s Tech Therapy podcast last week featured Libraries vs. IT Departments. (Via.)
How Wikipedia Works
When Phoebe Ayers isn’t hanging out at ROFLcon she’s probably doing something related to Wikipedia, so I’m looking forward to reading How Wikipedia Works: And How You Can Be a Part of It. Extra points: Phoebe and her co-authors somehow convinced their publisher to release the entire work under the GFDL, the same license Wikipedia […] » about 100 words
Tracking Aircraft Movements
From Justin: real-time flight tracking. You can even overlay it on Google Earth. None of them as pretty as Aaron Koblin’s Flightplan, though.
Do WordPress Pages Better With bSuite
WordPress‘ Pages feature makes the popular blogging platform a sophisticated CMS. bSuite adds a few features to make it even better. Write excerpts, tag, and categorize your pages WordPress excerpts are an underused but powerful feature that allow you to explain to your readers why they should read the page you wrote. Tagging and categorization […] » about 300 words
What Is Social Media?
Social Media in Plain English and RSS In Plain English, among others from Common Craft among the best explanations you’ll find. » about 100 words
My DevCamp Lightning Talk
Hi, I’m Casey. I developed Scriblio, which is really just a faceted search and browse plugin for WordPress that allows you to use it as a library catalog or digital library system (or both). I’m not the only one to misuse WordPress that way. Viddler is a cool YouTube competitor built atop WordPress that allows […] » about 400 words
The URL Is The Citation
From Jessamyn: “don’t toss up a bunch of bibliographic citations when a decent URL will do. You’re online, act like you’re online.”
Open Source Citation Extractors For Non-Structured Data
hmm-citation-extractor, ParsCit and FreeCite (not to be confused with FreeCite, the F/OSS EndNote-like app). FreeCite is available as a service and a download.
Still, wouldn’t a simple URL be easier than all these unstructured citation formats?
CommentPress Comments
The rights to my Library Technology Report on Open-Source Software for Libraries have reverted back to me, so I’m posting the text online under a CC-BY-SA license. More importantly, I’m using it as an opportunity to play with how longer-than-blog texts can be represented online. The Institute for the Future of the Book has spent […] » about 400 words