Prediction: we’ll talk about Google Print until they debut the beta, then we’ll talk about it more.
Copyfight posted some followup on Google’s announcement earlier this week. Of note was a quote from Michael Madison:
A first thought: It’s one more example, and a pretty important one, of the fading of the lines separating copyright law from communications law. Is Google Print an information conduit? A massive, rogue P2P technology? Is it a contributory infringer? A publisher? From whom, if anyone, does it need licenses, and who, if anyone, should regulate it, and how, if at all?
TeleRead started talking about how Google Print will be presented:
My understanding, which may be wrong, is that Google will OCR the page scans, but do only cursory machine cleanup of the raw unstructured text that results. This approach is which I call “raw digital text” or RDT), and use the still-error-laden RDT in their search system to pull up the page scans.
You can see this approach now in the way Amazon presents results of its “search inside this book” feature. The text is indexed for searching, but clicking on the results brings up the scanned, bitmapped pages. When available, the feature is incredibly useful, but I feel cheated when I try to copy and paste the text.
TeleRead points out that this is also how the University of Michigan’s Making of America collection works.
MoA scanned the books, placed the scanned page images online, and built a search engine to search the resulting RDT from OCR. Then, one by one they have been converting the RDT from selected books to highly-proofed SDT (structured digital text) using human proofers and TEI (I think) for structuring. So, the scans came first, and then the cleanup was (and is being) done at a later time.
The excitement here, for TeleRead, is that Google might end up contributing to efforts like Project Gutenberg and could benefit greatly from the Distributed Proofreaders volunteers.
Posted December 17, 2004 by Casey Bisson
Categories: Books, Movies, Music. .
No Comments
No comments yet.
Comments RSS
TrackBack Identifier URI
Leave a comment
User contributed tags for this post:
calculate btu (143) - google print hack (106) - hack google print (79) - How to calculate BTU (73) - how to print google books (63) - PowerEdge 2650 BTU (49) - Dell PowerEdge 2650 BTU (46) - calculating BTU (41) - google print OCR (41) - hacking google print (40) - dell 1850 btu (39) - google books hack (33) - printing google books (30) - printing from google books (29) - poweredge 1850 btu (27) - dell 2650 btu (26) - print google books (22) - print from google books (21) - how to calculate BTUs (20) - dell btu (20) - poweredge 750 btu (18) - how to print from google books (17) - how calculate btu (16) - dell btu rating (15) - calculate btu rating (15) - how to print google book (13) - Dell 1850 BTU rating (13) - hack google books (13) - calculating btus (12) - hacking google books (12) - Dell Poweredge 750 BTU (12) - how do you calculate btu (12) - all (11) - dell poweredge 1850 btu (11) - dell poweredge btu rating (10) - dell poweredge 2650 btu rating (10) - gmail for palm (10) - google print hacking (10) - dell 2650 btu rating (9) - Dell PowerEdge 1850 btu rating (8) - gmail on palm (8) - gmail en palm (8) - google books how to print (8) - Gmail and Palm (7) - btu poweredge 750 (6) - google print hacker (6) - print google pages hacking (6) - calculate BTUs (6) - hack google print google print print pages (6) - poweredge btu ratings (6) - BTU PowerEdge 2650 (6) - how to calculate BTU rating (6) - how do i calculate BTu (6) - print google hack (5) - mss (5) - dell 2650 btus (5) - print pages from google books (5) - how to calculate btu s (4) - google books download (4) - bangla scanned choti (4) - hacking print google (4) - IN (4) - Bring troops home car magnet (4) - hack googlebooks (4) - BTU dell (4) - about (4) - google books hacking (4) - calculate BTU s (4) - calculating btu s (4) - btu dell 2650 poweredge (4) - dell btu information (4) - calculate btu ratings (4) - calculating BTU rating (4) - poweredge btu rating (4) - Dell Poweredge BTU (4) - BTU Dell PowerEdge 2650 (4) - googlebooks hack (4) - amazon search inside hack (4) - google books print hack (3) - how to print pages from google books (3) - Dell Poweredge BTU ratings (3) - printing google books hack (3) - Michigan Parkour (3) - btu dell 1850 (3) - print googlebooks (3) - pe2650 BTU (3) - Hacking Google Print download (3) - amazon search inside hacking (3) - search inside this book hack (3) - google book how to print (3) - print google books hack (3) - Poweredge BTU (3) - how to print a google book (3) - dell poweredge 750 BTU output (3) - pc btu rating (3) - google books ocr (3) - DELL PC BTU (3) - HOW TO HACK BOOKS.GOOGLE (3) - hack print google (3) - hacking amazon search inside (3) - BTU for Dell 1850 (2) - BTU rating of poweredge 1850 (2) - dell btu ratings (2) - btu rating for dell 2650 (2) - amazon search inside hack download (2) - google books print (2) - BTU rating for dell pc (2) - print google ocr (2) - print google book (2) - google books hack download (2) - 3415671844873 (2) - ocr hacking (2) - how to print books google (2) - btu rating poweredge 1850 (2) - poweredge 750 btu power (2) - well talk about it more (2) - books google how to print (2) - googlebooks printing (2) - how to print google book pages (2) - dell 750 btu (2) - how to print google books hack (2) - btu how to calculate (2) - google books printing hack print pdf (2) - print a google book (2) - how to copy or print google books? (2) - print google books hacking (2) - BTU for Dell Power Edge 750 (2) - btu s for poweredge 2650 (2) - poweredge 1850 btu rating (2) - Google Books download images (2) - scanned choti book (2) - cache 2PhGRWU3OZIJ maisonbisson com blog post 10358 btu (2) - btu for dell 750 (2) - bangla choti scan (2) - choti boi scanned copy (2) - Dell PowerEdge 2650 BTU s (2) - bangla scanned choti collection (2) - how to copy and print google book (2) - googlebooks hacks (2) - google print debut (2) - amazon inside this book copy paste (2) - power edge btu (2) - BTU ratings for Poweredge 2650 (2) - 1 btu (2) - ocr google print images (2) - gmail to palm (2) - poweredge 2650 btu rating (2) - google print hacking software (2) - btu rating dell 2650 (2) - 2650 btu dell (2) -