Prediction: we’ll talk about Google Print until they debut the beta, then we’ll talk about it more.
Copyfight posted some followup on Google’s announcement earlier this week. Of note was a quote from Michael Madison:
A first thought: It’s one more example, and a pretty important one, of the fading of the lines separating copyright law from communications law. Is Google Print an information conduit? A massive, rogue P2P technology? Is it a contributory infringer? A publisher? From whom, if anyone, does it need licenses, and who, if anyone, should regulate it, and how, if at all?
TeleRead started talking about how Google Print will be presented:
My understanding, which may be wrong, is that Google will OCR the page scans, but do only cursory machine cleanup of the raw unstructured text that results. This approach is which I call “raw digital text” or RDT), and use the still-error-laden RDT in their search system to pull up the page scans.
You can see this approach now in the way Amazon presents results of its “search inside this book” feature. The text is indexed for searching, but clicking on the results brings up the scanned, bitmapped pages. When available, the feature is incredibly useful, but I feel cheated when I try to copy and paste the text.
TeleRead points out that this is also how the University of Michigan’s Making of America collection works.
MoA scanned the books, placed the scanned page images online, and built a search engine to search the resulting RDT from OCR. Then, one by one they have been converting the RDT from selected books to highly-proofed SDT (structured digital text) using human proofers and TEI (I think) for structuring. So, the scans came first, and then the cleanup was (and is being) done at a later time.
The excitement here, for TeleRead, is that Google might end up contributing to efforts like Project Gutenberg and could benefit greatly from the Distributed Proofreaders volunteers.
Related:
Posted December 17, 2004 by Casey
Categories: Books, Movies, Music. .
No Comments
No comments yet.
Comments RSS
TrackBack Identifier URI
Leave a comment
User contributed tags for this post:
calculate btu (143) - google print hack (105) - hack google print (79) - How to calculate BTU (73) - how to print google books (57) - PowerEdge 2650 BTU (49) - Dell PowerEdge 2650 BTU (45) - google print OCR (41) - calculating BTU (41) - hacking google print (39) - dell 1850 btu (38) - google books hack (31) - printing from google books (29) - printing google books (29) - poweredge 1850 btu (27) - dell 2650 btu (25) - print google books (22) - how to calculate BTUs (20) - dell btu (20) - print from google books (20) - poweredge 750 btu (16) - how calculate btu (16) - dell btu rating (15) - calculate btu rating (15) - Dell 1850 BTU rating (13) - Dell Poweredge 750 BTU (12) - how do you calculate btu (12) - hack google books (12) - how to print from google books (12) - how to print google book (12) - calculating btus (11) - hacking google books (11) - dell poweredge 1850 btu (11) - all (11) - dell poweredge btu rating (10) - gmail for palm (10) - dell poweredge 2650 btu rating (10) - google print hacking (10) - dell 2650 btu rating (9) - gmail en palm (8) - gmail on palm (8) - Dell PowerEdge 1850 btu rating (8) - google books how to print (8) - Gmail and Palm (7) - print google pages hacking (6) - BTU PowerEdge 2650 (6) - calculate BTUs (6) - google print hacker (6) - how to calculate BTU rating (6) - hack google print google print print pages (6) - how do i calculate BTu (6) - poweredge btu ratings (6) - btu poweredge 750 (5) - mss (5) - print google hack (5) - dell 2650 btus (5) - calculate BTU s (4) - hack googlebooks (4) - print pages from google books (4) - Bring troops home car magnet (4) - poweredge btu rating (4) - hacking print google (4) - BTU dell (4) - IN (4) - calculate btu ratings (4) - google books hacking (4) - btu dell 2650 poweredge (4) - calculating btu s (4) - google books download (4) - amazon search inside hack (4) - dell btu information (4) - googlebooks hack (4) - about (4) - Dell Poweredge BTU (4) - how to calculate btu s (4) - calculating BTU rating (4) - Michigan Parkour (3) - google books print hack (3) - google books ocr (3) - btu dell 1850 (3) - print google books hack (3) - printing google books hack (3) - Dell Poweredge BTU ratings (3) - search inside this book hack (3) - pc btu rating (3) - dell poweredge 750 BTU output (3) - how to print pages from google books (3) - amazon search inside hacking (3) - pe2650 BTU (3) - hacking amazon search inside (3) - hack print google (3) - HOW TO HACK BOOKS.GOOGLE (3) - google book how to print (3) - print googlebooks (3) - DELL POWEREDGE 2650 BTUs (2) - how to copy and print google book (2) - btu rating for dell 2650 (2) - BTU rating for dell pc (2) - poweredge 1850 btus (2) - dell btu ratings (2) - calculate btu m3 (2) - BTU rating of poweredge 1850 (2) - googlebooks hacks (2) - poweredge 750 btu power (2) - amazon search inside hack download (2) - BTU for Dell 1850 (2) - 3415671844873 (2) - ocr hacking (2) - btu how to calculate (2) - print google ocr (2) - print google book (2) - print google books hacking (2) - well talk about it more (2) - google books print (2) - how to print google book pages (2) - how to print google books hack (2) - bangla scanned choti collection (2) - books google how to print (2) - how to print books google (2) - how to print a google book (2) - BTU for Dell Power Edge 750 (2) - btu s for poweredge 2650 (2) - btu rating poweredge 1850 (2) - dell 750 btu (2) - Google Books download images (2) - poweredge 1850 btu rating (2) - hacking google print ocr (2) - googlebooks printing (2) - how to copy or print google books? (2) - cache 2PhGRWU3OZIJ maisonbisson com blog post 10358 btu (2) - btu for dell 750 (2) - Dell PowerEdge 2650 BTU s (2) - google books printing hack print pdf (2) - BTUs Dell (2) - ocr google print images (2) - dell power edge 2650 btu (2) - hack print google com (2) - btu dell poweredge 1850 (2) - print google hacks (2) - michigan s parkour (2) - btu for poweredge 2650 (2) - BTU Dell PowerEdge 2650 (2) - DELL PE 2650 btu (2) - how to hack print google (2) - google print book hack (2) - btu calculate (2) - gmail con palm (2) - BTU Dell 2650 (2) - m3 (2) - power edge btu (2) -