3 posts tagged with internetarchive.
Displaying 1 through 3 of 3. Subscribe:
A Better Wayback Machine
The Internet Archive Wayback Machine, an archive of historical web pages dating back to 1996, has been greatly improved. The index is now updated every hour, there is a new front page, a new Availability API, a 404 handler that webmasters can use, and many other improvements. I think the most important new feature is the ability to instantly save a page and receive a permanent URL for use in citations. [more inside]
Autocrop algorithm for scanned books
A couple years ago I wrote an autocrop algorithm for the Internet Archive, but never had time to optimize it, and it was never deployed because it was too slow to run in realtime. The algorithm works well for books imaged using the archive.org Scribe bookscanner. Here is example output of the autocrop algorithm, with a comparison of the same book cropped and deskewed by a human.
The Internet Archive Book Drive
The Internet Archive has launched a project to make books available to the print-disabled community in the DAISY format. Donate books to IA and we will turn them into DAISY books, which will be made available on the relaunched Open Library site. [more inside]