Photos of old books available on Flickr

Millions of files go through the Internet Archive at Commons Flickr

Flickr8

The unused 600 million pages of old books digitized by the nonprofit Internet Archive gradually ascend to FIickr, with the contribution of academician Kalev Leetaru. It is estimated that her photo hosting site Yahoo 12 will flood millions of historical photos from 1500 to 1922 that have passed to the public domain and are considered to be a common property without any restrictions on their use.

The photos come from public library books that have been digitized for years by the Internet Archive, however, they end up in a file format PDF or plain text without the ability to search for photos.

Kalev Leetaru's software as opposed to optics software ς χαρακτήρων δεν παρακάμπτει τις φωτογραφίες. Αξιοποιεί μάλιστα την αδυναμία του OCR, υποθέτοντας πως ότι παρακάμπτει είναι φωτογραφία και το αποθηκεύει σε μορφή αρχείου εικόνας Jpeg. Επιπλέον, επιχειρεί να συνοδεύσει τα αρχεία εικόνας με επεξηγηματικό κείμενο υπό μορφή λεζάντας, επιλέγοντας το κείμενο που διάβασε το OCR πριν και μετά την φωτογραφία της σκαναρισμένης σελίδας.

The universality of the Internet

Professor Leetaru's ambition is to make use of these photos -2,6 millions of which have already climbed to FIickr- by its authors Wikipedia to enrich its content, especially when the entry refers to historical events. He seems willing to distribute his code in libraries around the world to export photos from books they are trying to convert to digital, reports the BBC.

However, the of FIickr complain that since July, when the Internet Archive became a member of the service, his photos flooded the site and appear very often on without the possibility of user exclusion.

Source: tovima.gr

iGuRu.gr The Best Technology Site in Greecefgns

every publication, directly to your inbox

Join the 2.087 registrants.

Written by Dimitris

Dimitris hates on Mondays .....

Leave a reply

Your email address is not published. Required fields are mentioned with *

Your message will not be published if:
1. Contains insulting, defamatory, racist, offensive or inappropriate comments.
2. Causes harm to minors.
3. It interferes with the privacy and individual and social rights of other users.
4. Advertises products or services or websites.
5. Contains personal information (address, phone, etc.).