Sunday 20 February 2011

Metadata creation

In traditional libraries, the ability to find works of interest was directly related to how well they were catalogued. While cataloguing electronic works digitized from a library's existing holding may be as simple as copying moving a record for the print to the electronic item, with complex and born-digital works requiring substantially more effort. To handle the growing volume of electronic publications, new tools and technologies have to be designed to allow effective automated semantic classification and searching. While full text search can be used for some searches, there are many common catalog searches which cannot be performed using full text, including:

    * finding texts which are translations of other texts
    * linking texts published under pseudonyms to the real authors (Samuel Clemens and Mark Twain, for example)
    * differentiating non-fiction from parody (The Onion from The New York Times, for example)

No comments:

Post a Comment