A Browse Able Audiovisual Database Derived From Catalog Data


Published on

  • Be the first to comment

  • Be the first to like this

No Downloads
Total views
On SlideShare
From Embeds
Number of Embeds
Embeds 0
No embeds

No notes for slide

A Browse Able Audiovisual Database Derived From Catalog Data

  1. 1. A browse-able audiovisual database derived from catalog data Mark Dahl and Laura Ayling Lewis & Clark College http://www.lclark.edu/~dahl/presentations
  2. 2. A two-part presentation <ul><li>The searchable/browse-able A/V database (Mark) </li></ul><ul><li>A genre headings enrichment project undertaken by our catalogers (Laura) </li></ul>
  3. 3. Audiovisual lists <ul><li>Faculty, students like to browse videos (by language), CDs (by artist) </li></ul><ul><li>Video/CD collection behind circulation desk </li></ul><ul><li>III OPAC doesn’t facilitate an easy way to browse CDs and videos/DVDs by language and genre </li></ul>
  4. 4. Old system <ul><li>Use create lists in III to generate lists </li></ul><ul><li>Output lists in comma-delimited format </li></ul><ul><li>Manipulate them into web pages using search/replace </li></ul><ul><li>We provided: </li></ul><ul><ul><li>Lists of videos in certain foreign languages </li></ul></ul><ul><ul><li>Lists of CDs by artist </li></ul></ul><ul><li>Manually updated </li></ul>
  5. 5. New system <ul><li>Allows browsing our collection of: </li></ul><ul><ul><li>CDS </li></ul></ul><ul><ul><li>DVDs & Videos </li></ul></ul><ul><ul><li>Videos </li></ul></ul><ul><ul><li>DVDs </li></ul></ul><ul><li>Browse by: </li></ul><ul><ul><li>Title </li></ul></ul><ul><ul><li>Person (author, artist, film director, etc.) </li></ul></ul>
  6. 6. New system <ul><li>The following pre-limits may be made on these browses </li></ul><ul><ul><li>Language (Videos & DVDs only) </li></ul></ul><ul><ul><li>Film genre (Videos & DVDs only) </li></ul></ul><ul><ul><li>Musical genre (CDs only) </li></ul></ul><ul><li>Have a look: http://library.lclark.edu/dynamic/videoscds </li></ul><ul><li>Detailed genre browse for CDs (new): </li></ul><ul><ul><li>http://library. lclark . edu /dynamic/ videoscds / detailbrowse . htm </li></ul></ul><ul><li>Detailed genre browse for films on DVD/Video (new): </li></ul><ul><ul><li>http://library. lclark . edu /dynamic/ videoscds / detailfilmbrowse . htm </li></ul></ul>
  7. 7. The technology <ul><li>Extracting data via create lists/Expect script (weekly) </li></ul><ul><li>Loading data into PostgreSQL database via Perl (weekly) </li></ul><ul><li>Searching/browsing SQL database via PHP script </li></ul>
  8. 8. Expect script (running as a cron job) logs into ILS and instructs system to produce text file list and ftp it to external server ILS Text list ftped to server in delimited text format Text file opened up by Perl script and loaded into external SQL database SQL database on external server Web form sends query to PHP page PHP page queries SQL database PHP page produces html with results of query
  9. 9. Extracting the data <ul><li>Expect scripting language allows you to automate the process of creating lists, outputting results, ftping results to server </li></ul><ul><li>Expect acts like a robot that goes into Innopac and does what you tell it </li></ul><ul><li>Expect scripts can run on Linux/Unix machines with the Expect language loaded on them </li></ul>
  10. 10. Searching for records <ul><li>We have itypes for videos, DVDs and CDs </li></ul><ul><li>We search for bibliographic records that have an item record with this itype </li></ul>
  11. 11. Searching for audiovisual stuff by itype
  12. 12. Data extracted in create lists (from bib record) <ul><li>Title (t-field group, repeatable) </li></ul><ul><li>Author (a-field group, repeatable) </li></ul><ul><li>Subject—includes local genre headings 655 (d field group, repeatable) </li></ul><ul><li>Format (from itype) </li></ul><ul><li>Language </li></ul><ul><li>.b number </li></ul>
  13. 13. Output user selected format Output FTPed in comma-delimited format to account on Linux server.
  14. 14. Database loader <ul><li>Written in Perl </li></ul><ul><li>Depends on Perl-PostgreSQL module </li></ul><ul><li>Runs as cron job (weekly, daily) </li></ul><ul><li>Takes comma-delimited file from III and loads data into database </li></ul><ul><li>Recreates database from scratch when run </li></ul>
  15. 15. SQL relational database structure <ul><li>.b number </li></ul><ul><li>format </li></ul><ul><li>language </li></ul>Bib table Titles table Authors table Subjects table <ul><li>title </li></ul><ul><li>.b number </li></ul><ul><li>author </li></ul><ul><li>.b number </li></ul><ul><li>subject </li></ul><ul><li>.b number </li></ul>
  16. 16. PHP Search/Browse <ul><li>PHP built for querying databases and turning results into web pages </li></ul><ul><li>Similar to Cold Fusion or ASP </li></ul><ul><li>Functions within PHP particularly good at talking to databases, including PostgreSQL </li></ul>
  17. 17. PHP search/browse <ul><li>Query class (does all the talking to database) </li></ul><ul><li>Browse class extends query class </li></ul><ul><ul><li>Allows you to browse collection </li></ul></ul><ul><ul><li>Jump to letter of alphabet </li></ul></ul><ul><ul><li>Accepts limits (format, genre, language) </li></ul></ul><ul><li>Search class extends query class </li></ul><ul><ul><li>Allows you to enter search term </li></ul></ul><ul><ul><li>You may choose format </li></ul></ul>
  18. 18. Browse system <ul><li>Format pre-limiting achieved by SQL querying for certain itype codes </li></ul><ul><li>Language pre-limiting achieved by SQL querying for certain language codes </li></ul><ul><li>Genre pre-limiting by SQL querying for the presence of certain words within the SQL table subject entries </li></ul>
  19. 19. Languages <ul><li>“ English&quot; => &quot;eng“ </li></ul><ul><li>&quot;French&quot; => &quot;fre“ </li></ul><ul><li>&quot;German&quot; => &quot;ger“ </li></ul><ul><li>&quot;Spanish&quot; => &quot;spa&quot; </li></ul><ul><li>&quot;Japanese&quot; => &quot;jpn“ </li></ul><ul><li>&quot;Chinese&quot; => &quot;chi“ </li></ul><ul><li>&quot;Russian&quot; => &quot;rus&quot; </li></ul>
  20. 20. Genre limiting <ul><li>Genre limit based on an SQL query that looks at the first part of the subject/genre heading </li></ul><ul><li>If we limit on “jazz” we include records with any of these headings </li></ul><ul><ul><li>Jazz – 1971-1980 </li></ul></ul><ul><ul><li>Jazz – Instruction and Study </li></ul></ul><ul><ul><li>Jazz vocals </li></ul></ul>
  21. 21. Genre synonyms <ul><li>The system also allows for defining synonyms to make more inclusive genre browses </li></ul><ul><ul><li>&quot;songs&quot; => &quot;songs,vocal music,song cycles“ </li></ul></ul><ul><ul><li>&quot;rap/hip-hop&quot; => &quot;rap“ </li></ul></ul><ul><ul><li>&quot;techno&quot; => &quot;underground,techno&quot; </li></ul></ul><ul><ul><li>&quot;history&quot; => &quot;%history&quot; </li></ul></ul>
  22. 22. Web interface <ul><li>Browse parameters (format, field (person, title), language, genre) chosen by: </li></ul><ul><ul><li>Web form </li></ul></ul><ul><ul><li>Link with parameters </li></ul></ul>
  23. 23. By web form:
  24. 24. http://library. lclark . edu / dyanmic / videoscds / detailbrowse . php <a href=&quot;mediabrowse.php? field=person & format=CD s& encsubject=Big%2BBand &quot;>Big-Band</a>
  25. 25. Alternative systems <ul><li>III scope </li></ul><ul><li>Featured lists (can use Expect to update) </li></ul><ul><li>Custom search interface to III OPAC using web forms </li></ul><ul><li>III XML server </li></ul><ul><li>Direct queries to Oracle database </li></ul>
  26. 26. Reflections <ul><li>Low cost technology available to build online databases/browsing systems </li></ul><ul><li>These technologies are very flexible </li></ul><ul><ul><li>Index what YOU want </li></ul></ul><ul><ul><li>Display it how YOU want it </li></ul></ul>
  27. 27. Reflections <ul><li>The idea of a library catalog is changing </li></ul><ul><ul><li>A single search interface isn’t enough </li></ul></ul><ul><ul><li>Nice to search/browse within smaller, more manageable segments of your collection </li></ul></ul><ul><ul><li>Browsing electronically (not just in person) is desirable </li></ul></ul><ul><ul><li>People are developing new ways of browsing library collections </li></ul></ul><ul><ul><ul><li>http://belmont.antarcti.ca/ </li></ul></ul></ul><ul><ul><ul><li>High level browse concept (map LC classes to locally defined disciplines to facilitate browsing new books) </li></ul></ul></ul>
  28. 28. http://www.lclark.edu/~dahl/presentations