SlideShare a Scribd company logo
1 of 6
Download to read offline
VISUAL COLLAGING OF MUSIC IN A DIGITAL LIBRARY

          David Bainbridge                             Sally Jo Cunningham                       J. Stephen Downie
      Dept. of Computer Science                      Dept. of Computer Science               Graduate School of Library
        University of Waikato                          University of Waikato                  and Information Science
               Hamilton                                       Hamilton                          University of Illinois
             New Zealand                                    New Zealand                     Urbana-Champaign, IL, USA.


                           ABSTRACT                                         currently. Images already in the visualisation fade over
                                                                            time and as new images are retrieved from the various
   This article explores the role visual browsing can play                  sites, they are added in to the visualisation on top of what
within a digital music library. The context to the work                     is already there. If at any stage a displayed image sparks
is provided through a review of related techniques drawn                    some interest, the user can click on it and have a new web
from the fields of digital libraries and human computer in-                  browser open up displaying the web page the image comes
teraction. Implemented within the open source digital li-                   from.
brary toolkit Greenstone, a prototype system is described                       This paper is organized as follows: Section 2 reviews
that combines images located through textual metadata                       related work in collaging and browsing in digital libraries,
with a visualisation technique known as collaging to pro-                   and the “laid back” approach to interacting with informa-
vide a leisurely, undirected interaction with a music col-                  tion collections; Section 3 presents a walkthrough of a
lection. Emphasis in the article is given to the augmenta-                  prototype music collection that includes collaging; Sec-
tions of the basic technique to work in the musical domain.                 tion 4 describes implementation issues for creating the
                                                                            prototype; and Section 5 is a summary of our work.
                     1. INTRODUCTION

Looking at patterns of use in real music collections—phy-                                    2. RELATED WORK
sical libraries, music stores—we see people (of their own
volition) spending a significant amount of time looking,                     Previous graphics-based browsing tools for music docu-
browsing, sifting through the music/documents [5]. This                     ment collections primarily support visualization of entire
is in stark contrast to observed behavior in digital libraries;             collections; for example, as a self-organising map [14] or
transaction log analysis reveals brief sessions, few searches,              a 3D vector space [15] showing documents clustered by
short queries, and few documents viewed [7, 12]. Is this                    genre. While these visualisations can give an immedi-
a true reflection of how users, in general, wish to interact                 ate, powerful overview of relative size of groupings and
with digital repositories or merely an artifact of browsing                 a sense of distance between clusters, they are less effec-
mechanisms that are ineffectual or too cumbersome to be                     tive in supporting browsing of the documents themselves,
used? This issue has led to various research initiatives de-                since the idea of an overview means that the documents
veloping richer forms of support for browsing with text-                    must be minimised and represented at the top (browsing)
based digital libraries, for instance [3, 15, 14]. Given a                  level in a relatively impoverished fashion (for example, by
strong desire shown by people to browse real music col-                     a single point in a 3D space) [3].
lections then the ability to support this activity within a                    Earlier work applying collaging to information seeking
digital music library seems especially important.                           focused on using the resulting collage as a search tool, for
    This provided the motivation for the work described in                  example to build a collection of Web documents based on
this paper: the adaptation of a visual technique known as                   a CollageMachine search/browse over Google [8]. Based
collaging to the music domain. Collaging is a technique                     on user evaluation studies, collaging seems ill-suited to
devised by Kerne [9] for use in the more general environ-                   supporting more formal information seeking activities such
ment of the web. In one example of its use, a user keys                     as direct search; users objected to the lack of predictabil-
in several web domains—say mtv.com, archive.org/audio,                      ity in the collaging, and the lack of transparency in the
and ceolas.org—presses go and then observes a visualisa-                    relationship between the search terms that seed the col-
tion of images drawn from crawling these web sites con-                     lage and the collaged “results.” Collaging appears best
                                                                            suited to providing a browsing experience that is engag-
Permission to make digital or hard copies of all or part of this work for
                                                                            ing and interesting, but that does not require the direc-
personal or classroom use is granted without fee provided that copies       tion and focus on the part of the user that is characteristic
are not made or distributed for profit or commercial advantage and that      of searching or even of other types of browsing (for ex-
copies bear this notice and the full citation on the first page.             ample, browsing by metadata or by genre). This type of
 c 2004 Universitat Pompeu Fabra.
                                                                            activity, where the system selects information for viewing
based on little (or even no!) control from the user, has been   on a particular item will result in the song being played.
termed “laid back,” to distinguish it from the more active          The ability to search document content is one of the
and user-directed “sit forward” information seeking activ-      pillars upon which digital libraries are based, and although
ities [6].                                                      Greenstone does not currently include support for direct
    The utility of the laid back approach to browsing has       content searching of raw audio files, 1 in this context the
supporting evidence in the HCI, cognitive science and psy-      metadata stored in ID3 tags makes for a useful foundation
chology of memory literatures. It is now a well-known           upon which to base textual queries. The digital library
trope in these literatures that “recognition” is greater than   toolkit is flexible over what indexes can be formed: for
“recall” [13]. Since laid back browsing is premised on the      this collection it was decided to index titles, artist, and
user recognising an association presented by the collag-        a conglomeration of all text fields extracted from a file
ing interface rather than having to recall a specific fact,      (akin to full-text search). Field searching is also supported
melody, or set of appropriate search terms, it follows that     through an advanced search page.
the collaging technique, and its laid back approach, should         Depending on their source, not all MP3 files have their
provide a beneficial browsing enhancement for music seek-        metadata fields filled out appropriately. Quality control
ing users.                                                      is a hallmark of digital libraries that distinguishes them
    Previous implementations of collaging have been di-         from the web at large. To help with selection and main-
rected at using extracted images or text snippets to explore    tenance of documents in a digital library, Greenstone pro-
or search document collections that are primarily text [8,      vides a graphical interface for adding and editing existing
10] or images [2]. How appropriate is this technique likely     metadata, along with managing other workflow aspects of
to be when applied to collections of audio documents? In        a digital library lifecycle [1]. Using the graphical tool a
answer, we note that the visual collage-based browsing          user can shape their music digital library to their taste.
we describe in this paper is surprisingly close to a com-       Metadata need not be restricted to the fields present in
mon browsing activity observed in music stores—flicking          the MP3 files. A system of metadata sets is supported
through horizontal displays of music CDs. In this latter        that allows the use of existing standards, such as Dublin
activity, shoppers glance at the CD cover images as they        Core, as well as ad hoc sets developed by a user or com-
are flicked past, occasionally pausing to look more closely      munity of users. Through the augmentation of metadata
at an item of interest, perhaps to read its cover notes and     fields, richer searching and browsing structures can be de-
to “hear the song in my head” [5].                              veloped, for instance yielding a subject hierarchy for the
    The use of visual surrogates for audio files in a col-       collection.
lage also seems appropriate given evidence that appear-             Figure 2 shows a more novel form of browsing the col-
ance plays a significant role in the organization and dis-       lection. During the building phase of the digital library,
play of personal music collections—the attractiveness of        title and artist information contained in each MP3 file has
the CD covers, the arrangement of CDs on shelves, and           been used to locate two images deemed to be relevant to
the general effect achieved by prominently positioning the      the musical work. Collectively, these form the basis of the
collection as the centrepiece of a room [4]. Collaging is       collage. In broad brush strokes, one image at a time is ran-
designed to support the display of documents in an inter-       domly taken from the set and placed at a random location
esting and attractive manner, with the added benefit that        within the visualisation. As time passes, the image begins
the juxtaposition of documents in a novel and serendip-         to fade until, eventually, it can no longer be seen, at which
itous process may create compositions that the browser          point it is removed from the visualisation and added back
finds meaningful [11].                                           in to the set of possible choices. In parallel with this, new
                                                                images are being added at regular intervals. The selec-
                                                                tion method of acquiring images is heuristic based, which
                    3. PROTOTYPE
                                                                we explain more fully in the next section. Album covers,
                                                                photographs of the the actual vinyl disc—some younger
For pedagogical purposes a Greenstone digital library col-
                                                                users of the digital library collection may need this format
lection was built based on a user’s (modest) personal col-
                                                                explained!—and publicity shots of artists are most com-
lection of 80 MP3 files. The result can be seen in Fig-
                                                                mon.
ures 1 through 4. In addition to the commonplace brows-
                                                                    In Figure 2 the most prominent images in the collage
ing structures—such as songs arranged alphabetically by
                                                                are two record covers: The Beatles/1967–1970 (colloqui-
Title or grouped Artist, for instance—the music digital li-
                                                                ally known as the blue album) and in front of that the sin-
brary can provide additional functionality and form. In
                                                                gle’s cover for Another one bites the dust by Queen; to
a moment we will look at one of the more novel ideas,
                                                                the right of these there is also a picture of David Bowie
collaging tailored to music; here we will briefly mention
                                                                live in concert, although this is not so clearly reproduced
some more prosaic features that come from the existing
                                                                as a monochrome figure. If the user clicks on a particular
textual support within the digital library toolkit.
                                                                image, then a new browser window opens up, as is shown
    Figure 1 indeed shows the familiar “browse by artist”
                                                                in Figure 3 where the aforementioned David Bowie pic-
listing. Next to the bookshelf icon in the figure is dis-
played in brackets the number of songs by that artist, which       1 Greenstone does however support direct content querying of sym-

if clicked upon opens up to show the list. Further clicking     bolic music.
Figure 1. Browsing collection by artist




                                               Figure 2. Collaging in action.


ture has been targeted. The new window is a metaphor for        ing that there is an MP3 logo that links to the original file.
the selected record, and shows the two images associated        This allows a user to download the file if they wish or play
with the track (fashioned on the idea of opening up the         it using a different application. This last detail in itself is
CD cover) while the audio is played through an embed-           not especially astounding, but does help emphasize that
ded MP3 applet.                                                 digital libraries need to integrate with other aspects of the
                                                                user’s computer environment.
   Additional resources relating to the musical track are
hyperlinked through this page. If the user clicks on either
of the images, they are taken to the web where the im-                          4. IMPLEMENTATION
age was sourced from. In Figure 4 the user has clicked
on the right-hand image of the David Bowie “record” and         For rapid prototyping, Greenstone—a versatile open source
brought up series of interviews with the artist. In the case    digital library toolkit that the authors have considerable
of Take a chance on me by ABBA, to take another random          experience with—was chosen as an implementation frame-
example, the images take you to a discography site for the      work [16]. The toolkit incorporates a flexible architecture
band. The title of the song, displayed below the images, is     for digesting new document formats and customising how
also hyperlinked and is used to automatically formulate a       they are indexed, browsed and presented, making it ideal
query on Google Images to find, with a certain degree of         for our needs [17]. The architecture has four main compo-
accuracy, related items. There is also a small embedded         nents: 1) an internal XML document format to which all
Java applet that plays the track, primed to start once the      documents are converted when they are imported into the
page loads and to stream the audio if possible. Follow-         system (native media such as an MPEG video is bound
Figure 3. Viewing the work: two images, title and the
audio playing


to the format through a linking mechanism); 2) a set of
                                                                Figure 4. Original web page from which second David
parsers called plugins that process document and meta-
                                                                Bowie image sourced.
data formats; 3) a set of modules called classifiers that are
used to build browsing structures; and 4) a scheme for de-
signing individual digital library collections by providing     plugin that used the title and artist information to perform
a configuration file that specifies what kind of documents         a search on Google Images. To be precise, two exact
and metadata they are to contain, and what searching and        phrases are formed based on the file’s artist and title meta-
browsing facilities they provide.                               data and a search, restricted to JPEG files only, is issued
   To develop the music based prototype, a plugin for the       to Google Images. If for some reason the file is missing
MP3 format was needed; its primary purpose to bind the          either of these fields, an alternative search is generated
source MP3 file and digest the metadata stored in its ID3        based on the audio track’s filename, which due to informal
tags into the XML representation. To support the collag-        conventions that have grown up around raw audio files,
ing metaphor, a new classifier module was also required          often encodes similar title and artist information. Elemen-
along with an applet to drive the visualisation. We de-         tary manipulation of the filename is performed, such as
scribe each of these in turn.                                   converting underscores ( ) to spaces which are often used
                                                                as a substitute in filenames.
4.1. An MP3 Plugin                                                  For the resulting images identified by the Google search,
                                                                the first two valid matches are mirrored into the collec-
Plugins in Greenstone are written in Perl, and are accessed     tion’s document import area. Since the web is inherently
when a digital library collection is built. A substantial ar-   a noisy medium, even if the matching image is available,
ray of plugins already exists, processing formats as varied     it is sometimes the case that it is still not a valid JPEG
as HTML, Word, PowerPoint, PDF, Images, Email mes-              image. To help spot such anachronisms, we use identify,
sages, Excel spreadsheets and MARC data to name a few.          a command-line utility included in ImageMagick’s suite
There is also a plugin known as the UnknownPlug which           of open source image manipulation tools (see imagemag-
allows a digital library designer to digest, in a rudimentary   ick.org for more details). The necessary Perl code to im-
fashion, documents of a type that do not have a specificly       plement this totalled around 300 lines. Similar work could
targeted plugin by setting basic metadata such as the doc-      be done for comparable audio formats such as Ogg Vorbis
ument’s filename and file size, and by binding the native         and Windows Media Audio (WMA).
file into the XML format.
   UnknownPlug was the starting point for writing the
                                                                4.2. A Collaging Classifier
MP3 plugin, which—inheriting from this plugin—is es-
sentially a tailored version of this, where more detailed       To develop the collaging facility within Greenstone a mix-
metadata information, such as song title and artist, is ex-     ture of Perl and Java code was needed. The Perl code
tracted from the file. The lion’s share of this work, in         implemented a new classifier that linked document identi-
turn, is implemented by the MP3::Info package, avail-           fiers of musical works in the digital library to the associ-
able through the Comprehensive Perl Archive Network             ated images located through Google Images. This was not
(www.cpan.org). Retrieving an associate array of values         an unduly complex task as much of the required function-
through this package, each entry is added into the Green-       ality resides in a base class for classifiers. Roughly 200
stone system as metadata bearing the same tag label, pre-       lines of code was needed to develop the necessary struc-
fixed by ‘mp3’ as an XML namespace.                              ture. Care was given to capping the number of document
   With an eye on supporting collaging at runtime within        identifiers clustered together as one block, for runtime ef-
the digital library, an additional routine was added to the     ficiency. If the blocks get too large, then there is a no-
ticeable delay detectable in the user interface between the     An anecdotal example of this in use was with one user
information being retrieved and the resulting images being      who, on test-driving the prototype, remarked they’d al-
displayed.                                                      ways wondered if a particular song by The Beatles was I
   The Java code took the form of an applet to provide,         want to hold your hand or I wanna hold your hand. On
within the context of a web-browser based digital library,      seeing the two sample images displayed while the song
the necessary fine-grained interaction. The applet was           was playing, one of which was the single’s cover, they
multi-threaded, with one principle thread accessing the in-     were instantly able to satisfy their curiosity. Intrigued,
formation stored by the Collage classifier: once furnished       they then clicked on the image to explore further.
with a document’s identifier, the applet can make subse-
quent calls to access metadata and associated files. Since
                                                                4.4. Integration
the images associated with each MP3 file reside on the
Greenstone server, these need to be downloaded locally          The majority of the components written for the prototype—
by the applet for use in the visualisation. This too took       the plugin, classifier, and format statements—are natu-
place in a thread. Finally, a third thread was used to man-     rally drawn into Greenstone through a collection’s con-
age the graphic canvas in which the collage visualisation       figuration file. To operate with the extended functionality
was presented, set to refresh at regular intervals.             it is merely a matter of ensuring the modules of code are
   Alpha values—a technique in graphics to control the          placed in the correct directories.
transparency of pixels within an image—was used to im-              Integrating the collaging applet into the Greenstone en-
plement the fading of images over time. Earlier we de-          vironment required the editing of macro files. This is a
scribed the placement of images as random. This is an           mechanism within the toolkit that blends a digital library
over-simplification. To make better use of the real es-          site’s look and feel with the specification of functional as-
tate available within the visualization, a data-structure is    pects of the web page based interface, to precisely cater
maintained recording the areas currently occupied by the        for this sort of extension. Macro files typically set the
images in the display and how faded they are. When se-          global form within the digital library; the aforementioned
lecting a new “random” position to place a fresh image,         format statements override this for more local customisa-
several random positions are calculated and the one that        tions.
resides in the most dormant area is chosen.                         With this done a user can create a new collection, set
   Queues are kept of the downloaded and currently dis-         the collection configuration file to use the specialist plu-
played images. Parameters to the applet control aspects         gin and classifier, populate it with MP3 files and issue the
such as the dimensions and background colour of the can-        build command. Greenstone even provides a “base this
vas, and for conservation of memory (if desired), how           collection on” feature that means that once a baseline col-
large the various queues can grow. Compared with the            lection of this type has been established, future collections
code written for the new plugin and classifier, the applet       can inherit there configuration settings from this. Then the
was a more substantial—but still modest in the larger scheme    task becomes: start a new collection, populate it with doc-
of things—undertaking requiring 2,500 lines of code.            uments, and then build.

4.3. Runtime
                                                                                   5. CONCLUSION
The final component for developing the prototype within
Greenstone was to specify how “the works” (our MP3              The development of novel browsing tools in music dig-
files) were to be displayed. This is an aspect of digital        ital libraries represents a relatively overlooked research
library design that varies greatly depending on the types       domain. Empirical data suggest that browsing should be
of documents contained in a collection. Control of this in      seen as a fundamental aspect of music information re-
Greenstone is exerted through format statements, a mech-        trieval and music digital library use. To address this short-
anism for customising—on a collection by collection basis—      coming, we have developed and presented a proof-of-concept
presentation and functionality.                                 prototype that applies collaging, a novel form of digital li-
   For the display of MP3 target documents, it was de-          brary browsing, to musical data.
cided to rekindle the notion of a person perusing the al-           There are many directions in which to extend and de-
bum cover sleeve while listening to the music, albeit in a      velop the work described here. For example, we plan to
simplistic form. We have already seen the result of this in     explore the utility of affording users the ability to “drag
Figure 3. It was accomplished by writing a format state-        and drop” from the presented images to create a user-
ment that sets out a table in HTML with the two sample          defined playlist in future versions of this tool. The playlists
images (which, incidently, are stored as metadata for the       could actually form the bases for user-personalised sub-
document by the MP3 plugin at build-time) in the first row,      collections or mini-libraries drawn from larger, more het-
and the artist, title (more metadata) and embedded MP3          erogeneous music digital libraries.
Java applet in the second.                                          Beyond browsing, but closely related to the person-
   Additional syntax was included to hyperlink images           alization functions mentioned above, the music collag-
and title information to external resources on the web. The     ing technique can be used in conjunction with traditional
intention here is to engage the listener in related material.   search tools. Under this scenario, traditional search tools
could be enhanced to allow users to conduct deliberately        [9] A. Kerne. CollageMachine: an interactive agent
“broad” or “fuzzy” searches (i.e., music from the Haight-           of web recombination. Leonardo, 33(5):347–350,
Ashbury period, horror film music, etc.). The search re-             2000.
sults from these broad sets would then be presented via
the music collaging technique for “laid back” perusal by       [10] A. Kerne, M. Khandelwal, and V. Sundaram. Pub-
the user.                                                           lishing evolving metadocuments on the web. In
   We also envision implementations of the music col-               Proc. of the 14th ACM Conference on Hypertext
laging system that are presented on large, touch-sensitive          and Hypermedia, pages 104–105, Nottingham, UK,
screens located in record stores and music libraries. On            2003.
the opposite end of the spectrum, the music collaging tech-    [11] A. Kerne and V. Sundaram. A recombinant infor-
nique could equally run as a Personal Digital Assistant             mation space. In Proc. Computational Semiotics
(PDA) enhancement for users to access their MP3 collec-             in Games and New Media (CoSIGN), pages 48–57,
tions.                                                              2003.

                   6. REFERENCES                               [12] J. R. McPherson and D. Bainbridge. Usage of the
                                                                    MELDEX digital music library. In Proc. of the Sec-
 [1] D. Bainbridge, J. Thompson, and I. H. Witten. As-              ond Annual International Symposium on Music In-
     sembling and enriching digital library collections.            formation Retrieval: ISMIR, pages 19–20, 2001.
     In Proc. of the Third ACM and IEEE Joint Confer-
                                                               [13] J. Preece, Y. Rogers, H. Sharp, D. Benyon, S. Hol-
     ence on Digital Libraries, pages 323–334, Houston,
                                                                    land, and T. Carey. Human-Computer Interaction.
     Texas, 2003.
                                                                    Addison-Wesley, Wokingham, UK, 1994.
 [2] M. Chang and J. Leggett. Collection understanding
                                                               [14] A. Rauber, E. Pampalk, and D. Merkl. Using
     through streaming collage. In Proc. of the Informa-
                                                                    psycho-acoustic models and self-organizing maps to
     tion Visualization Interfaces for Retrieval and Anal-
                                                                    create a hierarchical structuring of music by sound
     ysis (IVARA) Workshop, associated with the Joint
                                                                    similarity. In Proc. of the Third International Con-
     Conference on Digital Libraries, Houston, Texas,
                                                                    ference on Music Information Retrieval: ISMIR,
     USA, May 2003.
                                                                    pages 71–80, 2002.
 [3] H. Chen, A. Houston, R. Sewell, and B. Schatz. In-
                                                               [15] G. Tzanetakis, G. Essl, and P. Cook. Automatic mu-
     ternet browsing and searching: User evaluations of
                                                                    sical genre classification of audio signals. In Proc. of
     category map and concept space techniques. Jour-
                                                                    the Second Annual International Symposium on Mu-
     nal of the American Society for Information Science,
                                                                    sic Information Retrieval: ISMIR, pages 205–210,
     49(7):582–603, 1998.
                                                                    2001.
 [4] S. Cunningham, M. Jones, and S. Jones. Organizing
                                                               [16] I. H. Witten and D. Bainbridge. How to build a digi-
     music for use: personal music collections. Technical
                                                                    tal library. Morgan Kaufmann, San Francisco, 2003.
     report, Department of Computer Science, University
     of Waikato, Hamilton, New Zealand, 2004.                  [17] I. H. Witten, D. Bainbridge, G. W. Paynter, and S. J.
                                                                    Boddie. Importing documents and metadata into
 [5] S. Cunningham, N. Reeves, and M. Britland. An
                                                                    digital libraries: Requirements analysis and an ex-
     ethnographic study of music information seeking:
                                                                    tensible architecture. In Proceedings of the Euro-
     implications for the design of a music digital library.
                                                                    pean Conference on Digital Libraries, pages 390–
     In Proc. of the Joint Conference on Digital Libraries,
                                                                    405, Sept. 2002.
     pages 5–16, Houston, Texas, USA, May 2003.
 [6] M. Jones, P. Jain, G. Buchanan, and G. Marsden. Us-
     ing a mobile device to vary the pace of search. In In-
     ternational Symposium on Mobile HCI, pages 8–11,
     Udine, Italy, September 2003.
 [7] S. Jones, S. Cunningham, R. McNab, and S. Boddie.
     A transaction log analysis of a digital library. Inter-
     national Journal on Digital Libraries, 3(2):152–169,
     2000.
 [8] U. Karadkar, A. Kerne, R. Furuta, L. Francisco-
     Revilla, F. Shipman, and J. Wang. Connecting in-
     terface metaphors to support creation of hyperme-
     dia collections. In Proc. European Conf Digital Li-
     braries, pages 338–349, 2003.

More Related Content

Viewers also liked

My paperimage
My paperimageMy paperimage
My paperimagePete Go
 
Soergel digitallibrariesandknowledgeorganization
Soergel digitallibrariesandknowledgeorganizationSoergel digitallibrariesandknowledgeorganization
Soergel digitallibrariesandknowledgeorganizationPete Go
 
New asia
New asiaNew asia
New asiaPete Go
 
Tf annual 03_final
Tf annual 03_finalTf annual 03_final
Tf annual 03_finalPete Go
 
Annual report 2011
Annual report 2011Annual report 2011
Annual report 2011Pete Go
 
5 s Production and Operation Management
5 s Production and Operation Management5 s Production and Operation Management
5 s Production and Operation Managementbhandariniharika
 

Viewers also liked (10)

Scan222
Scan222Scan222
Scan222
 
My paperimage
My paperimageMy paperimage
My paperimage
 
Regge
ReggeRegge
Regge
 
Scan222
Scan222Scan222
Scan222
 
ขอสอบ
ขอสอบขอสอบ
ขอสอบ
 
Soergel digitallibrariesandknowledgeorganization
Soergel digitallibrariesandknowledgeorganizationSoergel digitallibrariesandknowledgeorganization
Soergel digitallibrariesandknowledgeorganization
 
New asia
New asiaNew asia
New asia
 
Tf annual 03_final
Tf annual 03_finalTf annual 03_final
Tf annual 03_final
 
Annual report 2011
Annual report 2011Annual report 2011
Annual report 2011
 
5 s Production and Operation Management
5 s Production and Operation Management5 s Production and Operation Management
5 s Production and Operation Management
 

Similar to P072 page-397-paper223

Digital library services and the changing environment
Digital library services and the changing environmentDigital library services and the changing environment
Digital library services and the changing environmentJohn MacColl
 
Library discovery: past, present and some futures
Library discovery: past, present and some futuresLibrary discovery: past, present and some futures
Library discovery: past, present and some futureslisld
 
Cultural Objects in the Age of Digital Access
Cultural Objects in the Age of Digital AccessCultural Objects in the Age of Digital Access
Cultural Objects in the Age of Digital AccessFrancesco Spagnolo
 
Audio in a social Web of linked data
Audio in a social Web of linked dataAudio in a social Web of linked data
Audio in a social Web of linked dataEduserv Foundation
 
Discovery event peter burnhill (aggregation as tactic)
Discovery event peter burnhill (aggregation as tactic)Discovery event peter burnhill (aggregation as tactic)
Discovery event peter burnhill (aggregation as tactic)RDTF-Discovery
 
The network reconfigures the catalog
The network reconfigures the catalogThe network reconfigures the catalog
The network reconfigures the cataloglisld
 
4 technology trends every librarian needs to know
4 technology trends every librarian needs to know4 technology trends every librarian needs to know
4 technology trends every librarian needs to knowFacet Publishing
 
The rise of platforms could see off the web
The rise of platforms could see off the webThe rise of platforms could see off the web
The rise of platforms could see off the webKen Chad Consulting Ltd
 
Pratt Sils LIS653 4 Fall 2007
Pratt Sils LIS653 4 Fall 2007Pratt Sils LIS653 4 Fall 2007
Pratt Sils LIS653 4 Fall 2007PrattSILS
 
Bibliographic paradigm and the digital information resources
Bibliographic paradigm and the digital information resourcesBibliographic paradigm and the digital information resources
Bibliographic paradigm and the digital information resourcesNurhazman Abdul Aziz
 
The Inside Out Library.
The Inside Out Library. The Inside Out Library.
The Inside Out Library. lisld
 
Knowledge Isles in an Open Archipelago. The Open Archipelago Project
Knowledge Isles in an Open Archipelago. The Open Archipelago ProjectKnowledge Isles in an Open Archipelago. The Open Archipelago Project
Knowledge Isles in an Open Archipelago. The Open Archipelago ProjectUgo Eccli
 
Digital libraries: successfully designing developing and implementing your d...
Digital libraries:  successfully designing developing and implementing your d...Digital libraries:  successfully designing developing and implementing your d...
Digital libraries: successfully designing developing and implementing your d...Beatrice Amollo
 
A Domain Based Approach to Information Retrieval in Digital Libraries - Rotel...
A Domain Based Approach to Information Retrieval in Digital Libraries - Rotel...A Domain Based Approach to Information Retrieval in Digital Libraries - Rotel...
A Domain Based Approach to Information Retrieval in Digital Libraries - Rotel...University of Bari (Italy)
 
Increasing NUS Libraries' Visibility in the Virtual World - Updated
Increasing NUS Libraries' Visibility in the Virtual World - UpdatedIncreasing NUS Libraries' Visibility in the Virtual World - Updated
Increasing NUS Libraries' Visibility in the Virtual World - UpdatedKC Tan
 

Similar to P072 page-397-paper223 (20)

Metadata and Scotland’s information environment: potential benefits of Web 2.0
Metadata and Scotland’s information environment: potential benefits of Web 2.0Metadata and Scotland’s information environment: potential benefits of Web 2.0
Metadata and Scotland’s information environment: potential benefits of Web 2.0
 
confernece paper
confernece paperconfernece paper
confernece paper
 
Aggregation as tactic sm new
Aggregation as tactic sm newAggregation as tactic sm new
Aggregation as tactic sm new
 
Aggregation as Tactic
Aggregation as TacticAggregation as Tactic
Aggregation as Tactic
 
Digital library services and the changing environment
Digital library services and the changing environmentDigital library services and the changing environment
Digital library services and the changing environment
 
Digital library softaware greenstone & dsapce
Digital library softaware greenstone & dsapceDigital library softaware greenstone & dsapce
Digital library softaware greenstone & dsapce
 
Library discovery: past, present and some futures
Library discovery: past, present and some futuresLibrary discovery: past, present and some futures
Library discovery: past, present and some futures
 
Cultural Objects in the Age of Digital Access
Cultural Objects in the Age of Digital AccessCultural Objects in the Age of Digital Access
Cultural Objects in the Age of Digital Access
 
Audio in a social Web of linked data
Audio in a social Web of linked dataAudio in a social Web of linked data
Audio in a social Web of linked data
 
Discovery event peter burnhill (aggregation as tactic)
Discovery event peter burnhill (aggregation as tactic)Discovery event peter burnhill (aggregation as tactic)
Discovery event peter burnhill (aggregation as tactic)
 
The network reconfigures the catalog
The network reconfigures the catalogThe network reconfigures the catalog
The network reconfigures the catalog
 
4 technology trends every librarian needs to know
4 technology trends every librarian needs to know4 technology trends every librarian needs to know
4 technology trends every librarian needs to know
 
The rise of platforms could see off the web
The rise of platforms could see off the webThe rise of platforms could see off the web
The rise of platforms could see off the web
 
Pratt Sils LIS653 4 Fall 2007
Pratt Sils LIS653 4 Fall 2007Pratt Sils LIS653 4 Fall 2007
Pratt Sils LIS653 4 Fall 2007
 
Bibliographic paradigm and the digital information resources
Bibliographic paradigm and the digital information resourcesBibliographic paradigm and the digital information resources
Bibliographic paradigm and the digital information resources
 
The Inside Out Library.
The Inside Out Library. The Inside Out Library.
The Inside Out Library.
 
Knowledge Isles in an Open Archipelago. The Open Archipelago Project
Knowledge Isles in an Open Archipelago. The Open Archipelago ProjectKnowledge Isles in an Open Archipelago. The Open Archipelago Project
Knowledge Isles in an Open Archipelago. The Open Archipelago Project
 
Digital libraries: successfully designing developing and implementing your d...
Digital libraries:  successfully designing developing and implementing your d...Digital libraries:  successfully designing developing and implementing your d...
Digital libraries: successfully designing developing and implementing your d...
 
A Domain Based Approach to Information Retrieval in Digital Libraries - Rotel...
A Domain Based Approach to Information Retrieval in Digital Libraries - Rotel...A Domain Based Approach to Information Retrieval in Digital Libraries - Rotel...
A Domain Based Approach to Information Retrieval in Digital Libraries - Rotel...
 
Increasing NUS Libraries' Visibility in the Virtual World - Updated
Increasing NUS Libraries' Visibility in the Virtual World - UpdatedIncreasing NUS Libraries' Visibility in the Virtual World - Updated
Increasing NUS Libraries' Visibility in the Virtual World - Updated
 

Recently uploaded

भारत-रोम व्यापार.pptx, Indo-Roman Trade,
भारत-रोम व्यापार.pptx, Indo-Roman Trade,भारत-रोम व्यापार.pptx, Indo-Roman Trade,
भारत-रोम व्यापार.pptx, Indo-Roman Trade,Virag Sontakke
 
Earth Day Presentation wow hello nice great
Earth Day Presentation wow hello nice greatEarth Day Presentation wow hello nice great
Earth Day Presentation wow hello nice greatYousafMalik24
 
Biting mechanism of poisonous snakes.pdf
Biting mechanism of poisonous snakes.pdfBiting mechanism of poisonous snakes.pdf
Biting mechanism of poisonous snakes.pdfadityarao40181
 
Capitol Tech U Doctoral Presentation - April 2024.pptx
Capitol Tech U Doctoral Presentation - April 2024.pptxCapitol Tech U Doctoral Presentation - April 2024.pptx
Capitol Tech U Doctoral Presentation - April 2024.pptxCapitolTechU
 
Meghan Sutherland In Media Res Media Component
Meghan Sutherland In Media Res Media ComponentMeghan Sutherland In Media Res Media Component
Meghan Sutherland In Media Res Media ComponentInMediaRes1
 
Interactive Powerpoint_How to Master effective communication
Interactive Powerpoint_How to Master effective communicationInteractive Powerpoint_How to Master effective communication
Interactive Powerpoint_How to Master effective communicationnomboosow
 
call girls in Kamla Market (DELHI) 🔝 >༒9953330565🔝 genuine Escort Service 🔝✔️✔️
call girls in Kamla Market (DELHI) 🔝 >༒9953330565🔝 genuine Escort Service 🔝✔️✔️call girls in Kamla Market (DELHI) 🔝 >༒9953330565🔝 genuine Escort Service 🔝✔️✔️
call girls in Kamla Market (DELHI) 🔝 >༒9953330565🔝 genuine Escort Service 🔝✔️✔️9953056974 Low Rate Call Girls In Saket, Delhi NCR
 
Crayon Activity Handout For the Crayon A
Crayon Activity Handout For the Crayon ACrayon Activity Handout For the Crayon A
Crayon Activity Handout For the Crayon AUnboundStockton
 
CARE OF CHILD IN INCUBATOR..........pptx
CARE OF CHILD IN INCUBATOR..........pptxCARE OF CHILD IN INCUBATOR..........pptx
CARE OF CHILD IN INCUBATOR..........pptxGaneshChakor2
 
Full Stack Web Development Course for Beginners
Full Stack Web Development Course  for BeginnersFull Stack Web Development Course  for Beginners
Full Stack Web Development Course for BeginnersSabitha Banu
 
Introduction to ArtificiaI Intelligence in Higher Education
Introduction to ArtificiaI Intelligence in Higher EducationIntroduction to ArtificiaI Intelligence in Higher Education
Introduction to ArtificiaI Intelligence in Higher Educationpboyjonauth
 
Introduction to AI in Higher Education_draft.pptx
Introduction to AI in Higher Education_draft.pptxIntroduction to AI in Higher Education_draft.pptx
Introduction to AI in Higher Education_draft.pptxpboyjonauth
 
Pharmacognosy Flower 3. Compositae 2023.pdf
Pharmacognosy Flower 3. Compositae 2023.pdfPharmacognosy Flower 3. Compositae 2023.pdf
Pharmacognosy Flower 3. Compositae 2023.pdfMahmoud M. Sallam
 
Software Engineering Methodologies (overview)
Software Engineering Methodologies (overview)Software Engineering Methodologies (overview)
Software Engineering Methodologies (overview)eniolaolutunde
 
Final demo Grade 9 for demo Plan dessert.pptx
Final demo Grade 9 for demo Plan dessert.pptxFinal demo Grade 9 for demo Plan dessert.pptx
Final demo Grade 9 for demo Plan dessert.pptxAvyJaneVismanos
 
EPANDING THE CONTENT OF AN OUTLINE using notes.pptx
EPANDING THE CONTENT OF AN OUTLINE using notes.pptxEPANDING THE CONTENT OF AN OUTLINE using notes.pptx
EPANDING THE CONTENT OF AN OUTLINE using notes.pptxRaymartEstabillo3
 

Recently uploaded (20)

भारत-रोम व्यापार.pptx, Indo-Roman Trade,
भारत-रोम व्यापार.pptx, Indo-Roman Trade,भारत-रोम व्यापार.pptx, Indo-Roman Trade,
भारत-रोम व्यापार.pptx, Indo-Roman Trade,
 
Earth Day Presentation wow hello nice great
Earth Day Presentation wow hello nice greatEarth Day Presentation wow hello nice great
Earth Day Presentation wow hello nice great
 
Biting mechanism of poisonous snakes.pdf
Biting mechanism of poisonous snakes.pdfBiting mechanism of poisonous snakes.pdf
Biting mechanism of poisonous snakes.pdf
 
Capitol Tech U Doctoral Presentation - April 2024.pptx
Capitol Tech U Doctoral Presentation - April 2024.pptxCapitol Tech U Doctoral Presentation - April 2024.pptx
Capitol Tech U Doctoral Presentation - April 2024.pptx
 
9953330565 Low Rate Call Girls In Rohini Delhi NCR
9953330565 Low Rate Call Girls In Rohini  Delhi NCR9953330565 Low Rate Call Girls In Rohini  Delhi NCR
9953330565 Low Rate Call Girls In Rohini Delhi NCR
 
Meghan Sutherland In Media Res Media Component
Meghan Sutherland In Media Res Media ComponentMeghan Sutherland In Media Res Media Component
Meghan Sutherland In Media Res Media Component
 
Interactive Powerpoint_How to Master effective communication
Interactive Powerpoint_How to Master effective communicationInteractive Powerpoint_How to Master effective communication
Interactive Powerpoint_How to Master effective communication
 
Model Call Girl in Tilak Nagar Delhi reach out to us at 🔝9953056974🔝
Model Call Girl in Tilak Nagar Delhi reach out to us at 🔝9953056974🔝Model Call Girl in Tilak Nagar Delhi reach out to us at 🔝9953056974🔝
Model Call Girl in Tilak Nagar Delhi reach out to us at 🔝9953056974🔝
 
call girls in Kamla Market (DELHI) 🔝 >༒9953330565🔝 genuine Escort Service 🔝✔️✔️
call girls in Kamla Market (DELHI) 🔝 >༒9953330565🔝 genuine Escort Service 🔝✔️✔️call girls in Kamla Market (DELHI) 🔝 >༒9953330565🔝 genuine Escort Service 🔝✔️✔️
call girls in Kamla Market (DELHI) 🔝 >༒9953330565🔝 genuine Escort Service 🔝✔️✔️
 
Crayon Activity Handout For the Crayon A
Crayon Activity Handout For the Crayon ACrayon Activity Handout For the Crayon A
Crayon Activity Handout For the Crayon A
 
CARE OF CHILD IN INCUBATOR..........pptx
CARE OF CHILD IN INCUBATOR..........pptxCARE OF CHILD IN INCUBATOR..........pptx
CARE OF CHILD IN INCUBATOR..........pptx
 
Full Stack Web Development Course for Beginners
Full Stack Web Development Course  for BeginnersFull Stack Web Development Course  for Beginners
Full Stack Web Development Course for Beginners
 
Introduction to ArtificiaI Intelligence in Higher Education
Introduction to ArtificiaI Intelligence in Higher EducationIntroduction to ArtificiaI Intelligence in Higher Education
Introduction to ArtificiaI Intelligence in Higher Education
 
Introduction to AI in Higher Education_draft.pptx
Introduction to AI in Higher Education_draft.pptxIntroduction to AI in Higher Education_draft.pptx
Introduction to AI in Higher Education_draft.pptx
 
Pharmacognosy Flower 3. Compositae 2023.pdf
Pharmacognosy Flower 3. Compositae 2023.pdfPharmacognosy Flower 3. Compositae 2023.pdf
Pharmacognosy Flower 3. Compositae 2023.pdf
 
Software Engineering Methodologies (overview)
Software Engineering Methodologies (overview)Software Engineering Methodologies (overview)
Software Engineering Methodologies (overview)
 
ESSENTIAL of (CS/IT/IS) class 06 (database)
ESSENTIAL of (CS/IT/IS) class 06 (database)ESSENTIAL of (CS/IT/IS) class 06 (database)
ESSENTIAL of (CS/IT/IS) class 06 (database)
 
Final demo Grade 9 for demo Plan dessert.pptx
Final demo Grade 9 for demo Plan dessert.pptxFinal demo Grade 9 for demo Plan dessert.pptx
Final demo Grade 9 for demo Plan dessert.pptx
 
Model Call Girl in Bikash Puri Delhi reach out to us at 🔝9953056974🔝
Model Call Girl in Bikash Puri  Delhi reach out to us at 🔝9953056974🔝Model Call Girl in Bikash Puri  Delhi reach out to us at 🔝9953056974🔝
Model Call Girl in Bikash Puri Delhi reach out to us at 🔝9953056974🔝
 
EPANDING THE CONTENT OF AN OUTLINE using notes.pptx
EPANDING THE CONTENT OF AN OUTLINE using notes.pptxEPANDING THE CONTENT OF AN OUTLINE using notes.pptx
EPANDING THE CONTENT OF AN OUTLINE using notes.pptx
 

P072 page-397-paper223

  • 1. VISUAL COLLAGING OF MUSIC IN A DIGITAL LIBRARY David Bainbridge Sally Jo Cunningham J. Stephen Downie Dept. of Computer Science Dept. of Computer Science Graduate School of Library University of Waikato University of Waikato and Information Science Hamilton Hamilton University of Illinois New Zealand New Zealand Urbana-Champaign, IL, USA. ABSTRACT currently. Images already in the visualisation fade over time and as new images are retrieved from the various This article explores the role visual browsing can play sites, they are added in to the visualisation on top of what within a digital music library. The context to the work is already there. If at any stage a displayed image sparks is provided through a review of related techniques drawn some interest, the user can click on it and have a new web from the fields of digital libraries and human computer in- browser open up displaying the web page the image comes teraction. Implemented within the open source digital li- from. brary toolkit Greenstone, a prototype system is described This paper is organized as follows: Section 2 reviews that combines images located through textual metadata related work in collaging and browsing in digital libraries, with a visualisation technique known as collaging to pro- and the “laid back” approach to interacting with informa- vide a leisurely, undirected interaction with a music col- tion collections; Section 3 presents a walkthrough of a lection. Emphasis in the article is given to the augmenta- prototype music collection that includes collaging; Sec- tions of the basic technique to work in the musical domain. tion 4 describes implementation issues for creating the prototype; and Section 5 is a summary of our work. 1. INTRODUCTION Looking at patterns of use in real music collections—phy- 2. RELATED WORK sical libraries, music stores—we see people (of their own volition) spending a significant amount of time looking, Previous graphics-based browsing tools for music docu- browsing, sifting through the music/documents [5]. This ment collections primarily support visualization of entire is in stark contrast to observed behavior in digital libraries; collections; for example, as a self-organising map [14] or transaction log analysis reveals brief sessions, few searches, a 3D vector space [15] showing documents clustered by short queries, and few documents viewed [7, 12]. Is this genre. While these visualisations can give an immedi- a true reflection of how users, in general, wish to interact ate, powerful overview of relative size of groupings and with digital repositories or merely an artifact of browsing a sense of distance between clusters, they are less effec- mechanisms that are ineffectual or too cumbersome to be tive in supporting browsing of the documents themselves, used? This issue has led to various research initiatives de- since the idea of an overview means that the documents veloping richer forms of support for browsing with text- must be minimised and represented at the top (browsing) based digital libraries, for instance [3, 15, 14]. Given a level in a relatively impoverished fashion (for example, by strong desire shown by people to browse real music col- a single point in a 3D space) [3]. lections then the ability to support this activity within a Earlier work applying collaging to information seeking digital music library seems especially important. focused on using the resulting collage as a search tool, for This provided the motivation for the work described in example to build a collection of Web documents based on this paper: the adaptation of a visual technique known as a CollageMachine search/browse over Google [8]. Based collaging to the music domain. Collaging is a technique on user evaluation studies, collaging seems ill-suited to devised by Kerne [9] for use in the more general environ- supporting more formal information seeking activities such ment of the web. In one example of its use, a user keys as direct search; users objected to the lack of predictabil- in several web domains—say mtv.com, archive.org/audio, ity in the collaging, and the lack of transparency in the and ceolas.org—presses go and then observes a visualisa- relationship between the search terms that seed the col- tion of images drawn from crawling these web sites con- lage and the collaged “results.” Collaging appears best suited to providing a browsing experience that is engag- Permission to make digital or hard copies of all or part of this work for ing and interesting, but that does not require the direc- personal or classroom use is granted without fee provided that copies tion and focus on the part of the user that is characteristic are not made or distributed for profit or commercial advantage and that of searching or even of other types of browsing (for ex- copies bear this notice and the full citation on the first page. ample, browsing by metadata or by genre). This type of c 2004 Universitat Pompeu Fabra. activity, where the system selects information for viewing
  • 2. based on little (or even no!) control from the user, has been on a particular item will result in the song being played. termed “laid back,” to distinguish it from the more active The ability to search document content is one of the and user-directed “sit forward” information seeking activ- pillars upon which digital libraries are based, and although ities [6]. Greenstone does not currently include support for direct The utility of the laid back approach to browsing has content searching of raw audio files, 1 in this context the supporting evidence in the HCI, cognitive science and psy- metadata stored in ID3 tags makes for a useful foundation chology of memory literatures. It is now a well-known upon which to base textual queries. The digital library trope in these literatures that “recognition” is greater than toolkit is flexible over what indexes can be formed: for “recall” [13]. Since laid back browsing is premised on the this collection it was decided to index titles, artist, and user recognising an association presented by the collag- a conglomeration of all text fields extracted from a file ing interface rather than having to recall a specific fact, (akin to full-text search). Field searching is also supported melody, or set of appropriate search terms, it follows that through an advanced search page. the collaging technique, and its laid back approach, should Depending on their source, not all MP3 files have their provide a beneficial browsing enhancement for music seek- metadata fields filled out appropriately. Quality control ing users. is a hallmark of digital libraries that distinguishes them Previous implementations of collaging have been di- from the web at large. To help with selection and main- rected at using extracted images or text snippets to explore tenance of documents in a digital library, Greenstone pro- or search document collections that are primarily text [8, vides a graphical interface for adding and editing existing 10] or images [2]. How appropriate is this technique likely metadata, along with managing other workflow aspects of to be when applied to collections of audio documents? In a digital library lifecycle [1]. Using the graphical tool a answer, we note that the visual collage-based browsing user can shape their music digital library to their taste. we describe in this paper is surprisingly close to a com- Metadata need not be restricted to the fields present in mon browsing activity observed in music stores—flicking the MP3 files. A system of metadata sets is supported through horizontal displays of music CDs. In this latter that allows the use of existing standards, such as Dublin activity, shoppers glance at the CD cover images as they Core, as well as ad hoc sets developed by a user or com- are flicked past, occasionally pausing to look more closely munity of users. Through the augmentation of metadata at an item of interest, perhaps to read its cover notes and fields, richer searching and browsing structures can be de- to “hear the song in my head” [5]. veloped, for instance yielding a subject hierarchy for the The use of visual surrogates for audio files in a col- collection. lage also seems appropriate given evidence that appear- Figure 2 shows a more novel form of browsing the col- ance plays a significant role in the organization and dis- lection. During the building phase of the digital library, play of personal music collections—the attractiveness of title and artist information contained in each MP3 file has the CD covers, the arrangement of CDs on shelves, and been used to locate two images deemed to be relevant to the general effect achieved by prominently positioning the the musical work. Collectively, these form the basis of the collection as the centrepiece of a room [4]. Collaging is collage. In broad brush strokes, one image at a time is ran- designed to support the display of documents in an inter- domly taken from the set and placed at a random location esting and attractive manner, with the added benefit that within the visualisation. As time passes, the image begins the juxtaposition of documents in a novel and serendip- to fade until, eventually, it can no longer be seen, at which itous process may create compositions that the browser point it is removed from the visualisation and added back finds meaningful [11]. in to the set of possible choices. In parallel with this, new images are being added at regular intervals. The selec- tion method of acquiring images is heuristic based, which 3. PROTOTYPE we explain more fully in the next section. Album covers, photographs of the the actual vinyl disc—some younger For pedagogical purposes a Greenstone digital library col- users of the digital library collection may need this format lection was built based on a user’s (modest) personal col- explained!—and publicity shots of artists are most com- lection of 80 MP3 files. The result can be seen in Fig- mon. ures 1 through 4. In addition to the commonplace brows- In Figure 2 the most prominent images in the collage ing structures—such as songs arranged alphabetically by are two record covers: The Beatles/1967–1970 (colloqui- Title or grouped Artist, for instance—the music digital li- ally known as the blue album) and in front of that the sin- brary can provide additional functionality and form. In gle’s cover for Another one bites the dust by Queen; to a moment we will look at one of the more novel ideas, the right of these there is also a picture of David Bowie collaging tailored to music; here we will briefly mention live in concert, although this is not so clearly reproduced some more prosaic features that come from the existing as a monochrome figure. If the user clicks on a particular textual support within the digital library toolkit. image, then a new browser window opens up, as is shown Figure 1 indeed shows the familiar “browse by artist” in Figure 3 where the aforementioned David Bowie pic- listing. Next to the bookshelf icon in the figure is dis- played in brackets the number of songs by that artist, which 1 Greenstone does however support direct content querying of sym- if clicked upon opens up to show the list. Further clicking bolic music.
  • 3. Figure 1. Browsing collection by artist Figure 2. Collaging in action. ture has been targeted. The new window is a metaphor for ing that there is an MP3 logo that links to the original file. the selected record, and shows the two images associated This allows a user to download the file if they wish or play with the track (fashioned on the idea of opening up the it using a different application. This last detail in itself is CD cover) while the audio is played through an embed- not especially astounding, but does help emphasize that ded MP3 applet. digital libraries need to integrate with other aspects of the user’s computer environment. Additional resources relating to the musical track are hyperlinked through this page. If the user clicks on either of the images, they are taken to the web where the im- 4. IMPLEMENTATION age was sourced from. In Figure 4 the user has clicked on the right-hand image of the David Bowie “record” and For rapid prototyping, Greenstone—a versatile open source brought up series of interviews with the artist. In the case digital library toolkit that the authors have considerable of Take a chance on me by ABBA, to take another random experience with—was chosen as an implementation frame- example, the images take you to a discography site for the work [16]. The toolkit incorporates a flexible architecture band. The title of the song, displayed below the images, is for digesting new document formats and customising how also hyperlinked and is used to automatically formulate a they are indexed, browsed and presented, making it ideal query on Google Images to find, with a certain degree of for our needs [17]. The architecture has four main compo- accuracy, related items. There is also a small embedded nents: 1) an internal XML document format to which all Java applet that plays the track, primed to start once the documents are converted when they are imported into the page loads and to stream the audio if possible. Follow- system (native media such as an MPEG video is bound
  • 4. Figure 3. Viewing the work: two images, title and the audio playing to the format through a linking mechanism); 2) a set of Figure 4. Original web page from which second David parsers called plugins that process document and meta- Bowie image sourced. data formats; 3) a set of modules called classifiers that are used to build browsing structures; and 4) a scheme for de- signing individual digital library collections by providing plugin that used the title and artist information to perform a configuration file that specifies what kind of documents a search on Google Images. To be precise, two exact and metadata they are to contain, and what searching and phrases are formed based on the file’s artist and title meta- browsing facilities they provide. data and a search, restricted to JPEG files only, is issued To develop the music based prototype, a plugin for the to Google Images. If for some reason the file is missing MP3 format was needed; its primary purpose to bind the either of these fields, an alternative search is generated source MP3 file and digest the metadata stored in its ID3 based on the audio track’s filename, which due to informal tags into the XML representation. To support the collag- conventions that have grown up around raw audio files, ing metaphor, a new classifier module was also required often encodes similar title and artist information. Elemen- along with an applet to drive the visualisation. We de- tary manipulation of the filename is performed, such as scribe each of these in turn. converting underscores ( ) to spaces which are often used as a substitute in filenames. 4.1. An MP3 Plugin For the resulting images identified by the Google search, the first two valid matches are mirrored into the collec- Plugins in Greenstone are written in Perl, and are accessed tion’s document import area. Since the web is inherently when a digital library collection is built. A substantial ar- a noisy medium, even if the matching image is available, ray of plugins already exists, processing formats as varied it is sometimes the case that it is still not a valid JPEG as HTML, Word, PowerPoint, PDF, Images, Email mes- image. To help spot such anachronisms, we use identify, sages, Excel spreadsheets and MARC data to name a few. a command-line utility included in ImageMagick’s suite There is also a plugin known as the UnknownPlug which of open source image manipulation tools (see imagemag- allows a digital library designer to digest, in a rudimentary ick.org for more details). The necessary Perl code to im- fashion, documents of a type that do not have a specificly plement this totalled around 300 lines. Similar work could targeted plugin by setting basic metadata such as the doc- be done for comparable audio formats such as Ogg Vorbis ument’s filename and file size, and by binding the native and Windows Media Audio (WMA). file into the XML format. UnknownPlug was the starting point for writing the 4.2. A Collaging Classifier MP3 plugin, which—inheriting from this plugin—is es- sentially a tailored version of this, where more detailed To develop the collaging facility within Greenstone a mix- metadata information, such as song title and artist, is ex- ture of Perl and Java code was needed. The Perl code tracted from the file. The lion’s share of this work, in implemented a new classifier that linked document identi- turn, is implemented by the MP3::Info package, avail- fiers of musical works in the digital library to the associ- able through the Comprehensive Perl Archive Network ated images located through Google Images. This was not (www.cpan.org). Retrieving an associate array of values an unduly complex task as much of the required function- through this package, each entry is added into the Green- ality resides in a base class for classifiers. Roughly 200 stone system as metadata bearing the same tag label, pre- lines of code was needed to develop the necessary struc- fixed by ‘mp3’ as an XML namespace. ture. Care was given to capping the number of document With an eye on supporting collaging at runtime within identifiers clustered together as one block, for runtime ef- the digital library, an additional routine was added to the ficiency. If the blocks get too large, then there is a no-
  • 5. ticeable delay detectable in the user interface between the An anecdotal example of this in use was with one user information being retrieved and the resulting images being who, on test-driving the prototype, remarked they’d al- displayed. ways wondered if a particular song by The Beatles was I The Java code took the form of an applet to provide, want to hold your hand or I wanna hold your hand. On within the context of a web-browser based digital library, seeing the two sample images displayed while the song the necessary fine-grained interaction. The applet was was playing, one of which was the single’s cover, they multi-threaded, with one principle thread accessing the in- were instantly able to satisfy their curiosity. Intrigued, formation stored by the Collage classifier: once furnished they then clicked on the image to explore further. with a document’s identifier, the applet can make subse- quent calls to access metadata and associated files. Since 4.4. Integration the images associated with each MP3 file reside on the Greenstone server, these need to be downloaded locally The majority of the components written for the prototype— by the applet for use in the visualisation. This too took the plugin, classifier, and format statements—are natu- place in a thread. Finally, a third thread was used to man- rally drawn into Greenstone through a collection’s con- age the graphic canvas in which the collage visualisation figuration file. To operate with the extended functionality was presented, set to refresh at regular intervals. it is merely a matter of ensuring the modules of code are Alpha values—a technique in graphics to control the placed in the correct directories. transparency of pixels within an image—was used to im- Integrating the collaging applet into the Greenstone en- plement the fading of images over time. Earlier we de- vironment required the editing of macro files. This is a scribed the placement of images as random. This is an mechanism within the toolkit that blends a digital library over-simplification. To make better use of the real es- site’s look and feel with the specification of functional as- tate available within the visualization, a data-structure is pects of the web page based interface, to precisely cater maintained recording the areas currently occupied by the for this sort of extension. Macro files typically set the images in the display and how faded they are. When se- global form within the digital library; the aforementioned lecting a new “random” position to place a fresh image, format statements override this for more local customisa- several random positions are calculated and the one that tions. resides in the most dormant area is chosen. With this done a user can create a new collection, set Queues are kept of the downloaded and currently dis- the collection configuration file to use the specialist plu- played images. Parameters to the applet control aspects gin and classifier, populate it with MP3 files and issue the such as the dimensions and background colour of the can- build command. Greenstone even provides a “base this vas, and for conservation of memory (if desired), how collection on” feature that means that once a baseline col- large the various queues can grow. Compared with the lection of this type has been established, future collections code written for the new plugin and classifier, the applet can inherit there configuration settings from this. Then the was a more substantial—but still modest in the larger scheme task becomes: start a new collection, populate it with doc- of things—undertaking requiring 2,500 lines of code. uments, and then build. 4.3. Runtime 5. CONCLUSION The final component for developing the prototype within Greenstone was to specify how “the works” (our MP3 The development of novel browsing tools in music dig- files) were to be displayed. This is an aspect of digital ital libraries represents a relatively overlooked research library design that varies greatly depending on the types domain. Empirical data suggest that browsing should be of documents contained in a collection. Control of this in seen as a fundamental aspect of music information re- Greenstone is exerted through format statements, a mech- trieval and music digital library use. To address this short- anism for customising—on a collection by collection basis— coming, we have developed and presented a proof-of-concept presentation and functionality. prototype that applies collaging, a novel form of digital li- For the display of MP3 target documents, it was de- brary browsing, to musical data. cided to rekindle the notion of a person perusing the al- There are many directions in which to extend and de- bum cover sleeve while listening to the music, albeit in a velop the work described here. For example, we plan to simplistic form. We have already seen the result of this in explore the utility of affording users the ability to “drag Figure 3. It was accomplished by writing a format state- and drop” from the presented images to create a user- ment that sets out a table in HTML with the two sample defined playlist in future versions of this tool. The playlists images (which, incidently, are stored as metadata for the could actually form the bases for user-personalised sub- document by the MP3 plugin at build-time) in the first row, collections or mini-libraries drawn from larger, more het- and the artist, title (more metadata) and embedded MP3 erogeneous music digital libraries. Java applet in the second. Beyond browsing, but closely related to the person- Additional syntax was included to hyperlink images alization functions mentioned above, the music collag- and title information to external resources on the web. The ing technique can be used in conjunction with traditional intention here is to engage the listener in related material. search tools. Under this scenario, traditional search tools
  • 6. could be enhanced to allow users to conduct deliberately [9] A. Kerne. CollageMachine: an interactive agent “broad” or “fuzzy” searches (i.e., music from the Haight- of web recombination. Leonardo, 33(5):347–350, Ashbury period, horror film music, etc.). The search re- 2000. sults from these broad sets would then be presented via the music collaging technique for “laid back” perusal by [10] A. Kerne, M. Khandelwal, and V. Sundaram. Pub- the user. lishing evolving metadocuments on the web. In We also envision implementations of the music col- Proc. of the 14th ACM Conference on Hypertext laging system that are presented on large, touch-sensitive and Hypermedia, pages 104–105, Nottingham, UK, screens located in record stores and music libraries. On 2003. the opposite end of the spectrum, the music collaging tech- [11] A. Kerne and V. Sundaram. A recombinant infor- nique could equally run as a Personal Digital Assistant mation space. In Proc. Computational Semiotics (PDA) enhancement for users to access their MP3 collec- in Games and New Media (CoSIGN), pages 48–57, tions. 2003. 6. REFERENCES [12] J. R. McPherson and D. Bainbridge. Usage of the MELDEX digital music library. In Proc. of the Sec- [1] D. Bainbridge, J. Thompson, and I. H. Witten. As- ond Annual International Symposium on Music In- sembling and enriching digital library collections. formation Retrieval: ISMIR, pages 19–20, 2001. In Proc. of the Third ACM and IEEE Joint Confer- [13] J. Preece, Y. Rogers, H. Sharp, D. Benyon, S. Hol- ence on Digital Libraries, pages 323–334, Houston, land, and T. Carey. Human-Computer Interaction. Texas, 2003. Addison-Wesley, Wokingham, UK, 1994. [2] M. Chang and J. Leggett. Collection understanding [14] A. Rauber, E. Pampalk, and D. Merkl. Using through streaming collage. In Proc. of the Informa- psycho-acoustic models and self-organizing maps to tion Visualization Interfaces for Retrieval and Anal- create a hierarchical structuring of music by sound ysis (IVARA) Workshop, associated with the Joint similarity. In Proc. of the Third International Con- Conference on Digital Libraries, Houston, Texas, ference on Music Information Retrieval: ISMIR, USA, May 2003. pages 71–80, 2002. [3] H. Chen, A. Houston, R. Sewell, and B. Schatz. In- [15] G. Tzanetakis, G. Essl, and P. Cook. Automatic mu- ternet browsing and searching: User evaluations of sical genre classification of audio signals. In Proc. of category map and concept space techniques. Jour- the Second Annual International Symposium on Mu- nal of the American Society for Information Science, sic Information Retrieval: ISMIR, pages 205–210, 49(7):582–603, 1998. 2001. [4] S. Cunningham, M. Jones, and S. Jones. Organizing [16] I. H. Witten and D. Bainbridge. How to build a digi- music for use: personal music collections. Technical tal library. Morgan Kaufmann, San Francisco, 2003. report, Department of Computer Science, University of Waikato, Hamilton, New Zealand, 2004. [17] I. H. Witten, D. Bainbridge, G. W. Paynter, and S. J. Boddie. Importing documents and metadata into [5] S. Cunningham, N. Reeves, and M. Britland. An digital libraries: Requirements analysis and an ex- ethnographic study of music information seeking: tensible architecture. In Proceedings of the Euro- implications for the design of a music digital library. pean Conference on Digital Libraries, pages 390– In Proc. of the Joint Conference on Digital Libraries, 405, Sept. 2002. pages 5–16, Houston, Texas, USA, May 2003. [6] M. Jones, P. Jain, G. Buchanan, and G. Marsden. Us- ing a mobile device to vary the pace of search. In In- ternational Symposium on Mobile HCI, pages 8–11, Udine, Italy, September 2003. [7] S. Jones, S. Cunningham, R. McNab, and S. Boddie. A transaction log analysis of a digital library. Inter- national Journal on Digital Libraries, 3(2):152–169, 2000. [8] U. Karadkar, A. Kerne, R. Furuta, L. Francisco- Revilla, F. Shipman, and J. Wang. Connecting in- terface metaphors to support creation of hyperme- dia collections. In Proc. European Conf Digital Li- braries, pages 338–349, 2003.