This document summarizes the CyberStacks prototype, which applies traditional library classification and organization methods to facilitate access to science and technology resources on the World Wide Web. Specifically, it uses the Library of Congress classification system to organize selected Internet resources into categories. This allows users to browse broad subjects and drill down into more specific topics. The goals are to reduce cognitive load for users and provide context among related resources. Key elements include selecting high-quality reference materials, classifying resources based on subject coverage, and presenting descriptive annotations to help users evaluate resources. The document argues this approach can make Internet resources easier to navigate for users familiar with traditional library systems and services.
W3C Library Linked Data Incubator Group: Review of the Final ReportF. Tim Knight
This report is a snapshot describing the current state of library data management. It outlines the potential benefits of publishing library data as Linked Data and provides recommendations for library standards bodies, data and systems designers, librarians and archivists, and library leaders.
There are two supplementary reports that provide additional detail. The first is the "Use Cases" describing library applications that take advantage of the benefits of adopting Linked Data standards and principles involved in publishing things like bibliographic data, concept schemes, and authority files. The second supplementary report "Datasets, Value Vocabularies, and Metadata Element Sets" provides a list of resources available for creating library Linked Data . There are several additional documents available on the W3C's Semantic Web wiki <http: /> and there is discussion list public-lld <http: />, which are both open to interested members of the public.
An adaptive clustering and classification algorithm for Twitter data streamin...TELKOMNIKA JOURNAL
On-going big data from social networks sites alike Twitter or Facebook has been an entrancing
hotspot for investigation by researchers in current decades as a result of various aspects including
up-to-date-ness, accessibility and popularity; however anyway there may be a trade off in accuracy.
Moreover, clustering of twitter data has caught the attention of researchers. As such, an algorithm which
can cluster data within a lesser computational time, especially for data streaming is needed. The presented
adaptive clustering and classification algorithm is used for data streaming in Apache spark to overcome
the existing problems is processed in two phases. In the first phase, the input pre-processed twitter data is
viably clustered utilizing an Improved Fuzzy C-means clustering and the proposed clustering is additionally
improved by an Adaptive Particle swarm optimization (PSO) algorithm. Further the clustered data
streaming is assessed utilizing spark engine. In the second phase, the input pre-processed Higgs data is
classified utilizing the modified support vector machine (MSVM) classifier with grid search optimization.
At long last the optimized information is assessed in spark engine and the assessed esteem is utilized to
discover an accomplished confusion matrix. The proposed work is utilizing Twitter dataset and Higgs
dataset for the data streaming in Apache Spark. The computational examinations exhibit the superiority of
presented approach comparing with the existing methods in terms of precision, recall, F-score,
convergence, ROC curve and accuracy.
Presented at the Northern Ohio Technical Services Librarians' meeting, November 22, 2013. Describes why libraries should move toward a linked data future to enable their resources to be discoverable on the open web, and includes lessons learned from developing the eXtensible Catalog at the University of Rochester.
Odam an optimized distributed association rule mining algorithm (synopsis)Mumbai Academisc
This document proposes ODAM, an optimized distributed association rule mining algorithm. It aims to discover rules based on higher-order associations between items in distributed textual documents that are neither vertically nor horizontally distributed, but rather a hybrid of the two. Modern organizations have geographically distributed data stored locally at each site, making centralized data mining infeasible due to high communication costs. Distributed data mining emerged to address this challenge. ODAM reduces communication costs compared to previous distributed ARM algorithms by mining patterns across distributed databases without requiring data consolidation.
RDAP 16 Poster: A Proposed Course Model for Integrating RDM with Research Rep...ASIS&T
Research Data Access and Preservation Summit, 2016
Atlanta, GA
May 4-7, 2016
Poster session (Wednesday, May 4)
Poster available at: https://repository.library.brown.edu/studio/item/bdr:650020/
Presenters:
Andrew Creamer, Brown University
Hope Lappen, Brown University
John Santiago, Brown University
Presenters:
Ag Data Commons: Agricultural research metadata and dataCyndy Parr
The document proposes the Ag Data Commons as a solution to address challenges with agricultural research data by creating a central repository to host metadata and data according to federal directives for public access. It outlines the goals of the Ag Data Commons to support public access mandates through a sustainable platform for hosting and sharing agricultural research data and metadata in both human and machine-readable formats. The document also provides details on the workflow for submitting and publishing data on the Ag Data Commons to ensure standardized metadata and compliance with best practices.
Implementing web scale discovery services: special reference to Indian Librar...Nikesh Narayanan
Web scale Discovery services arebecoming the widely adopted Information Retrieval solution in libraries across the world to connect its patrons with the relevant information they seek. In lieu with the world trend, Resources Discovery Solution implementation is gathering momentum in Indian libraries also.
Considering the Indian Libraries scenario, this paper attempts to provide an overview of Library Web Scale Discovery solutions, its need in Indian Libraries, important parameters to be considered for evaluation of Discovery Services, essential factors to be considered prior to implementation, stages of implementation and finally some thoughts on post implementation analysis for measuring the success.
Web scale Discovery services are becoming the most sought after solution for Libraries to connect its patrons with the relevant information they seek. Many studies show that these services are getting wide acceptance from users as well as Library staff and making revolution in Library Information retrieval arena. Given such broad implications, selecting a new discovery service for libraries is an important undertaking. Library professionals should carefully evaluate options to meet their goal of finding the best potential match for their library. This Paper attempts to provide a comprehensive overview of Library Web Scale Discovery solutions by depicting various facets of Web Scale Discovery, how it differs from federated searching and highlights the important parameters to be considered for taking an informed and confident decision on selecting discovery service.
W3C Library Linked Data Incubator Group: Review of the Final ReportF. Tim Knight
This report is a snapshot describing the current state of library data management. It outlines the potential benefits of publishing library data as Linked Data and provides recommendations for library standards bodies, data and systems designers, librarians and archivists, and library leaders.
There are two supplementary reports that provide additional detail. The first is the "Use Cases" describing library applications that take advantage of the benefits of adopting Linked Data standards and principles involved in publishing things like bibliographic data, concept schemes, and authority files. The second supplementary report "Datasets, Value Vocabularies, and Metadata Element Sets" provides a list of resources available for creating library Linked Data . There are several additional documents available on the W3C's Semantic Web wiki <http: /> and there is discussion list public-lld <http: />, which are both open to interested members of the public.
An adaptive clustering and classification algorithm for Twitter data streamin...TELKOMNIKA JOURNAL
On-going big data from social networks sites alike Twitter or Facebook has been an entrancing
hotspot for investigation by researchers in current decades as a result of various aspects including
up-to-date-ness, accessibility and popularity; however anyway there may be a trade off in accuracy.
Moreover, clustering of twitter data has caught the attention of researchers. As such, an algorithm which
can cluster data within a lesser computational time, especially for data streaming is needed. The presented
adaptive clustering and classification algorithm is used for data streaming in Apache spark to overcome
the existing problems is processed in two phases. In the first phase, the input pre-processed twitter data is
viably clustered utilizing an Improved Fuzzy C-means clustering and the proposed clustering is additionally
improved by an Adaptive Particle swarm optimization (PSO) algorithm. Further the clustered data
streaming is assessed utilizing spark engine. In the second phase, the input pre-processed Higgs data is
classified utilizing the modified support vector machine (MSVM) classifier with grid search optimization.
At long last the optimized information is assessed in spark engine and the assessed esteem is utilized to
discover an accomplished confusion matrix. The proposed work is utilizing Twitter dataset and Higgs
dataset for the data streaming in Apache Spark. The computational examinations exhibit the superiority of
presented approach comparing with the existing methods in terms of precision, recall, F-score,
convergence, ROC curve and accuracy.
Presented at the Northern Ohio Technical Services Librarians' meeting, November 22, 2013. Describes why libraries should move toward a linked data future to enable their resources to be discoverable on the open web, and includes lessons learned from developing the eXtensible Catalog at the University of Rochester.
Odam an optimized distributed association rule mining algorithm (synopsis)Mumbai Academisc
This document proposes ODAM, an optimized distributed association rule mining algorithm. It aims to discover rules based on higher-order associations between items in distributed textual documents that are neither vertically nor horizontally distributed, but rather a hybrid of the two. Modern organizations have geographically distributed data stored locally at each site, making centralized data mining infeasible due to high communication costs. Distributed data mining emerged to address this challenge. ODAM reduces communication costs compared to previous distributed ARM algorithms by mining patterns across distributed databases without requiring data consolidation.
RDAP 16 Poster: A Proposed Course Model for Integrating RDM with Research Rep...ASIS&T
Research Data Access and Preservation Summit, 2016
Atlanta, GA
May 4-7, 2016
Poster session (Wednesday, May 4)
Poster available at: https://repository.library.brown.edu/studio/item/bdr:650020/
Presenters:
Andrew Creamer, Brown University
Hope Lappen, Brown University
John Santiago, Brown University
Presenters:
Ag Data Commons: Agricultural research metadata and dataCyndy Parr
The document proposes the Ag Data Commons as a solution to address challenges with agricultural research data by creating a central repository to host metadata and data according to federal directives for public access. It outlines the goals of the Ag Data Commons to support public access mandates through a sustainable platform for hosting and sharing agricultural research data and metadata in both human and machine-readable formats. The document also provides details on the workflow for submitting and publishing data on the Ag Data Commons to ensure standardized metadata and compliance with best practices.
Implementing web scale discovery services: special reference to Indian Librar...Nikesh Narayanan
Web scale Discovery services arebecoming the widely adopted Information Retrieval solution in libraries across the world to connect its patrons with the relevant information they seek. In lieu with the world trend, Resources Discovery Solution implementation is gathering momentum in Indian libraries also.
Considering the Indian Libraries scenario, this paper attempts to provide an overview of Library Web Scale Discovery solutions, its need in Indian Libraries, important parameters to be considered for evaluation of Discovery Services, essential factors to be considered prior to implementation, stages of implementation and finally some thoughts on post implementation analysis for measuring the success.
Web scale Discovery services are becoming the most sought after solution for Libraries to connect its patrons with the relevant information they seek. Many studies show that these services are getting wide acceptance from users as well as Library staff and making revolution in Library Information retrieval arena. Given such broad implications, selecting a new discovery service for libraries is an important undertaking. Library professionals should carefully evaluate options to meet their goal of finding the best potential match for their library. This Paper attempts to provide a comprehensive overview of Library Web Scale Discovery solutions by depicting various facets of Web Scale Discovery, how it differs from federated searching and highlights the important parameters to be considered for taking an informed and confident decision on selecting discovery service.
This document discusses peer review and quality assurance in the age of author self-archiving. It presents arguments from Stevan Harnad in support of author self-archiving and open access. Harnad believes self-archiving maximizes research access, use and impact. However, he also argues that peer review is still essential to ensure quality control and certification of research. The document then discusses various problems with conventional peer review processes and proposes that there are "invisible hands" such as personal reputation, institutional review, and self-correcting dynamics that can help assure quality in an environment of author self-archiving.
The document discusses alternative models for peer review in scholarly publishing. It presents several models, including neo-classical peer review, certification-based review, open peer review, commentary-based review, collaboratively-filtered review, institution-based review, citation-based review, index-based review, metadata-based review, computer-assisted review, no peer review, moderator-based review, and tier-based review. Each model is briefly described in one or two sentences with examples provided for some models. The document aims to explore new possibilities for peer review in the digital age beyond the traditional closed peer review process.
The document discusses a hypothetical panel discussion on copyright at Iowa State University Library. It describes monthly activities like exhibits, databases, and leisure reading. It then provides details of a screening of two videos on the Wright Brothers to commemorate the 100th anniversary of their first flights. The screening will be held on December 17, 2003 for only the Iowa State community in the library. The document discusses relevant sections of the U.S. Copyright Code on fair use and exemption of certain performances. It lists conditions for classroom use of videocassettes and poses the question of whether showing the videos is permissible under fair use.
This document provides summaries of several upcoming conferences, training programs, videoconferences, and workshops related to metadata and digital libraries. It also summarizes two ongoing projects: the development of a MARC 21 XML schema by the Library of Congress to facilitate the communication and conversion of MARC records, and the Metadata Encoding and Transmission Standard (METS) being developed by the Library of Congress as a standard for encoding metadata about digital library objects.
This document discusses the potential for electronic journals to go beyond traditional text-based formats by embedding multimedia content like audio, video, and 3D models. It presents several examples of e-journals that have begun incorporating multimedia and proposes categories for an "EJI" registry to track innovative e-journals. The document advocates for a vision of online information that provides more freedom by allowing any information to be connected to any other information.
The document summarizes Gerry McKiernan's presentation on alternative peer review models for 21st century scholarship. It discusses some of the problems with traditional peer review, such as subjectivity, bias, abuse, and delay. It then profiles several alternative peer review models, including neo-classical, certification-based, open peer review, commentary-based, collaboratively-filtered, institution-based, citation-based, index-based, and computer-assisted models. The presentation argues that these alternative models could help address some of the weaknesses of traditional peer review.
1. The document discusses how MOOCs (Massive Open Online Courses) could disrupt higher education through innovative business models and flexible learning approaches.
2. It analyzes MOOCs using the theory of disruptive innovation, finding that MOOCs initially target non-consumers of higher education but may eventually compete with traditional universities by improving performance over time.
3. While MOOCs have potential to disrupt higher education, the education sector is complex with many players and regulations, so the impacts are difficult to predict and universities may not be displaced in the same way as other disrupted industries.
The NASA Astrophysics Data System Abstract Service provides comprehensive access to bibliographic information and select full-text articles in astronomy, astrophysics, and related fields. It offers customizable search options, automatic inclusion of synonyms, and links to citing articles and related papers. The service had provided over 1 billion records since 1993. NEC Research Institute's ResearchIndex uses fully automated processes to extract citations from electronic papers and identify relationships between citations. ePrints.org advocates for open access to scholarly articles and preprints through either new open access journals or author self-archiving in publicly accessible websites.
Android is an open-source software stack that includes an operating system, middleware, and apps. It was initially developed by Android Inc. which was later acquired by Google. The document discusses the history and growth of Android, including its increasing market share and number of available apps. It provides details on specific Android phones, the Android Market app store, and apps for books, instruction, and research.
The document discusses the benefits of exercise for mental health. Regular physical activity can help reduce anxiety and depression and improve mood and cognitive functioning. Exercise boosts blood flow, releases endorphins, and promotes changes in the brain which help enhance one's emotional well-being and mental clarity.
The document describes several e-print servers across various scientific disciplines, including chemistry, clinical medicine, cognitive sciences, and mathematics. It provides details on the purpose, contents, search features, and organizational sponsors of each server. The Chemistry Preprint Server is highlighted as the first profiled, providing a freely available archive for chemistry research with 270 e-prints as of July 2001. Users must register with ChemWeb.com to access the server and submissions are converted to PDF format.
The document discusses social networking services and Facebook. It provides an overview of Facebook, describing its features such as profiles, friends, groups, pages, and privacy settings. It also discusses the growth of Facebook at Iowa State University and outlines plans to create an organizational social network called ESC-Net for the Engineering Staff Council at ISU. The presentation aims to educate about using social media to facilitate collaboration.
This document provides a summary of a presentation on quality assurance in the age of author self-archiving. It discusses self-archiving and how it allows authors to deposit pre-prints and post-prints of their work openly online. It also discusses peer review and some of its problems and limitations. Additionally, it covers "invisible hands" that can help assure quality in self-archived works, such as personal reputation, institutional review, and feedback from other scholars.
This document summarizes an article about new interfaces for electronic journals. It describes several novel technologies that take advantage of the digital format to further facilitate use of electronic collections, including higher-level access options beyond typical search features. These include automatically generating terms and semantic relationships to represent document topics, and visualizing these concepts and relationships to reduce the cognitive load of conventional searching. The visualization allows interactive exploration and filtering of document collections through concept mapping.
The document summarizes the arXiv.org e-print archive, which is hosted by the Los Alamos National Laboratory. It originated in 1991 as a way for physicists to share preprints electronically. It has since expanded to include the fields of mathematics, non-linear sciences, and computer science. The archive allows authors to submit preprints and replace versions. Users can search and retrieve full-text preprints and subscribe to receive new submissions. It has become a primary means of communicating ongoing research within certain scientific communities more quickly than traditional journals.
This document summarizes ebrary, a digital library that provides interactive access to over 40,000 academic books and documents. It utilizes PDF technology to allow users to interact with text at the word level. Ebrary employs streaming delivery of documents page by page without needing to download full works. It offers collections in various academic subjects as well as collections from over 180 publishers. The majority of the collection was published in 2000 or later.
Abilene Christian University (ACU) has implemented a comprehensive mobile learning initiative that provides mobile devices like iPhones and iPads to students and explores their educational benefits through research projects. ACU aims to understand how mobile technologies can enhance learning and has committed resources to empirically study their impact on student outcomes. The university supports faculty mobile learning projects and hosts an annual conference on mobile education topics. ACU's initiative has expanded with funding from AT&T to develop new research and teaching facilities.
BlackBerry smartphones allow users to access communications, information, and applications through their mobile operating system and app store, BlackBerry App World. Research in Motion developed the BlackBerry and its proprietary BlackBerry OS. While BlackBerry once dominated the US smartphone market, its global market share has declined in recent years as Android and iOS have grown. BlackBerry App World provides access to over 10,000 apps, though its catalog remains smaller than those of Android and iOS.
Reuse of Structured Data: Semantics, Linkage, and Realizationandrea huang
In order to increase the reuse value of existing datasets, it is now becoming a general practice to add semantic links among the records in a dataset, and to link these records to external resources. The enriched datasets are published on the web for both human and machine to consume and re‐purpose.
In this paper, we make use of publicly available structured records from a digital archive catalogue, and we demonstrate a principled approach to converting the records into semantically rich and interlinked resources for all to reuse. While exploring the various issues involved in the process of reusing and
re‐purposing existing datasets, we review the recent progress in the field of Linked Open Data (LOD), and examine twelve well‐known knowledge bases built with a Linked Data approach.
We also discuss the general issues of data quality, metadata vocabularies, and data provenance. The concrete outcome
of this research work is the following:
(1) a website data.odw.tw that hosts more than 840,000
semantically enriched catalogue records across multiple subject areas,
(2) a lightweight ontology voc4odw for describing data reuse and provenance, among others, and
(3) a set of open source software tools available to all to perform the kind of data conversion and enrichment we did in this research. We have used and extended CKAN (The Comprehensive Knowledge Archive Network) as a platform to host and publish Linked Data. Our extensions to CKAN is open sourced as well.
As the records we drawn from the originally catalogue are released under the Creative Commons licenses, the semantically enriched resources we now re‐publish on the Web are free for all to reuse as well.
The document discusses technical services roles in supporting the FocusOn Search and CategoryMap systems. It outlines how technical services can expand content selection to unstructured data on the web, leverage named entity resolution, build data filters and reformatting/metadata conversion utilities, and evaluate change management strategies. Technical services also has roles in parallel development, ensuring global access to data, user behavior analysis, leadership/collaboration, appropriate cataloging, metadata reuse, and integrating search systems.
COLLABORATIVE BIBLIOGRAPHIC SYSTEM FOR REVIEW/SURVEY ARTICLESijcsit
This paper proposes a Bibliographic system intends to exchange bibliographic information of survey/review articles by relying on Web service technology. It allows researchers and university students
to interact with system via single service using platform-independent standard named Web service to add,
search and retrieve bibliographic information of review articles in various science and technology fields
and build-up a dedicated database for these articles in each science and technology field. Additionally,
different implementation scenarios of the proposed system are presented and described, andrich features
that offered by such system are studied and described. However, this paper explains the proposed system
using computing area due to the existence of detailed taxonomy of this area, which allows defining the
system, their functionalities and features provided.However, the proposed system is not only confined to
computing area, it can support any other science and technology area without any need to modify this
system.
This document discusses peer review and quality assurance in the age of author self-archiving. It presents arguments from Stevan Harnad in support of author self-archiving and open access. Harnad believes self-archiving maximizes research access, use and impact. However, he also argues that peer review is still essential to ensure quality control and certification of research. The document then discusses various problems with conventional peer review processes and proposes that there are "invisible hands" such as personal reputation, institutional review, and self-correcting dynamics that can help assure quality in an environment of author self-archiving.
The document discusses alternative models for peer review in scholarly publishing. It presents several models, including neo-classical peer review, certification-based review, open peer review, commentary-based review, collaboratively-filtered review, institution-based review, citation-based review, index-based review, metadata-based review, computer-assisted review, no peer review, moderator-based review, and tier-based review. Each model is briefly described in one or two sentences with examples provided for some models. The document aims to explore new possibilities for peer review in the digital age beyond the traditional closed peer review process.
The document discusses a hypothetical panel discussion on copyright at Iowa State University Library. It describes monthly activities like exhibits, databases, and leisure reading. It then provides details of a screening of two videos on the Wright Brothers to commemorate the 100th anniversary of their first flights. The screening will be held on December 17, 2003 for only the Iowa State community in the library. The document discusses relevant sections of the U.S. Copyright Code on fair use and exemption of certain performances. It lists conditions for classroom use of videocassettes and poses the question of whether showing the videos is permissible under fair use.
This document provides summaries of several upcoming conferences, training programs, videoconferences, and workshops related to metadata and digital libraries. It also summarizes two ongoing projects: the development of a MARC 21 XML schema by the Library of Congress to facilitate the communication and conversion of MARC records, and the Metadata Encoding and Transmission Standard (METS) being developed by the Library of Congress as a standard for encoding metadata about digital library objects.
This document discusses the potential for electronic journals to go beyond traditional text-based formats by embedding multimedia content like audio, video, and 3D models. It presents several examples of e-journals that have begun incorporating multimedia and proposes categories for an "EJI" registry to track innovative e-journals. The document advocates for a vision of online information that provides more freedom by allowing any information to be connected to any other information.
The document summarizes Gerry McKiernan's presentation on alternative peer review models for 21st century scholarship. It discusses some of the problems with traditional peer review, such as subjectivity, bias, abuse, and delay. It then profiles several alternative peer review models, including neo-classical, certification-based, open peer review, commentary-based, collaboratively-filtered, institution-based, citation-based, index-based, and computer-assisted models. The presentation argues that these alternative models could help address some of the weaknesses of traditional peer review.
1. The document discusses how MOOCs (Massive Open Online Courses) could disrupt higher education through innovative business models and flexible learning approaches.
2. It analyzes MOOCs using the theory of disruptive innovation, finding that MOOCs initially target non-consumers of higher education but may eventually compete with traditional universities by improving performance over time.
3. While MOOCs have potential to disrupt higher education, the education sector is complex with many players and regulations, so the impacts are difficult to predict and universities may not be displaced in the same way as other disrupted industries.
The NASA Astrophysics Data System Abstract Service provides comprehensive access to bibliographic information and select full-text articles in astronomy, astrophysics, and related fields. It offers customizable search options, automatic inclusion of synonyms, and links to citing articles and related papers. The service had provided over 1 billion records since 1993. NEC Research Institute's ResearchIndex uses fully automated processes to extract citations from electronic papers and identify relationships between citations. ePrints.org advocates for open access to scholarly articles and preprints through either new open access journals or author self-archiving in publicly accessible websites.
Android is an open-source software stack that includes an operating system, middleware, and apps. It was initially developed by Android Inc. which was later acquired by Google. The document discusses the history and growth of Android, including its increasing market share and number of available apps. It provides details on specific Android phones, the Android Market app store, and apps for books, instruction, and research.
The document discusses the benefits of exercise for mental health. Regular physical activity can help reduce anxiety and depression and improve mood and cognitive functioning. Exercise boosts blood flow, releases endorphins, and promotes changes in the brain which help enhance one's emotional well-being and mental clarity.
The document describes several e-print servers across various scientific disciplines, including chemistry, clinical medicine, cognitive sciences, and mathematics. It provides details on the purpose, contents, search features, and organizational sponsors of each server. The Chemistry Preprint Server is highlighted as the first profiled, providing a freely available archive for chemistry research with 270 e-prints as of July 2001. Users must register with ChemWeb.com to access the server and submissions are converted to PDF format.
The document discusses social networking services and Facebook. It provides an overview of Facebook, describing its features such as profiles, friends, groups, pages, and privacy settings. It also discusses the growth of Facebook at Iowa State University and outlines plans to create an organizational social network called ESC-Net for the Engineering Staff Council at ISU. The presentation aims to educate about using social media to facilitate collaboration.
This document provides a summary of a presentation on quality assurance in the age of author self-archiving. It discusses self-archiving and how it allows authors to deposit pre-prints and post-prints of their work openly online. It also discusses peer review and some of its problems and limitations. Additionally, it covers "invisible hands" that can help assure quality in self-archived works, such as personal reputation, institutional review, and feedback from other scholars.
This document summarizes an article about new interfaces for electronic journals. It describes several novel technologies that take advantage of the digital format to further facilitate use of electronic collections, including higher-level access options beyond typical search features. These include automatically generating terms and semantic relationships to represent document topics, and visualizing these concepts and relationships to reduce the cognitive load of conventional searching. The visualization allows interactive exploration and filtering of document collections through concept mapping.
The document summarizes the arXiv.org e-print archive, which is hosted by the Los Alamos National Laboratory. It originated in 1991 as a way for physicists to share preprints electronically. It has since expanded to include the fields of mathematics, non-linear sciences, and computer science. The archive allows authors to submit preprints and replace versions. Users can search and retrieve full-text preprints and subscribe to receive new submissions. It has become a primary means of communicating ongoing research within certain scientific communities more quickly than traditional journals.
This document summarizes ebrary, a digital library that provides interactive access to over 40,000 academic books and documents. It utilizes PDF technology to allow users to interact with text at the word level. Ebrary employs streaming delivery of documents page by page without needing to download full works. It offers collections in various academic subjects as well as collections from over 180 publishers. The majority of the collection was published in 2000 or later.
Abilene Christian University (ACU) has implemented a comprehensive mobile learning initiative that provides mobile devices like iPhones and iPads to students and explores their educational benefits through research projects. ACU aims to understand how mobile technologies can enhance learning and has committed resources to empirically study their impact on student outcomes. The university supports faculty mobile learning projects and hosts an annual conference on mobile education topics. ACU's initiative has expanded with funding from AT&T to develop new research and teaching facilities.
BlackBerry smartphones allow users to access communications, information, and applications through their mobile operating system and app store, BlackBerry App World. Research in Motion developed the BlackBerry and its proprietary BlackBerry OS. While BlackBerry once dominated the US smartphone market, its global market share has declined in recent years as Android and iOS have grown. BlackBerry App World provides access to over 10,000 apps, though its catalog remains smaller than those of Android and iOS.
Reuse of Structured Data: Semantics, Linkage, and Realizationandrea huang
In order to increase the reuse value of existing datasets, it is now becoming a general practice to add semantic links among the records in a dataset, and to link these records to external resources. The enriched datasets are published on the web for both human and machine to consume and re‐purpose.
In this paper, we make use of publicly available structured records from a digital archive catalogue, and we demonstrate a principled approach to converting the records into semantically rich and interlinked resources for all to reuse. While exploring the various issues involved in the process of reusing and
re‐purposing existing datasets, we review the recent progress in the field of Linked Open Data (LOD), and examine twelve well‐known knowledge bases built with a Linked Data approach.
We also discuss the general issues of data quality, metadata vocabularies, and data provenance. The concrete outcome
of this research work is the following:
(1) a website data.odw.tw that hosts more than 840,000
semantically enriched catalogue records across multiple subject areas,
(2) a lightweight ontology voc4odw for describing data reuse and provenance, among others, and
(3) a set of open source software tools available to all to perform the kind of data conversion and enrichment we did in this research. We have used and extended CKAN (The Comprehensive Knowledge Archive Network) as a platform to host and publish Linked Data. Our extensions to CKAN is open sourced as well.
As the records we drawn from the originally catalogue are released under the Creative Commons licenses, the semantically enriched resources we now re‐publish on the Web are free for all to reuse as well.
The document discusses technical services roles in supporting the FocusOn Search and CategoryMap systems. It outlines how technical services can expand content selection to unstructured data on the web, leverage named entity resolution, build data filters and reformatting/metadata conversion utilities, and evaluate change management strategies. Technical services also has roles in parallel development, ensuring global access to data, user behavior analysis, leadership/collaboration, appropriate cataloging, metadata reuse, and integrating search systems.
COLLABORATIVE BIBLIOGRAPHIC SYSTEM FOR REVIEW/SURVEY ARTICLESijcsit
This paper proposes a Bibliographic system intends to exchange bibliographic information of survey/review articles by relying on Web service technology. It allows researchers and university students
to interact with system via single service using platform-independent standard named Web service to add,
search and retrieve bibliographic information of review articles in various science and technology fields
and build-up a dedicated database for these articles in each science and technology field. Additionally,
different implementation scenarios of the proposed system are presented and described, andrich features
that offered by such system are studied and described. However, this paper explains the proposed system
using computing area due to the existence of detailed taxonomy of this area, which allows defining the
system, their functionalities and features provided.However, the proposed system is not only confined to
computing area, it can support any other science and technology area without any need to modify this
system.
This document provides information about enumerative and faceted classification schemes. It discusses enumerative classification schemes, noting that they involve assigning unique notations or symbols to predefined categories to create a hierarchical structure for organizing resources. Examples provided include the Library of Congress Classification and Dewey Decimal Classification. The benefits of enumerative schemes are also summarized, including efficient retrieval, consistency, subject browsing, and facilitating information organization. Faceted classification is then introduced as a system that organizes information based on multiple independent attributes or facets, allowing for more precise categorization. The components and benefits of faceted classification are outlined.
The document discusses the relevance of classification and indexing for organizing internet resources. It argues that while the internet has grown without formal organization tools like classification schemes and vocabularies, bringing such library techniques to the web could help address the problem of information being difficult to find, access, and retrieve from the vast, unorganized data available online. Applying concepts from knowledge organization, like subject descriptors and relationships between resources, is an approach taken by the semantic web to help machines better understand and process information on the web.
The document proposes creating a digital library at Anonymous University using the Dublin Core metadata standard and Greenstone digital library software. It recommends training library staff on Dublin Core, the controlled vocabularies LCNAF and DCT, and assigning roles for the project such as project manager, digital manager, curator, and digitization staff. It also outlines plans for metadata elements, training procedures, collection assessment, and ensuring quality control of the digital library materials and records.
Navigation through citation network based on content similarity using cosine ...Salam Shah
The rate of scientific literature has been increased in the past few decades; new topics and information is added in the form of articles, papers, text documents, web logs, and patents. The growth of information at rapid rate caused a tremendous amount of additions in the current and past knowledge, during this process, new topics emerged, some topics split into many other sub-topics, on the other hand, many topics merge to formed single topic. The selection and search of a topic manually in such a huge amount of information have been found as an expensive and workforce-intensive task. For the emerging need of an automatic process to locate, organize, connect, and make associations among these sources the researchers have proposed different techniques that automatically extract components of the information presented in various formats and organize or structure them. The targeted data which is going to be processed for component extraction might be in the form of text, video or audio. The addition of different algorithms has structured information and grouped similar information into clusters and on the basis of their importance, weighted them. The organized, structured and weighted data is then compared with other structures to find similarity with the use of various algorithms. The semantic patterns can be found by employing visualization techniques that show similarity or relation between topics over time or related to a specific event. In this paper, we have proposed a model based on Cosine Similarity Algorithm for citation network which will answer the questions like, how to connect documents with the help of citation and content similarity and how to visualize and navigate through the document.
The document summarizes 5 innovative electronic journals, indexes, or services that go beyond conventional print publications by providing additional features and functionalities in their online offerings. It profiles the Astronomy and Astrophysics index, the Internet Journal of Chemistry, ResearchIndex, TheScientificWorld, and NEC Research Institute ResearchIndex. Each profile describes the purpose, features, and functionalities of the resource, including the ability to search literature, embed interactive content, and customize displays. The resources aim to enhance access and interaction with scientific literature through their online environments.
This document summarizes five innovative electronic journals, indexes, or services that go beyond conventional online publications by providing novel features and functionalities. It profiles the Astronomy and Astrophysics index from the Strasbourg Astronomical Observatory, which uses a self-organizing map to organize journal articles into a clickable graphical interface. It also summarizes the Internet Journal of Chemistry, an electronic-only journal that encourages authors to incorporate interactive elements like animations and molecular structures to enhance reader comprehension. The document discusses how these resources aim to fully utilize the digital environment and empower readers through customization options.
This document summarizes five innovative electronic journals, indexes, or services that go beyond conventional online publications by providing novel features and functionalities. It profiles the Astronomy and Astrophysics index from the Strasbourg Astronomical Observatory, which uses a self-organizing map to organize journal articles into a clickable graphical interface. It also summarizes the Internet Journal of Chemistry, an electronic-only journal that encourages authors to incorporate interactive elements like animations and molecular structures to enhance reader comprehension. The document discusses how these resources aim to fully utilize the digital environment and empower readers through customization options.
Cloud web scale discovery services landscape an overviewNikesh Narayanan
Abstract
The impact of Internet and Google like search engines radically influenced the information behavior of Net Generation users. They expect same environment in library services such that all their required information make available in a single set of results through unified search across all the available resources. Libraries have been striving to respond to this challenge for years. Until recently, federated search technology of the past decade was the better attempt in this area to meet these user expectations. But federated search solution is marked by the drawbacks of its slowness as it searches each database on the fly. New Generation cloud based Library Web scale discovery technology is a promising entrant in this landscape. This Paper attempts to provide a comprehensive overview of Library Web Scale Discovery solutions by depicting various facets of Web Scale Discovery solutions such as its importance to Library field, their possible role as the starting point for research, content coverage, and finally analyses the competition at the discovery front by comparing the services of major players. The comparative analysis shows that all the major service providers are extending competitive features and services, but varies in some areas and the adoption choice depends on the concerned library’s preferences and the cost involved.
Text Mining in Digital Libraries using OKAPI BM25 ModelEditor IJCATR
The emergence of the internet has made vast amounts of information available and easily accessible online. As a result, most libraries have digitized their content in order to remain relevant to their users and to keep pace with the advancement of the internet. However, these digital libraries have been criticized for using inefficient information retrieval models that do not perform relevance ranking to the retrieved results. This paper proposed the use of OKAPI BM25 model in text mining so as means of improving relevance ranking of digital libraries. Okapi BM25 model was selected because it is a probability-based relevance ranking algorithm. A case study research was conducted and the model design was based on information retrieval processes. The performance of Boolean, vector space, and Okapi BM25 models was compared for data retrieval. Relevant ranked documents were retrieved and displayed at the OPAC framework search page. The results revealed that Okapi BM 25 outperformed Boolean model and Vector Space model. Therefore, this paper proposes the use of Okapi BM25 model to reward terms according to their relative frequencies in a document so as to improve the performance of text mining in digital libraries.
To appreciate the paradigm shift involved in the next generation search systems one needs to look back at the traditional approach to resource discovery and compare to the new trends. Here I focus on three aspects:
• Databases versus search engines
• Federated versus integrated search
• Integrated versus modular architecture.
The document discusses digital libraries, including their architecture and design. It defines a digital library as a collection of documents available electronically on the internet or CD-ROM. Digital libraries use technology to break down traditional rules for archives by describing archived materials individually and allowing for reproduction. The document also discusses different types of metadata, including structural and descriptive metadata, and different metadata schemes.
From Bibliometrics to Cybermetrics - a book chapter by Nicola de BellisXanat V. Meza
Disclaimer: All original texts and images belong to their rightful owners.
Chapter 8 of the book "Bibliometrics and Citation Analysis" by Nicola de Bellis.
Academic Essays Database. Craven Community CollegeAndrea Briggs
This document discusses the challenges of writing an essay on academic essay databases. It notes that such an essay must delve into both the technical aspects of databases as well as the broader implications for academic research. It also must examine how databases impact research methods, information access, and scholarly communication. Additionally, the essay writer must consider the dynamic nature of the field and stay up-to-date on emerging technologies and standards. In conclusion, composing an essay on this topic demands an approach that integrates technical knowledge with a broader understanding of academic systems and their influence on scholarly work.
A Framework for Content Preparation to Support Open-Corpus Adaptive HypermediaKillian Levacher
This document proposes a framework for preparing open-corpus content to support adaptive hypermedia systems. The framework includes three components: 1) structural analysis to segment web pages and remove redundant information, 2) statistical analysis to extract concepts using techniques like hidden Markov models, and 3) intelligent slicing to fulfill specific information requests from adaptive systems by retrieving and tailoring open-corpus content. The goal is to leverage existing open web content for adaptive systems by automatically preparing and enriching content with metadata in a format agnostic to any specific system.
Researcher Reliance on Digital Libraries: A Descriptive AnalysisIJAEMSJORNAL
The digital library is an information technology that is structured as a digital knowledge resource, or can be alluded to a medium that stores information for a huge scope and is teamed up with the information the board gadget equipped for showing the information or information required by the client. Digital libraries can be extensively characterized as an information stockpiling and recovery frameworks that control digital information in the media (text, pictures, sound, static or dynamic) on the web. The main aim of this study is to study the awareness and using pattern of digital library by the researchers, to analyse the influence of digital library on researchers’ efficiency, analyse the purpose of using Digital Library Consortium, decide the effect of problems and motivational components of the digital library on the users, evaluate the satisfaction level of users with coverage of journals and perspectives on training and awareness programs and propose the available resources for effective utilization of the Digital Library.
A Domain Based Approach to Information Retrieval in Digital Libraries - Rotel...University of Bari (Italy)
The current abundance of electronic documents requires automatic techniques that support the users in understanding their content and extracting useful information. To this aim, improving the retrieval performance must necessarily go beyond simple lexical interpretation of the user queries, and pass through an understanding of their semantic content and aims. It goes without saying that any digital library would take enormous advantage from the availability of effective Information Retrieval techniques to provide to their users. This paper proposes an approach to Information Retrieval based on a correspondence of the domain of discourse between the query and the documents in the repository. Such an association is based on standard general-purpose linguistic resources (WordNet and WordNet Domains) and on a novel similarity assessment technique. Although the work is at a preliminary stage, interesting initial results suggest to go on extending and improving the approach.
This document summarizes the CyberStacks prototype, which applies traditional library classification and organization methods to facilitate access to science and technology resources on the World Wide Web. The prototype uses the Library of Congress classification system to organize selected Internet resources and provides browsable access through broad subject categories. It aims to enhance discovery and use of relevant resources by applying established selection criteria and presenting descriptive annotations. The document outlines the philosophy, organization, selection process, and access features of the CyberStacks prototype, arguing that familiar library structures and conventions can help users navigate Internet resources more effectively.
The Astrophysics Data System (ADS) is an online database funded by NASA that provides bibliographic records and full text articles in astronomy, astrophysics, and related fields. It contains over 2.3 million abstracts as of 2001. The ADS includes searchable abstracts from major journals, conference proceedings, technical reports, books, and dissertations. Users can search by author, title, keywords, publication date, and other filters. Search results are ranked by relevance and users can view brief records or full details of publications from the search results. The ADS is a widely used resource with over 100,000 users from 112 countries accessing it monthly in 2001.
Alexander Street Press provides online collections in the humanities and social sciences through its website and mobile apps. It offers collections of music scores, recordings, videos and reference materials that can be accessed on smartphones and tablets through its mobile-friendly website and apps for iOS and Android devices. Alexander Street Press partners with major content producers and plans to release an iPhone app.
The document summarizes recent events and projects in the fields of digital preservation, metadata, cataloging standards and practices. It describes a forum held by RLG on converging standards for digital preservation, ALCTS regional institutes on metadata and cataloging rules, training courses offered by Rare Book School, an Open Archives workshop in Portugal, and the annual NASIG conference in Virginia. It also provides overviews of the Diffuse Project standards information source and the DLIST digital library for information science and technology.
Gerry McKiernan is a popular science and technology librarian blogger from Iowa State University. He runs several blogs focused on emerging technologies and their applications for libraries. McKiernan has been blogging since 2003 and his blogs regularly rank in the top 1% worldwide according to Technorati. He gains professional benefits from blogging through opportunities to present at conferences and staying up to date on new developments relevant to his work.
Roddy MacLeod is an award-winning information professional at the cutting edge of the industry. He has been blogging since 2005 and runs multiple blogs, including the Heriot-Watt Library blog Spineless and the JISC-funded TicTocs project blog. Blogging helps Roddy market the library, build interest in projects, and stay on top of trends in the field. It also expands his network and improves his writing skills.
This document summarizes several free international web-based patent sites:
- The USPTO and European Patent Office websites provide free access to patent information as well as other intellectual property resources. Other non-Western nations also offer free patent databases.
- Key free patent database services highlighted include DEPATISnet, the Industrial Property Digital Library (Japan), and the Intellectual Property Digital Library hosted by WIPO. Each provides search access to large collections of patent documents and bibliographic information.
Gerry McKiernan is a popular science and technology librarian blogger from Iowa State University. He runs several blogs on topics like online social networks, bioenergy, geothermal energy, and wind energy that receive thousands of subscribers. McKiernan has been blogging since 2003 and credits it with helping him stay on top of developments in his fields and getting invitations to present at conferences. He enjoys the feedback from colleagues who find his blogs professionally valuable.
The document discusses several free international web-based patent sites, including those run by the USPTO, European Patent Office, China Patent Database, DEPATISnet, and the Industrial Property Digital Library. It describes the types of patent information available on each site, such as bibliographic data, abstracts, and in some cases full-text patent documents. It also summarizes the different search interfaces provided, including basic, advanced, and Boolean searching options available to users.
The document discusses several free international web-based patent sites, including those run by the USPTO, European Patent Office, China Patent Database, DEPATISnet, and Industrial Property Digital Library. It describes the types of patent information available on each site, such as bibliographic data, abstracts, and in some cases full-text patent documents. It also summarizes the search interfaces and capabilities of each site.
This document summarizes several free international web-based patent sites:
- The USPTO and European Patent Office websites provide free access to patent information as well as other intellectual property resources. Other non-Western nations also offer free patent databases.
- Key free patent database services described include DEPATISnet, the Industrial Property Digital Library (Japan), and the Intellectual Property Digital Library hosted by WIPO. Each provides search access to large collections of patent documents and bibliographic information.
This document provides details about the Energy Citations Database (ECD) produced by the U.S. Department of Energy. It describes the content, search features, and record format of the ECD. Key points include:
- The ECD contains over 2 million bibliographic records related to energy and energy-related research funded by the DOE.
- Records include fields for title, authors, publication date, report numbers, subject terms, and abstracts. Links to fulltext are provided for some newer publications.
- Searches can be done through a basic search box searching specific fields, or through an advanced search interface. Searching supports boolean operators and wildcard characters.
- Results can be sorted
The document discusses free web-based patent databases available through esp@cenet. Esp@cenet provides access to over 45 million patent documents from the European Patent Office and other patent offices around the world. It allows users to perform basic and advanced searches. The presentation also reviews other international patent databases from countries and regions like China, Germany, Japan, and WIPO.
The document summarizes esp@cenet®, a free web-based patent search service provided by the European Patent Office. Esp@cenet® provides access to over 42.5 million patent documents from over 70 countries. It allows users to search for patent information via three gateways - the EPO gateway, national patent office gateways of EPO member states, or the European Commission gateway. Basic "Quick Searches" can be done on title/abstract or by patent number. Search results provide bibliographic data and English abstracts when available.
EnergyFiles is a virtual library of energy-related scientific and technical information created by the US Department of Energy. It provides access to various collections including journal articles, technical reports, electronic preprints, and bibliographic databases containing over 2 million records. The Energy Citations Database within EnergyFiles contains citations and abstracts of literature from fields related to the work of the Department of Energy. Users can search individual databases or search across multiple collections simultaneously through the EnergyFiles portal.
This document summarizes five e-print servers across different academic disciplines:
1) The Chemistry Preprint Server (CPS) is a freely available archive for rapid distribution of chemistry research. Authors can submit manuscripts and works-in-progress.
2) Clinical Medicine & Health Research NetPrints is a repository for non-peer reviewed medical and health research. It contains 45 e-prints as of July 2001.
3) CogPrints is an e-print server for research in cognitive sciences like psychology and neuroscience containing over 4,000 e-prints as of July 2001.
This document discusses the increasing use of multimedia in electronic journals (e-journals) as the capabilities of digital technologies have advanced. It notes that while early e-journals were limited to text and basic images, the development of technologies like PDF, HTML, and the World Wide Web enabled incorporation of multimedia like audio, video, animations and more. The document reviews common types of multimedia being used in e-journals, such as animated GIFs, MIDI files, MPEG videos, and VRML. It also provides examples of e-journals across various academic fields that employ multimedia.
This newsletter issue introduces several new contributors and articles about library technology. It includes articles on pricing trends for personal computers, open source software in libraries, educating users about copyright and fair use, and a case study of library automation at the University of Valencia, Spain. Conference reports are provided from SITE, ACRL, E-Books 2001, ETD2001, and TechEd. New columns explore e-books and the SLAC e-conference proceedings archive. The issue also launches a new global case studies column.
This article discusses how e-journals are evolving to incorporate more innovative features by transcending limitations of print. It examines how e-journals are integrating a variety of electronic content like multimedia and being customized. It explores enhancements like interactive features and new formats of scholarly communication. The article also analyzes how e-journals offer personalized alerting services, tables of contents, and ability to select topics of interest to reduce information overload for readers.
This article discusses how e-journals are evolving to incorporate more innovative features by transcending limitations of print. It examines how e-journals are integrating a variety of electronic content like multimedia and being customized. It explores enhancements like interactive features and new formats of scholarly communication. The article also analyzes how e-journals offer personalized alerting services, tables of contents, and ability to select topics of interest to reduce information overload for readers.
1. The New/Old World Wide Web Order:
The Application
of "Neo-Conventional" Functionality
to Facilitate Access and Use
of a.WWW Database
of Scienceand Technology
Internet Resources
Gerry McKiernan
ABSTRACT. While there are numerous information sources that
are available on the World Wide Web (W),the identification of
significant Net resources is often not as efficient nor as effective as
many desire. In 1995, a demonstration prototype sewicdyber-
Stacks(sm) <h~:/h.pblic.iaFtaie.edu/-CYBE~CXS/,,was for-
mally established at lowa State University with the intent of enhancing
access and use of selected Internet resources in science, technology
and related areas through the application of the Library of Congress
classification system as an o anizational framework. As the proto-
type was refined, other metho% for identifying and selecting relevant
resources were subsequently incorporated withii its scheme.
This paper reviews the creation of the CyberStacks(sm) prototype,
describes the develo~mentand ~otentialusefulness of its matrix of
access options, and driscussesthe' applicability of traditional and con-
ventional library selection and organizational philosophies, practices,
-
Geny McKieman, AB, MS, is Coordinator, Scienceand Technology Section,
Reference and Instructional Services Department, Iowa State University, 152
Parks Library,Ames, IA 50011 (genymck@iastate.edu).
The author gratefully acknowledges the financial support provided by the
Travel and Research Funds Committee of the lowa State University Library for
enhancementof the CyberStacks(sm)prototype.
Journal of Internet Cataloging,Vol. l(1) 1997
O 1997by The Haworth Press, Inc. All rights reserved.
2. 48 JOURNAL OF INTERNET CATALOGING
methods and techniques for facilitating access to Web resources.
[Article copies available for a fie fmm me Haworth Document Delivery
Service: 1-800-342-9678.E-mail address:getinfo@hawrth.~
KEYWORDS. CyberStacks, Internet organization and access, sci-
ence and technology Internet resources
"EVERYTHING OLD IS NEW AGAIN"
Background
The Library of Congress classification system is a well-established
scheme that has been used for generations by libraries worldwide for
organizing a variety of publications and media. Within its schedules, this
classification system not only denotes subject coverage and content, but
information format and conceptual relationships as well. It is believed that
a classification system with the features found within the Library of Con-
gress scheme offers appropriate context and structure that can facili-
tatelidentification of relevant WWW and other Internet resources.'
In Fall 1995, CyberStacks(sm), ~hfrp://www.public.iastare.edu/
-CYBEkW4CKW a demonstration prototype database of selected Internet
resources, was established on the home page server at Iowa State Univer-
sity using the Library of Congress Classification schedules as an organiza-
tional framework.
Philosophy
Underlying the overall CyberStacks(sm) approach is the philosophy
that effective management of WWW and other Internet resources is best
achieved by establishing a defined collection of selected resources. While
thc CyberStacks(sm)model can be applied to any and all subjects, for any
and all media, we have sought to develop and refine its functionality by
defining it as a collection of significant Internet resources in science,
technology and related areas with Reference value. In associated docu-
ments linked to the CyberStacks(sm) home page, we have concisely artic-
ulated our views on each of these component features.
In general, we have applied the same philosophy and general criteria
commonly used in the selection of non-Internet Reference resources2 to
the selection of World Wide Web (WWW) and other appropriate Net
resources for the CyberStacks(sm) collection. These include authority of
3. source, accuracy of the information, clarity of presentation, uniqueness
within the context of the total collection, recency or timeliness, favorable
reviews, and community needs.
Organization
At the core of the CyberStacks(sm) scheme is a hypertext-enhanced
outline of the Library of Congress Classification ~chedules.~It is our
belief that the inherent structure, organization, context and level of speci-
ficity and description provided by this established classification system
can enhance the overall efficiency and effectiveness of using Web and
other Internet resources.
Unlike many current efforts that merely list resources alphabetically
within broad classes, CyberStacks(sm) has applied the Library of Con-
gress scheme to a far deeper level of specificity and provides a more
detailed description of incorporated resources than is commonly provided.
Through a series of broad, less broad and more specific categories, users
are presented with a series of options, which not only suggest search
topics, but, we believe, through the process, reduce the cognitive-load
often associated with command-based, direct searching.
The benefit of using a browsable scheme of categorized resources
within CyberStacks(sm) is well-supported by the observations of Chang
and Rice in their excellent review article on browsing as a significant
information seeking strategy. Particularly supportive of the Cyber-
Stacks(sm) approach is their clear articulation of the uncertainty that some
users experience in the conventional search process:
. . . [Ulsers are often in an anomalous state of knowledge as they
initiate a search . . . do not have predefined search criteria .. .and
may alter their interests during a ~earch.~
In citing the work of Hemer, Bates and others, they recognize the
importance of browsing in human information seeking and highlight the
need for information scientists to incorporate this approach into their
system designs. Their summary of critiques of 'specific, direct searching'
supports the need for browsable interfaces:
[There are] some unrealistic assumptions ab.out users and the nature
of information seeking-e.g., that users have unbounded rationality,
have static and well-defined information needs, know what they
want and are output driven ....5
4. 50 JOURNAL OFINTERNET CATALOGING
Within CyberStacks(sm), users can not only browse from general to
other general categories (e.g., Science (Q) to Medicine (R) to Agriculture
(S)), they can also browse from general to specific categories (e.g., Sci-
ence (Q) to Mathematics (QA)) or from within a specific category to a
related subcategory (e.g., Natural History (General) (QH 1-74)to Nature
Conservation (QH 75-77). Within subcategories, users also have the abil-
ity to select and browse related subject subclasses (e.g., QA, QB, QC, QD,
QE, QH, QK. QL, QM, QP, QR).
Through any of these browsable schemes, users can scan broader or
narrower subject categories at will, and then if desired, connect directly to
a selected resource. Thus CyberStacks(sm) seeks not to be a digital library
per se, but a 'virtual' library where like materials metaphorically 'stand
together' in the stacks.
The capability to browse from broad to less broad categories and from
within these to related categories is complemented by an alphabetical
Cross-Classification Index.
This index offers the user direct access to a separate, combined collec-
tion of more specific subject categories that are to be found elsewhere
under relevant broad and less broad classifications within the general
CyberStacks(sm) scheme along with the call number range associated
with each subcategory. With a duplicate, reconfigured set of subcategories,
users are presented with additional, more specific search topics for consid-
eration. Upon selection, they are linked to the broader class that includes
the specific subcategory selected. Here, they have the option of reviewing
conceptually-related subcategories or can link to resources assigned to
their originally selected subcategory.
Selection
The foundation of the CyberStacks(sm) approach is a collection devel-
opment practice that seeks to identify, isolate, and select Net resources that
are discrete document-like objects. The philosophy of selection of
resources for inclusion within the CyberStacks(sm) scheme is well-articu-
lated by Dernas, McDonald and Lawrence in a recent article:
. . . [Tlitle by title selection of high quality resources is one of the
most important values librarians can add in providing access to
information resources, including those accessible via the Internet. A
5. Gerry McKiernm 51
careful selection of resources is the touchstone o'f the electronic
library6
To facilitate the selection of such resourcesin this phase of the project,
we have intentionally sought to locate Internet resources that are the elec-
tronic equivalent or analog of standard Reference works, as well as
resources that may be so considered. Abstracting and indexing services,
bibliographies, biographical sources, dictionaries, directories, handbooks,
guides to the literature,maps, and standards,are but a few of these publica-
tion types. These may take the form of monographic or serial works, files,
databases, or search services, or other conventional information formats.
While CyberStacks(sm)does include entire Web sites within its scheme,
only those that are considered well-organized collections of significant
Internet resources are selected for incorporation. In general such sites are
viewed as either directories or encyclopedias, depending on their struc-
ture, organization and content, and are categorized and classified as such.
Over the years, to assist users in managing the ever-increasing volume
of information, librarians and others have developed or applied a variety
of selection and organizational tools and techniques. In a recent review
article, Hopkins provides a concise summary of the issue of information
overload and succinctly profiles a number of the methods used in counter-
ing the problem.' Among the conventional tools that librarians have
created to assist users in managing information overload have been guides,
handbooks, review articles, literature reviews, abridgments and rankings,
as well as indexes, digests, abstracts, and other similar services.
The intentional selection of sources of a Reference nature that serve to
index, abstract, guide or review significant and relevant information
sources for inclusion in the initial phase of the CyberStacks(sm) collection
is particularly appropriate in view of the historical role that such works
have played in assisting users to manage information.
When and where possible, resources are assigned the identical basic
classification number of their print or other electronic equivalent or ana-
log. If a resource does not have a counterpart, its general and specific
subject coverage is determined by a detailed examination of the resource.
If the reference nature of a specific resource is not indicated in its title, it is
assigned the most appropriate form from a listing of standard reference
publication types.
Subsequently, the local OPAC is searched to identify similar works of
the same form. Candidate classification numbers are noted, and later final-
6. 52 JOURNAL OF INTERNET CATALOGING
ized after reviewing the appropriate printed Library of Congress classifi-
cation schedules. If the local OPAC does not provide a candidate classifi-
cation number, the OCLC Online Union Catalog, or LOCIS, the Library of
Congress online catalog, is also searched.
After a majority of the resources identified to date are fully incorpo-
rated within CyberStacks(sm), we plan to refine the classification of
resources through a participatory resource database development project.
As out~ined,~this initiative will ask users to suggest the most appropriate
category or categories where an incorporated resource should be classified
or may also be classified. The intent of this project is to classify resources
in categories where users themselves would expect to find them, in addi-
tion to placing them in categories where librarians believe resources
should be classified according to professional practice, preference or tradi-
tion. It is believed that such user-assisted categorization will hrther facili-
tate identification and use of selected resources.
While its primary approach to providing access to World Wide Web and
other Internet resources is from within a hypertext-enhanced classification
system, CyberStacks(sm) has complemented this approach with a brows-
able index of selected or candidate titles.
Within this Etle Index, users may either browse an entire alphabetical
listing of titles, or select from an alphabetical group. From this index, users
may either connect directly to a listed resource, or if they desire, link to an
available profile of the resource within the CyberStacks(sm) collection to
review the resource's structure, content and coverage first. Titles fully
incorporated within CyberStacks(sm) are indicated with an 'information'
icon to the right of an entry. Selection of this latter option provides the user
with appropriate and sufficient information that we believe can facilitate
more efficient searching of the Internet. By providing users with a sum-
mary of the resource, as well as succinct instruction on access and use,
users know in advance the nature of the resource and how it may effec-
tively be navigated.
Selection of a resource profile will concurrently place the user within a
specific location within the CyberStacks(sm) outline in which resources of
a similar nature may be found, or access provided to a related resource.
The selection of a known itcm from the ?FtleIndexnot only offers the user
direct access to a specific potentially relevant resource, but through the
structure of the CyberStacks(sm) scheme, presents to users, either directly
or indirectly, related resources for consideration.
7. Gerry McKiernan
'Hyper-Thesaurus'
A proposed 'hyper-thesaurus' of assigned Library of Congress Subject
Headings (LCSH) would provide access to more specific subject topics.
As envisioned, such a thesaurus, created as an additional access function,
would outline all the entries assigned to resources within the Cyber-
Stacks(sm) database, indicate the broader, narrower or related relation-
ships between assigned subject headings, and provide a hypertext link
directly to those records assigned a specific subject heading.
In cases where a given subject heading has been assigned to more than
one resource, users would be provided with a menu of options, including
an alphabetical listing of all resource titles assigned a specific heading.
From this listine. a selection would link users directlv to the associated
profile within ~Gber~tacks(sm).Research on the designof next generation
online public access catalogs as it relates to such value-added functional-
ity, notably that of ~icco?;s currently being reviewed.
Presenting
In current Internet catalogs, it is not uncommon for record data to be
displayed in a labelled format or a MARC format. While the latter format
primarily serves the needs and interests of catalogers and other librarians,
and the former is intended to serve the needs of the user as well as the
librarian, end-users may not necessarily require all of the data provided in
either of these standard records formats to determine the relevance or
significance of a given Net resource.1°
While all fields and their associated data are necessary to uniquely
identify a resource or required for record manipulation within standard
information storage and retrieval systems, all fields and all data need not
be fully displayed as the primaly presentation format for a collection of
Internet resources. As clearly articulated by Hildreth:
System designers, especially designers of user interfaces must take
into account the primary tasks to be performed with the system and
the characteristics brought to the tasks by users of the system. An
understanding of these tasks and characteristics will inform the
design of appropriate information scarch, presentation, review,
selection, and related decision making facilities.''
We strongly endorse this viewpoint and believe that in the Age of
HyperText, catalogers and other librarians need to consider more fully the
8. 54 JOURNAL OF INTERNET CATALOGING
'presentation form' of a resource as well as the formats needed for high-
quality, consistent bibliographical control. Within CyberStacks(sm), we
do not delineate all relevant elements in describing a resource, but seek to
characterizeit suficiently so that users can judge its potential usefulness
and to simultaneously integrate the resource within an appropriate frame-
work that offers a context in which the relationship to other resources in its
collection is indicated.'=
Within the hyper-environmentof the Web, efforts to catalog Net resources
must consider the Foreground as well as the Background. In Cyber-
Stacks(sm), we seek to synthesize available information about a selected
resource and to present the equivalent of a descriptive annotation that
provides users with sufficient information about the subject coverage, size,
record structure, special features and origin to enable users to judge its
potential usefulness.
Icons
Likewise, the use of appropriate icons within CyberStacks(sm) in associa-
tion with broad and general subjects, at primary and secondary levels, is
intended to offer users visual clues that can facilitate access to categories
of resources of potential relevance within its collection.
Nomination
Through a variety of links, users themselves have thc option of nomi-
nating resources for inclusion within CyberStacks(sm), or to request that a
specific type of publication be made available as a Web publication within
it. Thus, as with other proactive collection development policies, users
themselves are involved with the enhancement of a collection they use and
consult.
Conclusion
In its effort to manage and enhance access to Internet resources, CyberS-
tacks(sm) has employed traditional libraryphilosophies, practices, methods,
techniques and structures for selection, organization and access. These
approaches have been implemented in both conventionaland conventional-
like ways by utilizing the inherent versatility of HTML-the HyperText
Markup Language, and the Web browser environment,more fblly.
We believe that familiar structures, access features, as well as enhance-
ments to familiar functions found integrated within CyberStacks(sm) can
greatly facilitateeffective and efficient use of lntemet resources.
9. Gerry McKiernan
Disclaimers
CyberStacks(sm) is an ongoing personal research project and its avail-
ability does not constitute an endorsement by Iowa State University, the
Iowa State University Computation Center, nor the Iowa State University
Library or Library of Congress.
NOTES
I. McKieman,Gerry, "CyberStacks(sm): A 'Library-Organized' Virtual Sci-
ence and Technology Reference Collection, D-Lib Magazine, December (1995).
Internet WWW Page, at URL: <http://www.dlib.org/dlib/decernber95/briefingsl
IZcyber.html> (version current at 10 February 1996).
2. American Library Association. Reference Collection Development and
Evaluation Committee, Reference Collection Development: A Manual (Chicago:
American Library Association, Referenceand Adult Services, 1992).
3. LC Classification Outline (Washington, D.C.: Library of Congress, 1990).
4. Chang, San-Ju and Ronald E. Rice, "Browsing: A Multidimensional Frame-
work," Annual Reviov of Information Science and Technology28 (1993): 233.
5. Ibid., 233.
6. Demas, Samuel G., Peter McDonald, and Gregory Lawrence, "The lnternet
and Collection Development: Mainstreaming Selection of lntemet Resources,"
Library Resourcesand Technical Services 39, no. 3 (July 1995):280.
7. Hopkins, Richard L., Countering Information Overload: The Role of the
Librarian," Reference Librarian 49/50 (1995): 305-333.
8. McKieman, Gerry. "Participatory WWW Database Development and
Resource Classification," bit.listserv.asis-l (13 December 1995).
9. Micco. Mary, "The Next Generation of Online Public Access Catalogs: A
New Look at Subject Access Using Hypermedia, Cataloging & Clussijication
Quarterly 13. nos. 3 4 (1991): 103-129.
10. McKieman, Gerry, "two-dimensional limitationd3-D Possibilitiedyber-
Stacks(sm): An Alternative Model for Selecting(Organizing(PresentinglAccessing
WWW Resources: A Position Paper Prepared for the OCLC Internet Cataloging
Project Colloquium." Intemet WWW Page, at URL: <http://www.public.iastate.
edul-CYBERSTACKSIOCLC-P.htm>(versioncurrent at 2 February 1996).
I I. Hildreth, Charles R., "The GUI OPAC: Approach with Caution," m e
Public-Access Computer Systems Review 6, no. 5 (1995):Z.O.
12. McKieman,Gerry, "two-dimensional limitations/3-DPossibilities," lnternet
WWW Page, at URL: <http://www.public.iastate.eddCYBERSTACKS/WLC-
P.htm>(version current at 2 February 1996).