OR2012, The 7th international conference on Open Repositories
09 - 13/Jul/2012, the University of Edinburgh, UK
RF3: Pecha Kucha – National Infrastructures, 11/Jul/2012: 11:00am – 12:30pm
RDAP13 Jian Qin: Functional and Architectural Requirements for MetadataASIS&T
Jian Qin, Syracuse University
Jian Qin, Syracuse University; Alex Ball, UKLON; Jane Greenberg, University of North Carolina at Chapel Hill: “Functional and Architectural Requirements for Metadata: Supporting Discovery and Management of Scientific Data”
Panel: Linked data and metadata (co-sponsored by the ASIS&T Digital Libraries SIG)
Research Data Access & Preservation Summit 2013
Baltimore, MD April 4, 2013 #rdap13
PMR metabolomics and transcriptomics database and its RESTful web APIs: A dat...Araport
PMR database is a community resource for deposition and analysis of metabolomics data and related transcriptomics data. PMR currently houses metabolomics data from over 25 species of eukaryotes. In this talk, we introduce PMRs RESTful web APIs for data sharing, and demonstrate its applications in research using Araport to provide Arabidopsis metabolomics data.
Identity, Location, and Citation at NEONMark Parsons
Latest version of an oft-given talk/discussion about data citation and related issues, presented to data scientists at HQ of the National Ecological Observatory Network
RDAP13 Jian Qin: Functional and Architectural Requirements for MetadataASIS&T
Jian Qin, Syracuse University
Jian Qin, Syracuse University; Alex Ball, UKLON; Jane Greenberg, University of North Carolina at Chapel Hill: “Functional and Architectural Requirements for Metadata: Supporting Discovery and Management of Scientific Data”
Panel: Linked data and metadata (co-sponsored by the ASIS&T Digital Libraries SIG)
Research Data Access & Preservation Summit 2013
Baltimore, MD April 4, 2013 #rdap13
PMR metabolomics and transcriptomics database and its RESTful web APIs: A dat...Araport
PMR database is a community resource for deposition and analysis of metabolomics data and related transcriptomics data. PMR currently houses metabolomics data from over 25 species of eukaryotes. In this talk, we introduce PMRs RESTful web APIs for data sharing, and demonstrate its applications in research using Araport to provide Arabidopsis metabolomics data.
Identity, Location, and Citation at NEONMark Parsons
Latest version of an oft-given talk/discussion about data citation and related issues, presented to data scientists at HQ of the National Ecological Observatory Network
The 7 Habits of Highly Effective Research CommunicatorsAnup Kumar Das
The emergence of Web 2.0 and simultaneously Library 2.0 platforms has helped the library and information professionals to outreach to new audiences beyond their physical boundaries. In a globalized society, information becomes very useful resource for socio-economic empowerment of marginalized communities, economic prosperity of common citizens, and knowledge enrichment of liberated minds. Scholarly information becomes both developmental and functional for researchers working towards advancement of knowledge. We must recognize a relay of information flow and information ecology while pursuing scholarly research. Published scholarly literatures we consult that help us in creation of new knowledge. Similarly, our published scholarly works should be outreached to future researchers for regeneration of next dimension of knowledge. Fortunately, present day research communicators have many freely available personalized digital tools to outreach to globalized research audiences having similar research interests. These tools and techniques, already adopted by many researchers in different subject areas across the world, should be enthusiastically utilized by LIS researchers in South Asia for global dissemination of their scholarly research works. This newly found enthusiasm will soon become integral part of the positive habits and cultural practices of research communicators in LIS domain.
Full-text Paper is available here: http://arxiv.org/ftp/arxiv/papers/1409/1409.3920.pdf
Engaging Information Professionals in the Process of Authoritative Interlinki...Lucy McKenna
Through the use of Linked Data (LD), Libraries, Archives and Museums (LAMs) have the potential to expose their collections to a larger audience and to allow for more efficient user searches. Despite this, relatively few LAMs have invested in LD projects and the majority of these display limited interlinking across datasets and institutions. A survey was conducted to understand Information Professionals' (IPs') position with regards to LD, with a particular focus on the interlinking problem. The survey was completed by 185 librarians, archivists, metadata cataloguers and researchers. Results indicated that, when interlinking, IPs find the process of ontology and property selection to be particularly challenging, and LD tooling to be technologically complex and unsuitable for their needs.
Our research is focused on developing an authoritative interlinking framework for LAMs with a view to increasing IP engagement in the linking process. Our framework will provide a set of standards to facilitate IPs in the selection of link types, specifically when linking local resources to authorities. The framework will include guidelines for authority, ontology and property selection, and for adding provenance data. A user-interface will be developed which will direct IPs through the resource interlinking process as per our framework. Although there are existing tools in this domain, our framework differs in that it will be designed with the needs and expertise of IPs in mind. This will be achieved by involving IPs in the design and evaluation of the framework. A mock-up of the interface has already been tested and adjustments have been made based on results. We are currently working on developing a minimal viable product so as to allow for further testing of the framework. We will present our updated framework, interface, and proposed interlinking solutions.
2012 03-28 Wf4ever, preserving workflows as digital research objectsStian Soiland-Reyes
Presented on 2012-03-28 at EGI Community Forum 2012, Munich.
http://www.wf4ever-project.org/
http://purl.org/wf4ever/model
http://cf2012.egi.eu/
https://www.egi.eu/indico/sessionDisplay.py?sessionId=66&confId=679#20120328
Keynote: SemSci 2017: Enabling Open Semantic Science
1st International Workshop co-located with ISWC 2017, October 2017, Vienna, Austria,
https://semsci.github.io/semSci2017/
Abstract
We have all grown up with the research article and article collections (let’s call them libraries) as the prime means of scientific discourse. But research output is more than just the rhetorical narrative. The experimental methods, computational codes, data, algorithms, workflows, Standard Operating Procedures, samples and so on are the objects of research that enable reuse and reproduction of scientific experiments, and they too need to be examined and exchanged as research knowledge.
We can think of “Research Objects” as different types and as packages all the components of an investigation. If we stop thinking of publishing papers and start thinking of releasing Research Objects (software), then scholar exchange is a new game: ROs and their content evolve; they are multi-authored and their authorship evolves; they are a mix of virtual and embedded, and so on.
But first, some baby steps before we get carried away with a new vision of scholarly communication. Many journals (e.g. eLife, F1000, Elsevier) are just figuring out how to package together the supplementary materials of a paper. Data catalogues are figuring out how to virtually package multiple datasets scattered across many repositories to keep the integrated experimental context.
Research Objects [1] (http://researchobject.org/) is a framework by which the many, nested and contributed components of research can be packaged together in a systematic way, and their context, provenance and relationships richly described. The brave new world of containerisation provides the containers and Linked Data provides the metadata framework for the container manifest construction and profiles. It’s not just theory, but also in practice with examples in Systems Biology modelling, Bioinformatics computational workflows, and Health Informatics data exchange. I’ll talk about why and how we got here, the framework and examples, and what we need to do.
[1] Sean Bechhofer, Iain Buchan, David De Roure, Paolo Missier, John Ainsworth, Jiten Bhagat, Philip Couch, Don Cruickshank, Mark Delderfield, Ian Dunlop, Matthew Gamble, Danius Michaelides, Stuart Owen, David Newman, Shoaib Sufi, Carole Goble, Why linked data is not enough for scientists, In Future Generation Computer Systems, Volume 29, Issue 2, 2013, Pages 599-611, ISSN 0167-739X, https://doi.org/10.1016/j.future.2011.08.004
Research Data Access and Preservation Summit, 2014
San Diego, CA
March 26-28, 2014
Jared Lyle, ICPSR
Jennifer Doty, Emory University
Joel Herndon, Duke University
Libbie Stephenson, University of California, Los Angeles
4.16.15 Slides, “Enhancing Early Career Researcher Profiles: VIVO & ORCID Int...DuraSpace
Hot Topics: The DuraSpace Community Webinar Series
Series 11: Integrating ORCID Persistent Identifiers with DSpace, Fedora and VIVO
Webinar 3: “Enhancing Early Career Researcher Profiles: VIVO & ORCID Integration”
April 16, 2015
Curated by Josh Brown, ORCID
Presented by: Simeon Warner, Library Information Systems, Cornell University, Jon Corson-Rikert, Head of Information Technology Services, Cornell University and Kristi Holmes, Director, Galter Health Sciences Library, Northwestern University
Data Equivalence
Mark Parsons, Lead Project Manager, Senior Associate Scientist, National Snow and Ice Data Center
Data citation, especially using persistent identifiers like Digital Object Identifiers (DOIs), is an increasingly accepted scientific practice. Recently, several, respected organizations have developed guidelines for data citation. The different guidelines are largely congruent in that they agree on the basic practice and elements of data citation, especially for relatively static, whole data collections. There is less agreement on the more subtle nuances of data citation that are sometimes necessary to ensure precise reference and scientific reproducibility--the core purpose of data citation. We need to be sure that if you follow a data reference you get to the precise data that were used or at least their scientific equivalent. Identifiers such as DOIs are necessary but not sufficient for the precise, detailed, references necessary. This talk discusses issues around data set versioning, micro-citation, and scientific equivalence. I propose some interim solutions and suggest research strategies for the future.
This is the presentation slides for the workshop BigScholar 2019 in conjunction with CIKM 2019 (ACM International Conference on Information and Knowledge Management) Nov 7, 2019, at CNCC, Beijing, China.
Citation: Kurakawa K, Sun Y and Ando S (2020) Application of a Novel Subject Classification Scheme for a Bibliographic Database Using a Data-Driven Correspondence. Front. Big Data 2:48. doi: 10.3389/fdata.2019.00048
This is the presentation slides for the joint conference of the 134th SIG conference of Information Fundamentals and Access Technologies (IFAT) and 112th SIG conference of Document Communication (DC), Information Processing Society of Japan (IPSJ)March 22, 2019, at Toyo University, Hakusan Campus.
Cite: Kei Kurakawa, Yuan Sun, and Satoko Ando, Applying a new subject classification scheme for a database by a data-driven correspondence, IPSJ SIG Technical Report, Vol.2019-IFAT-134/2019-DC-112, No.7, pp.1-10, (2019).
More Related Content
Similar to Researcher Identifiers and National Federated Search Portal for Japanese Institutional Repositories
The 7 Habits of Highly Effective Research CommunicatorsAnup Kumar Das
The emergence of Web 2.0 and simultaneously Library 2.0 platforms has helped the library and information professionals to outreach to new audiences beyond their physical boundaries. In a globalized society, information becomes very useful resource for socio-economic empowerment of marginalized communities, economic prosperity of common citizens, and knowledge enrichment of liberated minds. Scholarly information becomes both developmental and functional for researchers working towards advancement of knowledge. We must recognize a relay of information flow and information ecology while pursuing scholarly research. Published scholarly literatures we consult that help us in creation of new knowledge. Similarly, our published scholarly works should be outreached to future researchers for regeneration of next dimension of knowledge. Fortunately, present day research communicators have many freely available personalized digital tools to outreach to globalized research audiences having similar research interests. These tools and techniques, already adopted by many researchers in different subject areas across the world, should be enthusiastically utilized by LIS researchers in South Asia for global dissemination of their scholarly research works. This newly found enthusiasm will soon become integral part of the positive habits and cultural practices of research communicators in LIS domain.
Full-text Paper is available here: http://arxiv.org/ftp/arxiv/papers/1409/1409.3920.pdf
Engaging Information Professionals in the Process of Authoritative Interlinki...Lucy McKenna
Through the use of Linked Data (LD), Libraries, Archives and Museums (LAMs) have the potential to expose their collections to a larger audience and to allow for more efficient user searches. Despite this, relatively few LAMs have invested in LD projects and the majority of these display limited interlinking across datasets and institutions. A survey was conducted to understand Information Professionals' (IPs') position with regards to LD, with a particular focus on the interlinking problem. The survey was completed by 185 librarians, archivists, metadata cataloguers and researchers. Results indicated that, when interlinking, IPs find the process of ontology and property selection to be particularly challenging, and LD tooling to be technologically complex and unsuitable for their needs.
Our research is focused on developing an authoritative interlinking framework for LAMs with a view to increasing IP engagement in the linking process. Our framework will provide a set of standards to facilitate IPs in the selection of link types, specifically when linking local resources to authorities. The framework will include guidelines for authority, ontology and property selection, and for adding provenance data. A user-interface will be developed which will direct IPs through the resource interlinking process as per our framework. Although there are existing tools in this domain, our framework differs in that it will be designed with the needs and expertise of IPs in mind. This will be achieved by involving IPs in the design and evaluation of the framework. A mock-up of the interface has already been tested and adjustments have been made based on results. We are currently working on developing a minimal viable product so as to allow for further testing of the framework. We will present our updated framework, interface, and proposed interlinking solutions.
2012 03-28 Wf4ever, preserving workflows as digital research objectsStian Soiland-Reyes
Presented on 2012-03-28 at EGI Community Forum 2012, Munich.
http://www.wf4ever-project.org/
http://purl.org/wf4ever/model
http://cf2012.egi.eu/
https://www.egi.eu/indico/sessionDisplay.py?sessionId=66&confId=679#20120328
Keynote: SemSci 2017: Enabling Open Semantic Science
1st International Workshop co-located with ISWC 2017, October 2017, Vienna, Austria,
https://semsci.github.io/semSci2017/
Abstract
We have all grown up with the research article and article collections (let’s call them libraries) as the prime means of scientific discourse. But research output is more than just the rhetorical narrative. The experimental methods, computational codes, data, algorithms, workflows, Standard Operating Procedures, samples and so on are the objects of research that enable reuse and reproduction of scientific experiments, and they too need to be examined and exchanged as research knowledge.
We can think of “Research Objects” as different types and as packages all the components of an investigation. If we stop thinking of publishing papers and start thinking of releasing Research Objects (software), then scholar exchange is a new game: ROs and their content evolve; they are multi-authored and their authorship evolves; they are a mix of virtual and embedded, and so on.
But first, some baby steps before we get carried away with a new vision of scholarly communication. Many journals (e.g. eLife, F1000, Elsevier) are just figuring out how to package together the supplementary materials of a paper. Data catalogues are figuring out how to virtually package multiple datasets scattered across many repositories to keep the integrated experimental context.
Research Objects [1] (http://researchobject.org/) is a framework by which the many, nested and contributed components of research can be packaged together in a systematic way, and their context, provenance and relationships richly described. The brave new world of containerisation provides the containers and Linked Data provides the metadata framework for the container manifest construction and profiles. It’s not just theory, but also in practice with examples in Systems Biology modelling, Bioinformatics computational workflows, and Health Informatics data exchange. I’ll talk about why and how we got here, the framework and examples, and what we need to do.
[1] Sean Bechhofer, Iain Buchan, David De Roure, Paolo Missier, John Ainsworth, Jiten Bhagat, Philip Couch, Don Cruickshank, Mark Delderfield, Ian Dunlop, Matthew Gamble, Danius Michaelides, Stuart Owen, David Newman, Shoaib Sufi, Carole Goble, Why linked data is not enough for scientists, In Future Generation Computer Systems, Volume 29, Issue 2, 2013, Pages 599-611, ISSN 0167-739X, https://doi.org/10.1016/j.future.2011.08.004
Research Data Access and Preservation Summit, 2014
San Diego, CA
March 26-28, 2014
Jared Lyle, ICPSR
Jennifer Doty, Emory University
Joel Herndon, Duke University
Libbie Stephenson, University of California, Los Angeles
4.16.15 Slides, “Enhancing Early Career Researcher Profiles: VIVO & ORCID Int...DuraSpace
Hot Topics: The DuraSpace Community Webinar Series
Series 11: Integrating ORCID Persistent Identifiers with DSpace, Fedora and VIVO
Webinar 3: “Enhancing Early Career Researcher Profiles: VIVO & ORCID Integration”
April 16, 2015
Curated by Josh Brown, ORCID
Presented by: Simeon Warner, Library Information Systems, Cornell University, Jon Corson-Rikert, Head of Information Technology Services, Cornell University and Kristi Holmes, Director, Galter Health Sciences Library, Northwestern University
Data Equivalence
Mark Parsons, Lead Project Manager, Senior Associate Scientist, National Snow and Ice Data Center
Data citation, especially using persistent identifiers like Digital Object Identifiers (DOIs), is an increasingly accepted scientific practice. Recently, several, respected organizations have developed guidelines for data citation. The different guidelines are largely congruent in that they agree on the basic practice and elements of data citation, especially for relatively static, whole data collections. There is less agreement on the more subtle nuances of data citation that are sometimes necessary to ensure precise reference and scientific reproducibility--the core purpose of data citation. We need to be sure that if you follow a data reference you get to the precise data that were used or at least their scientific equivalent. Identifiers such as DOIs are necessary but not sufficient for the precise, detailed, references necessary. This talk discusses issues around data set versioning, micro-citation, and scientific equivalence. I propose some interim solutions and suggest research strategies for the future.
This is the presentation slides for the workshop BigScholar 2019 in conjunction with CIKM 2019 (ACM International Conference on Information and Knowledge Management) Nov 7, 2019, at CNCC, Beijing, China.
Citation: Kurakawa K, Sun Y and Ando S (2020) Application of a Novel Subject Classification Scheme for a Bibliographic Database Using a Data-Driven Correspondence. Front. Big Data 2:48. doi: 10.3389/fdata.2019.00048
This is the presentation slides for the joint conference of the 134th SIG conference of Information Fundamentals and Access Technologies (IFAT) and 112th SIG conference of Document Communication (DC), Information Processing Society of Japan (IPSJ)March 22, 2019, at Toyo University, Hakusan Campus.
Cite: Kei Kurakawa, Yuan Sun, and Satoko Ando, Applying a new subject classification scheme for a database by a data-driven correspondence, IPSJ SIG Technical Report, Vol.2019-IFAT-134/2019-DC-112, No.7, pp.1-10, (2019).
Presentation slide for this:
Kei Kurakawa, Toward universal information access on the digital object cloud, In book of abstracts of International Workshop on Data Science - Present & Future of Open Data & Open Science -, p.57-59, November 12-15, 2018, Mishima Citizens Cultural Hall & Joint Support-Center for Data Science Research, Mishima, Shizuoka, Japan
International Workshop on Sharing, Citation and Publication of Scientific Data across Disciplines
Joint Support-Center for Data science Research (DS), ROIS
NIPR / NINJAL, Tachikawa, Tokyo, Japan, 5-7 December 2017.
Analysis and Modeling of Complex Data in Behavioral and Social Sciences
Joint meeting of Japanese and Italian Classification Societies
Anacapri (Capri Island, Italy), 3-4 September 2012
Operation “Blue Star” is the only event in the history of Independent India where the state went into war with its own people. Even after about 40 years it is not clear if it was culmination of states anger over people of the region, a political game of power or start of dictatorial chapter in the democratic setup.
The people of Punjab felt alienated from main stream due to denial of their just demands during a long democratic struggle since independence. As it happen all over the word, it led to militant struggle with great loss of lives of military, police and civilian personnel. Killing of Indira Gandhi and massacre of innocent Sikhs in Delhi and other India cities was also associated with this movement.
Unit 8 - Information and Communication Technology (Paper I).pdfThiyagu K
This slides describes the basic concepts of ICT, basics of Email, Emerging Technology and Digital Initiatives in Education. This presentations aligns with the UGC Paper I syllabus.
The French Revolution, which began in 1789, was a period of radical social and political upheaval in France. It marked the decline of absolute monarchies, the rise of secular and democratic republics, and the eventual rise of Napoleon Bonaparte. This revolutionary period is crucial in understanding the transition from feudalism to modernity in Europe.
For more information, visit-www.vavaclasses.com
Welcome to TechSoup New Member Orientation and Q&A (May 2024).pdfTechSoup
In this webinar you will learn how your organization can access TechSoup's wide variety of product discount and donation programs. From hardware to software, we'll give you a tour of the tools available to help your nonprofit with productivity, collaboration, financial management, donor tracking, security, and more.
The Art Pastor's Guide to Sabbath | Steve ThomasonSteve Thomason
What is the purpose of the Sabbath Law in the Torah. It is interesting to compare how the context of the law shifts from Exodus to Deuteronomy. Who gets to rest, and why?
How to Create Map Views in the Odoo 17 ERPCeline George
The map views are useful for providing a geographical representation of data. They allow users to visualize and analyze the data in a more intuitive manner.
Instructions for Submissions thorugh G- Classroom.pptxJheel Barad
This presentation provides a briefing on how to upload submissions and documents in Google Classroom. It was prepared as part of an orientation for new Sainik School in-service teacher trainees. As a training officer, my goal is to ensure that you are comfortable and proficient with this essential tool for managing assignments and fostering student engagement.
We all have good and bad thoughts from time to time and situation to situation. We are bombarded daily with spiraling thoughts(both negative and positive) creating all-consuming feel , making us difficult to manage with associated suffering. Good thoughts are like our Mob Signal (Positive thought) amidst noise(negative thought) in the atmosphere. Negative thoughts like noise outweigh positive thoughts. These thoughts often create unwanted confusion, trouble, stress and frustration in our mind as well as chaos in our physical world. Negative thoughts are also known as “distorted thinking”.
Read| The latest issue of The Challenger is here! We are thrilled to announce that our school paper has qualified for the NATIONAL SCHOOLS PRESS CONFERENCE (NSPC) 2024. Thank you for your unwavering support and trust. Dive into the stories that made us stand out!
Researcher Identifiers and National Federated Search Portal for Japanese Institutional Repositories
1. Researcher Identifiers and
National Federated Search Portal
for Japanese Institutional Repositories
Kei Kurakawa1, Hideaki Takeda1, Ryo Shiozaki1,
Shun Morimoto2, and Hideki Uchijima2
1: National Institute of Informatics, Japan
2: Kanazawa University Library, Japan
OR2012, The 7th international conference on Open Repositories
09 - 13/Jul/2012, the University of Edinburgh, UK
RF3: Pecha Kucha – National Infrastructures, 11/Jul/2012: 11:00am – 12:30pm
2. Name disambiguation
• Name ambiguity problem is caused by
– the same family name and the same first name,
maiden name usage, pen name,
– character variants in Kanji (for Japanese),
– and a variety of name formats for each journals
• To assign identifier for researcher is an answer.
• If this problem is solved, it would be available to
– exactly compile a list of research achievement,
– exactly assess contributions of the researcher,
– exactly trace development paths by the researcher,
– and exchange exact information of the researcher in
various scholarly communication scenes.
2
3. Assigning identifiers for authors
of repository items
• DAI (Digital Author Identifier), the SURF
foundation, the Netherlands
• Names Project, mimas, University of
Manchester, UK
• Networking Names, OCLC
• ORCID, ORCID Inc.
• Researcher Name Resolver, NII, Japan
3
4. Objective of
the Researcher Name Resolver (RNR)
• Basic function as giving
– Identifiers of Japanese researcher on the Web
– Web resource links for researcher
• Implementations
– Researcher URI
– Identifier management
– Researcher pages http://rns.nii.ac.jp
– Web services
• Applications
– Name authority for external systems
– Researcher resource link resolver
4
5. Researcher identifier scheme
and management
• Identifier is of the form,
– 13-digit number
– namely, Researcher Name Resolver ID.
– http://rns.nii.ac.jp/nr/xxxxxxxxxxxxx
• Identifier management
– Identifiers of a researcher are grouped and
aliased to a representative of them
5
6. Prior registration
• As priming a pump effect,
– Registrants of the database of Grants-in-Aid for Scientific
Research in Japan, KAKEN are registered in RNR
– 200,882 researchers are registered at 2012-07-06
• Extracted data attributes
– Names (Japanese Kanji character, Japanese phonetic
character, English character)
– Affiliated institution (last), department, job title
– Grants-in-Aid for Scientific Research Researcher Number
– (Research fields of Grants-in-Aid for Scientific Research)
– (Research keywords of Grants-in-Aid for Scientific
Research)
6
7. Web resource links for researcher
Direct links to external systems
Planned
Database of Grants-in-Aid Academic information resource
for Scientific Research portal and link system of Japan
The biggest researcher directory
The biggest article search of Japan
published in Japan
Campus Directories
Planned
7
8. Batch processing approach
• To make links to external systems, the RNR
– loads profile data,
• ResearcherID based researcher profiles including a list
of external identifiers
– uses identifier mapping rules,
• RNR identifier is implicitly reuse of KAKEN identifier
• CiNii identifier is implicitly reuse of KAKEN identifier
– and, loads external identifier mapping lists
• J-GLOBAL identifier to KAKEN identifier
• ReaD&Researchmap identifier to KAKEN identifier
8
9. Crawling approach
• To build direct links to campus directories, the RNR
– crawled campus directories,
– built a list of researcher pages with names,
– and identify the researchers over RNR registrants
• To identify researchers, the RNR checks institution and
whether or not
– Kanji name is the same,
– institution is the same,
– and, the name is unique in the institution.
• In fact,
– 226 university campus directories were crawled.
– 102,949 researcher pages were indexed.
– RNR has 23,252 links to those campus directories.
– It is high precision rate, but low recall rate.
– For improvement, machine learning is a promising approach.
9
10. Researcher page
Researcher name
(Kanji, Katakana, English)
Researcher Researcher Name Resolver ID
basic information
Researcher URI
Direct links
Search links
Research fields of
Grants-in-aid for
Scientific Research
Research keywords of
Grants-in-aid for
Scientific Research
URI
of this document 10
11. Web services
• Researcher URI
• OpenSearch
External direct links for the researcher
• Linked data structure
• RDF export
• URL Redirection
Redirection service URL for a researcher
Ex.
http://rns.nii.ac.jp/services/redirect?source=r
esolver&id=1000080252831&target=kaken
Campus Directories
11
12. Application for JAIRO
• JAIRO is the national federated search portal.
• RNR is as being name authority for JAIRO.
• What repository managers do for the work is
– to embed an “id” attribute in the “creator” field of
metadata JAIRO (IRDB) harvests, then
1. put a RNR URI in “id” attribute value field, or
2. put anyURI for researcher in “id” attribute value field,
and upload researcher profiles with external ids table
on RNR
12
15. How author identifier changes UI?
• JAIRO is an example for exploring scholarly
knowledge.
• We assume that there are two modes of knowledge
exploration, i.e.
– basically, we use keywords for search, and
– sometimes, we use author name for search.
• When we use author id for search,
– we expect to get a exact list of articles of the researcher
– therefore, we could trace a research development path of
the researcher.
• New UI ought to be designed, depending on search
conditions with or without author identifier.
15
18. Privacy and data protection
• On dealing with researcher profiles and identifiers, we should
take care of privacy laws.
• Related acts in Japan
– Act on the Protection of Personal Information (effect from 2005)
• Act on the Protection of Personal Information Held by Administrative
Organs
• Act on the Protection of Personal Information Held by Independent
Administrative Agencies, etc.
• Regulations on the Protection of Personal Information
• Phase of practice Exclusion from application
– Academic research (feasibility study)
– Business
• Current status in Japan
– It is not enough to have discussions on privacy and data
protection of researcher profiles for repositories.
– License is not explicit on the researcher profiles for repositories.
18
19. Future work
• To make linkage between RNR and
external global services, e.g. ORCID,
Microsoft Academic Search
• To improve precision/recall rate of linkage
• To state privacy and data protection
matter
• To widely promote our activity to put
author identifiers in bibliographic metadata
19
20. Summary
• Identifier for researcher is required to solve name
ambiguity problem.
• Researcher Name Resolver deals with identifiers
for researchers in Japan.
– It manages researcher identifiers.
– It shows researcher profile on researcher pages.
– It gives links on researcher pages to external systems
– It provides relevant web services.
• When RNR functions as name authority for JAIRO,
it provides a search by author identifier.
20