The document discusses problems with discovery and access across libraries and publishers due to siloed systems and metadata. Specifically, it notes there are too many separate systems that cannot be logically mapped, leading to low precision and recall. This is partly due to limitations in metadata, including ambiguity of terms and a focus on individual instances rather than abstract concepts. This makes it difficult to discover related information about prolific authors across different publications and editions. Overall, the document argues the current landscape needs improvement to better serve users.
sms text message marketing - internet marketing servicesmsmarket12
Marketers have a unique opportunity to harness the power of this interactive and personal medium. Brands can now reach consumers anywhere and anytime with a relevant message and track responses in real-time. SMS is a powerful marketing tool which effectively targets audiences who are difficult to reach through traditional media channels.
sms text message marketing - internet marketing servicesmsmarket12
Marketers have a unique opportunity to harness the power of this interactive and personal medium. Brands can now reach consumers anywhere and anytime with a relevant message and track responses in real-time. SMS is a powerful marketing tool which effectively targets audiences who are difficult to reach through traditional media channels.
TIC Magazine est le premier magazine marocain francophone dédié aux Technologies de l’Information et la communication, destiné aux professionnels du secteur au Maroc
Plus d'infos sur http://www.ticmagazine.net
I talk about the evolution of digital content into services, the role of sensors in the future of the web, about the idea of man-machine collaboration in internet services, and about the role of social networking in building content.
This presentation was given during the February 2009 Coos Library Directors' meeting to convince them to look into adopting an open source integrated library system (ILS) such as Koha or Evergreen. The Coos County Library Service District is located on the southern Oregon coast.
Slides from the presentation at Munich in 2010. Slide have been presented to digital humanities experts and scholars participating the COST A32 action (2006-2010)
Interpretation, Context, and Metadata: Examples from Open ContextEric Kansa
Presentation given at the International Data Curation Conference (#IDCC!6) in Amsterdam, at the "A Context-driven Approach to Data Curation for Reuse" workshop (organized by Ixchel Faniel and Elizabeth Yakel) on Monday, February 22, 2015
A importância dos dados em sua arquitetura... uma visão muito além do SQL Ser...Alexandre Porcelli
Nos últimos 30 anos temos vivido a hegemonia dos bancos de dados relacionais, a grande bala de prata da TI. O armazenamento de dados se tornou tão comoditizado, que nem mesmo nos questionamos se o modelo relacional é adequado as nossas necessidades. Mas será que o armazenamento de dados se resume ao modelo relacional? Será que as técnicas tradicionais de normalização ou ferramentas de produtividade como ORM são realmente adequadas? Será que você está tratando seus dados com a devida atenção?
Nesta palestra respondemos estas e outras perguntas sobre tratamento e armazenamento de dados. Colocamos o "dedo na ferida" e apresentamos uma nova escola de pensamento bem como algumas ferramentas que suportam esta nova realidade.
Essentials of Automations: The Art of Triggers and Actions in FMESafe Software
In this second installment of our Essentials of Automations webinar series, we’ll explore the landscape of triggers and actions, guiding you through the nuances of authoring and adapting workspaces for seamless automations. Gain an understanding of the full spectrum of triggers and actions available in FME, empowering you to enhance your workspaces for efficient automation.
We’ll kick things off by showcasing the most commonly used event-based triggers, introducing you to various automation workflows like manual triggers, schedules, directory watchers, and more. Plus, see how these elements play out in real scenarios.
Whether you’re tweaking your current setup or building from the ground up, this session will arm you with the tools and insights needed to transform your FME usage into a powerhouse of productivity. Join us to discover effective strategies that simplify complex processes, enhancing your productivity and transforming your data management practices with FME. Let’s turn complexity into clarity and make your workspaces work wonders!
How to Get CNIC Information System with Paksim Ga.pptxdanishmna97
Pakdata Cf is a groundbreaking system designed to streamline and facilitate access to CNIC information. This innovative platform leverages advanced technology to provide users with efficient and secure access to their CNIC details.
A tale of scale & speed: How the US Navy is enabling software delivery from l...sonjaschweigert1
Rapid and secure feature delivery is a goal across every application team and every branch of the DoD. The Navy’s DevSecOps platform, Party Barge, has achieved:
- Reduction in onboarding time from 5 weeks to 1 day
- Improved developer experience and productivity through actionable findings and reduction of false positives
- Maintenance of superior security standards and inherent policy enforcement with Authorization to Operate (ATO)
Development teams can ship efficiently and ensure applications are cyber ready for Navy Authorizing Officials (AOs). In this webinar, Sigma Defense and Anchore will give attendees a look behind the scenes and demo secure pipeline automation and security artifacts that speed up application ATO and time to production.
We will cover:
- How to remove silos in DevSecOps
- How to build efficient development pipeline roles and component templates
- How to deliver security artifacts that matter for ATO’s (SBOMs, vulnerability reports, and policy evidence)
- How to streamline operations with automated policy checks on container images
GraphSummit Singapore | The Future of Agility: Supercharging Digital Transfor...Neo4j
Leonard Jayamohan, Partner & Generative AI Lead, Deloitte
This keynote will reveal how Deloitte leverages Neo4j’s graph power for groundbreaking digital twin solutions, achieving a staggering 100x performance boost. Discover the essential role knowledge graphs play in successful generative AI implementations. Plus, get an exclusive look at an innovative Neo4j + Generative AI solution Deloitte is developing in-house.
TIC Magazine est le premier magazine marocain francophone dédié aux Technologies de l’Information et la communication, destiné aux professionnels du secteur au Maroc
Plus d'infos sur http://www.ticmagazine.net
I talk about the evolution of digital content into services, the role of sensors in the future of the web, about the idea of man-machine collaboration in internet services, and about the role of social networking in building content.
This presentation was given during the February 2009 Coos Library Directors' meeting to convince them to look into adopting an open source integrated library system (ILS) such as Koha or Evergreen. The Coos County Library Service District is located on the southern Oregon coast.
Slides from the presentation at Munich in 2010. Slide have been presented to digital humanities experts and scholars participating the COST A32 action (2006-2010)
Interpretation, Context, and Metadata: Examples from Open ContextEric Kansa
Presentation given at the International Data Curation Conference (#IDCC!6) in Amsterdam, at the "A Context-driven Approach to Data Curation for Reuse" workshop (organized by Ixchel Faniel and Elizabeth Yakel) on Monday, February 22, 2015
A importância dos dados em sua arquitetura... uma visão muito além do SQL Ser...Alexandre Porcelli
Nos últimos 30 anos temos vivido a hegemonia dos bancos de dados relacionais, a grande bala de prata da TI. O armazenamento de dados se tornou tão comoditizado, que nem mesmo nos questionamos se o modelo relacional é adequado as nossas necessidades. Mas será que o armazenamento de dados se resume ao modelo relacional? Será que as técnicas tradicionais de normalização ou ferramentas de produtividade como ORM são realmente adequadas? Será que você está tratando seus dados com a devida atenção?
Nesta palestra respondemos estas e outras perguntas sobre tratamento e armazenamento de dados. Colocamos o "dedo na ferida" e apresentamos uma nova escola de pensamento bem como algumas ferramentas que suportam esta nova realidade.
Essentials of Automations: The Art of Triggers and Actions in FMESafe Software
In this second installment of our Essentials of Automations webinar series, we’ll explore the landscape of triggers and actions, guiding you through the nuances of authoring and adapting workspaces for seamless automations. Gain an understanding of the full spectrum of triggers and actions available in FME, empowering you to enhance your workspaces for efficient automation.
We’ll kick things off by showcasing the most commonly used event-based triggers, introducing you to various automation workflows like manual triggers, schedules, directory watchers, and more. Plus, see how these elements play out in real scenarios.
Whether you’re tweaking your current setup or building from the ground up, this session will arm you with the tools and insights needed to transform your FME usage into a powerhouse of productivity. Join us to discover effective strategies that simplify complex processes, enhancing your productivity and transforming your data management practices with FME. Let’s turn complexity into clarity and make your workspaces work wonders!
How to Get CNIC Information System with Paksim Ga.pptxdanishmna97
Pakdata Cf is a groundbreaking system designed to streamline and facilitate access to CNIC information. This innovative platform leverages advanced technology to provide users with efficient and secure access to their CNIC details.
A tale of scale & speed: How the US Navy is enabling software delivery from l...sonjaschweigert1
Rapid and secure feature delivery is a goal across every application team and every branch of the DoD. The Navy’s DevSecOps platform, Party Barge, has achieved:
- Reduction in onboarding time from 5 weeks to 1 day
- Improved developer experience and productivity through actionable findings and reduction of false positives
- Maintenance of superior security standards and inherent policy enforcement with Authorization to Operate (ATO)
Development teams can ship efficiently and ensure applications are cyber ready for Navy Authorizing Officials (AOs). In this webinar, Sigma Defense and Anchore will give attendees a look behind the scenes and demo secure pipeline automation and security artifacts that speed up application ATO and time to production.
We will cover:
- How to remove silos in DevSecOps
- How to build efficient development pipeline roles and component templates
- How to deliver security artifacts that matter for ATO’s (SBOMs, vulnerability reports, and policy evidence)
- How to streamline operations with automated policy checks on container images
GraphSummit Singapore | The Future of Agility: Supercharging Digital Transfor...Neo4j
Leonard Jayamohan, Partner & Generative AI Lead, Deloitte
This keynote will reveal how Deloitte leverages Neo4j’s graph power for groundbreaking digital twin solutions, achieving a staggering 100x performance boost. Discover the essential role knowledge graphs play in successful generative AI implementations. Plus, get an exclusive look at an innovative Neo4j + Generative AI solution Deloitte is developing in-house.
Enchancing adoption of Open Source Libraries. A case study on Albumentations.AIVladimir Iglovikov, Ph.D.
Presented by Vladimir Iglovikov:
- https://www.linkedin.com/in/iglovikov/
- https://x.com/viglovikov
- https://www.instagram.com/ternaus/
This presentation delves into the journey of Albumentations.ai, a highly successful open-source library for data augmentation.
Created out of a necessity for superior performance in Kaggle competitions, Albumentations has grown to become a widely used tool among data scientists and machine learning practitioners.
This case study covers various aspects, including:
People: The contributors and community that have supported Albumentations.
Metrics: The success indicators such as downloads, daily active users, GitHub stars, and financial contributions.
Challenges: The hurdles in monetizing open-source projects and measuring user engagement.
Development Practices: Best practices for creating, maintaining, and scaling open-source libraries, including code hygiene, CI/CD, and fast iteration.
Community Building: Strategies for making adoption easy, iterating quickly, and fostering a vibrant, engaged community.
Marketing: Both online and offline marketing tactics, focusing on real, impactful interactions and collaborations.
Mental Health: Maintaining balance and not feeling pressured by user demands.
Key insights include the importance of automation, making the adoption process seamless, and leveraging offline interactions for marketing. The presentation also emphasizes the need for continuous small improvements and building a friendly, inclusive community that contributes to the project's growth.
Vladimir Iglovikov brings his extensive experience as a Kaggle Grandmaster, ex-Staff ML Engineer at Lyft, sharing valuable lessons and practical advice for anyone looking to enhance the adoption of their open-source projects.
Explore more about Albumentations and join the community at:
GitHub: https://github.com/albumentations-team/albumentations
Website: https://albumentations.ai/
LinkedIn: https://www.linkedin.com/company/100504475
Twitter: https://x.com/albumentations
GraphRAG is All You need? LLM & Knowledge GraphGuy Korland
Guy Korland, CEO and Co-founder of FalkorDB, will review two articles on the integration of language models with knowledge graphs.
1. Unifying Large Language Models and Knowledge Graphs: A Roadmap.
https://arxiv.org/abs/2306.08302
2. Microsoft Research's GraphRAG paper and a review paper on various uses of knowledge graphs:
https://www.microsoft.com/en-us/research/blog/graphrag-unlocking-llm-discovery-on-narrative-private-data/
In the rapidly evolving landscape of technologies, XML continues to play a vital role in structuring, storing, and transporting data across diverse systems. The recent advancements in artificial intelligence (AI) present new methodologies for enhancing XML development workflows, introducing efficiency, automation, and intelligent capabilities. This presentation will outline the scope and perspective of utilizing AI in XML development. The potential benefits and the possible pitfalls will be highlighted, providing a balanced view of the subject.
We will explore the capabilities of AI in understanding XML markup languages and autonomously creating structured XML content. Additionally, we will examine the capacity of AI to enrich plain text with appropriate XML markup. Practical examples and methodological guidelines will be provided to elucidate how AI can be effectively prompted to interpret and generate accurate XML markup.
Further emphasis will be placed on the role of AI in developing XSLT, or schemas such as XSD and Schematron. We will address the techniques and strategies adopted to create prompts for generating code, explaining code, or refactoring the code, and the results achieved.
The discussion will extend to how AI can be used to transform XML content. In particular, the focus will be on the use of AI XPath extension functions in XSLT, Schematron, Schematron Quick Fixes, or for XML content refactoring.
The presentation aims to deliver a comprehensive overview of AI usage in XML development, providing attendees with the necessary knowledge to make informed decisions. Whether you’re at the early stages of adopting AI or considering integrating it in advanced XML development, this presentation will cover all levels of expertise.
By highlighting the potential advantages and challenges of integrating AI with XML development tools and languages, the presentation seeks to inspire thoughtful conversation around the future of XML development. We’ll not only delve into the technical aspects of AI-powered XML development but also discuss practical implications and possible future directions.
Sudheer Mechineni, Head of Application Frameworks, Standard Chartered Bank
Discover how Standard Chartered Bank harnessed the power of Neo4j to transform complex data access challenges into a dynamic, scalable graph database solution. This keynote will cover their journey from initial adoption to deploying a fully automated, enterprise-grade causal cluster, highlighting key strategies for modelling organisational changes and ensuring robust disaster recovery. Learn how these innovations have not only enhanced Standard Chartered Bank’s data infrastructure but also positioned them as pioneers in the banking sector’s adoption of graph technology.
Maruthi Prithivirajan, Head of ASEAN & IN Solution Architecture, Neo4j
Get an inside look at the latest Neo4j innovations that enable relationship-driven intelligence at scale. Learn more about the newest cloud integrations and product enhancements that make Neo4j an essential choice for developers building apps with interconnected data and generative AI.
Unlocking Productivity: Leveraging the Potential of Copilot in Microsoft 365, a presentation by Christoforos Vlachos, Senior Solutions Manager – Modern Workplace, Uni Systems
Alt. GDG Cloud Southlake #33: Boule & Rebala: Effective AppSec in SDLC using ...James Anderson
Effective Application Security in Software Delivery lifecycle using Deployment Firewall and DBOM
The modern software delivery process (or the CI/CD process) includes many tools, distributed teams, open-source code, and cloud platforms. Constant focus on speed to release software to market, along with the traditional slow and manual security checks has caused gaps in continuous security as an important piece in the software supply chain. Today organizations feel more susceptible to external and internal cyber threats due to the vast attack surface in their applications supply chain and the lack of end-to-end governance and risk management.
The software team must secure its software delivery process to avoid vulnerability and security breaches. This needs to be achieved with existing tool chains and without extensive rework of the delivery processes. This talk will present strategies and techniques for providing visibility into the true risk of the existing vulnerabilities, preventing the introduction of security issues in the software, resolving vulnerabilities in production environments quickly, and capturing the deployment bill of materials (DBOM).
Speakers:
Bob Boule
Robert Boule is a technology enthusiast with PASSION for technology and making things work along with a knack for helping others understand how things work. He comes with around 20 years of solution engineering experience in application security, software continuous delivery, and SaaS platforms. He is known for his dynamic presentations in CI/CD and application security integrated in software delivery lifecycle.
Gopinath Rebala
Gopinath Rebala is the CTO of OpsMx, where he has overall responsibility for the machine learning and data processing architectures for Secure Software Delivery. Gopi also has a strong connection with our customers, leading design and architecture for strategic implementations. Gopi is a frequent speaker and well-known leader in continuous delivery and integrating security into software delivery.
Observability Concepts EVERY Developer Should Know -- DeveloperWeek Europe.pdfPaige Cruz
Monitoring and observability aren’t traditionally found in software curriculums and many of us cobble this knowledge together from whatever vendor or ecosystem we were first introduced to and whatever is a part of your current company’s observability stack.
While the dev and ops silo continues to crumble….many organizations still relegate monitoring & observability as the purview of ops, infra and SRE teams. This is a mistake - achieving a highly observable system requires collaboration up and down the stack.
I, a former op, would like to extend an invitation to all application developers to join the observability party will share these foundational concepts to build on:
Communications Mining Series - Zero to Hero - Session 1DianaGray10
This session provides introduction to UiPath Communication Mining, importance and platform overview. You will acquire a good understand of the phases in Communication Mining as we go over the platform with you. Topics covered:
• Communication Mining Overview
• Why is it important?
• How can it help today’s business and the benefits
• Phases in Communication Mining
• Demo on Platform overview
• Q/A
Securing your Kubernetes cluster_ a step-by-step guide to success !KatiaHIMEUR1
Today, after several years of existence, an extremely active community and an ultra-dynamic ecosystem, Kubernetes has established itself as the de facto standard in container orchestration. Thanks to a wide range of managed services, it has never been so easy to set up a ready-to-use Kubernetes cluster.
However, this ease of use means that the subject of security in Kubernetes is often left for later, or even neglected. This exposes companies to significant risks.
In this talk, I'll show you step-by-step how to secure your Kubernetes cluster for greater peace of mind and reliability.
7. Monday, November 21, 11
Old Silos
We in the library and publishing trades force readers, some of them who are authors as
well, to search iteratively for information they want or need or thinks might exist, in
many different silos, using many different search engines, forms, and vocabularies. We
do not make it easy for them to discover what is locally available, what is more or less
easy to get, or everything that might be available.
No wonder the young and foolish depend upon and believe in Google’s searches.
Google is quick...and in terms of search terms of relevance, very, very dirty.
8. Monday, November 21, 11
We give them better interfaces, ones that permit refinement of results, to our holdings at
the title level, BUT...
9. Monday, November 21, 11
Simulateneously, we show them many other tools, each excellent in some ways, to
continue their exploration of the literature. No single tool is comprehensive. We do not
refer our clients to the Web, at least not on our own web sites! // Our OPACs refer to our
holdings. While Indices and abstracts refer our readers to articles in journals to which
we may have licensed. SFX and similar provide readers with links to titles revealed to
which we have subscribed. Neither our opacs nor the secondary databases directly to
more than a tiny, percentage of the vast collection of pages that is the World Wide Web.
The Web, of course, refers in fragmentary fashion to information resources we might, I
emphasize, MIGHT have on hand for our readers.
10. Monday, November 21, 11
And the results of using other, often very good, discovery tools differ in relevance
ranking, format, and options than the ones we provide for our OPAcs, thus adding
confusion.
11. Monday, November 21, 11
some of us provide our readers with lots of databases to search. Too many really, for all
but a few are not forensic-level scholars.
12. Monday, November 21, 11
Selecting a licensed data base is an art in itself!
Once again notice that we rarely offer a web search engine as an option, and for good
reasons. Nevertheless, the discoverable relevant information resources on the web
apparently are not part of our repertory.
13. !!!
Monday, November 21, 11
We have not conspired to make the search for relevant information objects difficult. We
just have not yet had the tools, the methods, the vision, and yes, the gumption to try
something new.
14. ATLAS at LHC -- 150*106 sensors
Ntl Cntr for
Biotech Info
NSF CyberInfrastructure
quake engineering simulation
Monday, November 21, 11
Here’s a teensy slice of the information and communication environment in which our
faculty and students find themselves. And it gets more complex every day. Alas the
larger the number of websites indexed by Bing or Google or whatever search engine du
jour, the more likely it is that the relevance of the returns will be less pointed and
precisely matched to what the searcher hoped to find.
17. One size fits all???
17
Monday, November 21, 11
Does
one
size
fit
all?
18. 18
Monday, November 21, 11
Not
quite.
Even
Google
has
silos
and
uses,
as
do
others,
clever
interfaces
to
hide
the
fact
of
the
silos.
19. Monday, November 21, 11
Given all these silos and search engines, our users, our authors, and readers, and
teachers, and students, people on the street, our nations...need us to find a better way.
Facts about the information objects we have acquired or leased, facts about books,
articles, films, and so forth that we have published need to be found in the wild, on the
web. Ideally, we, librarians and publishers will get the facts about what we have and
what we are making public, for fun or profit, discoverable on the Web.
20. Discovery & Access
... the problems
Monday, November 21, 11
Let’s dwell on the problems
briefly...
21. 1. Too many stovepipe systems
2. Too little precision
with inadequate recall
3
3. Too far removed from W
Web
Wide
World
Monday, November 21, 11
22. 1. Too many stovepipe systems
Monday, November 21, 11
23. 1. Too many stovepipe systems
The landscape of discovery & access
services is a shambles
Monday, November 21, 11
24. 1. Too many stovepipe systems
The landscape of discovery & access
services is a shambles
It can’t be mapped in any logical way
Monday, November 21, 11
25. 1. Too many stovepipe systems
The landscape of discovery & access
services is a shambles
It can’t be mapped in any logical way
• not by us (the supposed information pros)
• not by the faculty & students who must navigate the chaos
Monday, November 21, 11
26. 1. Too many stovepipe systems
The landscape of discovery & access
services is a shambles
It can’t be mapped in any logical way
• not by us (the supposed information pros)
• not by the faculty & students who must navigate the chaos
This state of affairs shouldn’t be a surprise
Monday, November 21, 11
27. 2. Too little precision
with inadequate recall
Monday, November 21, 11
28. 2. Too little precision
with inadequate recall
Some of the problem ... too many stovepipe systems
Monday, November 21, 11
29. 2. Too little precision
with inadequate recall
Some of the problem ... too many stovepipe systems
• dumbing-down effects of federation often hinder explicit searches
• each interface has its own search-refinement tricks
• numerous, overlapping discovery paths hamper full recall
Monday, November 21, 11
30. 2. Too little precision
with inadequate recall
Some of the problem ... too many systems
• dumbing down effects of federation often hinder explicit searches
• each interface has its own search-refinement tricks
• numerous, overlapping discovery paths hamper full recall
Most of the problem ...
limitations in the design & execution of infrastructure
that supports discovery & access
Monday, November 21, 11
32. the 1st limiting factor ... ambiguity
Most of our metadata uses a string of bytes
to label a semantic entity [people, places, things, events, ...]
Monday, November 21, 11
33. the 1st limiting factor ... ambiguity
Most of our metadata uses a string of bytes
to label a semantic entity [person, place, thing, event, ...]
• discovery based on matching text labels
• not on the gist of semantic entities
Monday, November 21, 11
34. the 1st limiting factor ... ambiguity
Most of our metadata uses a string of bytes
to label a semantic entity [person, place, thing, event, ...]
• discovery based on matching text labels
• not on the gist of semantic entities
For libraries, the fix is authorities
• authoritative forms of strings
(names, organization, titles, places, events, topics, etc.)
Monday, November 21, 11
35. the 1st limiting factor ... ambiguity
Most of our metadata uses a string of bytes
to label a semantic entity [person, place, thing, event, ...]
• discovery based on matching text labels
• not on the gist of semantic entities
For libraries, the fix is authorities
• authoritative forms of strings (names, organization, titles,
places, events, topics, etc.) work to improve precision and recall
hold on
... what about cases where no one-to-one relationship exists
between a string-of-text label & the underlying semantic entity
Monday, November 21, 11
36. the 1st limiting factor ... ambiguity
Most of our metadata uses a string of bytes
to label a semantic entity [person, place, thing, event, ...]
• discovery based on matching text labels
• not on the gist of semantic entities
For libraries, the fix is authorities
• authoritative forms of strings (names, organization, titles,
places, events, topics, etc.) work to improve precision and recall
hold on
... what about cases where no one-to-one relationship exists
between a string-of-text label & the underlying semantic entity
Take for example the text string: jaguar
byte string: 4a 61 67 75 61 72
Monday, November 21, 11
37. ... a rose is a rose is a rose
company
Ltd.
cars
XK series, in pro-
duction since 1996
E-Type (UK) or
XK-E (US) mftg
1961 to 1974
etc.
hardware & software
Atari video
game console
Macintosh
OS X 10.2
John Giannandrea, CTO, Metaweb
Monday, November 21, 11
Imagine this keyword search and realize the ambiguity of the term “jaquar”
inspired by John Giannandrea, CTO, Metaweb ... from his presentation at PARC in
April, 2008
38. ... a rose is a rose is a rose
company music
Ltd. heavy metal band formed
in Bristol, England. Dec 1979
cars
Fender electric guitar,
XK series, in pro- introduced in 1962
duction since 1996
Philadelphia-based
singer/songwriter
E-Type (UK) or Jaguar Wright
XK-E (US) mftg
1961 to 1974
etc. military
type 140 Jaguar
class fast attack
craft [torpedo],
hardware & software Germany WWII
Atari video
game console Anglo-French ground
attack aircraft
Macintosh XF10F prototype swing-wing
OS X 10.2 fighter, early 1950s, Grumman
John Giannandrea, CTO, Metaweb
Monday, November 21, 11
inspired by John Giannandrea, CTO, Metaweb
... from his presentation at PARC in April, 2008
39. ... a rose is a rose is a rose
company music
Ltd. heavy metal band formed
in Bristol, England. Dec 1979
cars
Fender electric guitar, heros
XK series, in pro- introduced in 1962
duction since 1996
The Jaguar is a superhero
published by Archie Comics
Philadelphia-based
singer/songwriter
E-Type (UK) or Jaguar Wright
XK-E (US) mftg
1961 to 1974 DC Comics' Impact series,
... loosely based on Archie
Comics' character
etc. military
type 140 Jaguar
class fast attack pro footbal
craft [torpedo],
hardware & software Germany WWII
Jacksonville
Atari video
game console Anglo-French ground
attack aircraft
Macintosh XF10F prototype swing-wing
OS X 10.2 fighter, early 1950s, Grumman
John Giannandrea, CTO, Metaweb
Monday, November 21, 11
inspired by John Giannandrea, CTO, Metaweb
... from his presentation at PARC in April, 2008
40. Prrrrr
... a rose is a rose is a rose
company music
Ltd. heavy metal band formed
in Bristol, England. Dec 1979
cars
Fender electric guitar, heros
XK series, in pro- introduced in 1962
duction since 1996
The Jaguar is a superhero
published by Archie Comics
Philadelphia-based
singer/songwriter
E-Type (UK) or Jaguar Wright
XK-E (US) mftg
1961 to 1974 DC Comics' Impact series,
... loosely based on Archie
Comics' character
etc. military
type 140 Jaguar
class fast attack pro footbal
craft [torpedo],
hardware & software Germany WWII
Jacksonville
Atari video
game console Anglo-French ground
attack aircraft
Macintosh XF10F prototype swing-wing
OS X 10.2 fighter, early 1950s, Grumman
John Giannandrea, CTO, Metaweb
Monday, November 21, 11
inspired by John Giannandrea, CTO, Metaweb
... from his presentation at PARC in April, 2008
41. the 2nd limiting factor
... instance-based metadata
Monday, November 21, 11
42. the 2nd limiting factor
... instance-based metadata
Most of our metadata uses focuses
on publication artifacts
• identify responsibility for its creation
• list topical headings
Monday, November 21, 11
43. the 2nd limiting factor
... instance-based metadata
Most of our metadata uses focuses
on publication artifacts
• identify responsibility for its creation
• list topical headings
For simple cases ... few worries
• as with ambiguity, one-to-one relationships pose few problems
• things work for authors with a few books in several editions
Monday, November 21, 11
44. the 2nd limiting factor
... instance-based metadata
Most of our metadata uses focuses
on publication artifacts
• identify responsibility for its creation
• list topical headings
For simple cases ... few worries
• as with ambiguity, one-to-one relationships pose few problems
• things work for authors with a few books in several editions
But, as complexity increases,
precision & recall suffer
Monday, November 21, 11
45. Prolific authors ... search:
Shakespeare’s Hamlet
Wading thru search results for authors 811 entries
like Shakespeare shows clearly the
effects that instance-based metadata
has on precision & recall
Monday, November 21, 11
A Socrates (Stanford Libraries OPAC) keyword search for the terms shakespeare and
hamlet
46. Prolific authors ... search:
Shakespeare’s Hamlet
Wading thru search results for authors 811 entries
like Shakespeare shows clearly the
effects that instance-based metadata
has on precision & recall
Unflagging patience marks the task of
flipping back & forth between hundreds
of brief and full records to sort thru
the varied instances of a single entity
Monday, November 21, 11
47. Prolific authors ... search:
Shakespeare’s Hamlet
Wading thru search results for authors 811 entries
like Shakespeare shows clearly the
effects that instance-based metadata
has on precision & recall
Unflagging patience marks the task of
flipping back & forth between hundreds
of brief and full records to sort thru
the varied instances of a single entity, e.g.
• critical editions based on primary sources
• 18th & 19th century collections of the plays
• social, historical and literary essays
• histories & critiques of such writings
• video and audio recordings of performances
• reviews and indices of the same
• treatments of stagecraft, costumes, music
• life & works of notables associated with the
plays (e.g., performers, directors)
• other art forms inspired by the plays
Monday, November 21, 11
48. 3
3. Too far removed from W
Web
Wide
World
Monday, November 21, 11
49. 3
3. Too far removed from W
Web
Wide
World
Together, our metadata & collections
make up a big chunk of the “dark web”
[ info resources that search-engine spiders can’t see ]
Monday, November 21, 11
50. 3
3. Too far removed from W
Web
Wide
World
Together, our metadata & collections
make up a big chunk of the “dark web”
[ info resources that search-engine spiders can’t see ]
It’s clear that visibility on the web promotes
dramatic increases in discovery and access
Monday, November 21, 11
51. 3
3. Too far removed from W
Web
Wide
World
Together, our metadata & collections
make up a big chunk of the “dark web”
[ info resources that search-engine spiders can’t see ]
It’s clear that visibility on the web promotes
dramatic increases in discovery and access
• Library of Congress & Smithsonian images (FLICKR)
Monday, November 21, 11
52. 3
3. Too far removed from W
Web
Wide
World
Together, our metadata & collections
make up a big chunk of the “dark web”
[ info resources that search-engine spiders can’t see ]
It’s clear that visibility on the web promotes
dramatic increases in discovery and access
• Library of Congress & Smithsonian images (FLICKR)
• SULAIR’s Highwire Press ( > 2x increase via Google)
Monday, November 21, 11
53. 3
3. Too far removed from W
Web
Wide
World
Together, our metadata & collections
make up a big chunk of the “dark web”
[ info resources that search-engine spiders can’t see ]
It’s clear that visibility on the web promotes
dramatic increases in discovery and access
• Library of Congress & Smithsonian images (FLICKR)
• SULAIR’s Highwire Press ( > 2x increase via Google)
The state of affairs is well known ...
Monday, November 21, 11
55. academy
publisher
pr
od
ce u
library
pr
Scholars
ov
&
students
e id
Monday, November 21, 11
Here is a schematic to suggest how our ecosystem works. It is more complex, of
course, but the basics are embodied here.
56. Once
upon
a
&me…the
Internet
internet
Monday, November 21, 11
And here is the way the e-discovery and e-communication environment is developing.
First there was the Internet. Prophets such as Vannevar Bush, Ted Nelson, and Doug
Englebart showed us the way.
57. Then…the
World
Wide
Web
web
of
pages
internet
Monday, November 21, 11
Thanks to another profit, Tim Berners-Lee, the Internet, a network of communicating
computers, became a web of pages of information. Scholarly journal publishers and some
librarians realized early on that there were functional advantages to scholarship and to
publishing in the web of pages. Yahoo, Google, and others realized that mining the web o
pages by words on those pages, could make the rapidly growing web of pages reveal mor
through indexing and cataloging the web. Indexing won out as we now know over catalog
The next thing is the subject of this talk. It is the web of data. It is the web of relationships
constructed and expressed so that both computers and humans can identify and understa
relationships in that web. The web of data lives with the web of pages and is carried on th
Internet, the global carrier.
58. web
Under
construc&on
of
data
web
of
pages
internet
Monday, November 21, 11
This web of data is the next big thing in discovering relevant information objects and the n
big thing in empowering individuals, communities, and industries in making better use of
information that they or others create. What distinguishes this web of data, this linked dat
environment, is the principal of identifying entities, virtual & real by statements of relations
and descriptions in machine readable form. More about this as we go along.
59. web
Under
construc&on
of
data
web
of
pages
internet
aka Linked Data
Monday, November 21, 11
We
are
calling
this
next
phase
the
Linked
Data
phase,
because
it
is
enGrely
dependent
upon
statements
of
relaGonships
and
descripGons
in
machine
readable
form,
but
this
phase
may
be
onl
a
pre-‐cursor
to
another,
more
complex
and
more
difficult
web
world
to
engineer.
The
next
phase
i
the
SemanGc
Web,
which
in
theory
allows
the
machine
readable
relaGonships
and
descripGons
to
interoperate
to
saGsfy
a
person’s
requirements,
albeit
without
constant
interacGon.
In
short,
in
th
SemanGc
Web,
the
machines
will
understand
meaning
and
presumably
act
on
it.
Scarey,
eh?
60. ConstrucGon
Tools
60
Monday, November 21, 11
How
to
we
work
to
alleviate
our
problems
as
informaGon
professionals,
librarians
and
publishers?
61. Recipe
for
crea+ng
the
web
of
data
• identify people, places, things, events,
and other entities embedded in the
knowledge resources that a research
university consumes and produces
Monday, November 21, 11
62. Recipe
for
crea+ng
the
web
of
data
• identify people, places, things, events,
and other entities embedded in the
knowledge resources that a research
university consumes and produces
• tie those facts together with
named connections
Monday, November 21, 11
63. Recipe
for
crea+ng
the
web
of
data
• identify people, places, things, events,
and other entities embedded in the
knowledge resources that a research
university consumes and produces
• tie those facts together with
named connections
• publish the relationships as
crawl-able links on the web
Monday, November 21, 11
64. Recipe
for
crea+ng
the
web
of
data
• identify people, places, things, events,
and other entities embedded in the
knowledge resources that a research
university consumes and produces
• tie those facts together with
named connections
• publish the relationships as
crawl-able links on the web
Build/use apps supporting discovery
via the web of data
Monday, November 21, 11
65. 65
Monday, November 21, 11
Here
is
a
pile
of
words
represenGng
all
the
words
on
the
web
that
most
search
engines
index
constantly.
Good
search
engines
today
can
do
a
lot
with
this
pile.
BUT,
the
search
engines
create
the
percepGon
of
relaGonships,
not
based
on
meaning,
but
on
other
factors,
such
as
number
of
links
to
a
site
containing
the
words
of
interest
OR
the
traffic
to
a
site.
66. From
this
pile
of
words,
structure! 66
Monday, November 21, 11
The
Linked
Data
approach
aSempts
to
structure
the
pile
in
anGcipaGon
of
the
need
for
discovery.
That
structure
is
based
on
meaning,
on
relaGonships.
I
will
make
this
clearer
in
the
next
slides.
67. 67
Monday, November 21, 11
Here’s
a
graph
of
a
very
few
relaGonships
to
Yo
Yo
Ma,
the
great
‘cellist.
68. Linked
Data
Web 68
Monday, November 21, 11
Here’s
a
graph
of
relaGonships
to
Haggis,
just
a
fun
one
I
could
not
resist
throwing
in.
Meaning
is
provided
by
understanding
relaGonships.
69. RDF$triples$&$URIs$
• RDF$triples$=$subject$–$object$–$predicate$
– A$way$to$describe$objects$or$even$ideas$on$the$web$
– An$object$or$idea$might$have$many$RDF$triples$describing$it$
– Objects$or$ideas$need$not$exist$on$the$web!$
• URIs$=$Uniform$Resource$IdenDfiers$
– Allows$machine$interacDon$among$Web$objects$
– Various$syntacDcal$schemes$&$protocols$used$to$construct$
URIs$
– At$least$3$needed$to$support$an$RDF$(subject$–$objectJ$
predicate)$
69
Monday, November 21, 11
Geek
ingredients
to
the
construcGon
of
the
Linked
DAta
Web.
RDF
means
Resource
DescripGon
Framework,
always
expressed
as
a
simple
sentence,
though
mulGple
such
statements
might
aSach
to
a
single
enGty.
In
fact,
we
need
mulGple
RDFs
in
this
scheme.
71. The Linked Data Principles
1. Use Resource Description Frameworks as
names of things (people, places, times, objects,
ideas...anything really)
2. Use HTTP URIs so that people can look up
those names
3. When someone looks up a URI, provide useful
RDF information
4. Include RDF statements that link to other
URIs so that they can discover related things
71
Monday, November 21, 11
The
really
great
aspect
of
RDFs
is
that
they
can
refer
to
ideas,
not
just
to
physical
or
virtual
enGGes.
Any
kind
of
idea
could
be
treated.
72. Library'Metadata'
• Library'metadata'standards'closed'
• '
“Passive”'metadata,'searchable,'but…'
• In'Silos ''
• Readable,'but'not'ac=onable'
• Search'results'refinable,'but'final'
72
Monday, November 21, 11
These
are
some
of
the
edges
of
the
problem
of
library
metadata.
73. Library'Metadata' Seman/c'Web'Metadata'
Library'Metadata' Seman/c'Web'Metadata'
• Library'metadata'standards' • Open'
• Library'metadata'standards' • Open'
closed'
closed'
• “Passive”'metadata,' • Dynamic,'Contextualized'
• “Passive”'metadata,' • Dynamic,'Contextualized'
searchable,'but…'
searchable,'but…'
• In'Silos '' • In'the'wild'
• In'Silos '' • In'the'wild'
• Readable,'but'not' • Interac<ve,'Responsive'
• Readable,'but'not' • Interac<ve,'Responsive'
ac<onable'
ac<onable'
• Search'results'refinable,'but'
• Search'results'refinable,'but' • Leading'to'other'queries'&'
final' • Leading'to'other'queries'&'
final' views'
views'
73
Monday, November 21, 11
And
here
is
the
comparison
between
the
library
metadata
scene
now
and
the
one
we
advocate
for
the
Linked
Data/SemanGc
Web.
Library
metadata
in
the
Linked
Data
Web
should
be
freely
available,
constantly
updated,
o[en
reconciled
with
RDF
triple
statements
from
non-‐library
sources.
Library
Linked
Data
should
be
enGrely
open
on
the
web.
74. Make
Library
bibliographic
facts
in
to
RDFs
&
URIs;
Release
them
into
the
wild.
Make
Library
Linked
Data
OPEN.
74
Monday, November 21, 11
I
should
add
that
accounGng
for
physical
objects
in
our
collecGons,
locaGng
them,
making
our
collecGons
auditable,
and
managing
our
collecGons
seems
to
be
possible
using
Linked
Data
too,
at
least
in
principal.
76. Publishers*&*Socie/es**
making*use*of*Linked*Data*
• Aggregate*content*in*their*own*realms*&*beyond*
• Aggregate*informa/on*about*
– Conferences*
– Career*building*&*employment*opportuni/es*
– Communi/es*in*collabora/on*
– Commercial*&*other*services*suppor/ng*research*with*
specimens,*source*material,*processing,*trials*
– Produc/ve*rela/onships*with*others*
• Provide*ac/onable,*constantly*updated*links*in*
support*of*scholars,*teachers,*and*learners*
• Provide*compelling*services*tying*users*to*them*
76
Monday, November 21, 11
Libraries
too
can
use
Linked
Data
to
reveal
and
adverGse
compelling
services
offered
to
their
clients.
77. Seman4c
Web
adopters 77
Monday, November 21, 11
Here
are
some
of
the
big
players
in
the
Linked
Data
/
SemanGc
Web
world.
The
BriGsh
Library
has
released
RDFs/URIs
for
the
enGre
BriGsh
NaGonal
Bibliography.
The
Library
of
Congress
has
released
the
same
for
LCSH
&
Name
Authority
Files.
LCSH
includes
links
to
AGROVOC,
RAMEAU,
DNB,
GLIN
Subject
Thesaurus,
and
the
NaGonal
Agriculture
Library's
Subject
Index.
Every
Personal
and
Corporate
entry
in
LC/NAF
links
to
VIAF,
the
Virtual
InternaGonal
Authority
File
based
at
OCLC.
The
N
Y
Times
18
months
ago
made
all
500,000
(and
growing)
of
its
index
terms
available
in
the
wild
as
RDFs
and
URIs.
78. 78
Monday, November 21, 11
For
publishers
and
libraries...though
we
should
not
neglect
services.
79. ...if
users
can
find
it
in
their
own
context
79
Monday, November 21, 11
80. Context
Users Content
Users
=
readers,
authors,
teachers,
students 80
Monday, November 21, 11
81. Context
Users Content
Publishers
must
make
content
VISIBLE 81
Monday, November 21, 11
I
am
using
the
imperaGve
here,
because
invisible
published
content
means
invisible
benefit
to
the
author
and/or
the
publisher.
82. 82
Monday, November 21, 11
Here
is
a
recent
PLoS
arGcle
from
PLoS
Neglected
Tropical
Diseases.
83. 83
Monday, November 21, 11
And
here
is
the
semanGcally
enhanced
version
of
this
arGcle,
enhancements
provided
by
David
ShoSen
et
al.
in
the
form
of
links
to
further
informaGon,
interacGve
figures,
re-‐orderable
reference
list,
citaGons
in
context
and
tag
trees.
These
enhancements
took
10
man
weeks
in
2009!
However,
with
the
growing
ecology
of
linked
data,
much
of
this
could
be
accomplished
by
auto-‐tagging
and
algorithmic
construcGon
of
the
basic
RDFs
&
URIs
for
the
unique
arGcle.
Microdata
submiSed
by
some
publishers
and
their
supporGng
services
to
schema.org
lead
to
these
exciGng
possibiliGes.
84. aggrega+on
84
Monday, November 21, 11
AggregaGon
counts,
but
think
how
much
more
we
would
get
if
we
could
aggregate
from
libraries,
publishers,
and
the
wild
and
weird
variety
of
sources
on
the
web?
86. Disambigua4on
86
Monday, November 21, 11
RDFs
and
URIs
can
operate
in
many
languages
and
relaGonships
can
be
expressed
across
languages,
a
potenGal
big
benefit
to
research
and
collaboraGon
in
research.
87. Web
of
Data
Progress
87
Monday, November 21, 11
88. 2007
88
Monday, November 21, 11
FOAF
=
Friend
of
a
Friend.
Hundreds
of
millions
of
RDFs/URIs.
Fortunately
they
do
not
take
much
space
in
memory!
89. 89
Monday, November 21, 11
This
is
the
2011
graph
of
enGGes
supplying
RDFs
and
URIs.
Now
the
populaGon
is
in
the
hundreds
of
billions,
heading
to
trillions.
90. 2011
90
hSp://inkdroid.org/lod-‐graph/
Monday, November 21, 11
92. Linked'Open'Data'Value'Proposi4on'
• Linked'open'data'(LOD)'puts'informa4on'where'people'are'looking'for'it'–'on'
the'Web;''
• LOD'can'expands'discoverability'of'our'content;''
• LOD'opens'opportuni4es'for'crea4ve'innova4on'in'digital'scholarship'and'
par4cipa4on;''
• LOD'allows'for'open'con4nuous'improvement'of'data;''
• LOD'creates'a'store'of'machineDac4onable'data'on'which'improved'services'can'
be'built;''
• Library'linked'open'data'might'facilitate'the'break'down'the'tyranny'of'domain'
silos;''
• LOD'can'provide'direct'access'to'data'in'ways'that'are'not'currently'possible;''
• LOD'provides'unan4cipated'benefits'that'will'emerge'later'as'the'stores'of'LOD'
expand'exponen4ally.''
'
A"product"of"the"Stanford/CLIR"Linked"Data"Workshop"June"2011."
92
Monday, November 21, 11
25
ParGcipants
from
the
BriGsh
Library,
the
Bibliothèque
naGonale
de
France,
the
Deutsch
NaGonalbibliothek,
the
Royal
Library
of
Denmark,
Aalto
University
in
Finland,
the
Library
of
Congress,
the
Bibliotheca
Alexandrina,
the
NaGonal
InsGtute
of
InformaGcs
of
Japan,
Google,
Seme4,
Emory,
University
of
Virginia,
University
of
Michigan,
California
Digital
Library,
Knowledge
MoGfs,
CLIR,
and
Stanford.
93. Google
using
Stanford
bib
facts
+
web
resources 93
Monday, November 21, 11
This
is
a
movie
of
a
live
interacGon
with
Freebase
using
bibliographic
facts
from
Stanford,
and
linked
informaGon
resources
from
the
web.
It
shows
in
a
limited
way
the
potenGal
for
discovery
and
retrieval
in
the
Linked
Data
Web.
94. BnF
using
data
only
from
its
catalogs
&
Gallica
94
Monday, November 21, 11
This
is
another
movie
of
the
Linked
Data
prototype
based
enGrely
on
bibliographic
facts
from
the
BnF
catalogs
and
digital
texts
in
Gallica.
There
are
no
other
web
resources
drawn
into
this
prototype...yet.
97. Value
Proposi-on
for
LAM’s
We
in
the
cultural
heritage
and
knowledge
management
institutions
are
discovering
better
ways
of
publishing,
sharing,
and
using
information
by
linking
data
and
helping
others
do
the
same.
Through
this
work,
we
have
come
to
value
and
to
promote
the
following
practices:
1.
Publishing
data
on
the
web
for
discovery
and
use,
rather
than
preserving
it
in
dark,
more
or
less
unreachable
archives
that
are
often
proprietary
and
pro?it
driven;
2.
Continuously
improving
data
and
Linked
Data,
rather
than
waiting
to
publish
“perfect”
data;
3.
Structuring
data
semantically,
rather
than
preparing
?lat,
unstructured
data;
4.
Collaborating,
rather
than
working
alone;
5.
Adopting
Web
standards,
rather
than
domain
speci?ic
ones;
6.
Using
open,
commonly
understood
licenses,
rather
than
closed
and/or
local
licenses.
from
the
Stanford/CLIR
Workshop
on
Linked
Data,
June
2011
97
Monday, November 21, 11
In
each
couplet,
we
emphasize
the
second
half,
a[er
“rather
than”,
admitng
that
someGmes
the
first
half
of
the
couplet
has
to
be
operaGve.
98. DARPA
Internet
98
Monday, November 21, 11
This
is
where
we
started
2.5
decades
ago.
99. World
Wide
Web 99
Monday, November 21, 11
Thanks
to
Tim
Berners-‐Lee
and
many
others,
we
advanced
in
this
environment
from
the
early
1990s
unGl
today.
100. SOCIAL
WEB
100
Monday, November 21, 11
We
cannot
ignore
the
social
web
that
exists
in
the
current
WWW,
but
think
how
much
more,
some
of
it
scarey,
could
be
done
in
the
Linked
Data
Web
with
the
behaviors
of
the
Social
Web.
101. Linked
Data
Web 101
Monday, November 21, 11
Just
that
funny
reminder
of
the
fundamental
nature
of
the
Linked
Data
Web:
expressing
machine
acGonable
relaGonships.
102. Seman+c
Web 102
Monday, November 21, 11
And
in
the
next
web,
the
SemanGc
Web,
who
knows
what
may
be
possible.
103. Ubiquitous
compu+ng
103
Monday, November 21, 11
To
the
progression
of
network
types,
we
need
to
add
a
couple
of
enormously
important
environmental
factors.
Ubiquitous
compuGng
is
a
very
important
one.
Having
lots
of
computers
on
the
net
makes
the
possibility
of
an
open
global
linked
data
web
very
strong.
104. Mobility
104
Monday, November 21, 11
And
our
ability
to
communicate
by
voice
(how
about
that
Siri?)
and
by
bits/bytes
from
everywhere,
is,
perhaps,
just
another
aspect
of
ubiquitous
compuGng.
105. Ubiquitous
Compu4ng
Linked
Web
M
o
b
i
l
e
Web
Social
Web
Internet
105
Monday, November 21, 11
The
black
box
in
the
upper
right
corner
is
the
SemanGc
Web,
a
level
of
sophisGcaGon
yet
to
be
achieved.
The
linked
data
web
is
at
hand,
though.
Will
Librarians
and
Publishers
join
the
development
of
the
Linked
Open
Data
web?
I
certainly
think
we
should.
107. W3C Library Linked Data Incubator
Group
http://www.w3.org/2005/Incubator/lld/
A Bibliographic Framework
Initiative General Plan for the
Digital Age (October 31, 2011)
http://www.loc.gov/marc/
transition/news/
framework-103111.html
Linked
Data
Survey
&
Workshop
June
2011
hSp://www.clir.org/pubs/archives/linked-‐data-‐
survey/ 107
Monday, November 21, 11