SlideShare a Scribd company logo
1 of 14
The Mosaic Search Engine Mark van Harmelen Hedtek Ltd markvanharmelen@gmail.comhedtek.com
Aim Provide a proof of concept that  Users can have personalised search results according to their place and stage of studies Users can adopt other personas or points-of-view to explore academic resources We can exploit ‘mass’ attention data as revealed by library circulation information So far only working with ISBN identified books
HEI circulation data build Solr index anonymise partial Copac records annotated with use and reading list data reading lists Solr HEI anonymise front-end HEI anonymise
Anonymisation Level 1: Current prototype, enables faceting Level 2: With extra information, enables“people who borrowed this also borrowed”and“people who borrowed this went on to borrow” Anonymisationutility provided DPA compliant, can also use fair processing agreements
Augmenting Solr’s index Solr’s search index is loaded with items and any associated use information Use information is:   institution  course  progression level   year of use count of number of uses in that year Use information enables faceting Also add reading list info to items
Solr OPAC resultset itemquery item data query client-side front-end (browser)
Narrowing and broadening Thoughts (NB, ‘thoughts’) of narrowing of choice led to two features to broaden choice Don’t believe that the Mosaic demo in itself narrows when used for browsing Broadening features More like this link Reading lists
The Harry Potter ‘problem’ and scale The Harry Potter ‘problem’: Balderdash! We can control this using Library of Congress subject categories and Dewey Decimal shelfmarks Paul Miller raises questions of scale Dave Pattern has shown success use of use data at a single (small) institution We want to leverage reasonably large scale: 3.5-4M students in HE, over say the last five years
User context and attention Has been relatively simple to parameterise an open source search engine with user context Institution, course, progression level, academic year This is only part of the user context, can add Location Attention data, e.g., search history Further social search information
Disclaimer The next slide is independent of any decisions on a pure data approach Could be a pure data approach in there Or maybe not
Where is this going? A personal view Bind together ,[object Object]
Mosiac searchpersonalised/point-of-view search Massively parallel search for blindingly fast response times Data mining for library ‘stewardship’ We have prototypes for the first two, and we’re about to start experimenting with parallel search using Hadoop+Lucene
Building institutional contributions Propose union-cat-local: Search in local library Mosaic-like search utilises local loan data if it is available Two ways to encourage library contribution of loan data (thoughts in progress) Narrow: Libraries which contribute loan data to the pool get Mosaic search over the pool Broad: Offer the contextual/PoV search available everywhere; users will agitate if they don’t see local data
This is a Just Do It moment A national union catalogue with contextual search and local library interfaces Relatively cheap to do Potentially massive gains for learners, teachers and researchers Portends the development of shared services across the library domain and large cost savings Doesn’t preclude / agnostic on an open data approach Could incorporate a pure data service approach and/or a centralised service

More Related Content

What's hot

Extracting and Reducing the Semantic Information Content of Web Documents to ...
Extracting and Reducing the Semantic Information Content of Web Documents to ...Extracting and Reducing the Semantic Information Content of Web Documents to ...
Extracting and Reducing the Semantic Information Content of Web Documents to ...ijsrd.com
 
Data.Mining.C.8(Ii).Web Mining 570802461
Data.Mining.C.8(Ii).Web Mining 570802461Data.Mining.C.8(Ii).Web Mining 570802461
Data.Mining.C.8(Ii).Web Mining 570802461Margaret Wang
 
Semantic Search Tutorial at SemTech 2012
Semantic Search Tutorial at SemTech 2012 Semantic Search Tutorial at SemTech 2012
Semantic Search Tutorial at SemTech 2012 Thanh Tran
 
Overbeeke
OverbeekeOverbeeke
Overbeekeanesah
 
Structured data and metadata evaluation methodology for organizations looking...
Structured data and metadata evaluation methodology for organizations looking...Structured data and metadata evaluation methodology for organizations looking...
Structured data and metadata evaluation methodology for organizations looking...Emily Kolvitz
 
Semantic Search on the Rise
Semantic Search on the RiseSemantic Search on the Rise
Semantic Search on the RisePeter Mika
 
Personalized search
Personalized searchPersonalized search
Personalized searchToine Bogers
 
Evaluation of Web Scale Discovery Services
Evaluation of Web Scale Discovery ServicesEvaluation of Web Scale Discovery Services
Evaluation of Web Scale Discovery ServicesNikesh Narayanan
 
A NOVEL APPROACH FOR INFORMATION RETRIEVAL TECHNIQUE FOR WEB USING NLP
A NOVEL APPROACH FOR INFORMATION RETRIEVAL TECHNIQUE FOR WEB USING NLPA NOVEL APPROACH FOR INFORMATION RETRIEVAL TECHNIQUE FOR WEB USING NLP
A NOVEL APPROACH FOR INFORMATION RETRIEVAL TECHNIQUE FOR WEB USING NLPijnlc
 
Semantic Search tutorial at SemTech 2012
Semantic Search tutorial at SemTech 2012Semantic Search tutorial at SemTech 2012
Semantic Search tutorial at SemTech 2012Peter Mika
 

What's hot (11)

Extracting and Reducing the Semantic Information Content of Web Documents to ...
Extracting and Reducing the Semantic Information Content of Web Documents to ...Extracting and Reducing the Semantic Information Content of Web Documents to ...
Extracting and Reducing the Semantic Information Content of Web Documents to ...
 
Data.Mining.C.8(Ii).Web Mining 570802461
Data.Mining.C.8(Ii).Web Mining 570802461Data.Mining.C.8(Ii).Web Mining 570802461
Data.Mining.C.8(Ii).Web Mining 570802461
 
Semantic Search Tutorial at SemTech 2012
Semantic Search Tutorial at SemTech 2012 Semantic Search Tutorial at SemTech 2012
Semantic Search Tutorial at SemTech 2012
 
Overbeeke
OverbeekeOverbeeke
Overbeeke
 
Structured data and metadata evaluation methodology for organizations looking...
Structured data and metadata evaluation methodology for organizations looking...Structured data and metadata evaluation methodology for organizations looking...
Structured data and metadata evaluation methodology for organizations looking...
 
Semantic Search on the Rise
Semantic Search on the RiseSemantic Search on the Rise
Semantic Search on the Rise
 
Personalized search
Personalized searchPersonalized search
Personalized search
 
Evaluation of Web Scale Discovery Services
Evaluation of Web Scale Discovery ServicesEvaluation of Web Scale Discovery Services
Evaluation of Web Scale Discovery Services
 
H0314450
H0314450H0314450
H0314450
 
A NOVEL APPROACH FOR INFORMATION RETRIEVAL TECHNIQUE FOR WEB USING NLP
A NOVEL APPROACH FOR INFORMATION RETRIEVAL TECHNIQUE FOR WEB USING NLPA NOVEL APPROACH FOR INFORMATION RETRIEVAL TECHNIQUE FOR WEB USING NLP
A NOVEL APPROACH FOR INFORMATION RETRIEVAL TECHNIQUE FOR WEB USING NLP
 
Semantic Search tutorial at SemTech 2012
Semantic Search tutorial at SemTech 2012Semantic Search tutorial at SemTech 2012
Semantic Search tutorial at SemTech 2012
 

Viewers also liked

Waiting For The Babel Fish
 Waiting For The Babel Fish Waiting For The Babel Fish
Waiting For The Babel Fishsbooth
 
Externally Hosted Web 2.0 Services
Externally Hosted Web 2.0 ServicesExternally Hosted Web 2.0 Services
Externally Hosted Web 2.0 Servicesmarkvanharmelen
 
Blogging and Internal Communications
Blogging and Internal CommunicationsBlogging and Internal Communications
Blogging and Internal Communicationssbooth
 
Elgg at the University of Brighton -- Stanier
Elgg at the University of Brighton -- StanierElgg at the University of Brighton -- Stanier
Elgg at the University of Brighton -- Staniermarkvanharmelen
 
Internet Is Fun
Internet Is FunInternet Is Fun
Internet Is Funngkaihoe
 
Introduction to CS60171 (2009)
Introduction to CS60171 (2009)Introduction to CS60171 (2009)
Introduction to CS60171 (2009)markvanharmelen
 
Teenagers and Blogs
Teenagers and BlogsTeenagers and Blogs
Teenagers and Blogssbooth
 
Enterprise2
Enterprise2Enterprise2
Enterprise2ngkaihoe
 
La scuola siamo noi: Matteucci Garibaldi
La scuola siamo noi: Matteucci Garibaldi La scuola siamo noi: Matteucci Garibaldi
La scuola siamo noi: Matteucci Garibaldi Alessandro Giorni
 
Il Web del futuro: dati strutturati e semantici in XHTML con un click - RDFa
Il Web del futuro: dati strutturati e semantici in XHTML con un click - RDFaIl Web del futuro: dati strutturati e semantici in XHTML con un click - RDFa
Il Web del futuro: dati strutturati e semantici in XHTML con un click - RDFaSimone Onofri
 
Web 2.0 and Learning and Teaching
Web 2.0 and Learning and TeachingWeb 2.0 and Learning and Teaching
Web 2.0 and Learning and Teachingmarkvanharmelen
 
Training Program
Training ProgramTraining Program
Training Programngkaihoe
 

Viewers also liked (20)

Waiting For The Babel Fish
 Waiting For The Babel Fish Waiting For The Babel Fish
Waiting For The Babel Fish
 
Externally Hosted Web 2.0 Services
Externally Hosted Web 2.0 ServicesExternally Hosted Web 2.0 Services
Externally Hosted Web 2.0 Services
 
Blogging and Internal Communications
Blogging and Internal CommunicationsBlogging and Internal Communications
Blogging and Internal Communications
 
Italy and rome's geography
Italy and rome's geographyItaly and rome's geography
Italy and rome's geography
 
Rev1,1
Rev1,1Rev1,1
Rev1,1
 
Manpower
ManpowerManpower
Manpower
 
Elgg at the University of Brighton -- Stanier
Elgg at the University of Brighton -- StanierElgg at the University of Brighton -- Stanier
Elgg at the University of Brighton -- Stanier
 
Hea.Keynote
Hea.KeynoteHea.Keynote
Hea.Keynote
 
Internet Is Fun
Internet Is FunInternet Is Fun
Internet Is Fun
 
Introduction to CS60171 (2009)
Introduction to CS60171 (2009)Introduction to CS60171 (2009)
Introduction to CS60171 (2009)
 
Jh Student Handbook 09 10
Jh Student Handbook 09 10Jh Student Handbook 09 10
Jh Student Handbook 09 10
 
Teenagers and Blogs
Teenagers and BlogsTeenagers and Blogs
Teenagers and Blogs
 
Enterprise2
Enterprise2Enterprise2
Enterprise2
 
La scuola siamo noi: Matteucci Garibaldi
La scuola siamo noi: Matteucci Garibaldi La scuola siamo noi: Matteucci Garibaldi
La scuola siamo noi: Matteucci Garibaldi
 
giornalino3M, terzo numero
giornalino3M, terzo numerogiornalino3M, terzo numero
giornalino3M, terzo numero
 
Il Web del futuro: dati strutturati e semantici in XHTML con un click - RDFa
Il Web del futuro: dati strutturati e semantici in XHTML con un click - RDFaIl Web del futuro: dati strutturati e semantici in XHTML con un click - RDFa
Il Web del futuro: dati strutturati e semantici in XHTML con un click - RDFa
 
Policy and Strategy
Policy and StrategyPolicy and Strategy
Policy and Strategy
 
Revelation 1st
Revelation 1stRevelation 1st
Revelation 1st
 
Web 2.0 and Learning and Teaching
Web 2.0 and Learning and TeachingWeb 2.0 and Learning and Teaching
Web 2.0 and Learning and Teaching
 
Training Program
Training ProgramTraining Program
Training Program
 

Similar to Mosiac Search Engine

Inteligent Catalogue Final
Inteligent Catalogue FinalInteligent Catalogue Final
Inteligent Catalogue Finalguestcaef1d
 
Faceted Navigation (LACASIS Fall Workshop 2005)
Faceted Navigation (LACASIS Fall Workshop 2005)Faceted Navigation (LACASIS Fall Workshop 2005)
Faceted Navigation (LACASIS Fall Workshop 2005)Bradley Allen
 
How discovery impacts of users' experiences
How discovery impacts of users' experiencesHow discovery impacts of users' experiences
How discovery impacts of users' experiencesKatherine Rose
 
Webscale Discovery with the Enduser in Mind
Webscale Discovery with the Enduser in Mind Webscale Discovery with the Enduser in Mind
Webscale Discovery with the Enduser in Mind Debra Kolah
 
Recent Trends in Semantic Search Technologies
Recent Trends in Semantic Search TechnologiesRecent Trends in Semantic Search Technologies
Recent Trends in Semantic Search TechnologiesThanh Tran
 
Sem tech2013 tutorial
Sem tech2013 tutorialSem tech2013 tutorial
Sem tech2013 tutorialThengo Kim
 
Establishing the Connection: Creating a Linked Data Version of the BNB
Establishing the Connection: Creating a Linked Data Version of the BNBEstablishing the Connection: Creating a Linked Data Version of the BNB
Establishing the Connection: Creating a Linked Data Version of the BNBnw13
 
Falling in and out and in love with Information Architecture
Falling in and out and in love with Information ArchitectureFalling in and out and in love with Information Architecture
Falling in and out and in love with Information ArchitectureLouis Rosenfeld
 
Research4C4U
Research4C4UResearch4C4U
Research4C4Uianmcnee
 
Sweeny ux-seo om-cap 2014_v3
Sweeny ux-seo om-cap 2014_v3Sweeny ux-seo om-cap 2014_v3
Sweeny ux-seo om-cap 2014_v3Marianne Sweeny
 
Using Search Analytics to Diagnose What’s Ailing your Information Architecture
Using Search Analytics to Diagnose What’s Ailing your Information ArchitectureUsing Search Analytics to Diagnose What’s Ailing your Information Architecture
Using Search Analytics to Diagnose What’s Ailing your Information ArchitectureLouis Rosenfeld
 
Erl10 web scale-gb-sg
Erl10 web scale-gb-sgErl10 web scale-gb-sg
Erl10 web scale-gb-sgGeorge Boston
 
2008 web-managers-hwilfert-final
2008 web-managers-hwilfert-final2008 web-managers-hwilfert-final
2008 web-managers-hwilfert-finalHallie Wilfert
 
Encore Presentation - ACRL/NEC ITIG Annual Meeting
Encore Presentation - ACRL/NEC ITIG Annual MeetingEncore Presentation - ACRL/NEC ITIG Annual Meeting
Encore Presentation - ACRL/NEC ITIG Annual MeetingLaura Kohl
 
Web-scale Discovery Implementation with the End User in Mind (SLA 2012)
Web-scale Discovery Implementation with the End User in Mind (SLA 2012)Web-scale Discovery Implementation with the End User in Mind (SLA 2012)
Web-scale Discovery Implementation with the End User in Mind (SLA 2012)Rafal Kasprowski
 
Investing in a time of desruptive change
Investing in a time of desruptive changeInvesting in a time of desruptive change
Investing in a time of desruptive changeJisc
 
Web scale discovery service
Web scale discovery serviceWeb scale discovery service
Web scale discovery serviceKankana Baishya
 

Similar to Mosiac Search Engine (20)

Inteligent Catalogue Final
Inteligent Catalogue FinalInteligent Catalogue Final
Inteligent Catalogue Final
 
Faceted Navigation (LACASIS Fall Workshop 2005)
Faceted Navigation (LACASIS Fall Workshop 2005)Faceted Navigation (LACASIS Fall Workshop 2005)
Faceted Navigation (LACASIS Fall Workshop 2005)
 
How discovery impacts of users' experiences
How discovery impacts of users' experiencesHow discovery impacts of users' experiences
How discovery impacts of users' experiences
 
Webscale Discovery with the Enduser in Mind
Webscale Discovery with the Enduser in Mind Webscale Discovery with the Enduser in Mind
Webscale Discovery with the Enduser in Mind
 
Recent Trends in Semantic Search Technologies
Recent Trends in Semantic Search TechnologiesRecent Trends in Semantic Search Technologies
Recent Trends in Semantic Search Technologies
 
Sem tech2013 tutorial
Sem tech2013 tutorialSem tech2013 tutorial
Sem tech2013 tutorial
 
Establishing the Connection: Creating a Linked Data Version of the BNB
Establishing the Connection: Creating a Linked Data Version of the BNBEstablishing the Connection: Creating a Linked Data Version of the BNB
Establishing the Connection: Creating a Linked Data Version of the BNB
 
2015 NISO Forum: The Future of Library Resource Discovery
2015 NISO Forum: The Future of Library Resource Discovery2015 NISO Forum: The Future of Library Resource Discovery
2015 NISO Forum: The Future of Library Resource Discovery
 
confernece paper
confernece paperconfernece paper
confernece paper
 
Falling in and out and in love with Information Architecture
Falling in and out and in love with Information ArchitectureFalling in and out and in love with Information Architecture
Falling in and out and in love with Information Architecture
 
Research4C4U
Research4C4UResearch4C4U
Research4C4U
 
Research4C4U
Research4C4UResearch4C4U
Research4C4U
 
Sweeny ux-seo om-cap 2014_v3
Sweeny ux-seo om-cap 2014_v3Sweeny ux-seo om-cap 2014_v3
Sweeny ux-seo om-cap 2014_v3
 
Using Search Analytics to Diagnose What’s Ailing your Information Architecture
Using Search Analytics to Diagnose What’s Ailing your Information ArchitectureUsing Search Analytics to Diagnose What’s Ailing your Information Architecture
Using Search Analytics to Diagnose What’s Ailing your Information Architecture
 
Erl10 web scale-gb-sg
Erl10 web scale-gb-sgErl10 web scale-gb-sg
Erl10 web scale-gb-sg
 
2008 web-managers-hwilfert-final
2008 web-managers-hwilfert-final2008 web-managers-hwilfert-final
2008 web-managers-hwilfert-final
 
Encore Presentation - ACRL/NEC ITIG Annual Meeting
Encore Presentation - ACRL/NEC ITIG Annual MeetingEncore Presentation - ACRL/NEC ITIG Annual Meeting
Encore Presentation - ACRL/NEC ITIG Annual Meeting
 
Web-scale Discovery Implementation with the End User in Mind (SLA 2012)
Web-scale Discovery Implementation with the End User in Mind (SLA 2012)Web-scale Discovery Implementation with the End User in Mind (SLA 2012)
Web-scale Discovery Implementation with the End User in Mind (SLA 2012)
 
Investing in a time of desruptive change
Investing in a time of desruptive changeInvesting in a time of desruptive change
Investing in a time of desruptive change
 
Web scale discovery service
Web scale discovery serviceWeb scale discovery service
Web scale discovery service
 

Recently uploaded

Oauth 2.0 Introduction and Flows with MuleSoft
Oauth 2.0 Introduction and Flows with MuleSoftOauth 2.0 Introduction and Flows with MuleSoft
Oauth 2.0 Introduction and Flows with MuleSoftshyamraj55
 
Using IESVE for Room Loads Analysis - UK & Ireland
Using IESVE for Room Loads Analysis - UK & IrelandUsing IESVE for Room Loads Analysis - UK & Ireland
Using IESVE for Room Loads Analysis - UK & IrelandIES VE
 
Google I/O Extended 2024 Warsaw
Google I/O Extended 2024 WarsawGoogle I/O Extended 2024 Warsaw
Google I/O Extended 2024 WarsawGDSC PJATK
 
Choosing the Right FDO Deployment Model for Your Application _ Geoffrey at In...
Choosing the Right FDO Deployment Model for Your Application _ Geoffrey at In...Choosing the Right FDO Deployment Model for Your Application _ Geoffrey at In...
Choosing the Right FDO Deployment Model for Your Application _ Geoffrey at In...FIDO Alliance
 
PLAI - Acceleration Program for Generative A.I. Startups
PLAI - Acceleration Program for Generative A.I. StartupsPLAI - Acceleration Program for Generative A.I. Startups
PLAI - Acceleration Program for Generative A.I. StartupsStefano
 
Your enemies use GenAI too - staying ahead of fraud with Neo4j
Your enemies use GenAI too - staying ahead of fraud with Neo4jYour enemies use GenAI too - staying ahead of fraud with Neo4j
Your enemies use GenAI too - staying ahead of fraud with Neo4jNeo4j
 
Working together SRE & Platform Engineering
Working together SRE & Platform EngineeringWorking together SRE & Platform Engineering
Working together SRE & Platform EngineeringMarcus Vechiato
 
Easier, Faster, and More Powerful – Notes Document Properties Reimagined
Easier, Faster, and More Powerful – Notes Document Properties ReimaginedEasier, Faster, and More Powerful – Notes Document Properties Reimagined
Easier, Faster, and More Powerful – Notes Document Properties Reimaginedpanagenda
 
The Metaverse: Are We There Yet?
The  Metaverse:    Are   We  There  Yet?The  Metaverse:    Are   We  There  Yet?
The Metaverse: Are We There Yet?Mark Billinghurst
 
AI presentation and introduction - Retrieval Augmented Generation RAG 101
AI presentation and introduction - Retrieval Augmented Generation RAG 101AI presentation and introduction - Retrieval Augmented Generation RAG 101
AI presentation and introduction - Retrieval Augmented Generation RAG 101vincent683379
 
How Red Hat Uses FDO in Device Lifecycle _ Costin and Vitaliy at Red Hat.pdf
How Red Hat Uses FDO in Device Lifecycle _ Costin and Vitaliy at Red Hat.pdfHow Red Hat Uses FDO in Device Lifecycle _ Costin and Vitaliy at Red Hat.pdf
How Red Hat Uses FDO in Device Lifecycle _ Costin and Vitaliy at Red Hat.pdfFIDO Alliance
 
Behind the Scenes From the Manager's Chair: Decoding the Secrets of Successfu...
Behind the Scenes From the Manager's Chair: Decoding the Secrets of Successfu...Behind the Scenes From the Manager's Chair: Decoding the Secrets of Successfu...
Behind the Scenes From the Manager's Chair: Decoding the Secrets of Successfu...CzechDreamin
 
Where to Learn More About FDO _ Richard at FIDO Alliance.pdf
Where to Learn More About FDO _ Richard at FIDO Alliance.pdfWhere to Learn More About FDO _ Richard at FIDO Alliance.pdf
Where to Learn More About FDO _ Richard at FIDO Alliance.pdfFIDO Alliance
 
AI mind or machine power point presentation
AI mind or machine power point presentationAI mind or machine power point presentation
AI mind or machine power point presentationyogeshlabana357357
 
Introduction to FDO and How It works Applications _ Richard at FIDO Alliance.pdf
Introduction to FDO and How It works Applications _ Richard at FIDO Alliance.pdfIntroduction to FDO and How It works Applications _ Richard at FIDO Alliance.pdf
Introduction to FDO and How It works Applications _ Richard at FIDO Alliance.pdfFIDO Alliance
 
The Value of Certifying Products for FDO _ Paul at FIDO Alliance.pdf
The Value of Certifying Products for FDO _ Paul at FIDO Alliance.pdfThe Value of Certifying Products for FDO _ Paul at FIDO Alliance.pdf
The Value of Certifying Products for FDO _ Paul at FIDO Alliance.pdfFIDO Alliance
 
Breaking Down the Flutterwave Scandal What You Need to Know.pdf
Breaking Down the Flutterwave Scandal What You Need to Know.pdfBreaking Down the Flutterwave Scandal What You Need to Know.pdf
Breaking Down the Flutterwave Scandal What You Need to Know.pdfUK Journal
 
A Business-Centric Approach to Design System Strategy
A Business-Centric Approach to Design System StrategyA Business-Centric Approach to Design System Strategy
A Business-Centric Approach to Design System StrategyUXDXConf
 
WebAssembly is Key to Better LLM Performance
WebAssembly is Key to Better LLM PerformanceWebAssembly is Key to Better LLM Performance
WebAssembly is Key to Better LLM PerformanceSamy Fodil
 

Recently uploaded (20)

Overview of Hyperledger Foundation
Overview of Hyperledger FoundationOverview of Hyperledger Foundation
Overview of Hyperledger Foundation
 
Oauth 2.0 Introduction and Flows with MuleSoft
Oauth 2.0 Introduction and Flows with MuleSoftOauth 2.0 Introduction and Flows with MuleSoft
Oauth 2.0 Introduction and Flows with MuleSoft
 
Using IESVE for Room Loads Analysis - UK & Ireland
Using IESVE for Room Loads Analysis - UK & IrelandUsing IESVE for Room Loads Analysis - UK & Ireland
Using IESVE for Room Loads Analysis - UK & Ireland
 
Google I/O Extended 2024 Warsaw
Google I/O Extended 2024 WarsawGoogle I/O Extended 2024 Warsaw
Google I/O Extended 2024 Warsaw
 
Choosing the Right FDO Deployment Model for Your Application _ Geoffrey at In...
Choosing the Right FDO Deployment Model for Your Application _ Geoffrey at In...Choosing the Right FDO Deployment Model for Your Application _ Geoffrey at In...
Choosing the Right FDO Deployment Model for Your Application _ Geoffrey at In...
 
PLAI - Acceleration Program for Generative A.I. Startups
PLAI - Acceleration Program for Generative A.I. StartupsPLAI - Acceleration Program for Generative A.I. Startups
PLAI - Acceleration Program for Generative A.I. Startups
 
Your enemies use GenAI too - staying ahead of fraud with Neo4j
Your enemies use GenAI too - staying ahead of fraud with Neo4jYour enemies use GenAI too - staying ahead of fraud with Neo4j
Your enemies use GenAI too - staying ahead of fraud with Neo4j
 
Working together SRE & Platform Engineering
Working together SRE & Platform EngineeringWorking together SRE & Platform Engineering
Working together SRE & Platform Engineering
 
Easier, Faster, and More Powerful – Notes Document Properties Reimagined
Easier, Faster, and More Powerful – Notes Document Properties ReimaginedEasier, Faster, and More Powerful – Notes Document Properties Reimagined
Easier, Faster, and More Powerful – Notes Document Properties Reimagined
 
The Metaverse: Are We There Yet?
The  Metaverse:    Are   We  There  Yet?The  Metaverse:    Are   We  There  Yet?
The Metaverse: Are We There Yet?
 
AI presentation and introduction - Retrieval Augmented Generation RAG 101
AI presentation and introduction - Retrieval Augmented Generation RAG 101AI presentation and introduction - Retrieval Augmented Generation RAG 101
AI presentation and introduction - Retrieval Augmented Generation RAG 101
 
How Red Hat Uses FDO in Device Lifecycle _ Costin and Vitaliy at Red Hat.pdf
How Red Hat Uses FDO in Device Lifecycle _ Costin and Vitaliy at Red Hat.pdfHow Red Hat Uses FDO in Device Lifecycle _ Costin and Vitaliy at Red Hat.pdf
How Red Hat Uses FDO in Device Lifecycle _ Costin and Vitaliy at Red Hat.pdf
 
Behind the Scenes From the Manager's Chair: Decoding the Secrets of Successfu...
Behind the Scenes From the Manager's Chair: Decoding the Secrets of Successfu...Behind the Scenes From the Manager's Chair: Decoding the Secrets of Successfu...
Behind the Scenes From the Manager's Chair: Decoding the Secrets of Successfu...
 
Where to Learn More About FDO _ Richard at FIDO Alliance.pdf
Where to Learn More About FDO _ Richard at FIDO Alliance.pdfWhere to Learn More About FDO _ Richard at FIDO Alliance.pdf
Where to Learn More About FDO _ Richard at FIDO Alliance.pdf
 
AI mind or machine power point presentation
AI mind or machine power point presentationAI mind or machine power point presentation
AI mind or machine power point presentation
 
Introduction to FDO and How It works Applications _ Richard at FIDO Alliance.pdf
Introduction to FDO and How It works Applications _ Richard at FIDO Alliance.pdfIntroduction to FDO and How It works Applications _ Richard at FIDO Alliance.pdf
Introduction to FDO and How It works Applications _ Richard at FIDO Alliance.pdf
 
The Value of Certifying Products for FDO _ Paul at FIDO Alliance.pdf
The Value of Certifying Products for FDO _ Paul at FIDO Alliance.pdfThe Value of Certifying Products for FDO _ Paul at FIDO Alliance.pdf
The Value of Certifying Products for FDO _ Paul at FIDO Alliance.pdf
 
Breaking Down the Flutterwave Scandal What You Need to Know.pdf
Breaking Down the Flutterwave Scandal What You Need to Know.pdfBreaking Down the Flutterwave Scandal What You Need to Know.pdf
Breaking Down the Flutterwave Scandal What You Need to Know.pdf
 
A Business-Centric Approach to Design System Strategy
A Business-Centric Approach to Design System StrategyA Business-Centric Approach to Design System Strategy
A Business-Centric Approach to Design System Strategy
 
WebAssembly is Key to Better LLM Performance
WebAssembly is Key to Better LLM PerformanceWebAssembly is Key to Better LLM Performance
WebAssembly is Key to Better LLM Performance
 

Mosiac Search Engine

  • 1. The Mosaic Search Engine Mark van Harmelen Hedtek Ltd markvanharmelen@gmail.comhedtek.com
  • 2. Aim Provide a proof of concept that Users can have personalised search results according to their place and stage of studies Users can adopt other personas or points-of-view to explore academic resources We can exploit ‘mass’ attention data as revealed by library circulation information So far only working with ISBN identified books
  • 3. HEI circulation data build Solr index anonymise partial Copac records annotated with use and reading list data reading lists Solr HEI anonymise front-end HEI anonymise
  • 4. Anonymisation Level 1: Current prototype, enables faceting Level 2: With extra information, enables“people who borrowed this also borrowed”and“people who borrowed this went on to borrow” Anonymisationutility provided DPA compliant, can also use fair processing agreements
  • 5. Augmenting Solr’s index Solr’s search index is loaded with items and any associated use information Use information is: institution course progression level year of use count of number of uses in that year Use information enables faceting Also add reading list info to items
  • 6. Solr OPAC resultset itemquery item data query client-side front-end (browser)
  • 7. Narrowing and broadening Thoughts (NB, ‘thoughts’) of narrowing of choice led to two features to broaden choice Don’t believe that the Mosaic demo in itself narrows when used for browsing Broadening features More like this link Reading lists
  • 8. The Harry Potter ‘problem’ and scale The Harry Potter ‘problem’: Balderdash! We can control this using Library of Congress subject categories and Dewey Decimal shelfmarks Paul Miller raises questions of scale Dave Pattern has shown success use of use data at a single (small) institution We want to leverage reasonably large scale: 3.5-4M students in HE, over say the last five years
  • 9. User context and attention Has been relatively simple to parameterise an open source search engine with user context Institution, course, progression level, academic year This is only part of the user context, can add Location Attention data, e.g., search history Further social search information
  • 10. Disclaimer The next slide is independent of any decisions on a pure data approach Could be a pure data approach in there Or maybe not
  • 11.
  • 12. Mosiac searchpersonalised/point-of-view search Massively parallel search for blindingly fast response times Data mining for library ‘stewardship’ We have prototypes for the first two, and we’re about to start experimenting with parallel search using Hadoop+Lucene
  • 13. Building institutional contributions Propose union-cat-local: Search in local library Mosaic-like search utilises local loan data if it is available Two ways to encourage library contribution of loan data (thoughts in progress) Narrow: Libraries which contribute loan data to the pool get Mosaic search over the pool Broad: Offer the contextual/PoV search available everywhere; users will agitate if they don’t see local data
  • 14. This is a Just Do It moment A national union catalogue with contextual search and local library interfaces Relatively cheap to do Potentially massive gains for learners, teachers and researchers Portends the development of shared services across the library domain and large cost savings Doesn’t preclude / agnostic on an open data approach Could incorporate a pure data service approach and/or a centralised service