SlideShare a Scribd company logo
Kevin Cohn
Chief Operating Officer
@Atypon
Improving Research
Efficiency
Academic Publishing in Europe, Berlin
30 January 2013
User and Content Fingerprinting
• Provider of Software as a Service content
delivery for publishers
• Literatum platform used to deliver 15M journal
articles and 70,000 eBooks
• 1.5 billion user sessions in 2012
About Atypon
3 Improving Research Efficiency
• Research efficiency can be greatly improved if
publishers tap into their huge volume of data to
better connect users to content.
Thesis
4 Improving Research Efficiency
Users don’t want “advanced search...”
...but they do want relevant results.
This is the APE I’m looking for.
Data can drive this behavior.
• Relevancy is the only order that matters
• > 50% of clicks are to the first result
• > 90% of clicks are on the first page
• Filters/facets aren’t used
Observations
9 Improving Research Efficiency
• Give users what they want: a simple, Google-
like search interface
• But use proprietary data to calculate relevancy
for each individual user
Objectives
10 Improving Research Efficiency
Automatic Topic Modeling
11 Improving Research Efficiency
• Based on a statistical model called latent
Dirichlet allocation (LDA)
• Creates “topics:” collections of words that occur
together with great frequency
Topic #1: {mammal, primate, hominoidea}
Topic #2: {academic, publishing, europe}
Automatic Topic Modeling
12 Improving Research Efficiency
13 Improving Research Efficiency
13 Improving Research Efficiency
Topic #1
Topic #2
16 Improving Research Efficiency
16 Improving Research Efficiency
17 Improving Research Efficiency
17 Improving Research Efficiency
17 Improving Research Efficiency
18 Improving Research Efficiency
• My search for “APE” returns results about this
conference, not primates
• The same is true for recommendations
• Better related articles (topics 1 and 2 are not
related, despite sharing “APE”)
Applications
19 Improving Research Efficiency
• Topics are self-updating = low-cost, low-
maintenance
• Flat (not hierarchical) = avoids troublesome
questions about classification
• Probabilistic (not binary) = better at expressing
relevancy to topics
Not a Taxonomy/Ontology...
20 Improving Research Efficiency
21 Improving Research Efficiency
21 Improving Research Efficiency
• Topics are “collections of words that occur
together with great frequency”
• Knowing that “APE” is an acronym for
“Academic Publishing in Europe”
• Knowing that “CC0” and “CC BY” are Creative
Commons license types
...But Is Helped by Them
22 Improving Research Efficiency
• We didn’t invent ATM (or LDA)
• Our implementation started as a collaboration
with academic researchers...
• ...and will require considerable experimentation
and testing to get right
Worth Mentioning
23 Improving Research Efficiency
• Usage is not personally identifiable
• Usage is not shared with third parties
• Users can opt out of personalization
Privacy
24 Improving Research Efficiency
• ATM uses proprietary data to calculate
relevancy for each individual user
• Gives users what they want: a simple, Google-
like search interface
• Improves research efficiency by freeing up
searching time for reading
Summary
25 Improving Research Efficiency
Thank You
26 Improving Research Efficiency
KCohn@Atypon.com
Kevin Cohn
Chief Operating Officer, Atypon

More Related Content

Similar to Improving Research Efficiency: User and Content Fingerprinting

NISO's IOTA Working Group: Creating an Index for Measuring the Quality of Ope...
NISO's IOTA Working Group: Creating an Index for Measuring the Quality of Ope...NISO's IOTA Working Group: Creating an Index for Measuring the Quality of Ope...
NISO's IOTA Working Group: Creating an Index for Measuring the Quality of Ope...
Rafal Kasprowski
 
Evaluating the Quality of OpenURLs Through Analytics (TLA 2012)
Evaluating the Quality of OpenURLs Through Analytics (TLA 2012)Evaluating the Quality of OpenURLs Through Analytics (TLA 2012)
Evaluating the Quality of OpenURLs Through Analytics (TLA 2012)
Rafal Kasprowski
 
Chandran Honour, Nature.com
Chandran Honour, Nature.comChandran Honour, Nature.com
Chandran Honour, Nature.com
Mashery
 
29 cc 2_b_all_speakers
29 cc 2_b_all_speakers29 cc 2_b_all_speakers
29 cc 2_b_all_speakers
Society for Scholarly Publishing
 
ASA Conference - New roles for the Modern Intermediary
ASA Conference - New roles for the Modern IntermediaryASA Conference - New roles for the Modern Intermediary
ASA Conference - New roles for the Modern Intermediary
Publishing Technology
 
Optimising Your Content for Findability
Optimising Your Content for FindabilityOptimising Your Content for Findability
Optimising Your Content for Findability
Findwise
 
Online08 stm market-outlook-vcamlek finalv1 (2)
Online08 stm market-outlook-vcamlek finalv1 (2)Online08 stm market-outlook-vcamlek finalv1 (2)
Online08 stm market-outlook-vcamlek finalv1 (2)
rotciv
 
Value stream mapping for complex processes (innovation, Lean, service design)
Value stream mapping for complex processes (innovation, Lean, service design) Value stream mapping for complex processes (innovation, Lean, service design)
Value stream mapping for complex processes (innovation, Lean, service design)
Teemu Toivonen
 
محاضرة برنامج Nails لتحليل الدراسات السابقة د.شروق المقرن
محاضرة برنامج Nails  لتحليل الدراسات السابقة د.شروق المقرنمحاضرة برنامج Nails  لتحليل الدراسات السابقة د.شروق المقرن
محاضرة برنامج Nails لتحليل الدراسات السابقة د.شروق المقرن
مركز البحوث الأقسام العلمية
 
Understanding the Depth of Google Scholar and its Implication for Webometrics...
Understanding the Depth of Google Scholar and its Implication for Webometrics...Understanding the Depth of Google Scholar and its Implication for Webometrics...
Understanding the Depth of Google Scholar and its Implication for Webometrics...
Idowu Adegbilero-Iwari
 
166 sspcc1 b_newman
166 sspcc1 b_newman166 sspcc1 b_newman
ROI In Corporate Libraries
ROI In Corporate LibrariesROI In Corporate Libraries
ROI In Corporate Libraries
George Scotti
 
Apis and scientific publishing
Apis and scientific publishingApis and scientific publishing
Apis and scientific publishing
Chandran Honour
 
We all do better when we work together: The International EconBiz Partner Net...
We all do better when we work together: The International EconBiz Partner Net...We all do better when we work together: The International EconBiz Partner Net...
We all do better when we work together: The International EconBiz Partner Net...
Tamara Pianos
 
IWMW 2002: open source sofware debate: kelly
IWMW 2002: open source sofware debate: kellyIWMW 2002: open source sofware debate: kelly
IWMW 2002: open source sofware debate: kelly
IWMW
 
Improving Search Strategies of Auditors –A Focus Group on Reflection Interven...
Improving Search Strategies of Auditors –A Focus Group on Reflection Interven...Improving Search Strategies of Auditors –A Focus Group on Reflection Interven...
Improving Search Strategies of Auditors –A Focus Group on Reflection Interven...
Angela Fessl
 
Shared book Academicpub.com Publisher Partnership Deck 2011
Shared book Academicpub.com Publisher Partnership Deck 2011Shared book Academicpub.com Publisher Partnership Deck 2011
Shared book Academicpub.com Publisher Partnership Deck 2011
Michael Cairns
 
An Example of Predictive Analytics: Building a Recommendation Engine Using Py...
An Example of Predictive Analytics: Building a Recommendation Engine Using Py...An Example of Predictive Analytics: Building a Recommendation Engine Using Py...
An Example of Predictive Analytics: Building a Recommendation Engine Using Py...
PyData
 
The current oer search dilemma
The current oer search dilemmaThe current oer search dilemma
The current oer search dilemma
Ishan Abeywardena, Ph.D.
 
Elsevier - Smart Data and Algorithms for the Publishing Industry
Elsevier - Smart Data and Algorithms for the Publishing IndustryElsevier - Smart Data and Algorithms for the Publishing Industry
Elsevier - Smart Data and Algorithms for the Publishing Industry
Antonio Gulli
 

Similar to Improving Research Efficiency: User and Content Fingerprinting (20)

NISO's IOTA Working Group: Creating an Index for Measuring the Quality of Ope...
NISO's IOTA Working Group: Creating an Index for Measuring the Quality of Ope...NISO's IOTA Working Group: Creating an Index for Measuring the Quality of Ope...
NISO's IOTA Working Group: Creating an Index for Measuring the Quality of Ope...
 
Evaluating the Quality of OpenURLs Through Analytics (TLA 2012)
Evaluating the Quality of OpenURLs Through Analytics (TLA 2012)Evaluating the Quality of OpenURLs Through Analytics (TLA 2012)
Evaluating the Quality of OpenURLs Through Analytics (TLA 2012)
 
Chandran Honour, Nature.com
Chandran Honour, Nature.comChandran Honour, Nature.com
Chandran Honour, Nature.com
 
29 cc 2_b_all_speakers
29 cc 2_b_all_speakers29 cc 2_b_all_speakers
29 cc 2_b_all_speakers
 
ASA Conference - New roles for the Modern Intermediary
ASA Conference - New roles for the Modern IntermediaryASA Conference - New roles for the Modern Intermediary
ASA Conference - New roles for the Modern Intermediary
 
Optimising Your Content for Findability
Optimising Your Content for FindabilityOptimising Your Content for Findability
Optimising Your Content for Findability
 
Online08 stm market-outlook-vcamlek finalv1 (2)
Online08 stm market-outlook-vcamlek finalv1 (2)Online08 stm market-outlook-vcamlek finalv1 (2)
Online08 stm market-outlook-vcamlek finalv1 (2)
 
Value stream mapping for complex processes (innovation, Lean, service design)
Value stream mapping for complex processes (innovation, Lean, service design) Value stream mapping for complex processes (innovation, Lean, service design)
Value stream mapping for complex processes (innovation, Lean, service design)
 
محاضرة برنامج Nails لتحليل الدراسات السابقة د.شروق المقرن
محاضرة برنامج Nails  لتحليل الدراسات السابقة د.شروق المقرنمحاضرة برنامج Nails  لتحليل الدراسات السابقة د.شروق المقرن
محاضرة برنامج Nails لتحليل الدراسات السابقة د.شروق المقرن
 
Understanding the Depth of Google Scholar and its Implication for Webometrics...
Understanding the Depth of Google Scholar and its Implication for Webometrics...Understanding the Depth of Google Scholar and its Implication for Webometrics...
Understanding the Depth of Google Scholar and its Implication for Webometrics...
 
166 sspcc1 b_newman
166 sspcc1 b_newman166 sspcc1 b_newman
166 sspcc1 b_newman
 
ROI In Corporate Libraries
ROI In Corporate LibrariesROI In Corporate Libraries
ROI In Corporate Libraries
 
Apis and scientific publishing
Apis and scientific publishingApis and scientific publishing
Apis and scientific publishing
 
We all do better when we work together: The International EconBiz Partner Net...
We all do better when we work together: The International EconBiz Partner Net...We all do better when we work together: The International EconBiz Partner Net...
We all do better when we work together: The International EconBiz Partner Net...
 
IWMW 2002: open source sofware debate: kelly
IWMW 2002: open source sofware debate: kellyIWMW 2002: open source sofware debate: kelly
IWMW 2002: open source sofware debate: kelly
 
Improving Search Strategies of Auditors –A Focus Group on Reflection Interven...
Improving Search Strategies of Auditors –A Focus Group on Reflection Interven...Improving Search Strategies of Auditors –A Focus Group on Reflection Interven...
Improving Search Strategies of Auditors –A Focus Group on Reflection Interven...
 
Shared book Academicpub.com Publisher Partnership Deck 2011
Shared book Academicpub.com Publisher Partnership Deck 2011Shared book Academicpub.com Publisher Partnership Deck 2011
Shared book Academicpub.com Publisher Partnership Deck 2011
 
An Example of Predictive Analytics: Building a Recommendation Engine Using Py...
An Example of Predictive Analytics: Building a Recommendation Engine Using Py...An Example of Predictive Analytics: Building a Recommendation Engine Using Py...
An Example of Predictive Analytics: Building a Recommendation Engine Using Py...
 
The current oer search dilemma
The current oer search dilemmaThe current oer search dilemma
The current oer search dilemma
 
Elsevier - Smart Data and Algorithms for the Publishing Industry
Elsevier - Smart Data and Algorithms for the Publishing IndustryElsevier - Smart Data and Algorithms for the Publishing Industry
Elsevier - Smart Data and Algorithms for the Publishing Industry
 

Recently uploaded

Mind map of terminologies used in context of Generative AI
Mind map of terminologies used in context of Generative AIMind map of terminologies used in context of Generative AI
Mind map of terminologies used in context of Generative AI
Kumud Singh
 
20240605 QFM017 Machine Intelligence Reading List May 2024
20240605 QFM017 Machine Intelligence Reading List May 202420240605 QFM017 Machine Intelligence Reading List May 2024
20240605 QFM017 Machine Intelligence Reading List May 2024
Matthew Sinclair
 
Introduction to CHERI technology - Cybersecurity
Introduction to CHERI technology - CybersecurityIntroduction to CHERI technology - Cybersecurity
Introduction to CHERI technology - Cybersecurity
mikeeftimakis1
 
GraphSummit Singapore | The Art of the Possible with Graph - Q2 2024
GraphSummit Singapore | The Art of the  Possible with Graph - Q2 2024GraphSummit Singapore | The Art of the  Possible with Graph - Q2 2024
GraphSummit Singapore | The Art of the Possible with Graph - Q2 2024
Neo4j
 
UiPath Test Automation using UiPath Test Suite series, part 5
UiPath Test Automation using UiPath Test Suite series, part 5UiPath Test Automation using UiPath Test Suite series, part 5
UiPath Test Automation using UiPath Test Suite series, part 5
DianaGray10
 
Let's Integrate MuleSoft RPA, COMPOSER, APM with AWS IDP along with Slack
Let's Integrate MuleSoft RPA, COMPOSER, APM with AWS IDP along with SlackLet's Integrate MuleSoft RPA, COMPOSER, APM with AWS IDP along with Slack
Let's Integrate MuleSoft RPA, COMPOSER, APM with AWS IDP along with Slack
shyamraj55
 
Unlock the Future of Search with MongoDB Atlas_ Vector Search Unleashed.pdf
Unlock the Future of Search with MongoDB Atlas_ Vector Search Unleashed.pdfUnlock the Future of Search with MongoDB Atlas_ Vector Search Unleashed.pdf
Unlock the Future of Search with MongoDB Atlas_ Vector Search Unleashed.pdf
Malak Abu Hammad
 
“I’m still / I’m still / Chaining from the Block”
“I’m still / I’m still / Chaining from the Block”“I’m still / I’m still / Chaining from the Block”
“I’m still / I’m still / Chaining from the Block”
Claudio Di Ciccio
 
Data structures and Algorithms in Python.pdf
Data structures and Algorithms in Python.pdfData structures and Algorithms in Python.pdf
Data structures and Algorithms in Python.pdf
TIPNGVN2
 
PCI PIN Basics Webinar from the Controlcase Team
PCI PIN Basics Webinar from the Controlcase TeamPCI PIN Basics Webinar from the Controlcase Team
PCI PIN Basics Webinar from the Controlcase Team
ControlCase
 
Large Language Model (LLM) and it’s Geospatial Applications
Large Language Model (LLM) and it’s Geospatial ApplicationsLarge Language Model (LLM) and it’s Geospatial Applications
Large Language Model (LLM) and it’s Geospatial Applications
Rohit Gautam
 
A tale of scale & speed: How the US Navy is enabling software delivery from l...
A tale of scale & speed: How the US Navy is enabling software delivery from l...A tale of scale & speed: How the US Navy is enabling software delivery from l...
A tale of scale & speed: How the US Navy is enabling software delivery from l...
sonjaschweigert1
 
National Security Agency - NSA mobile device best practices
National Security Agency - NSA mobile device best practicesNational Security Agency - NSA mobile device best practices
National Security Agency - NSA mobile device best practices
Quotidiano Piemontese
 
Building RAG with self-deployed Milvus vector database and Snowpark Container...
Building RAG with self-deployed Milvus vector database and Snowpark Container...Building RAG with self-deployed Milvus vector database and Snowpark Container...
Building RAG with self-deployed Milvus vector database and Snowpark Container...
Zilliz
 
GraphSummit Singapore | Neo4j Product Vision & Roadmap - Q2 2024
GraphSummit Singapore | Neo4j Product Vision & Roadmap - Q2 2024GraphSummit Singapore | Neo4j Product Vision & Roadmap - Q2 2024
GraphSummit Singapore | Neo4j Product Vision & Roadmap - Q2 2024
Neo4j
 
Pushing the limits of ePRTC: 100ns holdover for 100 days
Pushing the limits of ePRTC: 100ns holdover for 100 daysPushing the limits of ePRTC: 100ns holdover for 100 days
Pushing the limits of ePRTC: 100ns holdover for 100 days
Adtran
 
20240609 QFM020 Irresponsible AI Reading List May 2024
20240609 QFM020 Irresponsible AI Reading List May 202420240609 QFM020 Irresponsible AI Reading List May 2024
20240609 QFM020 Irresponsible AI Reading List May 2024
Matthew Sinclair
 
Uni Systems Copilot event_05062024_C.Vlachos.pdf
Uni Systems Copilot event_05062024_C.Vlachos.pdfUni Systems Copilot event_05062024_C.Vlachos.pdf
Uni Systems Copilot event_05062024_C.Vlachos.pdf
Uni Systems S.M.S.A.
 
Microsoft - Power Platform_G.Aspiotis.pdf
Microsoft - Power Platform_G.Aspiotis.pdfMicrosoft - Power Platform_G.Aspiotis.pdf
Microsoft - Power Platform_G.Aspiotis.pdf
Uni Systems S.M.S.A.
 
Artificial Intelligence for XMLDevelopment
Artificial Intelligence for XMLDevelopmentArtificial Intelligence for XMLDevelopment
Artificial Intelligence for XMLDevelopment
Octavian Nadolu
 

Recently uploaded (20)

Mind map of terminologies used in context of Generative AI
Mind map of terminologies used in context of Generative AIMind map of terminologies used in context of Generative AI
Mind map of terminologies used in context of Generative AI
 
20240605 QFM017 Machine Intelligence Reading List May 2024
20240605 QFM017 Machine Intelligence Reading List May 202420240605 QFM017 Machine Intelligence Reading List May 2024
20240605 QFM017 Machine Intelligence Reading List May 2024
 
Introduction to CHERI technology - Cybersecurity
Introduction to CHERI technology - CybersecurityIntroduction to CHERI technology - Cybersecurity
Introduction to CHERI technology - Cybersecurity
 
GraphSummit Singapore | The Art of the Possible with Graph - Q2 2024
GraphSummit Singapore | The Art of the  Possible with Graph - Q2 2024GraphSummit Singapore | The Art of the  Possible with Graph - Q2 2024
GraphSummit Singapore | The Art of the Possible with Graph - Q2 2024
 
UiPath Test Automation using UiPath Test Suite series, part 5
UiPath Test Automation using UiPath Test Suite series, part 5UiPath Test Automation using UiPath Test Suite series, part 5
UiPath Test Automation using UiPath Test Suite series, part 5
 
Let's Integrate MuleSoft RPA, COMPOSER, APM with AWS IDP along with Slack
Let's Integrate MuleSoft RPA, COMPOSER, APM with AWS IDP along with SlackLet's Integrate MuleSoft RPA, COMPOSER, APM with AWS IDP along with Slack
Let's Integrate MuleSoft RPA, COMPOSER, APM with AWS IDP along with Slack
 
Unlock the Future of Search with MongoDB Atlas_ Vector Search Unleashed.pdf
Unlock the Future of Search with MongoDB Atlas_ Vector Search Unleashed.pdfUnlock the Future of Search with MongoDB Atlas_ Vector Search Unleashed.pdf
Unlock the Future of Search with MongoDB Atlas_ Vector Search Unleashed.pdf
 
“I’m still / I’m still / Chaining from the Block”
“I’m still / I’m still / Chaining from the Block”“I’m still / I’m still / Chaining from the Block”
“I’m still / I’m still / Chaining from the Block”
 
Data structures and Algorithms in Python.pdf
Data structures and Algorithms in Python.pdfData structures and Algorithms in Python.pdf
Data structures and Algorithms in Python.pdf
 
PCI PIN Basics Webinar from the Controlcase Team
PCI PIN Basics Webinar from the Controlcase TeamPCI PIN Basics Webinar from the Controlcase Team
PCI PIN Basics Webinar from the Controlcase Team
 
Large Language Model (LLM) and it’s Geospatial Applications
Large Language Model (LLM) and it’s Geospatial ApplicationsLarge Language Model (LLM) and it’s Geospatial Applications
Large Language Model (LLM) and it’s Geospatial Applications
 
A tale of scale & speed: How the US Navy is enabling software delivery from l...
A tale of scale & speed: How the US Navy is enabling software delivery from l...A tale of scale & speed: How the US Navy is enabling software delivery from l...
A tale of scale & speed: How the US Navy is enabling software delivery from l...
 
National Security Agency - NSA mobile device best practices
National Security Agency - NSA mobile device best practicesNational Security Agency - NSA mobile device best practices
National Security Agency - NSA mobile device best practices
 
Building RAG with self-deployed Milvus vector database and Snowpark Container...
Building RAG with self-deployed Milvus vector database and Snowpark Container...Building RAG with self-deployed Milvus vector database and Snowpark Container...
Building RAG with self-deployed Milvus vector database and Snowpark Container...
 
GraphSummit Singapore | Neo4j Product Vision & Roadmap - Q2 2024
GraphSummit Singapore | Neo4j Product Vision & Roadmap - Q2 2024GraphSummit Singapore | Neo4j Product Vision & Roadmap - Q2 2024
GraphSummit Singapore | Neo4j Product Vision & Roadmap - Q2 2024
 
Pushing the limits of ePRTC: 100ns holdover for 100 days
Pushing the limits of ePRTC: 100ns holdover for 100 daysPushing the limits of ePRTC: 100ns holdover for 100 days
Pushing the limits of ePRTC: 100ns holdover for 100 days
 
20240609 QFM020 Irresponsible AI Reading List May 2024
20240609 QFM020 Irresponsible AI Reading List May 202420240609 QFM020 Irresponsible AI Reading List May 2024
20240609 QFM020 Irresponsible AI Reading List May 2024
 
Uni Systems Copilot event_05062024_C.Vlachos.pdf
Uni Systems Copilot event_05062024_C.Vlachos.pdfUni Systems Copilot event_05062024_C.Vlachos.pdf
Uni Systems Copilot event_05062024_C.Vlachos.pdf
 
Microsoft - Power Platform_G.Aspiotis.pdf
Microsoft - Power Platform_G.Aspiotis.pdfMicrosoft - Power Platform_G.Aspiotis.pdf
Microsoft - Power Platform_G.Aspiotis.pdf
 
Artificial Intelligence for XMLDevelopment
Artificial Intelligence for XMLDevelopmentArtificial Intelligence for XMLDevelopment
Artificial Intelligence for XMLDevelopment
 

Improving Research Efficiency: User and Content Fingerprinting

  • 1. Kevin Cohn Chief Operating Officer @Atypon Improving Research Efficiency Academic Publishing in Europe, Berlin 30 January 2013 User and Content Fingerprinting
  • 2.
  • 3. • Provider of Software as a Service content delivery for publishers • Literatum platform used to deliver 15M journal articles and 70,000 eBooks • 1.5 billion user sessions in 2012 About Atypon 3 Improving Research Efficiency
  • 4. • Research efficiency can be greatly improved if publishers tap into their huge volume of data to better connect users to content. Thesis 4 Improving Research Efficiency
  • 5.
  • 6. Users don’t want “advanced search...”
  • 7.
  • 8. ...but they do want relevant results.
  • 9. This is the APE I’m looking for.
  • 10. Data can drive this behavior.
  • 11. • Relevancy is the only order that matters • > 50% of clicks are to the first result • > 90% of clicks are on the first page • Filters/facets aren’t used Observations 9 Improving Research Efficiency
  • 12. • Give users what they want: a simple, Google- like search interface • But use proprietary data to calculate relevancy for each individual user Objectives 10 Improving Research Efficiency
  • 13. Automatic Topic Modeling 11 Improving Research Efficiency
  • 14. • Based on a statistical model called latent Dirichlet allocation (LDA) • Creates “topics:” collections of words that occur together with great frequency Topic #1: {mammal, primate, hominoidea} Topic #2: {academic, publishing, europe} Automatic Topic Modeling 12 Improving Research Efficiency
  • 15. 13 Improving Research Efficiency
  • 16. 13 Improving Research Efficiency
  • 19. 16 Improving Research Efficiency
  • 20. 16 Improving Research Efficiency
  • 21. 17 Improving Research Efficiency
  • 22. 17 Improving Research Efficiency
  • 23. 17 Improving Research Efficiency
  • 24. 18 Improving Research Efficiency
  • 25. • My search for “APE” returns results about this conference, not primates • The same is true for recommendations • Better related articles (topics 1 and 2 are not related, despite sharing “APE”) Applications 19 Improving Research Efficiency
  • 26. • Topics are self-updating = low-cost, low- maintenance • Flat (not hierarchical) = avoids troublesome questions about classification • Probabilistic (not binary) = better at expressing relevancy to topics Not a Taxonomy/Ontology... 20 Improving Research Efficiency
  • 27. 21 Improving Research Efficiency
  • 28. 21 Improving Research Efficiency
  • 29. • Topics are “collections of words that occur together with great frequency” • Knowing that “APE” is an acronym for “Academic Publishing in Europe” • Knowing that “CC0” and “CC BY” are Creative Commons license types ...But Is Helped by Them 22 Improving Research Efficiency
  • 30. • We didn’t invent ATM (or LDA) • Our implementation started as a collaboration with academic researchers... • ...and will require considerable experimentation and testing to get right Worth Mentioning 23 Improving Research Efficiency
  • 31. • Usage is not personally identifiable • Usage is not shared with third parties • Users can opt out of personalization Privacy 24 Improving Research Efficiency
  • 32. • ATM uses proprietary data to calculate relevancy for each individual user • Gives users what they want: a simple, Google- like search interface • Improves research efficiency by freeing up searching time for reading Summary 25 Improving Research Efficiency
  • 33. Thank You 26 Improving Research Efficiency KCohn@Atypon.com Kevin Cohn Chief Operating Officer, Atypon