SlideShare a Scribd company logo
1 of 39
Download to read offline
KOL Analytics from Biomedical Literature
II-SDV Conference
Nice, France
18 - 19 April 2016
Srinivasan Parthiban
Thava Alagu
New York, USA
โ€ข Working with pharmaceutical Medical
Affairs, Clinical, R&D, and commercial
organizations since 2005
โ€ข Working with more than half of the Top 50
Companies, 16 of the top 25 (17, and 18
contracting now!)
โ€ข The only completely integrated Scientific
Information Solution
Provides timely insights and facilitates strategic decision making
from the vast amount of publicly available scientific information
Medmeme
Meme(noun) - An idea or behavior that spreads in a manner analogous to the biological
transmission of genes.
Bottom Up vs. Top Down
โ€ข As each scientific dissemination is captured it is normalized
and disambiguated prior to being placed into the master data
warehouse
โ€ข Matching, tagging and synonyms are added at this stage
โ€ข Data is mapped to all relevant areas of interest:
โ€ข People
โ€ข Places
โ€ข Institutions & Companies
โ€ข Drugs
โ€ข Keywords: Mechanism of Action, treatment paradigms, etc.
Building the Scientific Data Warehouse
Grants
Over 1,128,000
Data Sources
Patents
Over 800,000
Clinical
Trials
Over 280,000
Publications
Over 8,930,000
Abstracts from
5760 journals
Meetings
Over 11,870,000
Abstracts
Monitoring 14,000+
meetings/year
Treatment
Guidelines
Over 36,480
Rolling 10 years ๏ท Continuously Updated ๏ท Scientifically Credible Sources
Aligned to the Scientific Discovery Process โ€“ from Grants to Guidelines
Impactmeme: The ultimate tool for constantly keeping on top of
who is saying what, where. It captures all available scientific
dissemination regardless of source
Profilememe: Complete, detailed profiles of virtually all significant
publishing and presenting activities for up to 10 years โ€“ at oneโ€™s
fingertips and continuously updated
Insightmeme: A virtual medical librarian on a desktop, allows a user
to search on almost any dimension, the entirety of medical journal
contents and congress outputs for the past 10 years up to the past
month โ€“ all normalize and indexed
Conferencememe: The most comprehensive database of medical
congress output available anywhere available to users everywhere.
See trends in content, as well as where the opinion leaders of interest
are presenting
Medmeme Products
โ€ข An Industry term and acronym: KOL = Key Opinion Leader
โ€ข KOLs are influential doctors, physicians and members of
the medical community whoโ€™s opinions are highly regarded
and who influence other doctorโ€™s and physicians.
โ€ข KOLs advise companies as to where unmet medical needs lie,
choose drug targets, help to define potential product profiles
and shape clinical programs, run clinical trials, and may be
involved in a drugโ€™s regulatory or reimbursement review
process.
โ€ข Peer-to-peer relationships with KOLs are maintained by
Medical Science Liaisons (MSL) from Pharma and healthcare
companies. MSLs are therapeutic specialists (e.g., oncology,
cardiology, neurology)
What is a KOL?
Therapeutic Areas
Geographic
Influence
Does the
physician
have to lead
clinical
research
studies?
Is the
physician an
early adopter
of new drugs?
Education
Level
Level of
Annual
Advising
Services
Funding
Level of
Annual Grant
Funding
Tier 1 Global Yes Yes Medical
Doctor
$25,000 to
$50,000
$100,000 to
$250,000
Tier 2 National (US) Yes Yes Medical
Doctor
$10,000 to
$25,000
Less Than
$100,000
Tier 3 Regional No Yes Medical
doctor
Less Than
$10,000
Less Than
$100,000
Tier 4 Local No Not
necessarily
Medical
doctor
Less Than
$10,000
Less Than
$100,000
Tier 5 Local or
National (non-
USA)
No No PharmD Less Than
$10,000
Less Than
$100,000
Different Levels of KOLs
Average Number of Publications per Year by
Thought Leader Tier
8,2
5,7
4,8
2,9
1,7
0
1
2
3
4
5
6
7
8
9
Tier-1 Tier-2 Tier-3 Tier-4 Tier-5
NumberofPublicationsperYear
Thought Leader Tier
Average Years of Clinical Experience by
Thought Leader Tier
12,9
9
7,4 7,3
5,2
0
2
4
6
8
10
12
14
Tier-1 Tier-2 Tier-3 Tier-4 Tier-5
ClinicalExperienceinYears
Thought Leader Tier
Average Number of Promotional Speeches per
Year by Thought Leader Tier
9,2
6
3,6
3,9
2,2
0
1
2
3
4
5
6
7
8
9
10
Tier-1 Tier-2 Tier-3 Tier-4 Tier-5
Speeches
Thought Leader Tier
KOL Profiling
1,85
2,32
7,17
6,79
6,69
20,65
7,38
5,52
2,17
0 5 10 15 20 25
Delivering a Promotional Speech
Delivering a Scientific Speech
Leading an Advisory Panel (Chair)
Moderating an Advisory Panel
Participating in an Advisory Panel
Authoring a manuscript
Authoring an Abstract
Thought Leader Training (General)
Compilance Training
Hours
Average Amount of Hours Spent per
Thought Leader Activity
Growth in PubMed
Three Challenges 1. Synonymy - A single individual may publish under
multiple namesโ€”this includes a) orthographic and spelling
variants, b) spelling errors, c) name changes over time as may
occur with marriage, religious conversion or gender re-
assignment, and d) the use of pen names.
2. Homonymy - Many different individuals have the same name
โ€“ in fact, common names may comprise several thousand
individuals.
3. The necessary metadata are often incomplete or lacking
entirely โ€“ for example, some publishers and bibliographic
databases did not record authorsโ€™ first names, their
geographical locations, or identifying information such as their
degrees or their positions.
Source: https://www.nlm.nih.gov/bsd/authors1.html
โ€ฆmistaken identity has resulted in the wrong
person being invited to work on a project [โ€ฆ] or
to undertake the peer review of an article
Type I error
False Positive: Identify different author
instances as same single author entity. Results
in bigger clusters than what it should be.
Type II error
False Negative: Not able to identify different
author instances of same author. Results in
too many small clusters.
What Can Go Wrong?
Percentage of author names in Medline that includes
full first name instead of an initial
0,0
10,0
20,0
30,0
40,0
50,0
60,0
70,0
80,0
90,0
1995 2000 2005 2010 2015
percentage(%)
Year
72,0
74,0
76,0
78,0
80,0
82,0
84,0
86,0
2000 2002 2004 2006 2008 2010 2012
percentage(%) Year
โ€ข Full names work much better than initials
โ€ข Only 5% of the author names on your institutionโ€™s articles are people in your
instance of Profiles. The rest are former faculty or external collaborators that you
have never heard about.
Can never be
100% accurate
85% is
considered
quite good
Further manual
disambiguation
is optional
Close enough
Who is John Smith and what is he talking about ?
Retrieve all clusters with the same author name
What Do You Want to Know?
Who is this John Smith, the author of Article X?
Retrieve other PubMed ids of the same cluster
Give me top 10 KOLs in the field of Cancer!
DISA Platform retrieves top 10 Unique-Author-IDs.
Each UAID is associated with one cluster (of articles) and
associated Identity information. (Affiliations and E-mails).
DISA uses the keywords associated with articles to pre-index
the authors with associated keywords.
What Do You Want to Know?
โ€ข High Precision and Recall is the goal.
โ€ข Precision
โ€ข Accuracy Ratio โ€“ Be correct in grouping.
โ€ข precision = #of correctly clustered pairs / #of
clustered pairs
โ€ข Stricter the condition, higher the precision
โ€ข Recall
โ€ข Efficiency Ratio - Do not miss the matches.
โ€ข recall = #of correctly clustered pairs / #of
true positive pairs
โ€ข More liberal condition, higher the recall
Disambiguation Goal
โ€ข Total Manual
Disambiguation is infeasible
โ€ข Automation is great, but
canโ€™t be 100%
โ€ข Manual process is hard,
uncertain, subjective
โ€ข Manual after Automation is
Pragmatic
Manual Vs Automated Disambiguation
โ€ข Group all publications into author clusters
โ€ข Match person to clusters
Clustering Methods
Clustering based on similarity probability model
Available factors :
โ€ข Co-authors
โ€ข Affiliation
โ€ข Journal
โ€ข Mesh Terms
โ€ข Publication Date
Automation Approach
โ€ข Self learning system possible โ€“ Learns from Gold Set
โ€ข Creating proper training set is the biggest challenge
โ€ข Manual creation of proper training set is costly
โ€ข Higher the complexity, vulnerable to bugs
โ€ข Main goal is to find relative importance of
the criteria
โ€ข Co-author Vs Affiliation Vs MeshTerms Vs Journal etc.
Machine Learning
โ€ข Extensive affiliation disambiguation is more
challenging
โ€ข Affiliation normalization helps in author
disambiguation
โ€ข Involves recognizing countries, cities and address
normalization into canonical form.
โ€ข Fuzzy matching possible after normalization โ€“ for
smaller buckets only.
Affiliation Disambiguation
โ€ข Remember โ€“ It is costly operation !
โ€ข Scalability Hazard !
โ€ข Algorithms:
โ€ข Monger-Elkan, Jaro-Winkler, Levenstein
based on edit distance.
โ€ข Jaccard, TF-IDF based on token based
multi-sets. (Order of words are not important)
โ€ข Some hybrid techniques are also common.
.
Fuzzy Matching
Article-1 Authors : X, Y
Article-2 Authors : X, Z (1 and 2 seems disconnected)
Article-3 Authors : X, Y, Z (Likely that X is same author
for all 3 articles)
Note: Clustering algorithm recognizes and handles this appropriately.
Transitivity Fixing
Introducing DISA
โ€ข DISA stands for Disambiguation Automated Platform.
โ€ข DISA provides powerful core kernel software system
backed by the author database.
โ€ข DISA enables applications to be developed on this
platform to explore the KOLs based on Pubmed and
Conferences information.
ETL - Extract, Transform and Load
Pubmed Data
Explode To Author Instances
Unique Authors
Rule Based Unification Engine
Author Instances
DISA API Layer For Application Access.
Conference Data
DISA Application
DISA Platform Architecture
DISA Technology Stack
โ€ข Disambiguation restricted to same
last name authors.
โ€ข This โ€œBlockingโ€ mechanism prevents
combinatorial explosion.
โ€ข Still poses problems for common
names
โ€ข Fuzzy algorithms are very expensive
on large buckets/blocks.
Scalability Issues
โ€ข Relatively less researched so far.
โ€ข Need faster updates for delta addition.
โ€ข Reconstruct clusters of given name spaces.
โ€ข Use incremental clustering
โ€ข Embedded database to store and retrieve the
disambiguated author data.
Incremental Disambiguation
โ€ข We need both higher precision and
recall.
โ€ข But precision is more important.
โ€ข Precision errors are more permanent
and harder to fix.
โ€ข Recall misses may be fixed in future or
by manual disambiguation.
Being Conservative : Precision Vs Recall
Can not Fix Impossible Situations
Not possible to identify these without authorโ€™s voluntary disclosures.
ORCID
Voluntary Creation of Unique ID and linking
How to Fix it Going Forward ?
501 7th Avenue, Suite 508
New York, NY, 10018 (USA)
Tel.: 212-725-5992
Fax: 212-725-5993
www.medmeme.com
Thank You

More Related Content

What's hot

effective data sharing for a learning healthcare system
effective data sharing for a learning healthcare systemeffective data sharing for a learning healthcare system
effective data sharing for a learning healthcare systemPaul Houston
ย 
Introduction to FundRef Webinar
Introduction to FundRef WebinarIntroduction to FundRef Webinar
Introduction to FundRef WebinarCrossref
ย 
Stratergies for the intergration of information (IPI_ConfEX)
Stratergies for the intergration of information (IPI_ConfEX)Stratergies for the intergration of information (IPI_ConfEX)
Stratergies for the intergration of information (IPI_ConfEX)Ben Gardner
ย 
Human Genome and Big Data Challenges
Human Genome and Big Data ChallengesHuman Genome and Big Data Challenges
Human Genome and Big Data ChallengesPhilip Bourne
ย 
A Justification-based Semantic Framework for Representing, Evaluating and Uti...
A Justification-based Semantic Framework for Representing, Evaluating and Uti...A Justification-based Semantic Framework for Representing, Evaluating and Uti...
A Justification-based Semantic Framework for Representing, Evaluating and Uti...Kerstin Forsberg
ย 
Big data supporting drug discovery - cautionary tales from the world of chemi...
Big data supporting drug discovery - cautionary tales from the world of chemi...Big data supporting drug discovery - cautionary tales from the world of chemi...
Big data supporting drug discovery - cautionary tales from the world of chemi...Valery Tkachenko
ย 
Embase advanced-training-slidespdf (2)
Embase advanced-training-slidespdf (2)Embase advanced-training-slidespdf (2)
Embase advanced-training-slidespdf (2)rosie.dunne
ย 
Introduction to CrossRef for Affiliates
Introduction to CrossRef for AffiliatesIntroduction to CrossRef for Affiliates
Introduction to CrossRef for AffiliatesCrossref
ย 
FAIR Data Knowledge Graphsโ€“from Theory to Practice
FAIR Data Knowledge Graphsโ€“from Theory to PracticeFAIR Data Knowledge Graphsโ€“from Theory to Practice
FAIR Data Knowledge Graphsโ€“from Theory to PracticeTom Plasterer
ย 
Pushing back, standards and standard organizations in a Semantic Web enabled ...
Pushing back, standards and standard organizations in a Semantic Web enabled ...Pushing back, standards and standard organizations in a Semantic Web enabled ...
Pushing back, standards and standard organizations in a Semantic Web enabled ...Kerstin Forsberg
ย 
DataFAIRy bioassays pilot -- lessons learned and future outlook
DataFAIRy bioassays pilot -- lessons learned and future outlookDataFAIRy bioassays pilot -- lessons learned and future outlook
DataFAIRy bioassays pilot -- lessons learned and future outlookIsabella Feierberg
ย 
Open PHACTS for BDE SC1.1
Open PHACTS for BDE SC1.1Open PHACTS for BDE SC1.1
Open PHACTS for BDE SC1.1BigData_Europe
ย 
Biomedical Search
Biomedical SearchBiomedical Search
Biomedical SearchSarvnaz Karimi
ย 
Wimmics seminar--drug interaction knowledge base, micropublication, open anno...
Wimmics seminar--drug interaction knowledge base, micropublication, open anno...Wimmics seminar--drug interaction knowledge base, micropublication, open anno...
Wimmics seminar--drug interaction knowledge base, micropublication, open anno...jodischneider
ย 

What's hot (20)

Register "New Directions in Cataloging and Metadata Creation"
Register "New Directions in Cataloging and Metadata Creation"Register "New Directions in Cataloging and Metadata Creation"
Register "New Directions in Cataloging and Metadata Creation"
ย 
effective data sharing for a learning healthcare system
effective data sharing for a learning healthcare systemeffective data sharing for a learning healthcare system
effective data sharing for a learning healthcare system
ย 
Introduction to FundRef Webinar
Introduction to FundRef WebinarIntroduction to FundRef Webinar
Introduction to FundRef Webinar
ย 
Stratergies for the intergration of information (IPI_ConfEX)
Stratergies for the intergration of information (IPI_ConfEX)Stratergies for the intergration of information (IPI_ConfEX)
Stratergies for the intergration of information (IPI_ConfEX)
ย 
Human Genome and Big Data Challenges
Human Genome and Big Data ChallengesHuman Genome and Big Data Challenges
Human Genome and Big Data Challenges
ย 
Canadian health census to lod
Canadian health census to lodCanadian health census to lod
Canadian health census to lod
ย 
RSC ChemSpider Science Commons Symposium Pacific Northwest #scspn
RSC ChemSpider Science Commons Symposium Pacific Northwest #scspnRSC ChemSpider Science Commons Symposium Pacific Northwest #scspn
RSC ChemSpider Science Commons Symposium Pacific Northwest #scspn
ย 
NISO/NFAIS Joint Virtual Conference: Connecting the Library to the Wider Wor...
NISO/NFAIS Joint Virtual Conference:  Connecting the Library to the Wider Wor...NISO/NFAIS Joint Virtual Conference:  Connecting the Library to the Wider Wor...
NISO/NFAIS Joint Virtual Conference: Connecting the Library to the Wider Wor...
ย 
A Justification-based Semantic Framework for Representing, Evaluating and Uti...
A Justification-based Semantic Framework for Representing, Evaluating and Uti...A Justification-based Semantic Framework for Representing, Evaluating and Uti...
A Justification-based Semantic Framework for Representing, Evaluating and Uti...
ย 
Big data supporting drug discovery - cautionary tales from the world of chemi...
Big data supporting drug discovery - cautionary tales from the world of chemi...Big data supporting drug discovery - cautionary tales from the world of chemi...
Big data supporting drug discovery - cautionary tales from the world of chemi...
ย 
Embase advanced-training-slidespdf (2)
Embase advanced-training-slidespdf (2)Embase advanced-training-slidespdf (2)
Embase advanced-training-slidespdf (2)
ย 
Introduction to CrossRef for Affiliates
Introduction to CrossRef for AffiliatesIntroduction to CrossRef for Affiliates
Introduction to CrossRef for Affiliates
ย 
FAIR Data Knowledge Graphsโ€“from Theory to Practice
FAIR Data Knowledge Graphsโ€“from Theory to PracticeFAIR Data Knowledge Graphsโ€“from Theory to Practice
FAIR Data Knowledge Graphsโ€“from Theory to Practice
ย 
Pushing back, standards and standard organizations in a Semantic Web enabled ...
Pushing back, standards and standard organizations in a Semantic Web enabled ...Pushing back, standards and standard organizations in a Semantic Web enabled ...
Pushing back, standards and standard organizations in a Semantic Web enabled ...
ย 
Funk and Beck "Driving Use: Identifiers and Enhanced Metadata"
Funk and Beck "Driving Use: Identifiers and Enhanced Metadata"Funk and Beck "Driving Use: Identifiers and Enhanced Metadata"
Funk and Beck "Driving Use: Identifiers and Enhanced Metadata"
ย 
DataFAIRy bioassays pilot -- lessons learned and future outlook
DataFAIRy bioassays pilot -- lessons learned and future outlookDataFAIRy bioassays pilot -- lessons learned and future outlook
DataFAIRy bioassays pilot -- lessons learned and future outlook
ย 
Open PHACTS for BDE SC1.1
Open PHACTS for BDE SC1.1Open PHACTS for BDE SC1.1
Open PHACTS for BDE SC1.1
ย 
ChemSpider โ€“ A Community Platform for Chemistry and Resources Supporting the ...
ChemSpider โ€“ A Community Platform for Chemistry and Resources Supporting the ...ChemSpider โ€“ A Community Platform for Chemistry and Resources Supporting the ...
ChemSpider โ€“ A Community Platform for Chemistry and Resources Supporting the ...
ย 
Biomedical Search
Biomedical SearchBiomedical Search
Biomedical Search
ย 
Wimmics seminar--drug interaction knowledge base, micropublication, open anno...
Wimmics seminar--drug interaction knowledge base, micropublication, open anno...Wimmics seminar--drug interaction knowledge base, micropublication, open anno...
Wimmics seminar--drug interaction knowledge base, micropublication, open anno...
ย 

Viewers also liked

PatSeer Introduction
PatSeer IntroductionPatSeer Introduction
PatSeer IntroductionGridlogics
ย 
II-SDV 2016 IRIX Software Engineering
II-SDV 2016 IRIX Software EngineeringII-SDV 2016 IRIX Software Engineering
II-SDV 2016 IRIX Software EngineeringDr. Haxel Consult
ย 
II-SDV Arne Krรผger - Elastic Search & Patent Information @ mtc
II-SDV Arne Krรผger - Elastic Search & Patent Information @ mtcII-SDV Arne Krรผger - Elastic Search & Patent Information @ mtc
II-SDV Arne Krรผger - Elastic Search & Patent Information @ mtcDr. Haxel Consult
ย 
II-SDV 2016 Nils Newman - Sentiment Analysis: What your Choice of Words Says ...
II-SDV 2016 Nils Newman - Sentiment Analysis: What your Choice of Words Says ...II-SDV 2016 Nils Newman - Sentiment Analysis: What your Choice of Words Says ...
II-SDV 2016 Nils Newman - Sentiment Analysis: What your Choice of Words Says ...Dr. Haxel Consult
ย 
II-SDV 2016 Expert System
II-SDV 2016 Expert SystemII-SDV 2016 Expert System
II-SDV 2016 Expert SystemDr. Haxel Consult
ย 
II-SDV 2016 Simon Fitall -
II-SDV 2016 Simon Fitall - II-SDV 2016 Simon Fitall -
II-SDV 2016 Simon Fitall - Dr. Haxel Consult
ย 
II-SDV 2016 - QWAM Content Intelligence
II-SDV 2016 - QWAM Content IntelligenceII-SDV 2016 - QWAM Content Intelligence
II-SDV 2016 - QWAM Content IntelligenceDr. Haxel Consult
ย 
II-SDV Andrew Hinton - Text mining - as normal as data mining?
II-SDV Andrew Hinton - Text mining - as normal as data mining?II-SDV Andrew Hinton - Text mining - as normal as data mining?
II-SDV Andrew Hinton - Text mining - as normal as data mining?Dr. Haxel Consult
ย 
II-SDV 2016 Irene Kitsara - Patent Landscape Reports and Other WIPO Activitie...
II-SDV 2016 Irene Kitsara - Patent Landscape Reports and Other WIPO Activitie...II-SDV 2016 Irene Kitsara - Patent Landscape Reports and Other WIPO Activitie...
II-SDV 2016 Irene Kitsara - Patent Landscape Reports and Other WIPO Activitie...Dr. Haxel Consult
ย 
II-SDV 2016 Michael Iarrobino - Improving Text Mining Results with Access to ...
II-SDV 2016 Michael Iarrobino - Improving Text Mining Results with Access to ...II-SDV 2016 Michael Iarrobino - Improving Text Mining Results with Access to ...
II-SDV 2016 Michael Iarrobino - Improving Text Mining Results with Access to ...Dr. Haxel Consult
ย 
II-SDV 2016 Manish Sinka - Taking Patent Research platforms beyond Search
II-SDV 2016 Manish Sinka - Taking Patent Research platforms beyond SearchII-SDV 2016 Manish Sinka - Taking Patent Research platforms beyond Search
II-SDV 2016 Manish Sinka - Taking Patent Research platforms beyond SearchDr. Haxel Consult
ย 
II-SDV 2016 Patrick Beaucamp - Data Science with R and Vanilla Air
II-SDV 2016 Patrick Beaucamp - Data Science with R and Vanilla AirII-SDV 2016 Patrick Beaucamp - Data Science with R and Vanilla Air
II-SDV 2016 Patrick Beaucamp - Data Science with R and Vanilla AirDr. Haxel Consult
ย 
Monitoring and Analysis of Web Information for Various Business Contexts : Co...
Monitoring and Analysis of Web Information for Various Business Contexts : Co...Monitoring and Analysis of Web Information for Various Business Contexts : Co...
Monitoring and Analysis of Web Information for Various Business Contexts : Co...Dr. Haxel Consult
ย 
Biomedical Annotation - Kevin Livingston
Biomedical Annotation - Kevin LivingstonBiomedical Annotation - Kevin Livingston
Biomedical Annotation - Kevin LivingstonDLFCLIR
ย 
Table mining and data curation from biomedical literature
Table mining and data curation from biomedical literatureTable mining and data curation from biomedical literature
Table mining and data curation from biomedical literatureNikola Milosevic
ย 
II-SDV 2016 Bob Stembridge We have all the Time in the World; a Review of ho...
II-SDV 2016 Bob Stembridge  We have all the Time in the World; a Review of ho...II-SDV 2016 Bob Stembridge  We have all the Time in the World; a Review of ho...
II-SDV 2016 Bob Stembridge We have all the Time in the World; a Review of ho...Dr. Haxel Consult
ย 
II-SDV 2016 Raphael Ilmer, Quentin Ladetto - Optimization of Patent Landscape...
II-SDV 2016 Raphael Ilmer, Quentin Ladetto - Optimization of Patent Landscape...II-SDV 2016 Raphael Ilmer, Quentin Ladetto - Optimization of Patent Landscape...
II-SDV 2016 Raphael Ilmer, Quentin Ladetto - Optimization of Patent Landscape...Dr. Haxel Consult
ย 
II-SDV 2017 in Nice - The International Information Conference on Search, Dat...
II-SDV 2017 in Nice - The International Information Conference on Search, Dat...II-SDV 2017 in Nice - The International Information Conference on Search, Dat...
II-SDV 2017 in Nice - The International Information Conference on Search, Dat...Dr. Haxel Consult
ย 

Viewers also liked (19)

PatSeer Introduction
PatSeer IntroductionPatSeer Introduction
PatSeer Introduction
ย 
II-SDV 2016 IRIX Software Engineering
II-SDV 2016 IRIX Software EngineeringII-SDV 2016 IRIX Software Engineering
II-SDV 2016 IRIX Software Engineering
ย 
II-SDV Arne Krรผger - Elastic Search & Patent Information @ mtc
II-SDV Arne Krรผger - Elastic Search & Patent Information @ mtcII-SDV Arne Krรผger - Elastic Search & Patent Information @ mtc
II-SDV Arne Krรผger - Elastic Search & Patent Information @ mtc
ย 
II-SDV 2016 Nils Newman - Sentiment Analysis: What your Choice of Words Says ...
II-SDV 2016 Nils Newman - Sentiment Analysis: What your Choice of Words Says ...II-SDV 2016 Nils Newman - Sentiment Analysis: What your Choice of Words Says ...
II-SDV 2016 Nils Newman - Sentiment Analysis: What your Choice of Words Says ...
ย 
II-SDV 2016 Expert System
II-SDV 2016 Expert SystemII-SDV 2016 Expert System
II-SDV 2016 Expert System
ย 
II-SDV 2016 Simon Fitall -
II-SDV 2016 Simon Fitall - II-SDV 2016 Simon Fitall -
II-SDV 2016 Simon Fitall -
ย 
II-SDV 2016 - QWAM Content Intelligence
II-SDV 2016 - QWAM Content IntelligenceII-SDV 2016 - QWAM Content Intelligence
II-SDV 2016 - QWAM Content Intelligence
ย 
II-SDV Andrew Hinton - Text mining - as normal as data mining?
II-SDV Andrew Hinton - Text mining - as normal as data mining?II-SDV Andrew Hinton - Text mining - as normal as data mining?
II-SDV Andrew Hinton - Text mining - as normal as data mining?
ย 
II-SDV 2016 Irene Kitsara - Patent Landscape Reports and Other WIPO Activitie...
II-SDV 2016 Irene Kitsara - Patent Landscape Reports and Other WIPO Activitie...II-SDV 2016 Irene Kitsara - Patent Landscape Reports and Other WIPO Activitie...
II-SDV 2016 Irene Kitsara - Patent Landscape Reports and Other WIPO Activitie...
ย 
II-SDV 2016 Michael Iarrobino - Improving Text Mining Results with Access to ...
II-SDV 2016 Michael Iarrobino - Improving Text Mining Results with Access to ...II-SDV 2016 Michael Iarrobino - Improving Text Mining Results with Access to ...
II-SDV 2016 Michael Iarrobino - Improving Text Mining Results with Access to ...
ย 
II-SDV 2016 Manish Sinka - Taking Patent Research platforms beyond Search
II-SDV 2016 Manish Sinka - Taking Patent Research platforms beyond SearchII-SDV 2016 Manish Sinka - Taking Patent Research platforms beyond Search
II-SDV 2016 Manish Sinka - Taking Patent Research platforms beyond Search
ย 
II-SDV 2016 Patrick Beaucamp - Data Science with R and Vanilla Air
II-SDV 2016 Patrick Beaucamp - Data Science with R and Vanilla AirII-SDV 2016 Patrick Beaucamp - Data Science with R and Vanilla Air
II-SDV 2016 Patrick Beaucamp - Data Science with R and Vanilla Air
ย 
Monitoring and Analysis of Web Information for Various Business Contexts : Co...
Monitoring and Analysis of Web Information for Various Business Contexts : Co...Monitoring and Analysis of Web Information for Various Business Contexts : Co...
Monitoring and Analysis of Web Information for Various Business Contexts : Co...
ย 
Biomedical Annotation - Kevin Livingston
Biomedical Annotation - Kevin LivingstonBiomedical Annotation - Kevin Livingston
Biomedical Annotation - Kevin Livingston
ย 
Table mining and data curation from biomedical literature
Table mining and data curation from biomedical literatureTable mining and data curation from biomedical literature
Table mining and data curation from biomedical literature
ย 
II-SDV 2016 Bob Stembridge We have all the Time in the World; a Review of ho...
II-SDV 2016 Bob Stembridge  We have all the Time in the World; a Review of ho...II-SDV 2016 Bob Stembridge  We have all the Time in the World; a Review of ho...
II-SDV 2016 Bob Stembridge We have all the Time in the World; a Review of ho...
ย 
II-SDV 2016 Raphael Ilmer, Quentin Ladetto - Optimization of Patent Landscape...
II-SDV 2016 Raphael Ilmer, Quentin Ladetto - Optimization of Patent Landscape...II-SDV 2016 Raphael Ilmer, Quentin Ladetto - Optimization of Patent Landscape...
II-SDV 2016 Raphael Ilmer, Quentin Ladetto - Optimization of Patent Landscape...
ย 
II-SDV 2016 BizInt
II-SDV 2016 BizIntII-SDV 2016 BizInt
II-SDV 2016 BizInt
ย 
II-SDV 2017 in Nice - The International Information Conference on Search, Dat...
II-SDV 2017 in Nice - The International Information Conference on Search, Dat...II-SDV 2017 in Nice - The International Information Conference on Search, Dat...
II-SDV 2017 in Nice - The International Information Conference on Search, Dat...
ย 

Similar to II-SDV 2016 Srinivasan Parthiban - KOL Analytics from Biomedical Literature

Clinical Epidemiology - Systematic PubMed Searching Workshop
Clinical Epidemiology - Systematic PubMed Searching WorkshopClinical Epidemiology - Systematic PubMed Searching Workshop
Clinical Epidemiology - Systematic PubMed Searching WorkshopRobin Featherstone
ย 
How Semantic Technology Helps Researchers
How Semantic Technology Helps ResearchersHow Semantic Technology Helps Researchers
How Semantic Technology Helps ResearchersDarrell W. Gunter
ย 
AAP/PSP Semantic Publishing Workshop
AAP/PSP Semantic Publishing  WorkshopAAP/PSP Semantic Publishing  Workshop
AAP/PSP Semantic Publishing WorkshopDarrell W. Gunter
ย 
Program of Academic Excellence
Program of Academic ExcellenceProgram of Academic Excellence
Program of Academic ExcellenceDarrell W. Gunter
ย 
Publishing Connect NUI Galway - 31st Jan 2017
Publishing Connect NUI Galway - 31st Jan 2017Publishing Connect NUI Galway - 31st Jan 2017
Publishing Connect NUI Galway - 31st Jan 2017Michaela Kurschildgen
ย 
Week 6PrintAcademic IntegrityCreating change is one of the k.docx
Week 6PrintAcademic IntegrityCreating change is one of the k.docxWeek 6PrintAcademic IntegrityCreating change is one of the k.docx
Week 6PrintAcademic IntegrityCreating change is one of the k.docxlillie234567
ย 
How to Conduct a Systematic Search
How to Conduct a Systematic SearchHow to Conduct a Systematic Search
How to Conduct a Systematic SearchRobin Featherstone
ย 
Final Project Instructions for ANT 2401 Anthropology of Sust
Final Project  Instructions for ANT 2401  Anthropology of SustFinal Project  Instructions for ANT 2401  Anthropology of Sust
Final Project Instructions for ANT 2401 Anthropology of SustChereCheek752
ย 
Searching the medical literature aug 2010
Searching the medical literature aug 2010Searching the medical literature aug 2010
Searching the medical literature aug 2010Robin Featherstone
ย 
Ontologies: What Librarians Need to Know
Ontologies: What Librarians Need to KnowOntologies: What Librarians Need to Know
Ontologies: What Librarians Need to KnowBarry Smith
ย 
21 minutes agoTami Frazierย RE Discussion - Week 3COLLAPSE.docx
21 minutes agoTami Frazierย RE Discussion - Week 3COLLAPSE.docx21 minutes agoTami Frazierย RE Discussion - Week 3COLLAPSE.docx
21 minutes agoTami Frazierย RE Discussion - Week 3COLLAPSE.docxvickeryr87
ย 
Qualitative analysis boot camp final presentation slides
Qualitative analysis boot camp final presentation slidesQualitative analysis boot camp final presentation slides
Qualitative analysis boot camp final presentation slidesAlexandra Howson MA, PhD, CHCP
ย 
Qualitative analysis boot camp final presentation slides
Qualitative analysis boot camp final presentation slidesQualitative analysis boot camp final presentation slides
Qualitative analysis boot camp final presentation slidesAlexandra Howson MA, PhD, CHCP
ย 
Crime Scene Investigations Workgroup Chair Major Susan .docx
Crime Scene Investigations Workgroup  Chair Major Susan .docxCrime Scene Investigations Workgroup  Chair Major Susan .docx
Crime Scene Investigations Workgroup Chair Major Susan .docxvanesaburnand
ย 
ENGL147N-60265 Modules Week 4 Discussion Source Evaluation!.docx
ENGL147N-60265 Modules Week 4 Discussion Source Evaluation!.docxENGL147N-60265 Modules Week 4 Discussion Source Evaluation!.docx
ENGL147N-60265 Modules Week 4 Discussion Source Evaluation!.docxgidmanmary
ย 
How to do a Literature search for your research and scientific publication
How to do a Literature search for your research and scientific publication How to do a Literature search for your research and scientific publication
How to do a Literature search for your research and scientific publication BhaskarBorgohain4
ย 

Similar to II-SDV 2016 Srinivasan Parthiban - KOL Analytics from Biomedical Literature (20)

Clinical Epidemiology - Systematic PubMed Searching Workshop
Clinical Epidemiology - Systematic PubMed Searching WorkshopClinical Epidemiology - Systematic PubMed Searching Workshop
Clinical Epidemiology - Systematic PubMed Searching Workshop
ย 
How Semantic Technology Helps Researchers
How Semantic Technology Helps ResearchersHow Semantic Technology Helps Researchers
How Semantic Technology Helps Researchers
ย 
AAP/PSP Semantic Publishing Workshop
AAP/PSP Semantic Publishing  WorkshopAAP/PSP Semantic Publishing  Workshop
AAP/PSP Semantic Publishing Workshop
ย 
Program of Academic Excellence
Program of Academic ExcellenceProgram of Academic Excellence
Program of Academic Excellence
ย 
Publishing Connect NUI Galway - 31st Jan 2017
Publishing Connect NUI Galway - 31st Jan 2017Publishing Connect NUI Galway - 31st Jan 2017
Publishing Connect NUI Galway - 31st Jan 2017
ย 
Week 6PrintAcademic IntegrityCreating change is one of the k.docx
Week 6PrintAcademic IntegrityCreating change is one of the k.docxWeek 6PrintAcademic IntegrityCreating change is one of the k.docx
Week 6PrintAcademic IntegrityCreating change is one of the k.docx
ย 
How to Conduct a Systematic Search
How to Conduct a Systematic SearchHow to Conduct a Systematic Search
How to Conduct a Systematic Search
ย 
Cesse July 22 2009
Cesse   July 22 2009Cesse   July 22 2009
Cesse July 22 2009
ย 
Final Project Instructions for ANT 2401 Anthropology of Sust
Final Project  Instructions for ANT 2401  Anthropology of SustFinal Project  Instructions for ANT 2401  Anthropology of Sust
Final Project Instructions for ANT 2401 Anthropology of Sust
ย 
Exercise Science
Exercise ScienceExercise Science
Exercise Science
ย 
Searching the medical literature aug 2010
Searching the medical literature aug 2010Searching the medical literature aug 2010
Searching the medical literature aug 2010
ย 
NURS201 - May 2013
NURS201 - May 2013 NURS201 - May 2013
NURS201 - May 2013
ย 
Tutoriel ssmt
Tutoriel ssmtTutoriel ssmt
Tutoriel ssmt
ย 
Ontologies: What Librarians Need to Know
Ontologies: What Librarians Need to KnowOntologies: What Librarians Need to Know
Ontologies: What Librarians Need to Know
ย 
21 minutes agoTami Frazierย RE Discussion - Week 3COLLAPSE.docx
21 minutes agoTami Frazierย RE Discussion - Week 3COLLAPSE.docx21 minutes agoTami Frazierย RE Discussion - Week 3COLLAPSE.docx
21 minutes agoTami Frazierย RE Discussion - Week 3COLLAPSE.docx
ย 
Qualitative analysis boot camp final presentation slides
Qualitative analysis boot camp final presentation slidesQualitative analysis boot camp final presentation slides
Qualitative analysis boot camp final presentation slides
ย 
Qualitative analysis boot camp final presentation slides
Qualitative analysis boot camp final presentation slidesQualitative analysis boot camp final presentation slides
Qualitative analysis boot camp final presentation slides
ย 
Crime Scene Investigations Workgroup Chair Major Susan .docx
Crime Scene Investigations Workgroup  Chair Major Susan .docxCrime Scene Investigations Workgroup  Chair Major Susan .docx
Crime Scene Investigations Workgroup Chair Major Susan .docx
ย 
ENGL147N-60265 Modules Week 4 Discussion Source Evaluation!.docx
ENGL147N-60265 Modules Week 4 Discussion Source Evaluation!.docxENGL147N-60265 Modules Week 4 Discussion Source Evaluation!.docx
ENGL147N-60265 Modules Week 4 Discussion Source Evaluation!.docx
ย 
How to do a Literature search for your research and scientific publication
How to do a Literature search for your research and scientific publication How to do a Literature search for your research and scientific publication
How to do a Literature search for your research and scientific publication
ย 

More from Dr. Haxel Consult

AI-SDV 2022: Henry Chang Patent Intelligence and Engineering Management
AI-SDV 2022: Henry Chang Patent Intelligence and Engineering ManagementAI-SDV 2022: Henry Chang Patent Intelligence and Engineering Management
AI-SDV 2022: Henry Chang Patent Intelligence and Engineering ManagementDr. Haxel Consult
ย 
AI-SDV 2022: Creation and updating of large Knowledge Graphs through NLP Anal...
AI-SDV 2022: Creation and updating of large Knowledge Graphs through NLP Anal...AI-SDV 2022: Creation and updating of large Knowledge Graphs through NLP Anal...
AI-SDV 2022: Creation and updating of large Knowledge Graphs through NLP Anal...Dr. Haxel Consult
ย 
AI-SDV 2022: The race to net zero: Tracking the green industrial revolution t...
AI-SDV 2022: The race to net zero: Tracking the green industrial revolution t...AI-SDV 2022: The race to net zero: Tracking the green industrial revolution t...
AI-SDV 2022: The race to net zero: Tracking the green industrial revolution t...Dr. Haxel Consult
ย 
AI-SDV 2022: Accommodating the Deep Learning Revolution by a Development Proc...
AI-SDV 2022: Accommodating the Deep Learning Revolution by a Development Proc...AI-SDV 2022: Accommodating the Deep Learning Revolution by a Development Proc...
AI-SDV 2022: Accommodating the Deep Learning Revolution by a Development Proc...Dr. Haxel Consult
ย 
AI-SDV 2022: Domain Knowledge makes Artificial Intelligence Smart Linda Ander...
AI-SDV 2022: Domain Knowledge makes Artificial Intelligence Smart Linda Ander...AI-SDV 2022: Domain Knowledge makes Artificial Intelligence Smart Linda Ander...
AI-SDV 2022: Domain Knowledge makes Artificial Intelligence Smart Linda Ander...Dr. Haxel Consult
ย 
AI-SDV 2022: Embedding-based Search Vs. Relevancy Search: comparing the new w...
AI-SDV 2022: Embedding-based Search Vs. Relevancy Search: comparing the new w...AI-SDV 2022: Embedding-based Search Vs. Relevancy Search: comparing the new w...
AI-SDV 2022: Embedding-based Search Vs. Relevancy Search: comparing the new w...Dr. Haxel Consult
ย 
AI-SDV 2022: Rolling out web crawling at Boehringer Ingelheim - 10 years of e...
AI-SDV 2022: Rolling out web crawling at Boehringer Ingelheim - 10 years of e...AI-SDV 2022: Rolling out web crawling at Boehringer Ingelheim - 10 years of e...
AI-SDV 2022: Rolling out web crawling at Boehringer Ingelheim - 10 years of e...Dr. Haxel Consult
ย 
AI-SDV 2022: Machine learning based patent categorization: A success story in...
AI-SDV 2022: Machine learning based patent categorization: A success story in...AI-SDV 2022: Machine learning based patent categorization: A success story in...
AI-SDV 2022: Machine learning based patent categorization: A success story in...Dr. Haxel Consult
ย 
AI-SDV 2022: Machine learning based patent categorization: A success story in...
AI-SDV 2022: Machine learning based patent categorization: A success story in...AI-SDV 2022: Machine learning based patent categorization: A success story in...
AI-SDV 2022: Machine learning based patent categorization: A success story in...Dr. Haxel Consult
ย 
AI-SDV 2022: Finding the WHAT โ€“ Will AI help? Nils Newman (Search Technology,...
AI-SDV 2022: Finding the WHAT โ€“ Will AI help? Nils Newman (Search Technology,...AI-SDV 2022: Finding the WHAT โ€“ Will AI help? Nils Newman (Search Technology,...
AI-SDV 2022: Finding the WHAT โ€“ Will AI help? Nils Newman (Search Technology,...Dr. Haxel Consult
ย 
AI-SDV 2022: New Insights from Trademarks with Natural Language Processing Al...
AI-SDV 2022: New Insights from Trademarks with Natural Language Processing Al...AI-SDV 2022: New Insights from Trademarks with Natural Language Processing Al...
AI-SDV 2022: New Insights from Trademarks with Natural Language Processing Al...Dr. Haxel Consult
ย 
AI-SDV 2022: Extracting information from tables in documents Holger Keibel (K...
AI-SDV 2022: Extracting information from tables in documents Holger Keibel (K...AI-SDV 2022: Extracting information from tables in documents Holger Keibel (K...
AI-SDV 2022: Extracting information from tables in documents Holger Keibel (K...Dr. Haxel Consult
ย 
AI-SDV 2022: Scientific publishing in the age of data mining and artificial i...
AI-SDV 2022: Scientific publishing in the age of data mining and artificial i...AI-SDV 2022: Scientific publishing in the age of data mining and artificial i...
AI-SDV 2022: Scientific publishing in the age of data mining and artificial i...Dr. Haxel Consult
ย 
AI-SDV 2022: AI developments and usability Linus Wretblad (IPscreener / Uppdr...
AI-SDV 2022: AI developments and usability Linus Wretblad (IPscreener / Uppdr...AI-SDV 2022: AI developments and usability Linus Wretblad (IPscreener / Uppdr...
AI-SDV 2022: AI developments and usability Linus Wretblad (IPscreener / Uppdr...Dr. Haxel Consult
ย 
AI-SDV 2022: Whereโ€™s the one aboutโ€ฆ? Looney Tunesยฎ Revisited Jay Ven Eman (CE...
AI-SDV 2022: Whereโ€™s the one aboutโ€ฆ? Looney Tunesยฎ Revisited Jay Ven Eman (CE...AI-SDV 2022: Whereโ€™s the one aboutโ€ฆ? Looney Tunesยฎ Revisited Jay Ven Eman (CE...
AI-SDV 2022: Whereโ€™s the one aboutโ€ฆ? Looney Tunesยฎ Revisited Jay Ven Eman (CE...Dr. Haxel Consult
ย 
AI-SDV 2022: Copyright Clearance Center
AI-SDV 2022: Copyright Clearance CenterAI-SDV 2022: Copyright Clearance Center
AI-SDV 2022: Copyright Clearance CenterDr. Haxel Consult
ย 
AI-SDV 2022: Lighthouse IP
AI-SDV 2022: Lighthouse IPAI-SDV 2022: Lighthouse IP
AI-SDV 2022: Lighthouse IPDr. Haxel Consult
ย 
AI-SDV 2022: New Product Introductions: CENTREDOC
AI-SDV 2022: New Product Introductions: CENTREDOCAI-SDV 2022: New Product Introductions: CENTREDOC
AI-SDV 2022: New Product Introductions: CENTREDOCDr. Haxel Consult
ย 
AI-SDV 2022: Possibilities and limitations of AI-boosted multi-categorization...
AI-SDV 2022: Possibilities and limitations of AI-boosted multi-categorization...AI-SDV 2022: Possibilities and limitations of AI-boosted multi-categorization...
AI-SDV 2022: Possibilities and limitations of AI-boosted multi-categorization...Dr. Haxel Consult
ย 
AI-SDV 2022: Big data analytics platform at Bayer โ€“ Turning bits into insight...
AI-SDV 2022: Big data analytics platform at Bayer โ€“ Turning bits into insight...AI-SDV 2022: Big data analytics platform at Bayer โ€“ Turning bits into insight...
AI-SDV 2022: Big data analytics platform at Bayer โ€“ Turning bits into insight...Dr. Haxel Consult
ย 

More from Dr. Haxel Consult (20)

AI-SDV 2022: Henry Chang Patent Intelligence and Engineering Management
AI-SDV 2022: Henry Chang Patent Intelligence and Engineering ManagementAI-SDV 2022: Henry Chang Patent Intelligence and Engineering Management
AI-SDV 2022: Henry Chang Patent Intelligence and Engineering Management
ย 
AI-SDV 2022: Creation and updating of large Knowledge Graphs through NLP Anal...
AI-SDV 2022: Creation and updating of large Knowledge Graphs through NLP Anal...AI-SDV 2022: Creation and updating of large Knowledge Graphs through NLP Anal...
AI-SDV 2022: Creation and updating of large Knowledge Graphs through NLP Anal...
ย 
AI-SDV 2022: The race to net zero: Tracking the green industrial revolution t...
AI-SDV 2022: The race to net zero: Tracking the green industrial revolution t...AI-SDV 2022: The race to net zero: Tracking the green industrial revolution t...
AI-SDV 2022: The race to net zero: Tracking the green industrial revolution t...
ย 
AI-SDV 2022: Accommodating the Deep Learning Revolution by a Development Proc...
AI-SDV 2022: Accommodating the Deep Learning Revolution by a Development Proc...AI-SDV 2022: Accommodating the Deep Learning Revolution by a Development Proc...
AI-SDV 2022: Accommodating the Deep Learning Revolution by a Development Proc...
ย 
AI-SDV 2022: Domain Knowledge makes Artificial Intelligence Smart Linda Ander...
AI-SDV 2022: Domain Knowledge makes Artificial Intelligence Smart Linda Ander...AI-SDV 2022: Domain Knowledge makes Artificial Intelligence Smart Linda Ander...
AI-SDV 2022: Domain Knowledge makes Artificial Intelligence Smart Linda Ander...
ย 
AI-SDV 2022: Embedding-based Search Vs. Relevancy Search: comparing the new w...
AI-SDV 2022: Embedding-based Search Vs. Relevancy Search: comparing the new w...AI-SDV 2022: Embedding-based Search Vs. Relevancy Search: comparing the new w...
AI-SDV 2022: Embedding-based Search Vs. Relevancy Search: comparing the new w...
ย 
AI-SDV 2022: Rolling out web crawling at Boehringer Ingelheim - 10 years of e...
AI-SDV 2022: Rolling out web crawling at Boehringer Ingelheim - 10 years of e...AI-SDV 2022: Rolling out web crawling at Boehringer Ingelheim - 10 years of e...
AI-SDV 2022: Rolling out web crawling at Boehringer Ingelheim - 10 years of e...
ย 
AI-SDV 2022: Machine learning based patent categorization: A success story in...
AI-SDV 2022: Machine learning based patent categorization: A success story in...AI-SDV 2022: Machine learning based patent categorization: A success story in...
AI-SDV 2022: Machine learning based patent categorization: A success story in...
ย 
AI-SDV 2022: Machine learning based patent categorization: A success story in...
AI-SDV 2022: Machine learning based patent categorization: A success story in...AI-SDV 2022: Machine learning based patent categorization: A success story in...
AI-SDV 2022: Machine learning based patent categorization: A success story in...
ย 
AI-SDV 2022: Finding the WHAT โ€“ Will AI help? Nils Newman (Search Technology,...
AI-SDV 2022: Finding the WHAT โ€“ Will AI help? Nils Newman (Search Technology,...AI-SDV 2022: Finding the WHAT โ€“ Will AI help? Nils Newman (Search Technology,...
AI-SDV 2022: Finding the WHAT โ€“ Will AI help? Nils Newman (Search Technology,...
ย 
AI-SDV 2022: New Insights from Trademarks with Natural Language Processing Al...
AI-SDV 2022: New Insights from Trademarks with Natural Language Processing Al...AI-SDV 2022: New Insights from Trademarks with Natural Language Processing Al...
AI-SDV 2022: New Insights from Trademarks with Natural Language Processing Al...
ย 
AI-SDV 2022: Extracting information from tables in documents Holger Keibel (K...
AI-SDV 2022: Extracting information from tables in documents Holger Keibel (K...AI-SDV 2022: Extracting information from tables in documents Holger Keibel (K...
AI-SDV 2022: Extracting information from tables in documents Holger Keibel (K...
ย 
AI-SDV 2022: Scientific publishing in the age of data mining and artificial i...
AI-SDV 2022: Scientific publishing in the age of data mining and artificial i...AI-SDV 2022: Scientific publishing in the age of data mining and artificial i...
AI-SDV 2022: Scientific publishing in the age of data mining and artificial i...
ย 
AI-SDV 2022: AI developments and usability Linus Wretblad (IPscreener / Uppdr...
AI-SDV 2022: AI developments and usability Linus Wretblad (IPscreener / Uppdr...AI-SDV 2022: AI developments and usability Linus Wretblad (IPscreener / Uppdr...
AI-SDV 2022: AI developments and usability Linus Wretblad (IPscreener / Uppdr...
ย 
AI-SDV 2022: Whereโ€™s the one aboutโ€ฆ? Looney Tunesยฎ Revisited Jay Ven Eman (CE...
AI-SDV 2022: Whereโ€™s the one aboutโ€ฆ? Looney Tunesยฎ Revisited Jay Ven Eman (CE...AI-SDV 2022: Whereโ€™s the one aboutโ€ฆ? Looney Tunesยฎ Revisited Jay Ven Eman (CE...
AI-SDV 2022: Whereโ€™s the one aboutโ€ฆ? Looney Tunesยฎ Revisited Jay Ven Eman (CE...
ย 
AI-SDV 2022: Copyright Clearance Center
AI-SDV 2022: Copyright Clearance CenterAI-SDV 2022: Copyright Clearance Center
AI-SDV 2022: Copyright Clearance Center
ย 
AI-SDV 2022: Lighthouse IP
AI-SDV 2022: Lighthouse IPAI-SDV 2022: Lighthouse IP
AI-SDV 2022: Lighthouse IP
ย 
AI-SDV 2022: New Product Introductions: CENTREDOC
AI-SDV 2022: New Product Introductions: CENTREDOCAI-SDV 2022: New Product Introductions: CENTREDOC
AI-SDV 2022: New Product Introductions: CENTREDOC
ย 
AI-SDV 2022: Possibilities and limitations of AI-boosted multi-categorization...
AI-SDV 2022: Possibilities and limitations of AI-boosted multi-categorization...AI-SDV 2022: Possibilities and limitations of AI-boosted multi-categorization...
AI-SDV 2022: Possibilities and limitations of AI-boosted multi-categorization...
ย 
AI-SDV 2022: Big data analytics platform at Bayer โ€“ Turning bits into insight...
AI-SDV 2022: Big data analytics platform at Bayer โ€“ Turning bits into insight...AI-SDV 2022: Big data analytics platform at Bayer โ€“ Turning bits into insight...
AI-SDV 2022: Big data analytics platform at Bayer โ€“ Turning bits into insight...
ย 

Recently uploaded

Pirangut | Call Girls Pune Phone No 8005736733 Elite Escort Service Available...
Pirangut | Call Girls Pune Phone No 8005736733 Elite Escort Service Available...Pirangut | Call Girls Pune Phone No 8005736733 Elite Escort Service Available...
Pirangut | Call Girls Pune Phone No 8005736733 Elite Escort Service Available...SUHANI PANDEY
ย 
WhatsApp ๐Ÿ“ž 8448380779 โœ…Call Girls In Mamura Sector 66 ( Noida)
WhatsApp ๐Ÿ“ž 8448380779 โœ…Call Girls In Mamura Sector 66 ( Noida)WhatsApp ๐Ÿ“ž 8448380779 โœ…Call Girls In Mamura Sector 66 ( Noida)
WhatsApp ๐Ÿ“ž 8448380779 โœ…Call Girls In Mamura Sector 66 ( Noida)Delhi Call girls
ย 
Microsoft Azure Arc Customer Deck Microsoft
Microsoft Azure Arc Customer Deck MicrosoftMicrosoft Azure Arc Customer Deck Microsoft
Microsoft Azure Arc Customer Deck MicrosoftAanSulistiyo
ย 
20240510 QFM016 Irresponsible AI Reading List April 2024.pdf
20240510 QFM016 Irresponsible AI Reading List April 2024.pdf20240510 QFM016 Irresponsible AI Reading List April 2024.pdf
20240510 QFM016 Irresponsible AI Reading List April 2024.pdfMatthew Sinclair
ย 
Nanded City ( Call Girls ) Pune 6297143586 Hot Model With Sexy Bhabi Ready ...
Nanded City ( Call Girls ) Pune  6297143586  Hot Model With Sexy Bhabi Ready ...Nanded City ( Call Girls ) Pune  6297143586  Hot Model With Sexy Bhabi Ready ...
Nanded City ( Call Girls ) Pune 6297143586 Hot Model With Sexy Bhabi Ready ...tanu pandey
ย 
โžฅ๐Ÿ” 7737669865 ๐Ÿ”โ–ป mehsana Call-girls in Women Seeking Men ๐Ÿ”mehsana๐Ÿ” Escorts...
โžฅ๐Ÿ” 7737669865 ๐Ÿ”โ–ป mehsana Call-girls in Women Seeking Men  ๐Ÿ”mehsana๐Ÿ”   Escorts...โžฅ๐Ÿ” 7737669865 ๐Ÿ”โ–ป mehsana Call-girls in Women Seeking Men  ๐Ÿ”mehsana๐Ÿ”   Escorts...
โžฅ๐Ÿ” 7737669865 ๐Ÿ”โ–ป mehsana Call-girls in Women Seeking Men ๐Ÿ”mehsana๐Ÿ” Escorts...nirzagarg
ย 
APNIC Updates presented by Paul Wilson at ARIN 53
APNIC Updates presented by Paul Wilson at ARIN 53APNIC Updates presented by Paul Wilson at ARIN 53
APNIC Updates presented by Paul Wilson at ARIN 53APNIC
ย 
Call Girls in Prashant Vihar, Delhi ๐Ÿ’ฏ Call Us ๐Ÿ”9953056974 ๐Ÿ” Escort Service
Call Girls in Prashant Vihar, Delhi ๐Ÿ’ฏ Call Us ๐Ÿ”9953056974 ๐Ÿ” Escort ServiceCall Girls in Prashant Vihar, Delhi ๐Ÿ’ฏ Call Us ๐Ÿ”9953056974 ๐Ÿ” Escort Service
Call Girls in Prashant Vihar, Delhi ๐Ÿ’ฏ Call Us ๐Ÿ”9953056974 ๐Ÿ” Escort Service9953056974 Low Rate Call Girls In Saket, Delhi NCR
ย 
( Pune ) VIP Pimpri Chinchwad Call Girls ๐ŸŽ—๏ธ 9352988975 Sizzling | Escorts | G...
( Pune ) VIP Pimpri Chinchwad Call Girls ๐ŸŽ—๏ธ 9352988975 Sizzling | Escorts | G...( Pune ) VIP Pimpri Chinchwad Call Girls ๐ŸŽ—๏ธ 9352988975 Sizzling | Escorts | G...
( Pune ) VIP Pimpri Chinchwad Call Girls ๐ŸŽ—๏ธ 9352988975 Sizzling | Escorts | G...nilamkumrai
ย 
"Boost Your Digital Presence: Partner with a Leading SEO Agency"
"Boost Your Digital Presence: Partner with a Leading SEO Agency""Boost Your Digital Presence: Partner with a Leading SEO Agency"
"Boost Your Digital Presence: Partner with a Leading SEO Agency"growthgrids
ย 
All Time Service Available Call Girls Mg Road ๐Ÿ‘Œ โญ๏ธ 6378878445
All Time Service Available Call Girls Mg Road ๐Ÿ‘Œ โญ๏ธ 6378878445All Time Service Available Call Girls Mg Road ๐Ÿ‘Œ โญ๏ธ 6378878445
All Time Service Available Call Girls Mg Road ๐Ÿ‘Œ โญ๏ธ 6378878445ruhi
ย 
VIP Model Call Girls NIBM ( Pune ) Call ON 8005736733 Starting From 5K to 25K...
VIP Model Call Girls NIBM ( Pune ) Call ON 8005736733 Starting From 5K to 25K...VIP Model Call Girls NIBM ( Pune ) Call ON 8005736733 Starting From 5K to 25K...
VIP Model Call Girls NIBM ( Pune ) Call ON 8005736733 Starting From 5K to 25K...SUHANI PANDEY
ย 
Al Barsha Night Partner +0567686026 Call Girls Dubai
Al Barsha Night Partner +0567686026 Call Girls  DubaiAl Barsha Night Partner +0567686026 Call Girls  Dubai
Al Barsha Night Partner +0567686026 Call Girls DubaiEscorts Call Girls
ย 
Ganeshkhind ! Call Girls Pune - 450+ Call Girl Cash Payment 8005736733 Neha T...
Ganeshkhind ! Call Girls Pune - 450+ Call Girl Cash Payment 8005736733 Neha T...Ganeshkhind ! Call Girls Pune - 450+ Call Girl Cash Payment 8005736733 Neha T...
Ganeshkhind ! Call Girls Pune - 450+ Call Girl Cash Payment 8005736733 Neha T...SUHANI PANDEY
ย 
Call Girls Sangvi Call Me 7737669865 Budget Friendly No Advance BookingCall G...
Call Girls Sangvi Call Me 7737669865 Budget Friendly No Advance BookingCall G...Call Girls Sangvi Call Me 7737669865 Budget Friendly No Advance BookingCall G...
Call Girls Sangvi Call Me 7737669865 Budget Friendly No Advance BookingCall G...roncy bisnoi
ย 
๐Ÿ“ฑDehradun Call Girls Service ๐Ÿ“ฑโ˜Ž๏ธ +91'905,3900,678 โ˜Ž๏ธ๐Ÿ“ฑ Call Girls In Dehradun ๐Ÿ“ฑ
๐Ÿ“ฑDehradun Call Girls Service ๐Ÿ“ฑโ˜Ž๏ธ +91'905,3900,678 โ˜Ž๏ธ๐Ÿ“ฑ Call Girls In Dehradun ๐Ÿ“ฑ๐Ÿ“ฑDehradun Call Girls Service ๐Ÿ“ฑโ˜Ž๏ธ +91'905,3900,678 โ˜Ž๏ธ๐Ÿ“ฑ Call Girls In Dehradun ๐Ÿ“ฑ
๐Ÿ“ฑDehradun Call Girls Service ๐Ÿ“ฑโ˜Ž๏ธ +91'905,3900,678 โ˜Ž๏ธ๐Ÿ“ฑ Call Girls In Dehradun ๐Ÿ“ฑ@Chandigarh #call #Girls 9053900678 @Call #Girls in @Punjab 9053900678
ย 
2nd Solid Symposium: Solid Pods vs Personal Knowledge Graphs
2nd Solid Symposium: Solid Pods vs Personal Knowledge Graphs2nd Solid Symposium: Solid Pods vs Personal Knowledge Graphs
2nd Solid Symposium: Solid Pods vs Personal Knowledge GraphsEleniIlkou
ย 

Recently uploaded (20)

Russian Call Girls in %(+971524965298 )# Call Girls in Dubai
Russian Call Girls in %(+971524965298  )#  Call Girls in DubaiRussian Call Girls in %(+971524965298  )#  Call Girls in Dubai
Russian Call Girls in %(+971524965298 )# Call Girls in Dubai
ย 
Pirangut | Call Girls Pune Phone No 8005736733 Elite Escort Service Available...
Pirangut | Call Girls Pune Phone No 8005736733 Elite Escort Service Available...Pirangut | Call Girls Pune Phone No 8005736733 Elite Escort Service Available...
Pirangut | Call Girls Pune Phone No 8005736733 Elite Escort Service Available...
ย 
WhatsApp ๐Ÿ“ž 8448380779 โœ…Call Girls In Mamura Sector 66 ( Noida)
WhatsApp ๐Ÿ“ž 8448380779 โœ…Call Girls In Mamura Sector 66 ( Noida)WhatsApp ๐Ÿ“ž 8448380779 โœ…Call Girls In Mamura Sector 66 ( Noida)
WhatsApp ๐Ÿ“ž 8448380779 โœ…Call Girls In Mamura Sector 66 ( Noida)
ย 
Microsoft Azure Arc Customer Deck Microsoft
Microsoft Azure Arc Customer Deck MicrosoftMicrosoft Azure Arc Customer Deck Microsoft
Microsoft Azure Arc Customer Deck Microsoft
ย 
20240510 QFM016 Irresponsible AI Reading List April 2024.pdf
20240510 QFM016 Irresponsible AI Reading List April 2024.pdf20240510 QFM016 Irresponsible AI Reading List April 2024.pdf
20240510 QFM016 Irresponsible AI Reading List April 2024.pdf
ย 
Nanded City ( Call Girls ) Pune 6297143586 Hot Model With Sexy Bhabi Ready ...
Nanded City ( Call Girls ) Pune  6297143586  Hot Model With Sexy Bhabi Ready ...Nanded City ( Call Girls ) Pune  6297143586  Hot Model With Sexy Bhabi Ready ...
Nanded City ( Call Girls ) Pune 6297143586 Hot Model With Sexy Bhabi Ready ...
ย 
โžฅ๐Ÿ” 7737669865 ๐Ÿ”โ–ป mehsana Call-girls in Women Seeking Men ๐Ÿ”mehsana๐Ÿ” Escorts...
โžฅ๐Ÿ” 7737669865 ๐Ÿ”โ–ป mehsana Call-girls in Women Seeking Men  ๐Ÿ”mehsana๐Ÿ”   Escorts...โžฅ๐Ÿ” 7737669865 ๐Ÿ”โ–ป mehsana Call-girls in Women Seeking Men  ๐Ÿ”mehsana๐Ÿ”   Escorts...
โžฅ๐Ÿ” 7737669865 ๐Ÿ”โ–ป mehsana Call-girls in Women Seeking Men ๐Ÿ”mehsana๐Ÿ” Escorts...
ย 
APNIC Updates presented by Paul Wilson at ARIN 53
APNIC Updates presented by Paul Wilson at ARIN 53APNIC Updates presented by Paul Wilson at ARIN 53
APNIC Updates presented by Paul Wilson at ARIN 53
ย 
Call Girls in Prashant Vihar, Delhi ๐Ÿ’ฏ Call Us ๐Ÿ”9953056974 ๐Ÿ” Escort Service
Call Girls in Prashant Vihar, Delhi ๐Ÿ’ฏ Call Us ๐Ÿ”9953056974 ๐Ÿ” Escort ServiceCall Girls in Prashant Vihar, Delhi ๐Ÿ’ฏ Call Us ๐Ÿ”9953056974 ๐Ÿ” Escort Service
Call Girls in Prashant Vihar, Delhi ๐Ÿ’ฏ Call Us ๐Ÿ”9953056974 ๐Ÿ” Escort Service
ย 
Low Sexy Call Girls In Mohali 9053900678 ๐ŸฅตHave Save And Good Place ๐Ÿฅต
Low Sexy Call Girls In Mohali 9053900678 ๐ŸฅตHave Save And Good Place ๐ŸฅตLow Sexy Call Girls In Mohali 9053900678 ๐ŸฅตHave Save And Good Place ๐Ÿฅต
Low Sexy Call Girls In Mohali 9053900678 ๐ŸฅตHave Save And Good Place ๐Ÿฅต
ย 
( Pune ) VIP Pimpri Chinchwad Call Girls ๐ŸŽ—๏ธ 9352988975 Sizzling | Escorts | G...
( Pune ) VIP Pimpri Chinchwad Call Girls ๐ŸŽ—๏ธ 9352988975 Sizzling | Escorts | G...( Pune ) VIP Pimpri Chinchwad Call Girls ๐ŸŽ—๏ธ 9352988975 Sizzling | Escorts | G...
( Pune ) VIP Pimpri Chinchwad Call Girls ๐ŸŽ—๏ธ 9352988975 Sizzling | Escorts | G...
ย 
"Boost Your Digital Presence: Partner with a Leading SEO Agency"
"Boost Your Digital Presence: Partner with a Leading SEO Agency""Boost Your Digital Presence: Partner with a Leading SEO Agency"
"Boost Your Digital Presence: Partner with a Leading SEO Agency"
ย 
All Time Service Available Call Girls Mg Road ๐Ÿ‘Œ โญ๏ธ 6378878445
All Time Service Available Call Girls Mg Road ๐Ÿ‘Œ โญ๏ธ 6378878445All Time Service Available Call Girls Mg Road ๐Ÿ‘Œ โญ๏ธ 6378878445
All Time Service Available Call Girls Mg Road ๐Ÿ‘Œ โญ๏ธ 6378878445
ย 
VIP Model Call Girls NIBM ( Pune ) Call ON 8005736733 Starting From 5K to 25K...
VIP Model Call Girls NIBM ( Pune ) Call ON 8005736733 Starting From 5K to 25K...VIP Model Call Girls NIBM ( Pune ) Call ON 8005736733 Starting From 5K to 25K...
VIP Model Call Girls NIBM ( Pune ) Call ON 8005736733 Starting From 5K to 25K...
ย 
Al Barsha Night Partner +0567686026 Call Girls Dubai
Al Barsha Night Partner +0567686026 Call Girls  DubaiAl Barsha Night Partner +0567686026 Call Girls  Dubai
Al Barsha Night Partner +0567686026 Call Girls Dubai
ย 
Ganeshkhind ! Call Girls Pune - 450+ Call Girl Cash Payment 8005736733 Neha T...
Ganeshkhind ! Call Girls Pune - 450+ Call Girl Cash Payment 8005736733 Neha T...Ganeshkhind ! Call Girls Pune - 450+ Call Girl Cash Payment 8005736733 Neha T...
Ganeshkhind ! Call Girls Pune - 450+ Call Girl Cash Payment 8005736733 Neha T...
ย 
6.High Profile Call Girls In Punjab +919053900678 Punjab Call GirlHigh Profil...
6.High Profile Call Girls In Punjab +919053900678 Punjab Call GirlHigh Profil...6.High Profile Call Girls In Punjab +919053900678 Punjab Call GirlHigh Profil...
6.High Profile Call Girls In Punjab +919053900678 Punjab Call GirlHigh Profil...
ย 
Call Girls Sangvi Call Me 7737669865 Budget Friendly No Advance BookingCall G...
Call Girls Sangvi Call Me 7737669865 Budget Friendly No Advance BookingCall G...Call Girls Sangvi Call Me 7737669865 Budget Friendly No Advance BookingCall G...
Call Girls Sangvi Call Me 7737669865 Budget Friendly No Advance BookingCall G...
ย 
๐Ÿ“ฑDehradun Call Girls Service ๐Ÿ“ฑโ˜Ž๏ธ +91'905,3900,678 โ˜Ž๏ธ๐Ÿ“ฑ Call Girls In Dehradun ๐Ÿ“ฑ
๐Ÿ“ฑDehradun Call Girls Service ๐Ÿ“ฑโ˜Ž๏ธ +91'905,3900,678 โ˜Ž๏ธ๐Ÿ“ฑ Call Girls In Dehradun ๐Ÿ“ฑ๐Ÿ“ฑDehradun Call Girls Service ๐Ÿ“ฑโ˜Ž๏ธ +91'905,3900,678 โ˜Ž๏ธ๐Ÿ“ฑ Call Girls In Dehradun ๐Ÿ“ฑ
๐Ÿ“ฑDehradun Call Girls Service ๐Ÿ“ฑโ˜Ž๏ธ +91'905,3900,678 โ˜Ž๏ธ๐Ÿ“ฑ Call Girls In Dehradun ๐Ÿ“ฑ
ย 
2nd Solid Symposium: Solid Pods vs Personal Knowledge Graphs
2nd Solid Symposium: Solid Pods vs Personal Knowledge Graphs2nd Solid Symposium: Solid Pods vs Personal Knowledge Graphs
2nd Solid Symposium: Solid Pods vs Personal Knowledge Graphs
ย 

II-SDV 2016 Srinivasan Parthiban - KOL Analytics from Biomedical Literature

  • 1. KOL Analytics from Biomedical Literature II-SDV Conference Nice, France 18 - 19 April 2016 Srinivasan Parthiban Thava Alagu New York, USA
  • 2. โ€ข Working with pharmaceutical Medical Affairs, Clinical, R&D, and commercial organizations since 2005 โ€ข Working with more than half of the Top 50 Companies, 16 of the top 25 (17, and 18 contracting now!) โ€ข The only completely integrated Scientific Information Solution Provides timely insights and facilitates strategic decision making from the vast amount of publicly available scientific information Medmeme Meme(noun) - An idea or behavior that spreads in a manner analogous to the biological transmission of genes.
  • 3. Bottom Up vs. Top Down โ€ข As each scientific dissemination is captured it is normalized and disambiguated prior to being placed into the master data warehouse โ€ข Matching, tagging and synonyms are added at this stage โ€ข Data is mapped to all relevant areas of interest: โ€ข People โ€ข Places โ€ข Institutions & Companies โ€ข Drugs โ€ข Keywords: Mechanism of Action, treatment paradigms, etc. Building the Scientific Data Warehouse
  • 4. Grants Over 1,128,000 Data Sources Patents Over 800,000 Clinical Trials Over 280,000 Publications Over 8,930,000 Abstracts from 5760 journals Meetings Over 11,870,000 Abstracts Monitoring 14,000+ meetings/year Treatment Guidelines Over 36,480 Rolling 10 years ๏ท Continuously Updated ๏ท Scientifically Credible Sources Aligned to the Scientific Discovery Process โ€“ from Grants to Guidelines
  • 5. Impactmeme: The ultimate tool for constantly keeping on top of who is saying what, where. It captures all available scientific dissemination regardless of source Profilememe: Complete, detailed profiles of virtually all significant publishing and presenting activities for up to 10 years โ€“ at oneโ€™s fingertips and continuously updated Insightmeme: A virtual medical librarian on a desktop, allows a user to search on almost any dimension, the entirety of medical journal contents and congress outputs for the past 10 years up to the past month โ€“ all normalize and indexed Conferencememe: The most comprehensive database of medical congress output available anywhere available to users everywhere. See trends in content, as well as where the opinion leaders of interest are presenting Medmeme Products
  • 6. โ€ข An Industry term and acronym: KOL = Key Opinion Leader โ€ข KOLs are influential doctors, physicians and members of the medical community whoโ€™s opinions are highly regarded and who influence other doctorโ€™s and physicians. โ€ข KOLs advise companies as to where unmet medical needs lie, choose drug targets, help to define potential product profiles and shape clinical programs, run clinical trials, and may be involved in a drugโ€™s regulatory or reimbursement review process. โ€ข Peer-to-peer relationships with KOLs are maintained by Medical Science Liaisons (MSL) from Pharma and healthcare companies. MSLs are therapeutic specialists (e.g., oncology, cardiology, neurology) What is a KOL?
  • 8. Geographic Influence Does the physician have to lead clinical research studies? Is the physician an early adopter of new drugs? Education Level Level of Annual Advising Services Funding Level of Annual Grant Funding Tier 1 Global Yes Yes Medical Doctor $25,000 to $50,000 $100,000 to $250,000 Tier 2 National (US) Yes Yes Medical Doctor $10,000 to $25,000 Less Than $100,000 Tier 3 Regional No Yes Medical doctor Less Than $10,000 Less Than $100,000 Tier 4 Local No Not necessarily Medical doctor Less Than $10,000 Less Than $100,000 Tier 5 Local or National (non- USA) No No PharmD Less Than $10,000 Less Than $100,000 Different Levels of KOLs
  • 9. Average Number of Publications per Year by Thought Leader Tier 8,2 5,7 4,8 2,9 1,7 0 1 2 3 4 5 6 7 8 9 Tier-1 Tier-2 Tier-3 Tier-4 Tier-5 NumberofPublicationsperYear Thought Leader Tier
  • 10. Average Years of Clinical Experience by Thought Leader Tier 12,9 9 7,4 7,3 5,2 0 2 4 6 8 10 12 14 Tier-1 Tier-2 Tier-3 Tier-4 Tier-5 ClinicalExperienceinYears Thought Leader Tier
  • 11. Average Number of Promotional Speeches per Year by Thought Leader Tier 9,2 6 3,6 3,9 2,2 0 1 2 3 4 5 6 7 8 9 10 Tier-1 Tier-2 Tier-3 Tier-4 Tier-5 Speeches Thought Leader Tier
  • 13. 1,85 2,32 7,17 6,79 6,69 20,65 7,38 5,52 2,17 0 5 10 15 20 25 Delivering a Promotional Speech Delivering a Scientific Speech Leading an Advisory Panel (Chair) Moderating an Advisory Panel Participating in an Advisory Panel Authoring a manuscript Authoring an Abstract Thought Leader Training (General) Compilance Training Hours Average Amount of Hours Spent per Thought Leader Activity
  • 15. Three Challenges 1. Synonymy - A single individual may publish under multiple namesโ€”this includes a) orthographic and spelling variants, b) spelling errors, c) name changes over time as may occur with marriage, religious conversion or gender re- assignment, and d) the use of pen names. 2. Homonymy - Many different individuals have the same name โ€“ in fact, common names may comprise several thousand individuals. 3. The necessary metadata are often incomplete or lacking entirely โ€“ for example, some publishers and bibliographic databases did not record authorsโ€™ first names, their geographical locations, or identifying information such as their degrees or their positions.
  • 17. โ€ฆmistaken identity has resulted in the wrong person being invited to work on a project [โ€ฆ] or to undertake the peer review of an article
  • 18. Type I error False Positive: Identify different author instances as same single author entity. Results in bigger clusters than what it should be. Type II error False Negative: Not able to identify different author instances of same author. Results in too many small clusters. What Can Go Wrong?
  • 19. Percentage of author names in Medline that includes full first name instead of an initial 0,0 10,0 20,0 30,0 40,0 50,0 60,0 70,0 80,0 90,0 1995 2000 2005 2010 2015 percentage(%) Year 72,0 74,0 76,0 78,0 80,0 82,0 84,0 86,0 2000 2002 2004 2006 2008 2010 2012 percentage(%) Year โ€ข Full names work much better than initials โ€ข Only 5% of the author names on your institutionโ€™s articles are people in your instance of Profiles. The rest are former faculty or external collaborators that you have never heard about.
  • 20. Can never be 100% accurate 85% is considered quite good Further manual disambiguation is optional Close enough
  • 21. Who is John Smith and what is he talking about ? Retrieve all clusters with the same author name What Do You Want to Know? Who is this John Smith, the author of Article X? Retrieve other PubMed ids of the same cluster
  • 22. Give me top 10 KOLs in the field of Cancer! DISA Platform retrieves top 10 Unique-Author-IDs. Each UAID is associated with one cluster (of articles) and associated Identity information. (Affiliations and E-mails). DISA uses the keywords associated with articles to pre-index the authors with associated keywords. What Do You Want to Know?
  • 23. โ€ข High Precision and Recall is the goal. โ€ข Precision โ€ข Accuracy Ratio โ€“ Be correct in grouping. โ€ข precision = #of correctly clustered pairs / #of clustered pairs โ€ข Stricter the condition, higher the precision โ€ข Recall โ€ข Efficiency Ratio - Do not miss the matches. โ€ข recall = #of correctly clustered pairs / #of true positive pairs โ€ข More liberal condition, higher the recall Disambiguation Goal
  • 24. โ€ข Total Manual Disambiguation is infeasible โ€ข Automation is great, but canโ€™t be 100% โ€ข Manual process is hard, uncertain, subjective โ€ข Manual after Automation is Pragmatic Manual Vs Automated Disambiguation
  • 25. โ€ข Group all publications into author clusters โ€ข Match person to clusters Clustering Methods
  • 26. Clustering based on similarity probability model Available factors : โ€ข Co-authors โ€ข Affiliation โ€ข Journal โ€ข Mesh Terms โ€ข Publication Date Automation Approach
  • 27. โ€ข Self learning system possible โ€“ Learns from Gold Set โ€ข Creating proper training set is the biggest challenge โ€ข Manual creation of proper training set is costly โ€ข Higher the complexity, vulnerable to bugs โ€ข Main goal is to find relative importance of the criteria โ€ข Co-author Vs Affiliation Vs MeshTerms Vs Journal etc. Machine Learning
  • 28. โ€ข Extensive affiliation disambiguation is more challenging โ€ข Affiliation normalization helps in author disambiguation โ€ข Involves recognizing countries, cities and address normalization into canonical form. โ€ข Fuzzy matching possible after normalization โ€“ for smaller buckets only. Affiliation Disambiguation
  • 29. โ€ข Remember โ€“ It is costly operation ! โ€ข Scalability Hazard ! โ€ข Algorithms: โ€ข Monger-Elkan, Jaro-Winkler, Levenstein based on edit distance. โ€ข Jaccard, TF-IDF based on token based multi-sets. (Order of words are not important) โ€ข Some hybrid techniques are also common. . Fuzzy Matching
  • 30. Article-1 Authors : X, Y Article-2 Authors : X, Z (1 and 2 seems disconnected) Article-3 Authors : X, Y, Z (Likely that X is same author for all 3 articles) Note: Clustering algorithm recognizes and handles this appropriately. Transitivity Fixing
  • 31. Introducing DISA โ€ข DISA stands for Disambiguation Automated Platform. โ€ข DISA provides powerful core kernel software system backed by the author database. โ€ข DISA enables applications to be developed on this platform to explore the KOLs based on Pubmed and Conferences information.
  • 32. ETL - Extract, Transform and Load Pubmed Data Explode To Author Instances Unique Authors Rule Based Unification Engine Author Instances DISA API Layer For Application Access. Conference Data DISA Application DISA Platform Architecture
  • 34. โ€ข Disambiguation restricted to same last name authors. โ€ข This โ€œBlockingโ€ mechanism prevents combinatorial explosion. โ€ข Still poses problems for common names โ€ข Fuzzy algorithms are very expensive on large buckets/blocks. Scalability Issues
  • 35. โ€ข Relatively less researched so far. โ€ข Need faster updates for delta addition. โ€ข Reconstruct clusters of given name spaces. โ€ข Use incremental clustering โ€ข Embedded database to store and retrieve the disambiguated author data. Incremental Disambiguation
  • 36. โ€ข We need both higher precision and recall. โ€ข But precision is more important. โ€ข Precision errors are more permanent and harder to fix. โ€ข Recall misses may be fixed in future or by manual disambiguation. Being Conservative : Precision Vs Recall
  • 37. Can not Fix Impossible Situations Not possible to identify these without authorโ€™s voluntary disclosures.
  • 38. ORCID Voluntary Creation of Unique ID and linking How to Fix it Going Forward ?
  • 39. 501 7th Avenue, Suite 508 New York, NY, 10018 (USA) Tel.: 212-725-5992 Fax: 212-725-5993 www.medmeme.com Thank You