SlideShare a Scribd company logo
Country Long Distance
Australia +61 3 8488 8993
Austria +43 (0) 7 2088 2171
Belgium +32 (0) 42 68 0164
Canada +1 (647) 497-9386
Denmark +45 (0) 89 88 04 43
Finland +358 (0) 931 58 4587
France +33 (0) 182 880 933
Germany +49 (0) 692 5736 7304
Ireland +353 (0) 19 036 186
Text Mining Full Text for Molecular Targets
with George Jiang, Ph.D., M.B.A
Our Webinar will begin in a few minutes
Country Long Distance
Italy +39 0 294 75 15 36
Netherlands +31 (0) 108 080 115
New Zealand +64 (0) 9 801 0293
Norway +47 21 03 72 89
Spain +34 911 23 4247
Sweden +46 (0) 852 500 292
Switzerland +41 (0) 435 0824 40
United Kingdom +44 (0) 330 221 9921
United States +1 (646) 307-1726
TO USE YOUR COMPUTER'S AUDIO: When the webinar begins, you will be connected to audio using
your computer's microphone and speakers (VoIP). A headset is recommended.
--OR--
TO USE YOUR TELEPHONE: If you prefer to use your phone, you must select "Use Telephone" after
joining the webinar and call in using the numbers below.
Dial your country’s number and then use Access Code: 655-028-479
Text Mining Full Text
for Molecular Targets
George Jiang, PhD, MBA
Product Manager, Text Mining
g.jiang@elsevier.com
March 31, 2015
George Jiang
Product Manager
Text Mining
Trained scientist with several years of experience in text analytics, data integration, and
scientific software development
• Currently, Product Manager with Elsevier working on text mining projects and
semantic search products, based out of Rockville, MD
• Previously, worked at US National Center for Biotechnology Information (NCBI)
working on Discovery Initiative to understand users needs and crosslink data and
expose it to make research information more discoverable
Text Mining Full Text for Molecular Targets – March 31, 2015 – George Jiang
World Leader in Digital Information Solutions
Published over
330,000 articles
in 2013
Founded over
130 years ago
Work with over
30 million
scientists, students, health
& information professionals
Received over
1 million submissions
in 2013
SOLUTIONS
Over 53 million
items indexed by
Scopus
Elsevier
R+D Solutions
Elsevier
Clinical Solutions
Helps corporate
researchers, R+D
professionals, and
engineers improve how
they interact with, share,
and apply information to
solve problems using
our digital workflow
tools, analytics, and data
Provides universities,
governments, and
research institutions with
the resources and
insights to improve
institutional research
strategy, management,
and performance.
Elsevier
Education
Helps medical
professionals apply
trusted data and
sophisticated tools to
make better clinical
decisions, deliver better
care, and produce
better healthcare
outcomes.
Helps educate
highly-skilled,
effective healthcare
professionals, using
the most advanced
pedagogical tools
and reference
works.
Elsevier
Research Intelligence
CONTENT
CAPABILITIESPLATFORMS
Publishes over
2,200 online
journals & over
26,000 books
(e + print)
Elsevier eBooks, Online
Journals, Databases
Text Mining Full Text for Molecular Targets – March 31, 2015 – George Jiang
Working With Text is A Big Data Challenge
Text is everywhere! We’ve already covered 100s of terms in this presentation.
 Twitter - 58M tweets/day x 14.98 words/tweet => 868M words/day => 6B
Average journal article = 10, 150, 6000 words in title, abstract , full text
 abstracts – 2.4B words (24M abstracts @ PubMed x 100 words/abstract)
 full text – 144B words ( if comparable set from PubMed, 25M x 6000
The information deluge of scientific content and how to manage
and/or leverage this information is a big data challenge
Information seeking challenges can be
addressed with automation assistance
and text mining for greater insight
Text Mining Full Text for Molecular Targets – March 31, 2015 – George Jiang
Summary
• Text mining can help to sift through large amounts of scientific literature and other
textual content
• Text mining can help to increase project team efficiency to find precise statements
and relationships
• Full text articles provide richer result sets that can be useful in finding additional
insights that cannot be garnered just using abstracts
• Several hurdles still exist to implement text mining but the value can outweigh costs
Text mining full text can be used to help find molecular targets of
interest quickly that may be missed if relying on abstracts and
keyword searching
Text Mining Full Text for Molecular Targets – March 31, 2015 – George Jiang
Agenda
• Introduction to Text Mining
• The Value of Full Text Articles
• Illustration of Text Mining Full Text Articles
• Recap
• Q&A
Text Mining Full Text for Molecular Targets – March 31, 2015 – George Jiang
What is Text Mining?
Text Mining
• Refers to the process of deriving high-quality structured
A Does B
X Inhibits Y
G Stops D
I Drink T
documents facts
Why Text Mining?
• Text Mining can yield better results, and increase team efficiency
• The application of text mining techniques can be used to solve
business problems
Text Mining Full Text for Molecular Targets – March 31, 2015 – George Jiang
Example of Getting Structured Information (Facts)
Triple negative breast cancer (TNBC) cells lack receptor expression, are frequently
more aggressive and are resistant to growth factor inhibition
documents
sentence
fact(s)
Tumour cells show greater dependency on glycolysis so providing a sufficient and rapid energy supply for fast growth. In many breast cancers, estrogen, progesterone and
epidermal growth factor receptor-positive cells proliferate in response to growth factors and growth factor antagonists are a mainstay of treatment. However, triple negative
breast cancer (TNBC) cells lack receptor expression, are frequently more aggressive and are resistant to growth factor inhibition. Downstream of growth factor receptors,
signal transduction proceeds via phosphatidylinositol 3-kinase (PI3k), Akt and FOXO3a inhibition, the latter being partly responsible for coordinated increases in glycolysis
and apoptosis resistance. FOXO3a may be an attractive therapeutic target for TNBC. Therefore we have undertaken a systematic review of FOXO3a as a target for breast
cancer therapeutics.
paragraph
TNBC cells lack receptor expression
TNBC cells are more aggressive
TNBC cells resist growth factor inhibition
Excerpt from Taylor et al. Evaluating the
evidence for targeting FOXO3a in breast
cancer: a systematic review.
Wordcloud plotted with Wordle.net
tokens
Text Mining Full Text for Molecular Targets – March 31, 2015 – George Jiang
Text analytics and
Visualizations
What is Text Mining Being Used For?
Use cases include:
• Target identification and prioritization
• Biomarker discovery
• Drug repurposing
• Drug safety and finding adverse events
• Clinical study design and site selection
• Competitive intelligence
DISCOVERY
PRE-
CLINICAL
CLINICAL
POST-
LAUNCH
Text mining article submissions for curation
assistance in publishing
Basic Research Applied Research
Text mining can be used to support several research and development areas
Information retrieval and analysis
of biomedical literature for target
identification, systematic reviews,
etc.
Searching clinical trial data
or electronic health records
to find signals in patient
populations
Triage of news and papers
for literature curation and
regulatory reporting
Identifying relevant items for
meta-analysis of specific research
results
Text Mining Full Text for Molecular Targets – March 31, 2015 – George Jiang
How to Text Mine?
• Content
• Ontology
• Software solution(s)
• Expertise
Several pieces and steps are often needed to get results from text mining
Aggregate1 Structure2
Normalize3
Integrate4
• PDF -> XML
• XML quality differs
• XML uniformity e.g. dealing
with sources, types, etc.
Default or custom ontology
• Text mining the corpus
• Balancing expectations of
precision and recall
1. Aggregate
2. Structure
3. Normalize
4. Integrate
Text Mining Full Text for Molecular Targets – March 31, 2015 – George Jiang
Text Mining solutions &
Professional Services
Elsevier Offers Several Text Mining Solutions
facts and data out
support downstream
applications and activities
Aggregate
Normalize
Structure
Integrate
1
2
3
4
Journals and
Books
Internal
content
Patents
Other
Software solution
UI / API
Public data
sources
User Questions
Software solutions and Professional Services available for text mining and
semantic searching
Text Mining Full Text for Molecular Targets – March 31, 2015 – George Jiang
• Introduction to Text Mining
• The Value of Full Text Articles
• Illustration of Text Mining Full Text Articles
• Recap
• Q&A
Agenda
Text Mining Full Text for Molecular Targets – March 31, 2015 – George Jiang
Abstracts vs Full Text
• Concise summaries
• Readily accessible
• Relatively uniform
Summary of main differences
• Complete documents
• May not be as accessible
• Information within can vary
Text Mining Full Text for Molecular Targets – March 31, 2015 – George Jiang
Benefits of Using Full Text
• Distribution of keywords, facts and relations – more keywords, facts
and relations are found in full text
• Concept under-representation in abstracts – specific entities may not
be mentioned in abstracts but primarily in full text sections e.g.
biological functions
• Missing Negative data – often negative results or non-significant data
are missing from abstracts
• Citations per article – full text sections are more cited vs abstracts
• Timeliness – Relevant facts and relationships can be found in full text
first before any mentions in abstracts as researchers surmise in
Full Text provide richer results sets
Text Mining Full Text for Molecular Targets – March 31, 2015 – George Jiang
Additional Reading
• Information extraction from full text scientific articles: where are the keywords? BMC Bioinformatics. 2003 May 29;4:20. Epub 2003 May 29.
• Beyond genes, proteins, and abstracts: Identifying scientific claims from full-text biomedical articles. J Biomed Inform. 2010
Apr;43(2):173-89. doi: 10.1016/j.jbi.2009.11.001. Epub 2009 Nov 10.
• Do Peers See More in a Paper Than Its Authors? Adv Bioinformatics. 2012;2012:750214. doi: 10.1155/2012/750214. Epub 2012 Nov 27.
• Is searching full text more effective than searching abstracts? Bioinformatics. 2009 Feb 3;10:46. doi: 10.1186/1471-2105-10-46.
• Challenges for automatically extracting molecular interactions from full-text articles. BMC Bioinformatics. 2009 Sep 24;10:311. doi:
10.1186/1471-2105-10-311.
• Semi-Automatic Indexing of Full Text Biomedical Articles. AMIA Annu Symp Proc. 2005:271-5.
• Discovering implicit associations between genes and hereditary diseases. Pac Symp Biocomput. 2007:316-27.
• The structural and content aspects of abstracts versus bodies of full text journal articles are different. BMC Bioinformatics. 2010
Sep 29;11:492. doi: 10.1186/1471-2105-11-492.
• Abstracts in high profile journals often fail to report harm. BMC Med Res Methodol. 2008 Mar 27;8:14. doi: 10.1186/1471-2288-8-14.
• Quality of abstracts of original research articles in CMAJ in 1989. CMAJ. 1991 Feb 15;144(4):449-53.
• Accuracy of data in abstracts of published research articles. JAMA. 1999 Mar 24-31;281(12):1110-1.
Articles highlighting the differences between abstracts and full text
Text Mining Full Text for Molecular Targets – March 31, 2015 – George Jiang
Abstract vs Full Text Example
Challenges
 Sifting through more information!
 Finding the right results
Concise abstracts cannot contain all details whereas full text will
contain all the relevant information
Significant advances have been made in the treatment of human immunodeficiency virus (HIV) infection over the past two
decades. Improved therapy has prolonged survival and improved clinical outcome for HIV-infected children and adults.
Sixteen antiretroviral (ART) medications have been approved for use in pediatric HIV infection. The Department of Health
and Human Services (DHHS) has issued “Guidelines for the Use of Antiretroviral Agents in Pediatric HIV Infection”, which
provide detailed information on currently recommended antiretroviral therapies (ART). However, consultation with an HIV
specialist is recommended as the current therapy of pediatric HIV therapy is complex and rapidly evolving.
Elvitegravir is a once daily integrase inhibitor being studied in adults.
Children with treatment failure should be evaluated for medication adherence, drug intolerance, and possible drug
interactions which may lessen the efficacy of the therapeutic regimen.
Abstract
Full Text
Text Mining Full Text for Molecular Targets – March 31, 2015 – George Jiang
• Introduction to Text Mining
• The Value of Full Text Articles
• Illustration of Text Mining Full Text Articles
• Recap
• Q&A
Agenda
Text Mining Full Text for Molecular Targets – March 31, 2015 – George Jiang
• Use Elsevier Text Mining solution to search against corpus of biomedical literature
• Abstracts – MEDLINE/PubMed (24M)
• Full text – PubMed Central, Elsevier and partner publishers (4M)
• Refine results corpus, redefine search / text mining output
• Review and analyze data
• Create visual data reports using other tools available
Methods
Text Mining Full Text for Molecular Targets – March 31, 2015 – George Jiang
Search against scientific literature corpus for sentences related to efficacy
If looking for details, one really needs to look at the full text results
Text Mining Abstracts vs Full Text
Word clouds suggest insight differences between abstracts and full text
Full textAbstracts Only
Text Mining Full Text for Molecular Targets – March 31, 2015 – George Jiang
Full text provides insights into the specific mutations implicated in differential enzymatic efficacy of
a particular drug class
Finding Molecular Targets in Full Text
Word clouds illustrating differences in point mutations mentioned
Full TextAbstracts Only
Gives insight into the mutations implicated for
changes in efficacy.
No mutations mentioned in abstracts of
comparable document set.
Text Mining Full Text for Molecular Targets – March 31, 2015 – George Jiang
Finding Molecular Targets in Full Text
Example searching for cancer immunity checkpoint proteins
Full text provides insights into additional protein targets that may be of interest for cancer
immunology research in cancer checkpoints
Full TextAbstracts Only
Text Mining Full Text for Molecular Targets – March 31, 2015 – George Jiang
Text Mining Results Can Then Be Used For Analyses
• Review results. Not just keyword matching anymore …
 identifying more relevant documents for review
 identifying relationships and precise statements
 Identifying other targets/content of interest
• Link data to other items of interest
• Analytics, visualization and system/network analysis e.g. Pathway Studio,
Cytoscape
• Integrate text mining data and process into different workflows for project
quality and efficiency
Text mining results can be used to improve scientific research and can be
used to address business problems
Text Mining Full Text for Molecular Targets – March 31, 2015 – George Jiang
Text Mining Finds Answers Faster & Increases Efficiency
An Example Project Comparison
Text Mining Full Text for Molecular Targets – March 31, 2015 – George Jiang
Savings:
 Text mining robustly identifies the relevant articles
 Savings of 171 person-days per project
 Allows more projects/higher quality with same staff
Keyword searching: Text Mining:
Finds 1,408 articles
Many of them not relevant
Identifies 142 relevant articles
176 person-days to review
@ 20 min/article
5 person-days to review
@ 20 min/article
VS
24
Writing comprehensive state of the science review article on the chemical toxicity of a particular
substance
Relationship map using Elsevier Text Mining
results into Cytoscape visualization
NLP
Example of Visual Insights of Text Mining Results
Intersecting adverse events between two anti-TNF drugs
Text Mining Full Text for Molecular Targets – March 31, 2015 – George Jiang
Summary
• Text mining can help to sift through large amounts of scientific literature and other
textual content
• Text mining can help to increase project team efficiency to find precise statements
and relationships
• Full text articles provide richer result sets that can be useful in finding additional
insights that cannot be garnered just using abstracts
• Several hurdles still exist to implement text mining but the value can outweigh costs
Text mining full text can be used to help find molecular targets of
interest quickly that may be missed if relying on abstracts and
keyword searching
Text Mining Full Text for Molecular Targets – March 31, 2015 – George Jiang
Text Mining Full Text for Molecular Targets – March 31, 2015 – George Jiang
Thank you for joining our webinar today:
Text Mining Full Text for Molecular Targets
with George Jiang, Ph.D., M.B.A
If you have any questions for our speaker, please type them into
the CHAT window.
If you would like more information you can contact:
George Jiang
g.jiang@elsevier.com
Text Mining Full Text for Molecular Targets – March 31, 2015 – George Jiang

More Related Content

What's hot

Open science and the individual researcher
Open science and the individual researcherOpen science and the individual researcher
Open science and the individual researcher
Bram Zandbelt
 
Seefeld stats r_bio
Seefeld stats r_bioSeefeld stats r_bio
Seefeld stats r_bio
haramaya university
 
Annotation examples--Fribourg--2019-09-03
Annotation examples--Fribourg--2019-09-03Annotation examples--Fribourg--2019-09-03
Annotation examples--Fribourg--2019-09-03
jodischneider
 
Working with Quertle
Working with QuertleWorking with Quertle
Working with Quertle
Janet Delicata
 
Using Bioinformatics Data to inform Therapeutics discovery and development
Using Bioinformatics Data to inform Therapeutics discovery and developmentUsing Bioinformatics Data to inform Therapeutics discovery and development
Using Bioinformatics Data to inform Therapeutics discovery and development
Eleanor Howe
 
Pikas bibliometricsfor21may2015
Pikas bibliometricsfor21may2015Pikas bibliometricsfor21may2015
Pikas bibliometricsfor21may2015
Christina Pikas
 
ELSS use cases and strategy
ELSS use cases and strategyELSS use cases and strategy
ELSS use cases and strategy
Anton Yuryev
 
Publishing and citing presentation for VLAG graduate school Baarlo
Publishing and citing presentation for VLAG graduate school BaarloPublishing and citing presentation for VLAG graduate school Baarlo
Publishing and citing presentation for VLAG graduate school Baarlo
Hugo Besemer
 
Increasing transparency in Medical Education through Open Data
Increasing transparency in Medical Education through Open Data Increasing transparency in Medical Education through Open Data
Increasing transparency in Medical Education through Open Data
Rebecca Grant
 
Scott Edmunds: Quantifying how FAIR is Hong Kong: The Hong Kong Shareability ...
Scott Edmunds: Quantifying how FAIR is Hong Kong: The Hong Kong Shareability ...Scott Edmunds: Quantifying how FAIR is Hong Kong: The Hong Kong Shareability ...
Scott Edmunds: Quantifying how FAIR is Hong Kong: The Hong Kong Shareability ...
GigaScience, BGI Hong Kong
 
Recommending Scientific Papers: Investigating the User Curriculum
Recommending Scientific Papers: Investigating the User CurriculumRecommending Scientific Papers: Investigating the User Curriculum
Recommending Scientific Papers: Investigating the User Curriculum
Jonathas Magalhães
 
Towards knowledge maintenance in scientific digital libraries with the keysto...
Towards knowledge maintenance in scientific digital libraries with the keysto...Towards knowledge maintenance in scientific digital libraries with the keysto...
Towards knowledge maintenance in scientific digital libraries with the keysto...
jodischneider
 
Research in the time of Covid: Surveying impacts on Early Career Researchers
Research in the time of Covid: Surveying impacts on Early Career ResearchersResearch in the time of Covid: Surveying impacts on Early Career Researchers
Research in the time of Covid: Surveying impacts on Early Career Researchers
Rebecca Grant
 
Drug Discovery Data Insights with Andrew Leach (ChEMBL), Evan Bolton (PubChem...
Drug Discovery Data Insights with Andrew Leach (ChEMBL), Evan Bolton (PubChem...Drug Discovery Data Insights with Andrew Leach (ChEMBL), Evan Bolton (PubChem...
Drug Discovery Data Insights with Andrew Leach (ChEMBL), Evan Bolton (PubChem...
Lixin Liu
 
A National Approach to Open Data in Ireland: Publishers and Research Data Man...
A National Approach to Open Data in Ireland: Publishers and Research Data Man...A National Approach to Open Data in Ireland: Publishers and Research Data Man...
A National Approach to Open Data in Ireland: Publishers and Research Data Man...
Rebecca Grant
 
Do Open data badges influence author behaviour? A case study at Springer Nature
Do Open data badges influence author behaviour? A case study at Springer NatureDo Open data badges influence author behaviour? A case study at Springer Nature
Do Open data badges influence author behaviour? A case study at Springer Nature
Rebecca Grant
 
Antibiotic Discovery Webinar
Antibiotic Discovery WebinarAntibiotic Discovery Webinar
Antibiotic Discovery Webinar
Lixin Liu
 
Developing a Replicable Methodology for Automated Identification of Emerging ...
Developing a Replicable Methodology for Automated Identification of Emerging ...Developing a Replicable Methodology for Automated Identification of Emerging ...
Developing a Replicable Methodology for Automated Identification of Emerging ...
University of Michigan Taubman Health Sciences Library
 
Warren-Jones "Using text-mining and summarisation technology to manage the gr...
Warren-Jones "Using text-mining and summarisation technology to manage the gr...Warren-Jones "Using text-mining and summarisation technology to manage the gr...
Warren-Jones "Using text-mining and summarisation technology to manage the gr...
National Information Standards Organization (NISO)
 

What's hot (19)

Open science and the individual researcher
Open science and the individual researcherOpen science and the individual researcher
Open science and the individual researcher
 
Seefeld stats r_bio
Seefeld stats r_bioSeefeld stats r_bio
Seefeld stats r_bio
 
Annotation examples--Fribourg--2019-09-03
Annotation examples--Fribourg--2019-09-03Annotation examples--Fribourg--2019-09-03
Annotation examples--Fribourg--2019-09-03
 
Working with Quertle
Working with QuertleWorking with Quertle
Working with Quertle
 
Using Bioinformatics Data to inform Therapeutics discovery and development
Using Bioinformatics Data to inform Therapeutics discovery and developmentUsing Bioinformatics Data to inform Therapeutics discovery and development
Using Bioinformatics Data to inform Therapeutics discovery and development
 
Pikas bibliometricsfor21may2015
Pikas bibliometricsfor21may2015Pikas bibliometricsfor21may2015
Pikas bibliometricsfor21may2015
 
ELSS use cases and strategy
ELSS use cases and strategyELSS use cases and strategy
ELSS use cases and strategy
 
Publishing and citing presentation for VLAG graduate school Baarlo
Publishing and citing presentation for VLAG graduate school BaarloPublishing and citing presentation for VLAG graduate school Baarlo
Publishing and citing presentation for VLAG graduate school Baarlo
 
Increasing transparency in Medical Education through Open Data
Increasing transparency in Medical Education through Open Data Increasing transparency in Medical Education through Open Data
Increasing transparency in Medical Education through Open Data
 
Scott Edmunds: Quantifying how FAIR is Hong Kong: The Hong Kong Shareability ...
Scott Edmunds: Quantifying how FAIR is Hong Kong: The Hong Kong Shareability ...Scott Edmunds: Quantifying how FAIR is Hong Kong: The Hong Kong Shareability ...
Scott Edmunds: Quantifying how FAIR is Hong Kong: The Hong Kong Shareability ...
 
Recommending Scientific Papers: Investigating the User Curriculum
Recommending Scientific Papers: Investigating the User CurriculumRecommending Scientific Papers: Investigating the User Curriculum
Recommending Scientific Papers: Investigating the User Curriculum
 
Towards knowledge maintenance in scientific digital libraries with the keysto...
Towards knowledge maintenance in scientific digital libraries with the keysto...Towards knowledge maintenance in scientific digital libraries with the keysto...
Towards knowledge maintenance in scientific digital libraries with the keysto...
 
Research in the time of Covid: Surveying impacts on Early Career Researchers
Research in the time of Covid: Surveying impacts on Early Career ResearchersResearch in the time of Covid: Surveying impacts on Early Career Researchers
Research in the time of Covid: Surveying impacts on Early Career Researchers
 
Drug Discovery Data Insights with Andrew Leach (ChEMBL), Evan Bolton (PubChem...
Drug Discovery Data Insights with Andrew Leach (ChEMBL), Evan Bolton (PubChem...Drug Discovery Data Insights with Andrew Leach (ChEMBL), Evan Bolton (PubChem...
Drug Discovery Data Insights with Andrew Leach (ChEMBL), Evan Bolton (PubChem...
 
A National Approach to Open Data in Ireland: Publishers and Research Data Man...
A National Approach to Open Data in Ireland: Publishers and Research Data Man...A National Approach to Open Data in Ireland: Publishers and Research Data Man...
A National Approach to Open Data in Ireland: Publishers and Research Data Man...
 
Do Open data badges influence author behaviour? A case study at Springer Nature
Do Open data badges influence author behaviour? A case study at Springer NatureDo Open data badges influence author behaviour? A case study at Springer Nature
Do Open data badges influence author behaviour? A case study at Springer Nature
 
Antibiotic Discovery Webinar
Antibiotic Discovery WebinarAntibiotic Discovery Webinar
Antibiotic Discovery Webinar
 
Developing a Replicable Methodology for Automated Identification of Emerging ...
Developing a Replicable Methodology for Automated Identification of Emerging ...Developing a Replicable Methodology for Automated Identification of Emerging ...
Developing a Replicable Methodology for Automated Identification of Emerging ...
 
Warren-Jones "Using text-mining and summarisation technology to manage the gr...
Warren-Jones "Using text-mining and summarisation technology to manage the gr...Warren-Jones "Using text-mining and summarisation technology to manage the gr...
Warren-Jones "Using text-mining and summarisation technology to manage the gr...
 

Viewers also liked

Text Mining and Visualization
Text Mining and VisualizationText Mining and Visualization
Text Mining and Visualization
Seth Grimes
 
Introduction to Text Mining
Introduction to Text MiningIntroduction to Text Mining
Introduction to Text Mining
Minha Hwang
 
Text & Data Mining Licensing Issues
Text & Data Mining Licensing IssuesText & Data Mining Licensing Issues
Text & Data Mining Licensing Issues
Daniel Dollar
 
Visualizing Text: Seth Redmore at the 2015 Smart Data Conference
Visualizing Text: Seth Redmore at the 2015 Smart Data ConferenceVisualizing Text: Seth Redmore at the 2015 Smart Data Conference
Visualizing Text: Seth Redmore at the 2015 Smart Data Conference
sredmore
 
Week12
Week12Week12
Week12
Esha Meher
 
Text mining by examples, By Hadi Mohammadzadeh
Text mining by examples, By Hadi MohammadzadehText mining by examples, By Hadi Mohammadzadeh
Text mining by examples, By Hadi Mohammadzadeh
Hadi Mohammadzadeh
 
OUTDATED Text Mining 1/5: Introduction
OUTDATED Text Mining 1/5: IntroductionOUTDATED Text Mining 1/5: Introduction
OUTDATED Text Mining 1/5: Introduction
Florian Leitner
 
Text and text stream mining tutorial
Text and text stream mining tutorialText and text stream mining tutorial
Text and text stream mining tutorial
mgrcar
 
Introduction to Data Warehousing
Introduction to Data WarehousingIntroduction to Data Warehousing
Introduction to Data Warehousing
Edureka!
 
마인즈랩 사업소개 20151031_v1.0
마인즈랩 사업소개 20151031_v1.0마인즈랩 사업소개 20151031_v1.0
마인즈랩 사업소개 20151031_v1.0
Taejoon Yoo
 
i-VOC (Voice of the Customer Big Data Analytics Solution) 고객의소리 분석
i-VOC (Voice of the Customer Big Data Analytics Solution) 고객의소리 분석i-VOC (Voice of the Customer Big Data Analytics Solution) 고객의소리 분석
i-VOC (Voice of the Customer Big Data Analytics Solution) 고객의소리 분석
Taejoon Yoo
 
Big Data & Text Mining
Big Data & Text MiningBig Data & Text Mining
Big Data & Text Mining
Michel Bruley
 
TextMining with R
TextMining with RTextMining with R
TextMining with R
Aleksei Beloshytski
 
Data mining
Data miningData mining
Data mining
Akannsha Totewar
 
Text Mining with R -- an Analysis of Twitter Data
Text Mining with R -- an Analysis of Twitter DataText Mining with R -- an Analysis of Twitter Data
Text Mining with R -- an Analysis of Twitter Data
Yanchang Zhao
 

Viewers also liked (16)

Text Mining and Visualization
Text Mining and VisualizationText Mining and Visualization
Text Mining and Visualization
 
Introduction to Text Mining
Introduction to Text MiningIntroduction to Text Mining
Introduction to Text Mining
 
Text & Data Mining Licensing Issues
Text & Data Mining Licensing IssuesText & Data Mining Licensing Issues
Text & Data Mining Licensing Issues
 
Visualizing Text: Seth Redmore at the 2015 Smart Data Conference
Visualizing Text: Seth Redmore at the 2015 Smart Data ConferenceVisualizing Text: Seth Redmore at the 2015 Smart Data Conference
Visualizing Text: Seth Redmore at the 2015 Smart Data Conference
 
Week12
Week12Week12
Week12
 
Text mining by examples, By Hadi Mohammadzadeh
Text mining by examples, By Hadi MohammadzadehText mining by examples, By Hadi Mohammadzadeh
Text mining by examples, By Hadi Mohammadzadeh
 
OUTDATED Text Mining 1/5: Introduction
OUTDATED Text Mining 1/5: IntroductionOUTDATED Text Mining 1/5: Introduction
OUTDATED Text Mining 1/5: Introduction
 
Text Mining - Data Mining
Text Mining - Data MiningText Mining - Data Mining
Text Mining - Data Mining
 
Text and text stream mining tutorial
Text and text stream mining tutorialText and text stream mining tutorial
Text and text stream mining tutorial
 
Introduction to Data Warehousing
Introduction to Data WarehousingIntroduction to Data Warehousing
Introduction to Data Warehousing
 
마인즈랩 사업소개 20151031_v1.0
마인즈랩 사업소개 20151031_v1.0마인즈랩 사업소개 20151031_v1.0
마인즈랩 사업소개 20151031_v1.0
 
i-VOC (Voice of the Customer Big Data Analytics Solution) 고객의소리 분석
i-VOC (Voice of the Customer Big Data Analytics Solution) 고객의소리 분석i-VOC (Voice of the Customer Big Data Analytics Solution) 고객의소리 분석
i-VOC (Voice of the Customer Big Data Analytics Solution) 고객의소리 분석
 
Big Data & Text Mining
Big Data & Text MiningBig Data & Text Mining
Big Data & Text Mining
 
TextMining with R
TextMining with RTextMining with R
TextMining with R
 
Data mining
Data miningData mining
Data mining
 
Text Mining with R -- an Analysis of Twitter Data
Text Mining with R -- an Analysis of Twitter DataText Mining with R -- an Analysis of Twitter Data
Text Mining with R -- an Analysis of Twitter Data
 

Similar to Text mining full text for molecular targets

GSmith Springer Nature Data policies and practices: HKU Open Data and Data Pu...
GSmith Springer Nature Data policies and practices: HKU Open Data and Data Pu...GSmith Springer Nature Data policies and practices: HKU Open Data and Data Pu...
GSmith Springer Nature Data policies and practices: HKU Open Data and Data Pu...
GrahamSmith646206
 
Bioinformatics
BioinformaticsBioinformatics
Bioinformatics
nadimissimple
 
Developing core common outcomes for tropical peatland research and management
Developing core common outcomes for tropical peatland research and managementDeveloping core common outcomes for tropical peatland research and management
Developing core common outcomes for tropical peatland research and management
Mark Reed
 
Sci Know Mine 2013: What can we learn from topic modeling on 350M academic do...
Sci Know Mine 2013: What can we learn from topic modeling on 350M academic do...Sci Know Mine 2013: What can we learn from topic modeling on 350M academic do...
Sci Know Mine 2013: What can we learn from topic modeling on 350M academic do...
William Gunn
 
Practical applications for altmetrics in a changing metrics landscape
Practical applications for altmetrics in a changing metrics landscapePractical applications for altmetrics in a changing metrics landscape
Practical applications for altmetrics in a changing metrics landscape
Digital Science
 
Gather evidence to demonstrate the impact of your research
Gather evidence to demonstrate the impact of your researchGather evidence to demonstrate the impact of your research
Gather evidence to demonstrate the impact of your research
IUPUI
 
Research and Commercialisation Challenges
Research and Commercialisation ChallengesResearch and Commercialisation Challenges
Research and Commercialisation Challenges
Dr. Mazlan Abbas
 
Managing 'Big Data' in the social sciences: the contribution of an analytico-...
Managing 'Big Data' in the social sciences: the contribution of an analytico-...Managing 'Big Data' in the social sciences: the contribution of an analytico-...
Managing 'Big Data' in the social sciences: the contribution of an analytico-...
CILIP MDG
 
Nov1 webinar intro_slides v
Nov1 webinar intro_slides vNov1 webinar intro_slides v
Nov1 webinar intro_slides v
SC CTSI at USC and CHLA
 
Optimising Your Content for findability
Optimising Your Content for findabilityOptimising Your Content for findability
Optimising Your Content for findability
Kristian Norling
 
Not just for STEM: Open and reproducible research in the social sciences
Not just for STEM: Open and reproducible research in the social sciencesNot just for STEM: Open and reproducible research in the social sciences
Not just for STEM: Open and reproducible research in the social sciences
UoLResearchSupport
 
Research methods
Research methodsResearch methods
Research methods
A M
 
Open from beginning to end: addressing barriers to open research - a personal...
Open from beginning to end: addressing barriers to open research - a personal...Open from beginning to end: addressing barriers to open research - a personal...
Open from beginning to end: addressing barriers to open research - a personal...
UoLResearchSupport
 
Let's Talk Research 2015 -Juliet Goldbart - Introduction To Qualitative Metho...
Let's Talk Research 2015 -Juliet Goldbart - Introduction To Qualitative Metho...Let's Talk Research 2015 -Juliet Goldbart - Introduction To Qualitative Metho...
Let's Talk Research 2015 -Juliet Goldbart - Introduction To Qualitative Metho...
NHSNWRD
 
Implementation science and learning health systems: Connecting the dots
Implementation science and learning health systems:  Connecting the dotsImplementation science and learning health systems:  Connecting the dots
Implementation science and learning health systems: Connecting the dots
Department of Learning Health Sciences, University of Michigan Medical School
 
Why is Test Driven Development so hard to implement in an analytics platform?
Why is Test Driven Development so hard to implement in an analytics platform?Why is Test Driven Development so hard to implement in an analytics platform?
Why is Test Driven Development so hard to implement in an analytics platform?
Phil Watt
 
Research Data Management Services at UWA (November 2015)
Research Data Management Services at UWA (November 2015)Research Data Management Services at UWA (November 2015)
Research Data Management Services at UWA (November 2015)
Katina Toufexis
 
The Simulacrum, a Synthetic Cancer Dataset
The Simulacrum, a Synthetic Cancer DatasetThe Simulacrum, a Synthetic Cancer Dataset
The Simulacrum, a Synthetic Cancer Dataset
CongChen35
 
Öppen data och forskningens genomslag
Öppen data och forskningens genomslagÖppen data och forskningens genomslag
Öppen data och forskningens genomslag
Kungliga biblioteket National Library of Sweden
 
Rebecca Grant - Publishers and RDM
Rebecca Grant - Publishers and RDMRebecca Grant - Publishers and RDM
Rebecca Grant - Publishers and RDM
dri_ireland
 

Similar to Text mining full text for molecular targets (20)

GSmith Springer Nature Data policies and practices: HKU Open Data and Data Pu...
GSmith Springer Nature Data policies and practices: HKU Open Data and Data Pu...GSmith Springer Nature Data policies and practices: HKU Open Data and Data Pu...
GSmith Springer Nature Data policies and practices: HKU Open Data and Data Pu...
 
Bioinformatics
BioinformaticsBioinformatics
Bioinformatics
 
Developing core common outcomes for tropical peatland research and management
Developing core common outcomes for tropical peatland research and managementDeveloping core common outcomes for tropical peatland research and management
Developing core common outcomes for tropical peatland research and management
 
Sci Know Mine 2013: What can we learn from topic modeling on 350M academic do...
Sci Know Mine 2013: What can we learn from topic modeling on 350M academic do...Sci Know Mine 2013: What can we learn from topic modeling on 350M academic do...
Sci Know Mine 2013: What can we learn from topic modeling on 350M academic do...
 
Practical applications for altmetrics in a changing metrics landscape
Practical applications for altmetrics in a changing metrics landscapePractical applications for altmetrics in a changing metrics landscape
Practical applications for altmetrics in a changing metrics landscape
 
Gather evidence to demonstrate the impact of your research
Gather evidence to demonstrate the impact of your researchGather evidence to demonstrate the impact of your research
Gather evidence to demonstrate the impact of your research
 
Research and Commercialisation Challenges
Research and Commercialisation ChallengesResearch and Commercialisation Challenges
Research and Commercialisation Challenges
 
Managing 'Big Data' in the social sciences: the contribution of an analytico-...
Managing 'Big Data' in the social sciences: the contribution of an analytico-...Managing 'Big Data' in the social sciences: the contribution of an analytico-...
Managing 'Big Data' in the social sciences: the contribution of an analytico-...
 
Nov1 webinar intro_slides v
Nov1 webinar intro_slides vNov1 webinar intro_slides v
Nov1 webinar intro_slides v
 
Optimising Your Content for findability
Optimising Your Content for findabilityOptimising Your Content for findability
Optimising Your Content for findability
 
Not just for STEM: Open and reproducible research in the social sciences
Not just for STEM: Open and reproducible research in the social sciencesNot just for STEM: Open and reproducible research in the social sciences
Not just for STEM: Open and reproducible research in the social sciences
 
Research methods
Research methodsResearch methods
Research methods
 
Open from beginning to end: addressing barriers to open research - a personal...
Open from beginning to end: addressing barriers to open research - a personal...Open from beginning to end: addressing barriers to open research - a personal...
Open from beginning to end: addressing barriers to open research - a personal...
 
Let's Talk Research 2015 -Juliet Goldbart - Introduction To Qualitative Metho...
Let's Talk Research 2015 -Juliet Goldbart - Introduction To Qualitative Metho...Let's Talk Research 2015 -Juliet Goldbart - Introduction To Qualitative Metho...
Let's Talk Research 2015 -Juliet Goldbart - Introduction To Qualitative Metho...
 
Implementation science and learning health systems: Connecting the dots
Implementation science and learning health systems:  Connecting the dotsImplementation science and learning health systems:  Connecting the dots
Implementation science and learning health systems: Connecting the dots
 
Why is Test Driven Development so hard to implement in an analytics platform?
Why is Test Driven Development so hard to implement in an analytics platform?Why is Test Driven Development so hard to implement in an analytics platform?
Why is Test Driven Development so hard to implement in an analytics platform?
 
Research Data Management Services at UWA (November 2015)
Research Data Management Services at UWA (November 2015)Research Data Management Services at UWA (November 2015)
Research Data Management Services at UWA (November 2015)
 
The Simulacrum, a Synthetic Cancer Dataset
The Simulacrum, a Synthetic Cancer DatasetThe Simulacrum, a Synthetic Cancer Dataset
The Simulacrum, a Synthetic Cancer Dataset
 
Öppen data och forskningens genomslag
Öppen data och forskningens genomslagÖppen data och forskningens genomslag
Öppen data och forskningens genomslag
 
Rebecca Grant - Publishers and RDM
Rebecca Grant - Publishers and RDMRebecca Grant - Publishers and RDM
Rebecca Grant - Publishers and RDM
 

More from Ann-Marie Roche

How predictive models help Medicinal Chemists design better drugs_webinar
How predictive models help Medicinal Chemists design better drugs_webinarHow predictive models help Medicinal Chemists design better drugs_webinar
How predictive models help Medicinal Chemists design better drugs_webinar
Ann-Marie Roche
 
Webinar: New RMC - Your lead_optimization Solution June082017
Webinar: New RMC - Your lead_optimization Solution June082017Webinar: New RMC - Your lead_optimization Solution June082017
Webinar: New RMC - Your lead_optimization Solution June082017
Ann-Marie Roche
 
Oil&Gas Thought Leader Webinar - New Plays for Old Ideas - Dr.Gabor Tari
Oil&Gas Thought Leader Webinar - New Plays for Old Ideas - Dr.Gabor TariOil&Gas Thought Leader Webinar - New Plays for Old Ideas - Dr.Gabor Tari
Oil&Gas Thought Leader Webinar - New Plays for Old Ideas - Dr.Gabor Tari
Ann-Marie Roche
 
Oil&Gas Thought-Leader Webinar - New Plays for Old Ideas - Dr. Rob Forkner
Oil&Gas Thought-Leader Webinar - New Plays for Old Ideas - Dr. Rob ForknerOil&Gas Thought-Leader Webinar - New Plays for Old Ideas - Dr. Rob Forkner
Oil&Gas Thought-Leader Webinar - New Plays for Old Ideas - Dr. Rob Forkner
Ann-Marie Roche
 
Oil&Gas Thought-Leader Webinar - New Plays for Old Ideas - Dr. Sander Houben
Oil&Gas Thought-Leader Webinar - New Plays for Old Ideas - Dr. Sander HoubenOil&Gas Thought-Leader Webinar - New Plays for Old Ideas - Dr. Sander Houben
Oil&Gas Thought-Leader Webinar - New Plays for Old Ideas - Dr. Sander Houben
Ann-Marie Roche
 
Embase for pharmacovigilance: Search and validation March 22 2017
Embase for pharmacovigilance: Search and validation March 22 2017Embase for pharmacovigilance: Search and validation March 22 2017
Embase for pharmacovigilance: Search and validation March 22 2017
Ann-Marie Roche
 
Literature Management for Pharmacovigilance: Outsource or in-house solution? ...
Literature Management for Pharmacovigilance: Outsource or in-house solution? ...Literature Management for Pharmacovigilance: Outsource or in-house solution? ...
Literature Management for Pharmacovigilance: Outsource or in-house solution? ...
Ann-Marie Roche
 
Finding the right medical device information in embase 11 2016
Finding the right medical device information in embase 11 2016Finding the right medical device information in embase 11 2016
Finding the right medical device information in embase 11 2016
Ann-Marie Roche
 
Medical device reporting 27 sep2016
Medical device reporting 27 sep2016Medical device reporting 27 sep2016
Medical device reporting 27 sep2016
Ann-Marie Roche
 
Eac webinar 09.21.2016
Eac webinar 09.21.2016Eac webinar 09.21.2016
Eac webinar 09.21.2016
Ann-Marie Roche
 
Literature monitoring for pv what are we doing at galderma elsevier webinar
Literature monitoring for pv   what are we doing at galderma elsevier webinarLiterature monitoring for pv   what are we doing at galderma elsevier webinar
Literature monitoring for pv what are we doing at galderma elsevier webinar
Ann-Marie Roche
 
Drug analytics based on triple linking v1.0
Drug analytics based on triple linking v1.0Drug analytics based on triple linking v1.0
Drug analytics based on triple linking v1.0
Ann-Marie Roche
 
Knovel lss webinar
Knovel lss webinarKnovel lss webinar
Knovel lss webinar
Ann-Marie Roche
 
Reaxys rmc unified platform_ webinar_
Reaxys rmc unified platform_ webinar_Reaxys rmc unified platform_ webinar_
Reaxys rmc unified platform_ webinar_
Ann-Marie Roche
 
Pathway studiosymposium lorenzi
Pathway studiosymposium lorenziPathway studiosymposium lorenzi
Pathway studiosymposium lorenzi
Ann-Marie Roche
 
Searching literature databases for post authorisation safety studies (pass)
Searching literature databases for post authorisation safety studies (pass)Searching literature databases for post authorisation safety studies (pass)
Searching literature databases for post authorisation safety studies (pass)
Ann-Marie Roche
 
Julie glanville embase sunrise seminar may 2016
Julie glanville embase sunrise seminar may 2016Julie glanville embase sunrise seminar may 2016
Julie glanville embase sunrise seminar may 2016
Ann-Marie Roche
 
Ian crowlesmith embase retrospective mla 2016
Ian crowlesmith embase retrospective mla 2016Ian crowlesmith embase retrospective mla 2016
Ian crowlesmith embase retrospective mla 2016
Ann-Marie Roche
 
Ivan krstic embase update mla 2016
Ivan krstic embase update mla 2016Ivan krstic embase update mla 2016
Ivan krstic embase update mla 2016
Ann-Marie Roche
 
Kp bloch psm preparedness final rev
Kp bloch psm preparedness final revKp bloch psm preparedness final rev
Kp bloch psm preparedness final rev
Ann-Marie Roche
 

More from Ann-Marie Roche (20)

How predictive models help Medicinal Chemists design better drugs_webinar
How predictive models help Medicinal Chemists design better drugs_webinarHow predictive models help Medicinal Chemists design better drugs_webinar
How predictive models help Medicinal Chemists design better drugs_webinar
 
Webinar: New RMC - Your lead_optimization Solution June082017
Webinar: New RMC - Your lead_optimization Solution June082017Webinar: New RMC - Your lead_optimization Solution June082017
Webinar: New RMC - Your lead_optimization Solution June082017
 
Oil&Gas Thought Leader Webinar - New Plays for Old Ideas - Dr.Gabor Tari
Oil&Gas Thought Leader Webinar - New Plays for Old Ideas - Dr.Gabor TariOil&Gas Thought Leader Webinar - New Plays for Old Ideas - Dr.Gabor Tari
Oil&Gas Thought Leader Webinar - New Plays for Old Ideas - Dr.Gabor Tari
 
Oil&Gas Thought-Leader Webinar - New Plays for Old Ideas - Dr. Rob Forkner
Oil&Gas Thought-Leader Webinar - New Plays for Old Ideas - Dr. Rob ForknerOil&Gas Thought-Leader Webinar - New Plays for Old Ideas - Dr. Rob Forkner
Oil&Gas Thought-Leader Webinar - New Plays for Old Ideas - Dr. Rob Forkner
 
Oil&Gas Thought-Leader Webinar - New Plays for Old Ideas - Dr. Sander Houben
Oil&Gas Thought-Leader Webinar - New Plays for Old Ideas - Dr. Sander HoubenOil&Gas Thought-Leader Webinar - New Plays for Old Ideas - Dr. Sander Houben
Oil&Gas Thought-Leader Webinar - New Plays for Old Ideas - Dr. Sander Houben
 
Embase for pharmacovigilance: Search and validation March 22 2017
Embase for pharmacovigilance: Search and validation March 22 2017Embase for pharmacovigilance: Search and validation March 22 2017
Embase for pharmacovigilance: Search and validation March 22 2017
 
Literature Management for Pharmacovigilance: Outsource or in-house solution? ...
Literature Management for Pharmacovigilance: Outsource or in-house solution? ...Literature Management for Pharmacovigilance: Outsource or in-house solution? ...
Literature Management for Pharmacovigilance: Outsource or in-house solution? ...
 
Finding the right medical device information in embase 11 2016
Finding the right medical device information in embase 11 2016Finding the right medical device information in embase 11 2016
Finding the right medical device information in embase 11 2016
 
Medical device reporting 27 sep2016
Medical device reporting 27 sep2016Medical device reporting 27 sep2016
Medical device reporting 27 sep2016
 
Eac webinar 09.21.2016
Eac webinar 09.21.2016Eac webinar 09.21.2016
Eac webinar 09.21.2016
 
Literature monitoring for pv what are we doing at galderma elsevier webinar
Literature monitoring for pv   what are we doing at galderma elsevier webinarLiterature monitoring for pv   what are we doing at galderma elsevier webinar
Literature monitoring for pv what are we doing at galderma elsevier webinar
 
Drug analytics based on triple linking v1.0
Drug analytics based on triple linking v1.0Drug analytics based on triple linking v1.0
Drug analytics based on triple linking v1.0
 
Knovel lss webinar
Knovel lss webinarKnovel lss webinar
Knovel lss webinar
 
Reaxys rmc unified platform_ webinar_
Reaxys rmc unified platform_ webinar_Reaxys rmc unified platform_ webinar_
Reaxys rmc unified platform_ webinar_
 
Pathway studiosymposium lorenzi
Pathway studiosymposium lorenziPathway studiosymposium lorenzi
Pathway studiosymposium lorenzi
 
Searching literature databases for post authorisation safety studies (pass)
Searching literature databases for post authorisation safety studies (pass)Searching literature databases for post authorisation safety studies (pass)
Searching literature databases for post authorisation safety studies (pass)
 
Julie glanville embase sunrise seminar may 2016
Julie glanville embase sunrise seminar may 2016Julie glanville embase sunrise seminar may 2016
Julie glanville embase sunrise seminar may 2016
 
Ian crowlesmith embase retrospective mla 2016
Ian crowlesmith embase retrospective mla 2016Ian crowlesmith embase retrospective mla 2016
Ian crowlesmith embase retrospective mla 2016
 
Ivan krstic embase update mla 2016
Ivan krstic embase update mla 2016Ivan krstic embase update mla 2016
Ivan krstic embase update mla 2016
 
Kp bloch psm preparedness final rev
Kp bloch psm preparedness final revKp bloch psm preparedness final rev
Kp bloch psm preparedness final rev
 

Recently uploaded

(June 12, 2024) Webinar: Development of PET theranostics targeting the molecu...
(June 12, 2024) Webinar: Development of PET theranostics targeting the molecu...(June 12, 2024) Webinar: Development of PET theranostics targeting the molecu...
(June 12, 2024) Webinar: Development of PET theranostics targeting the molecu...
Scintica Instrumentation
 
Immersive Learning That Works: Research Grounding and Paths Forward
Immersive Learning That Works: Research Grounding and Paths ForwardImmersive Learning That Works: Research Grounding and Paths Forward
Immersive Learning That Works: Research Grounding and Paths Forward
Leonel Morgado
 
在线办理(salfor毕业证书)索尔福德大学毕业证毕业完成信一模一样
在线办理(salfor毕业证书)索尔福德大学毕业证毕业完成信一模一样在线办理(salfor毕业证书)索尔福德大学毕业证毕业完成信一模一样
在线办理(salfor毕业证书)索尔福德大学毕业证毕业完成信一模一样
vluwdy49
 
Applied Science: Thermodynamics, Laws & Methodology.pdf
Applied Science: Thermodynamics, Laws & Methodology.pdfApplied Science: Thermodynamics, Laws & Methodology.pdf
Applied Science: Thermodynamics, Laws & Methodology.pdf
University of Hertfordshire
 
molar-distalization in orthodontics-seminar.pptx
molar-distalization in orthodontics-seminar.pptxmolar-distalization in orthodontics-seminar.pptx
molar-distalization in orthodontics-seminar.pptx
Anagha Prasad
 
EWOCS-I: The catalog of X-ray sources in Westerlund 1 from the Extended Weste...
EWOCS-I: The catalog of X-ray sources in Westerlund 1 from the Extended Weste...EWOCS-I: The catalog of X-ray sources in Westerlund 1 from the Extended Weste...
EWOCS-I: The catalog of X-ray sources in Westerlund 1 from the Extended Weste...
Sérgio Sacani
 
The cost of acquiring information by natural selection
The cost of acquiring information by natural selectionThe cost of acquiring information by natural selection
The cost of acquiring information by natural selection
Carl Bergstrom
 
11.1 Role of physical biological in deterioration of grains.pdf
11.1 Role of physical biological in deterioration of grains.pdf11.1 Role of physical biological in deterioration of grains.pdf
11.1 Role of physical biological in deterioration of grains.pdf
PirithiRaju
 
Sexuality - Issues, Attitude and Behaviour - Applied Social Psychology - Psyc...
Sexuality - Issues, Attitude and Behaviour - Applied Social Psychology - Psyc...Sexuality - Issues, Attitude and Behaviour - Applied Social Psychology - Psyc...
Sexuality - Issues, Attitude and Behaviour - Applied Social Psychology - Psyc...
PsychoTech Services
 
Randomised Optimisation Algorithms in DAPHNE
Randomised Optimisation Algorithms in DAPHNERandomised Optimisation Algorithms in DAPHNE
Randomised Optimisation Algorithms in DAPHNE
University of Maribor
 
Eukaryotic Transcription Presentation.pptx
Eukaryotic Transcription Presentation.pptxEukaryotic Transcription Presentation.pptx
Eukaryotic Transcription Presentation.pptx
RitabrataSarkar3
 
Authoring a personal GPT for your research and practice: How we created the Q...
Authoring a personal GPT for your research and practice: How we created the Q...Authoring a personal GPT for your research and practice: How we created the Q...
Authoring a personal GPT for your research and practice: How we created the Q...
Leonel Morgado
 
GBSN - Biochemistry (Unit 6) Chemistry of Proteins
GBSN - Biochemistry (Unit 6) Chemistry of ProteinsGBSN - Biochemistry (Unit 6) Chemistry of Proteins
GBSN - Biochemistry (Unit 6) Chemistry of Proteins
Areesha Ahmad
 
Micronuclei test.M.sc.zoology.fisheries.
Micronuclei test.M.sc.zoology.fisheries.Micronuclei test.M.sc.zoology.fisheries.
Micronuclei test.M.sc.zoology.fisheries.
Aditi Bajpai
 
waterlessdyeingtechnolgyusing carbon dioxide chemicalspdf
waterlessdyeingtechnolgyusing carbon dioxide chemicalspdfwaterlessdyeingtechnolgyusing carbon dioxide chemicalspdf
waterlessdyeingtechnolgyusing carbon dioxide chemicalspdf
LengamoLAppostilic
 
The debris of the ‘last major merger’ is dynamically young
The debris of the ‘last major merger’ is dynamically youngThe debris of the ‘last major merger’ is dynamically young
The debris of the ‘last major merger’ is dynamically young
Sérgio Sacani
 
Compexometric titration/Chelatorphy titration/chelating titration
Compexometric titration/Chelatorphy titration/chelating titrationCompexometric titration/Chelatorphy titration/chelating titration
Compexometric titration/Chelatorphy titration/chelating titration
Vandana Devesh Sharma
 
HOW DO ORGANISMS REPRODUCE?reproduction part 1
HOW DO ORGANISMS REPRODUCE?reproduction part 1HOW DO ORGANISMS REPRODUCE?reproduction part 1
HOW DO ORGANISMS REPRODUCE?reproduction part 1
Shashank Shekhar Pandey
 
Describing and Interpreting an Immersive Learning Case with the Immersion Cub...
Describing and Interpreting an Immersive Learning Case with the Immersion Cub...Describing and Interpreting an Immersive Learning Case with the Immersion Cub...
Describing and Interpreting an Immersive Learning Case with the Immersion Cub...
Leonel Morgado
 
Sciences of Europe journal No 142 (2024)
Sciences of Europe journal No 142 (2024)Sciences of Europe journal No 142 (2024)
Sciences of Europe journal No 142 (2024)
Sciences of Europe
 

Recently uploaded (20)

(June 12, 2024) Webinar: Development of PET theranostics targeting the molecu...
(June 12, 2024) Webinar: Development of PET theranostics targeting the molecu...(June 12, 2024) Webinar: Development of PET theranostics targeting the molecu...
(June 12, 2024) Webinar: Development of PET theranostics targeting the molecu...
 
Immersive Learning That Works: Research Grounding and Paths Forward
Immersive Learning That Works: Research Grounding and Paths ForwardImmersive Learning That Works: Research Grounding and Paths Forward
Immersive Learning That Works: Research Grounding and Paths Forward
 
在线办理(salfor毕业证书)索尔福德大学毕业证毕业完成信一模一样
在线办理(salfor毕业证书)索尔福德大学毕业证毕业完成信一模一样在线办理(salfor毕业证书)索尔福德大学毕业证毕业完成信一模一样
在线办理(salfor毕业证书)索尔福德大学毕业证毕业完成信一模一样
 
Applied Science: Thermodynamics, Laws & Methodology.pdf
Applied Science: Thermodynamics, Laws & Methodology.pdfApplied Science: Thermodynamics, Laws & Methodology.pdf
Applied Science: Thermodynamics, Laws & Methodology.pdf
 
molar-distalization in orthodontics-seminar.pptx
molar-distalization in orthodontics-seminar.pptxmolar-distalization in orthodontics-seminar.pptx
molar-distalization in orthodontics-seminar.pptx
 
EWOCS-I: The catalog of X-ray sources in Westerlund 1 from the Extended Weste...
EWOCS-I: The catalog of X-ray sources in Westerlund 1 from the Extended Weste...EWOCS-I: The catalog of X-ray sources in Westerlund 1 from the Extended Weste...
EWOCS-I: The catalog of X-ray sources in Westerlund 1 from the Extended Weste...
 
The cost of acquiring information by natural selection
The cost of acquiring information by natural selectionThe cost of acquiring information by natural selection
The cost of acquiring information by natural selection
 
11.1 Role of physical biological in deterioration of grains.pdf
11.1 Role of physical biological in deterioration of grains.pdf11.1 Role of physical biological in deterioration of grains.pdf
11.1 Role of physical biological in deterioration of grains.pdf
 
Sexuality - Issues, Attitude and Behaviour - Applied Social Psychology - Psyc...
Sexuality - Issues, Attitude and Behaviour - Applied Social Psychology - Psyc...Sexuality - Issues, Attitude and Behaviour - Applied Social Psychology - Psyc...
Sexuality - Issues, Attitude and Behaviour - Applied Social Psychology - Psyc...
 
Randomised Optimisation Algorithms in DAPHNE
Randomised Optimisation Algorithms in DAPHNERandomised Optimisation Algorithms in DAPHNE
Randomised Optimisation Algorithms in DAPHNE
 
Eukaryotic Transcription Presentation.pptx
Eukaryotic Transcription Presentation.pptxEukaryotic Transcription Presentation.pptx
Eukaryotic Transcription Presentation.pptx
 
Authoring a personal GPT for your research and practice: How we created the Q...
Authoring a personal GPT for your research and practice: How we created the Q...Authoring a personal GPT for your research and practice: How we created the Q...
Authoring a personal GPT for your research and practice: How we created the Q...
 
GBSN - Biochemistry (Unit 6) Chemistry of Proteins
GBSN - Biochemistry (Unit 6) Chemistry of ProteinsGBSN - Biochemistry (Unit 6) Chemistry of Proteins
GBSN - Biochemistry (Unit 6) Chemistry of Proteins
 
Micronuclei test.M.sc.zoology.fisheries.
Micronuclei test.M.sc.zoology.fisheries.Micronuclei test.M.sc.zoology.fisheries.
Micronuclei test.M.sc.zoology.fisheries.
 
waterlessdyeingtechnolgyusing carbon dioxide chemicalspdf
waterlessdyeingtechnolgyusing carbon dioxide chemicalspdfwaterlessdyeingtechnolgyusing carbon dioxide chemicalspdf
waterlessdyeingtechnolgyusing carbon dioxide chemicalspdf
 
The debris of the ‘last major merger’ is dynamically young
The debris of the ‘last major merger’ is dynamically youngThe debris of the ‘last major merger’ is dynamically young
The debris of the ‘last major merger’ is dynamically young
 
Compexometric titration/Chelatorphy titration/chelating titration
Compexometric titration/Chelatorphy titration/chelating titrationCompexometric titration/Chelatorphy titration/chelating titration
Compexometric titration/Chelatorphy titration/chelating titration
 
HOW DO ORGANISMS REPRODUCE?reproduction part 1
HOW DO ORGANISMS REPRODUCE?reproduction part 1HOW DO ORGANISMS REPRODUCE?reproduction part 1
HOW DO ORGANISMS REPRODUCE?reproduction part 1
 
Describing and Interpreting an Immersive Learning Case with the Immersion Cub...
Describing and Interpreting an Immersive Learning Case with the Immersion Cub...Describing and Interpreting an Immersive Learning Case with the Immersion Cub...
Describing and Interpreting an Immersive Learning Case with the Immersion Cub...
 
Sciences of Europe journal No 142 (2024)
Sciences of Europe journal No 142 (2024)Sciences of Europe journal No 142 (2024)
Sciences of Europe journal No 142 (2024)
 

Text mining full text for molecular targets

  • 1. Country Long Distance Australia +61 3 8488 8993 Austria +43 (0) 7 2088 2171 Belgium +32 (0) 42 68 0164 Canada +1 (647) 497-9386 Denmark +45 (0) 89 88 04 43 Finland +358 (0) 931 58 4587 France +33 (0) 182 880 933 Germany +49 (0) 692 5736 7304 Ireland +353 (0) 19 036 186 Text Mining Full Text for Molecular Targets with George Jiang, Ph.D., M.B.A Our Webinar will begin in a few minutes Country Long Distance Italy +39 0 294 75 15 36 Netherlands +31 (0) 108 080 115 New Zealand +64 (0) 9 801 0293 Norway +47 21 03 72 89 Spain +34 911 23 4247 Sweden +46 (0) 852 500 292 Switzerland +41 (0) 435 0824 40 United Kingdom +44 (0) 330 221 9921 United States +1 (646) 307-1726 TO USE YOUR COMPUTER'S AUDIO: When the webinar begins, you will be connected to audio using your computer's microphone and speakers (VoIP). A headset is recommended. --OR-- TO USE YOUR TELEPHONE: If you prefer to use your phone, you must select "Use Telephone" after joining the webinar and call in using the numbers below. Dial your country’s number and then use Access Code: 655-028-479
  • 2. Text Mining Full Text for Molecular Targets George Jiang, PhD, MBA Product Manager, Text Mining g.jiang@elsevier.com March 31, 2015
  • 3. George Jiang Product Manager Text Mining Trained scientist with several years of experience in text analytics, data integration, and scientific software development • Currently, Product Manager with Elsevier working on text mining projects and semantic search products, based out of Rockville, MD • Previously, worked at US National Center for Biotechnology Information (NCBI) working on Discovery Initiative to understand users needs and crosslink data and expose it to make research information more discoverable Text Mining Full Text for Molecular Targets – March 31, 2015 – George Jiang
  • 4. World Leader in Digital Information Solutions Published over 330,000 articles in 2013 Founded over 130 years ago Work with over 30 million scientists, students, health & information professionals Received over 1 million submissions in 2013 SOLUTIONS Over 53 million items indexed by Scopus Elsevier R+D Solutions Elsevier Clinical Solutions Helps corporate researchers, R+D professionals, and engineers improve how they interact with, share, and apply information to solve problems using our digital workflow tools, analytics, and data Provides universities, governments, and research institutions with the resources and insights to improve institutional research strategy, management, and performance. Elsevier Education Helps medical professionals apply trusted data and sophisticated tools to make better clinical decisions, deliver better care, and produce better healthcare outcomes. Helps educate highly-skilled, effective healthcare professionals, using the most advanced pedagogical tools and reference works. Elsevier Research Intelligence CONTENT CAPABILITIESPLATFORMS Publishes over 2,200 online journals & over 26,000 books (e + print) Elsevier eBooks, Online Journals, Databases Text Mining Full Text for Molecular Targets – March 31, 2015 – George Jiang
  • 5. Working With Text is A Big Data Challenge Text is everywhere! We’ve already covered 100s of terms in this presentation.  Twitter - 58M tweets/day x 14.98 words/tweet => 868M words/day => 6B Average journal article = 10, 150, 6000 words in title, abstract , full text  abstracts – 2.4B words (24M abstracts @ PubMed x 100 words/abstract)  full text – 144B words ( if comparable set from PubMed, 25M x 6000 The information deluge of scientific content and how to manage and/or leverage this information is a big data challenge Information seeking challenges can be addressed with automation assistance and text mining for greater insight Text Mining Full Text for Molecular Targets – March 31, 2015 – George Jiang
  • 6. Summary • Text mining can help to sift through large amounts of scientific literature and other textual content • Text mining can help to increase project team efficiency to find precise statements and relationships • Full text articles provide richer result sets that can be useful in finding additional insights that cannot be garnered just using abstracts • Several hurdles still exist to implement text mining but the value can outweigh costs Text mining full text can be used to help find molecular targets of interest quickly that may be missed if relying on abstracts and keyword searching Text Mining Full Text for Molecular Targets – March 31, 2015 – George Jiang
  • 7. Agenda • Introduction to Text Mining • The Value of Full Text Articles • Illustration of Text Mining Full Text Articles • Recap • Q&A Text Mining Full Text for Molecular Targets – March 31, 2015 – George Jiang
  • 8. What is Text Mining? Text Mining • Refers to the process of deriving high-quality structured A Does B X Inhibits Y G Stops D I Drink T documents facts Why Text Mining? • Text Mining can yield better results, and increase team efficiency • The application of text mining techniques can be used to solve business problems Text Mining Full Text for Molecular Targets – March 31, 2015 – George Jiang
  • 9. Example of Getting Structured Information (Facts) Triple negative breast cancer (TNBC) cells lack receptor expression, are frequently more aggressive and are resistant to growth factor inhibition documents sentence fact(s) Tumour cells show greater dependency on glycolysis so providing a sufficient and rapid energy supply for fast growth. In many breast cancers, estrogen, progesterone and epidermal growth factor receptor-positive cells proliferate in response to growth factors and growth factor antagonists are a mainstay of treatment. However, triple negative breast cancer (TNBC) cells lack receptor expression, are frequently more aggressive and are resistant to growth factor inhibition. Downstream of growth factor receptors, signal transduction proceeds via phosphatidylinositol 3-kinase (PI3k), Akt and FOXO3a inhibition, the latter being partly responsible for coordinated increases in glycolysis and apoptosis resistance. FOXO3a may be an attractive therapeutic target for TNBC. Therefore we have undertaken a systematic review of FOXO3a as a target for breast cancer therapeutics. paragraph TNBC cells lack receptor expression TNBC cells are more aggressive TNBC cells resist growth factor inhibition Excerpt from Taylor et al. Evaluating the evidence for targeting FOXO3a in breast cancer: a systematic review. Wordcloud plotted with Wordle.net tokens Text Mining Full Text for Molecular Targets – March 31, 2015 – George Jiang Text analytics and Visualizations
  • 10. What is Text Mining Being Used For? Use cases include: • Target identification and prioritization • Biomarker discovery • Drug repurposing • Drug safety and finding adverse events • Clinical study design and site selection • Competitive intelligence DISCOVERY PRE- CLINICAL CLINICAL POST- LAUNCH Text mining article submissions for curation assistance in publishing Basic Research Applied Research Text mining can be used to support several research and development areas Information retrieval and analysis of biomedical literature for target identification, systematic reviews, etc. Searching clinical trial data or electronic health records to find signals in patient populations Triage of news and papers for literature curation and regulatory reporting Identifying relevant items for meta-analysis of specific research results Text Mining Full Text for Molecular Targets – March 31, 2015 – George Jiang
  • 11. How to Text Mine? • Content • Ontology • Software solution(s) • Expertise Several pieces and steps are often needed to get results from text mining Aggregate1 Structure2 Normalize3 Integrate4 • PDF -> XML • XML quality differs • XML uniformity e.g. dealing with sources, types, etc. Default or custom ontology • Text mining the corpus • Balancing expectations of precision and recall 1. Aggregate 2. Structure 3. Normalize 4. Integrate Text Mining Full Text for Molecular Targets – March 31, 2015 – George Jiang Text Mining solutions & Professional Services
  • 12. Elsevier Offers Several Text Mining Solutions facts and data out support downstream applications and activities Aggregate Normalize Structure Integrate 1 2 3 4 Journals and Books Internal content Patents Other Software solution UI / API Public data sources User Questions Software solutions and Professional Services available for text mining and semantic searching Text Mining Full Text for Molecular Targets – March 31, 2015 – George Jiang
  • 13. • Introduction to Text Mining • The Value of Full Text Articles • Illustration of Text Mining Full Text Articles • Recap • Q&A Agenda Text Mining Full Text for Molecular Targets – March 31, 2015 – George Jiang
  • 14. Abstracts vs Full Text • Concise summaries • Readily accessible • Relatively uniform Summary of main differences • Complete documents • May not be as accessible • Information within can vary Text Mining Full Text for Molecular Targets – March 31, 2015 – George Jiang
  • 15. Benefits of Using Full Text • Distribution of keywords, facts and relations – more keywords, facts and relations are found in full text • Concept under-representation in abstracts – specific entities may not be mentioned in abstracts but primarily in full text sections e.g. biological functions • Missing Negative data – often negative results or non-significant data are missing from abstracts • Citations per article – full text sections are more cited vs abstracts • Timeliness – Relevant facts and relationships can be found in full text first before any mentions in abstracts as researchers surmise in Full Text provide richer results sets Text Mining Full Text for Molecular Targets – March 31, 2015 – George Jiang
  • 16. Additional Reading • Information extraction from full text scientific articles: where are the keywords? BMC Bioinformatics. 2003 May 29;4:20. Epub 2003 May 29. • Beyond genes, proteins, and abstracts: Identifying scientific claims from full-text biomedical articles. J Biomed Inform. 2010 Apr;43(2):173-89. doi: 10.1016/j.jbi.2009.11.001. Epub 2009 Nov 10. • Do Peers See More in a Paper Than Its Authors? Adv Bioinformatics. 2012;2012:750214. doi: 10.1155/2012/750214. Epub 2012 Nov 27. • Is searching full text more effective than searching abstracts? Bioinformatics. 2009 Feb 3;10:46. doi: 10.1186/1471-2105-10-46. • Challenges for automatically extracting molecular interactions from full-text articles. BMC Bioinformatics. 2009 Sep 24;10:311. doi: 10.1186/1471-2105-10-311. • Semi-Automatic Indexing of Full Text Biomedical Articles. AMIA Annu Symp Proc. 2005:271-5. • Discovering implicit associations between genes and hereditary diseases. Pac Symp Biocomput. 2007:316-27. • The structural and content aspects of abstracts versus bodies of full text journal articles are different. BMC Bioinformatics. 2010 Sep 29;11:492. doi: 10.1186/1471-2105-11-492. • Abstracts in high profile journals often fail to report harm. BMC Med Res Methodol. 2008 Mar 27;8:14. doi: 10.1186/1471-2288-8-14. • Quality of abstracts of original research articles in CMAJ in 1989. CMAJ. 1991 Feb 15;144(4):449-53. • Accuracy of data in abstracts of published research articles. JAMA. 1999 Mar 24-31;281(12):1110-1. Articles highlighting the differences between abstracts and full text Text Mining Full Text for Molecular Targets – March 31, 2015 – George Jiang
  • 17. Abstract vs Full Text Example Challenges  Sifting through more information!  Finding the right results Concise abstracts cannot contain all details whereas full text will contain all the relevant information Significant advances have been made in the treatment of human immunodeficiency virus (HIV) infection over the past two decades. Improved therapy has prolonged survival and improved clinical outcome for HIV-infected children and adults. Sixteen antiretroviral (ART) medications have been approved for use in pediatric HIV infection. The Department of Health and Human Services (DHHS) has issued “Guidelines for the Use of Antiretroviral Agents in Pediatric HIV Infection”, which provide detailed information on currently recommended antiretroviral therapies (ART). However, consultation with an HIV specialist is recommended as the current therapy of pediatric HIV therapy is complex and rapidly evolving. Elvitegravir is a once daily integrase inhibitor being studied in adults. Children with treatment failure should be evaluated for medication adherence, drug intolerance, and possible drug interactions which may lessen the efficacy of the therapeutic regimen. Abstract Full Text Text Mining Full Text for Molecular Targets – March 31, 2015 – George Jiang
  • 18. • Introduction to Text Mining • The Value of Full Text Articles • Illustration of Text Mining Full Text Articles • Recap • Q&A Agenda Text Mining Full Text for Molecular Targets – March 31, 2015 – George Jiang
  • 19. • Use Elsevier Text Mining solution to search against corpus of biomedical literature • Abstracts – MEDLINE/PubMed (24M) • Full text – PubMed Central, Elsevier and partner publishers (4M) • Refine results corpus, redefine search / text mining output • Review and analyze data • Create visual data reports using other tools available Methods Text Mining Full Text for Molecular Targets – March 31, 2015 – George Jiang
  • 20. Search against scientific literature corpus for sentences related to efficacy If looking for details, one really needs to look at the full text results Text Mining Abstracts vs Full Text Word clouds suggest insight differences between abstracts and full text Full textAbstracts Only Text Mining Full Text for Molecular Targets – March 31, 2015 – George Jiang
  • 21. Full text provides insights into the specific mutations implicated in differential enzymatic efficacy of a particular drug class Finding Molecular Targets in Full Text Word clouds illustrating differences in point mutations mentioned Full TextAbstracts Only Gives insight into the mutations implicated for changes in efficacy. No mutations mentioned in abstracts of comparable document set. Text Mining Full Text for Molecular Targets – March 31, 2015 – George Jiang
  • 22. Finding Molecular Targets in Full Text Example searching for cancer immunity checkpoint proteins Full text provides insights into additional protein targets that may be of interest for cancer immunology research in cancer checkpoints Full TextAbstracts Only Text Mining Full Text for Molecular Targets – March 31, 2015 – George Jiang
  • 23. Text Mining Results Can Then Be Used For Analyses • Review results. Not just keyword matching anymore …  identifying more relevant documents for review  identifying relationships and precise statements  Identifying other targets/content of interest • Link data to other items of interest • Analytics, visualization and system/network analysis e.g. Pathway Studio, Cytoscape • Integrate text mining data and process into different workflows for project quality and efficiency Text mining results can be used to improve scientific research and can be used to address business problems Text Mining Full Text for Molecular Targets – March 31, 2015 – George Jiang
  • 24. Text Mining Finds Answers Faster & Increases Efficiency An Example Project Comparison Text Mining Full Text for Molecular Targets – March 31, 2015 – George Jiang Savings:  Text mining robustly identifies the relevant articles  Savings of 171 person-days per project  Allows more projects/higher quality with same staff Keyword searching: Text Mining: Finds 1,408 articles Many of them not relevant Identifies 142 relevant articles 176 person-days to review @ 20 min/article 5 person-days to review @ 20 min/article VS 24 Writing comprehensive state of the science review article on the chemical toxicity of a particular substance
  • 25. Relationship map using Elsevier Text Mining results into Cytoscape visualization NLP Example of Visual Insights of Text Mining Results Intersecting adverse events between two anti-TNF drugs Text Mining Full Text for Molecular Targets – March 31, 2015 – George Jiang
  • 26. Summary • Text mining can help to sift through large amounts of scientific literature and other textual content • Text mining can help to increase project team efficiency to find precise statements and relationships • Full text articles provide richer result sets that can be useful in finding additional insights that cannot be garnered just using abstracts • Several hurdles still exist to implement text mining but the value can outweigh costs Text mining full text can be used to help find molecular targets of interest quickly that may be missed if relying on abstracts and keyword searching Text Mining Full Text for Molecular Targets – March 31, 2015 – George Jiang
  • 27. Text Mining Full Text for Molecular Targets – March 31, 2015 – George Jiang Thank you for joining our webinar today: Text Mining Full Text for Molecular Targets with George Jiang, Ph.D., M.B.A If you have any questions for our speaker, please type them into the CHAT window. If you would like more information you can contact: George Jiang g.jiang@elsevier.com Text Mining Full Text for Molecular Targets – March 31, 2015 – George Jiang

Editor's Notes

  1. Who are we? Elsevier is a digital information solutions company with roots in publishing world class, peer-reviewed scientific, medical, and technical literature, going back 130 years. What is our Mission? Elsevier’s abiding purpose—it’s brand essence—is to empower knowledge and to empower its clients through knowledge. It is to perpetuate knowledge as a vital, organic set of discoveries oriented toward truth and, ultimately, solutions to fundamental human challenges. This mission of empowerment is accomplished by the application of sophisticated digital technology and analytics to some of the world’s greatest scientific, technical, and medical content, which Elsevier has helped to produce, under the peer review system, for over 130 years. Empowered Knowledge. The Knowledge that Empowers. What do we do? We help professionals advance knowledge by expanding it as a body of confirmed facts and ideas, and getting it to yield positive, measurable—sometimes ground-breaking—outcomes in these disciplines (example: more Nobel Laureate authors published in the last half-century than by any other publisher) What is ‘the product?’ We continue to produce intellectual content—largely in the form of digitized books, journals, and proprietary databases—delivering access to them via the internet and other offline digital channels (examples: journals (Lancet, imprints (Cell, MK), ScienceDirect, Mendeley, Scopus) In addition to—and layered over—this content, are technology and analytics (tools and solutions), that allow clients and end-users to do ‘more with’ information: to produce it, interact with it, manipulate it, and share it with greater facility, efficiency, and creativity (examples: ClinicalKey, Reaxys, SciVal, SimChart) What are the Benefits of working with us? Elsevier empowers knowledge professionals to be more collaborative and competitive, efficient and effective; to perform better, and to create knowledge with impact.
  2. Today, focus is on scientific literature space and full text articles Sources of data for various statistics Oxford University Press Cognition article Manual copy and paste and word counting
  3. Circle back why text mining is used for use cases