November 2017
MyScienceWork
TDM/ NLP Enhanced Platforms & Services
Virginie SIMON, CEO
Yann MAHE, VP Sales & Marketing
1. Introduction – Global platform for researchers, institutions and
publishers – Virginie SIMON, CEO - MySciencework
2. Social Network & Publications – Patents Database – Yann MAHE, Sales
& Marketing Director – MySciencework
3. Institutional Repository & Innovative Research Monitoring
Infrastruture – Yann MAHE, Sales & Marketing Director –
MySciencework
✓ Case studies - Marc Diederich, Head of Lab - LBMCC
4. Publishers Solutions – Yann MAHE, Sales & Marketing Director –
MyScienceWork
✓ Testimony – Darrell W. Gunter, STM Director North America
5. Conclusion – Virginie SIMON, CEO - MyScienceWork
AGENDA
2
Global platform
for researchers, institutions
and publishers
Virginie SIMON
CEO 3
• Created in 2010 by Virginie Simon and Tristan Davaille
• About $1,422,626 in share capital
• 5 M€ invested
• 3 offices in San Francisco, Luxembourg and Paris since 2012
 IP portfolio, R&D team and IT team based in Luxembourg
 Business team based in Paris
 Executive team based in San Francisco
• Team of 15 people
• R&D team Machine-Learning, AI Semantics
MyScienceWork: A company sustainably rooted in Luxembourg
4
MyScienceWork Overview: Key Products, Services & Clients
• Analysis of scientific production,
innovation drive and strategic
research decisions
• Research Mapping
• Online dashboards and science
metrics
• Bibliometric studies and evaluation
reports
• Archive scientific publication
• Promote research output to rise up
international visibility
• Institutional dashboard
• Metrics on visibility and activity
• Statistics and analysis of performance
• >70M scientific publications
• >12M patents
• >1M visitors per month
• Top 1 visitors: US
• 1.8M publications indexed by Google
Scholar (main partner)
• Individual profile / statistics
• Collaborative tools
Sirius is a big data
service provider
Polaris is an interconnected
repository solution
MyScienceWork is the 1st global
open access platform for science
Since Jan 2015 Since Jan 2017
CUSTOMERS:
Institutions & Publishers
Business Model:
> Saas Solution: annual license
> Onsite Solution: Set up fees
CUSTOMERS:
Institutions, Publishers &
Corporations
Business Model:
On quote
CUSTOMERS:
Publishers services providers
Business Model:
$ per click
5
From Institutional Repositories To Global Big Data Service
Development of advanced technologies such as Semantic Web, NLP: Natural Language Processing, Machine Learning enabled the launch of Sirius
+70M Publications
+38M Authors & Inventors
Patents
BIG DATA SCIENTIFIC SERVICES :
-Customized Innovation Studies = Trends Researching, Benchmark
-Support IP Strategy Decisions
- Experts Targeting
Peer Review & Registration
Legend
Harvesting, Curation
Scientific Articles
+12M Patents
+35K Institutions & Corporations
Analysis (keywords, concept,
multilingual, citations, etc.)
Direct upload (trafic)
From 38K Journals - Open Access & Non OA From USPTO (US) & EPO (EU)
6
SOCIAL NETWORK
PUBLICATIONS – PATENTS DATABASE
Yann Mahé
Sales & Marketing Director 7
70M publications
12M patents (USPTO, EPO)
38K journals
38M authors / inventors
500K members
35K affiliations
/ institutions
Semantics / NLP
Machine-Learning
Data mining / TDM
Free discovery & promotion platform
for scholarly content
8
ADDED VALUE FOR RESEARCHERS
9
Free discovery & promotion platform for researchers
• Discovery engine for WW science
• Promotion of individual profiles
• Sharing of scientific papers and networking with peers
• Collaboration tool
• Tips for science papers & profiles
• Individual research impact statistics
10
Semantic Discovery Search Engine
11
Semantic Discovery Search Engine
12
Publication details
• Main doc types:
 Published articles, pre-print,
thesis, posters, patents,
• All relevant metadata
• PDF full text of OA articles
• Article timeline
• Comments
• Statistics
• Similar articles suggested
13
Profile details
• Automated publications
completion
• By completing your profile and
uploading your publication, you
will:
 Increase your visibility
 Get more collaboration proposal
 Make MSW content suggestion
more relevant
 Increase your citation score
14
Researcher Profile
15
Collaboration Tool
Find and Connect with
researchers and experts
to initiate collaboration
and find their
papers/articles
16
Dashboard and metrics
 Profile view
 Publications view
 Publications
download
 Publications read
17
FEW LAST WORDS
With MyScienceWork, researchers can:
•Easily create and complete a profile
•Increase their citation impact
•Discover publications
•Find researchers/peers for collaboration
18
Institutional Repository
and
Innovative Research
Monitoring Infrastructure
19
Yann Mahé
Sales & Marketing Director
An embedded network of repositories
• 4 key functions:
 Archive / Centralize
 Collaborate
 Analyze
 Communicate
• Objectives:
 Build a global network of repositories
customized at the institutions' level
 Increase the national and international
visibility of institutions’ research activities
20
Main Benefits
• Manage
 Centralize scientific publications (ideally in in open access)
 Manage publications management workflow and metadata quality/enrichment
• Promote
 Present a clear, comprehensive image of research work
 Disseminate as widely as possible publications and research results
 Communicate research work to an international community
• Connect
 Create a scientific profile for every member
 Construct a professional network of experts
 Facilitate access to the latest scientific advances
• Analyze
 Track the progress of publications and evaluate their influence
 Understand the readership and potential collaborators
 Measure the impact of your scholarly outputs and the trends of research 21
Solution Integration
• Solution can be integrated through:
 SaaS Solution
 On site set up
• Interoperability with:
 Institution information system
 Existing institutional website
 Existing archiving systems such as ArXiv...
• Interoperability with metadata resources:
 Author ID: ORCID,
 License: Sherpa/Romeo
 DOI Provider: Crossref
 Others: Pubmed, ArXiv, Caltech, USPTO…
 Bibliometric : Scopus, Web of Science, Altmetric
22
1. Preliminary Study
• Existing databases, resources,
process, subscriptions…
• Existing & expected metrics
2. Corpus constitution
• Internal/External, Existing/
Complementary Databases
Extraction
3. Workflow Processing
• Publication/ Metadata process
management workflow
4. Pre-Processing Data
• Data Model
• Cleaning, Disambiguation,
deduplication, enrichment
5. Processing Data
• Data enrichment: citations,
keywords, abstract, full text
6. Metrics & Visualization
• Existing Metrics: nb of publications etc
• New metrics: collaboration, citation etc
7. Final Dashboard
• Analysis, report production
• Dynamic dashboard
A « Data Driven » Methodology
23
Data Stream
Publication
Management
Solution
Dashboard
Internal Information
System
Employee
Repo.
Other content
repo.
Internal Tools
www.instit
utional.com
Other
websites
Bibliometric Database
Citation
Database
Altmetrics
Database
Metadata Database
Others
External Repositories and
Search Engines
Others
Option:Hostedonyour
ownServers
External
Servers
Authors Repo
Manager
24
Workflow for publication deposit
and/or
Upload file(s)
Auto-recovery of
metadata
Mandatory metadata have all
been generated
Mandatory information
are missing
Confirm your deposit if all
information have been
filled in
Click on the red field(s)
and then fill in
Upload file(s)
Fill in metadata
Fill in author’s data
Confirm deposit
Upload file(s)
Fill in metadata
Fill in author’s data
Confirm deposit
Open institutional
repositories
25
DASHBOARD – Publications & Impact
➢ Number of publications / years/ types
➢ Number of publications/ year/ research
group + Affiliated PI
➢ List of publications / type
➢ h index ranking
➢ Citations ranking
26
DASHBOARD - Collaboration
➢ Collaborators (e.g. co-authors
name/institution/country)
➢ Type of collaborator (e.g. academia, private
company, public, etc.)
➢ External (e.g. SES, university of Torino, etc.) and
internal (e.g. CSC, Law, etc.)
➢ Number of collaborations (e.g. SnT has a total
of 76 collaborations)
➢ Intensity of collaboration (e.g. 5 publications
with SES)
➢ Some results could be displayed on a map for
example
27
Top Sources, journals & conferences
by nb of articles in all sources and in journals
28
Publications
Evolution overtime by document types
29
Top institutions Collaboration
by nb of publications & by nb of citations
30
Top Countries Collaboration
by nb of publications
31
CASE STUDIES
Marc Diederich
Head of Lab, LBMCC 32
QUESTIONS?
33
PUBLISHERS SOLUTIONS
Yann Mahé
Sales & Marketing Director 34
WHY SHOULD PUBLISHER INDEX
THEIR CONTENT IN MYSCIENCEWORK
DATABASE?
35
Based on a random sample of English language articles
drawn from ResearchGate, a recent study showed that:
« The key finding was that 201 (51.3%) out of
392 non-OA articles infringed the copyright
and were non-compliant with publishers’
policy. »
« Copyright compliance and infringement in ResearchGate full-text journal article »
- Scientometrics - Springer (July 2017)
36
Copyright Issue
MYSCIENCEWORK SOLUTION
37
MSW Copyright Control
USUAL SITUATION
MYSCIENCEWORK
SOLUTION
Upload file(s) Upload file(s)
No copyright
control
Publisher
Content
Automated
Copyright control
License Check
Social Networks
No License
Check
38
WEBSITE TRAFFIC INCREASE
Publisher Content
Metadata and/or
Full text URL
Publisher Website
1M Monthly Visitors 1M Monthly Visitors
Free Traffic
Increase
39
Some of our Partners
40
SIRIUS DISSEMINATION
DASHBOARD
41
DISSEMINATION CONTENT DASHBOARD
# Journals
# Articles
# Authors
# Full text
115
3520
153
2549
PUBLISHER CONTENTINSTITUTIONAL REPOSITORIES VS
42
DISSEMINATION CONTENT DASHBOARD
PUBLISHER CONTENT
Journals
Articles
Authors
Full text
115
3520
153
2549
INSTITUTIONAL REPOSITORIESCOUNTRIES
# and List
43
Benefits for publishers
With MyScienceWork, publishers can:
•Control copyright on msw.com
•Increase the visibility of publisher’s content
•Increase the traffic on publisher’s website
•Analyze and qualify their content in IR
44
CASE STUDIES
Darrell W. Gunter
Director, North America, Director Memberships,
International Association of STM Publishers 45
Darrell’s Expertise
• 20+ years in the Scholarly Publishing Industry
• Elsevier Launched
 Americas Regional Sales Office
46
Why ?
• Applications focused on the researcher!
• Profiles to show case a researcher’s body of research
• Ability to collaborate with like research minder researchers
• Repositories to demonstrate an institutions body of works
• Applications to analyze and unlock the value of the body of
work
47
QUESTIONS?
48
FEW LAST WORDS
Virginie SIMON
CEO 49
For more information
yann.mahe@mysciencework.com
www.mysciencework.com
Yann Mahé,
Sales & Marketing Director
50

MyScienceWork & NFAIS - Webinar 07 11 2017

  • 1.
    November 2017 MyScienceWork TDM/ NLPEnhanced Platforms & Services Virginie SIMON, CEO Yann MAHE, VP Sales & Marketing
  • 2.
    1. Introduction –Global platform for researchers, institutions and publishers – Virginie SIMON, CEO - MySciencework 2. Social Network & Publications – Patents Database – Yann MAHE, Sales & Marketing Director – MySciencework 3. Institutional Repository & Innovative Research Monitoring Infrastruture – Yann MAHE, Sales & Marketing Director – MySciencework ✓ Case studies - Marc Diederich, Head of Lab - LBMCC 4. Publishers Solutions – Yann MAHE, Sales & Marketing Director – MyScienceWork ✓ Testimony – Darrell W. Gunter, STM Director North America 5. Conclusion – Virginie SIMON, CEO - MyScienceWork AGENDA 2
  • 3.
    Global platform for researchers,institutions and publishers Virginie SIMON CEO 3
  • 4.
    • Created in2010 by Virginie Simon and Tristan Davaille • About $1,422,626 in share capital • 5 M€ invested • 3 offices in San Francisco, Luxembourg and Paris since 2012  IP portfolio, R&D team and IT team based in Luxembourg  Business team based in Paris  Executive team based in San Francisco • Team of 15 people • R&D team Machine-Learning, AI Semantics MyScienceWork: A company sustainably rooted in Luxembourg 4
  • 5.
    MyScienceWork Overview: KeyProducts, Services & Clients • Analysis of scientific production, innovation drive and strategic research decisions • Research Mapping • Online dashboards and science metrics • Bibliometric studies and evaluation reports • Archive scientific publication • Promote research output to rise up international visibility • Institutional dashboard • Metrics on visibility and activity • Statistics and analysis of performance • >70M scientific publications • >12M patents • >1M visitors per month • Top 1 visitors: US • 1.8M publications indexed by Google Scholar (main partner) • Individual profile / statistics • Collaborative tools Sirius is a big data service provider Polaris is an interconnected repository solution MyScienceWork is the 1st global open access platform for science Since Jan 2015 Since Jan 2017 CUSTOMERS: Institutions & Publishers Business Model: > Saas Solution: annual license > Onsite Solution: Set up fees CUSTOMERS: Institutions, Publishers & Corporations Business Model: On quote CUSTOMERS: Publishers services providers Business Model: $ per click 5
  • 6.
    From Institutional RepositoriesTo Global Big Data Service Development of advanced technologies such as Semantic Web, NLP: Natural Language Processing, Machine Learning enabled the launch of Sirius +70M Publications +38M Authors & Inventors Patents BIG DATA SCIENTIFIC SERVICES : -Customized Innovation Studies = Trends Researching, Benchmark -Support IP Strategy Decisions - Experts Targeting Peer Review & Registration Legend Harvesting, Curation Scientific Articles +12M Patents +35K Institutions & Corporations Analysis (keywords, concept, multilingual, citations, etc.) Direct upload (trafic) From 38K Journals - Open Access & Non OA From USPTO (US) & EPO (EU) 6
  • 7.
    SOCIAL NETWORK PUBLICATIONS –PATENTS DATABASE Yann Mahé Sales & Marketing Director 7
  • 8.
    70M publications 12M patents(USPTO, EPO) 38K journals 38M authors / inventors 500K members 35K affiliations / institutions Semantics / NLP Machine-Learning Data mining / TDM Free discovery & promotion platform for scholarly content 8
  • 9.
    ADDED VALUE FORRESEARCHERS 9
  • 10.
    Free discovery &promotion platform for researchers • Discovery engine for WW science • Promotion of individual profiles • Sharing of scientific papers and networking with peers • Collaboration tool • Tips for science papers & profiles • Individual research impact statistics 10
  • 11.
  • 12.
  • 13.
    Publication details • Maindoc types:  Published articles, pre-print, thesis, posters, patents, • All relevant metadata • PDF full text of OA articles • Article timeline • Comments • Statistics • Similar articles suggested 13
  • 14.
    Profile details • Automatedpublications completion • By completing your profile and uploading your publication, you will:  Increase your visibility  Get more collaboration proposal  Make MSW content suggestion more relevant  Increase your citation score 14
  • 15.
  • 16.
    Collaboration Tool Find andConnect with researchers and experts to initiate collaboration and find their papers/articles 16
  • 17.
    Dashboard and metrics Profile view  Publications view  Publications download  Publications read 17
  • 18.
    FEW LAST WORDS WithMyScienceWork, researchers can: •Easily create and complete a profile •Increase their citation impact •Discover publications •Find researchers/peers for collaboration 18
  • 19.
    Institutional Repository and Innovative Research MonitoringInfrastructure 19 Yann Mahé Sales & Marketing Director
  • 20.
    An embedded networkof repositories • 4 key functions:  Archive / Centralize  Collaborate  Analyze  Communicate • Objectives:  Build a global network of repositories customized at the institutions' level  Increase the national and international visibility of institutions’ research activities 20
  • 21.
    Main Benefits • Manage Centralize scientific publications (ideally in in open access)  Manage publications management workflow and metadata quality/enrichment • Promote  Present a clear, comprehensive image of research work  Disseminate as widely as possible publications and research results  Communicate research work to an international community • Connect  Create a scientific profile for every member  Construct a professional network of experts  Facilitate access to the latest scientific advances • Analyze  Track the progress of publications and evaluate their influence  Understand the readership and potential collaborators  Measure the impact of your scholarly outputs and the trends of research 21
  • 22.
    Solution Integration • Solutioncan be integrated through:  SaaS Solution  On site set up • Interoperability with:  Institution information system  Existing institutional website  Existing archiving systems such as ArXiv... • Interoperability with metadata resources:  Author ID: ORCID,  License: Sherpa/Romeo  DOI Provider: Crossref  Others: Pubmed, ArXiv, Caltech, USPTO…  Bibliometric : Scopus, Web of Science, Altmetric 22
  • 23.
    1. Preliminary Study •Existing databases, resources, process, subscriptions… • Existing & expected metrics 2. Corpus constitution • Internal/External, Existing/ Complementary Databases Extraction 3. Workflow Processing • Publication/ Metadata process management workflow 4. Pre-Processing Data • Data Model • Cleaning, Disambiguation, deduplication, enrichment 5. Processing Data • Data enrichment: citations, keywords, abstract, full text 6. Metrics & Visualization • Existing Metrics: nb of publications etc • New metrics: collaboration, citation etc 7. Final Dashboard • Analysis, report production • Dynamic dashboard A « Data Driven » Methodology 23
  • 24.
    Data Stream Publication Management Solution Dashboard Internal Information System Employee Repo. Othercontent repo. Internal Tools www.instit utional.com Other websites Bibliometric Database Citation Database Altmetrics Database Metadata Database Others External Repositories and Search Engines Others Option:Hostedonyour ownServers External Servers Authors Repo Manager 24
  • 25.
    Workflow for publicationdeposit and/or Upload file(s) Auto-recovery of metadata Mandatory metadata have all been generated Mandatory information are missing Confirm your deposit if all information have been filled in Click on the red field(s) and then fill in Upload file(s) Fill in metadata Fill in author’s data Confirm deposit Upload file(s) Fill in metadata Fill in author’s data Confirm deposit Open institutional repositories 25
  • 26.
    DASHBOARD – Publications& Impact ➢ Number of publications / years/ types ➢ Number of publications/ year/ research group + Affiliated PI ➢ List of publications / type ➢ h index ranking ➢ Citations ranking 26
  • 27.
    DASHBOARD - Collaboration ➢Collaborators (e.g. co-authors name/institution/country) ➢ Type of collaborator (e.g. academia, private company, public, etc.) ➢ External (e.g. SES, university of Torino, etc.) and internal (e.g. CSC, Law, etc.) ➢ Number of collaborations (e.g. SnT has a total of 76 collaborations) ➢ Intensity of collaboration (e.g. 5 publications with SES) ➢ Some results could be displayed on a map for example 27
  • 28.
    Top Sources, journals& conferences by nb of articles in all sources and in journals 28
  • 29.
  • 30.
    Top institutions Collaboration bynb of publications & by nb of citations 30
  • 31.
    Top Countries Collaboration bynb of publications 31
  • 32.
  • 33.
  • 34.
  • 35.
    WHY SHOULD PUBLISHERINDEX THEIR CONTENT IN MYSCIENCEWORK DATABASE? 35
  • 36.
    Based on arandom sample of English language articles drawn from ResearchGate, a recent study showed that: « The key finding was that 201 (51.3%) out of 392 non-OA articles infringed the copyright and were non-compliant with publishers’ policy. » « Copyright compliance and infringement in ResearchGate full-text journal article » - Scientometrics - Springer (July 2017) 36 Copyright Issue
  • 37.
  • 38.
    MSW Copyright Control USUALSITUATION MYSCIENCEWORK SOLUTION Upload file(s) Upload file(s) No copyright control Publisher Content Automated Copyright control License Check Social Networks No License Check 38
  • 39.
    WEBSITE TRAFFIC INCREASE PublisherContent Metadata and/or Full text URL Publisher Website 1M Monthly Visitors 1M Monthly Visitors Free Traffic Increase 39
  • 40.
    Some of ourPartners 40
  • 41.
  • 42.
    DISSEMINATION CONTENT DASHBOARD #Journals # Articles # Authors # Full text 115 3520 153 2549 PUBLISHER CONTENTINSTITUTIONAL REPOSITORIES VS 42
  • 43.
    DISSEMINATION CONTENT DASHBOARD PUBLISHERCONTENT Journals Articles Authors Full text 115 3520 153 2549 INSTITUTIONAL REPOSITORIESCOUNTRIES # and List 43
  • 44.
    Benefits for publishers WithMyScienceWork, publishers can: •Control copyright on msw.com •Increase the visibility of publisher’s content •Increase the traffic on publisher’s website •Analyze and qualify their content in IR 44
  • 45.
    CASE STUDIES Darrell W.Gunter Director, North America, Director Memberships, International Association of STM Publishers 45
  • 46.
    Darrell’s Expertise • 20+years in the Scholarly Publishing Industry • Elsevier Launched  Americas Regional Sales Office 46
  • 47.
    Why ? • Applicationsfocused on the researcher! • Profiles to show case a researcher’s body of research • Ability to collaborate with like research minder researchers • Repositories to demonstrate an institutions body of works • Applications to analyze and unlock the value of the body of work 47
  • 48.
  • 49.
  • 50.