Mining,&Myths&and&the&Parthenon
Searching*for*a*narrative*of*value*and*values
Cameron(Neylon(–(OpenAire/COAR(Meeting(–(Ath...
2
https://www.flickr.com/photos/lamnatos/4520967783 CC BY
Myths…
http://www.flickr.com/photos/10101046@N06/3484426248 CC BY
5
http://www.flickr.com/photos/rosemania/86741023 CC BY
6
http://www.flickr.com/photos/michiganmoves/3375583341 CC BY-SA
Myth'1

Researchers'don’t'want'TDM
Researchers&need'TDM…
…but&ask&for&faster&horses
Myth'2

“TDM'will'crush'our'servers…”
PLOS&ONE&Traffic&over&six&months
15
Pageviewsperhour
0
2750
5500
8250
11000
22/04/2017 31/05/2017 09/07/2017 17/08/2017 26...
0
2750
5500
8250
11000
22/04/2017 31/05/2017 09/07/2017 17/08/2017 26/09
PLOS&ONE&Traffic&over&six&months
16
Pageviewsperh...
17
http://www.reddit.com/r/science/comments/1hv933/
a_mere_60_minutes_of_aerobic_exercise_could/
18
Page&views&per&hour
0
7500
15000
22500
30000
Daily average Daily peak Reddit DDOS Text mining
Pageviewsperhour
19
Page&views&per&hour
0
150000
300000
450000
600000
Daily average Daily peak Reddit DDOS Text mining
Pageviewsperhour
20
Page&views&per&hour
0
150000
300000
450000
600000
Daily average Daily peak Reddit DDOS Text mining
Pageviewsperhour
21
Page&views&per&hour
0
6250
12500
18750
25000
Daily average Daily peak Reddit DDOS Text mining
Pageviewsperhour
22
Page&views&per&hour
0
2500
5000
7500
10000
Daily average Daily peak Reddit DDOS Text mining
Pageviewsperhour
23
Page&views&per&hour
0
1250
2500
3750
5000
Daily average Daily peak Reddit DDOS Text mining
Pageviewsperhour
If&you&can’t&manage&TDM,&
you’re&not&competent&to&run&a&
modern&web&service
Myth'3

“This'isn’t'core'business/revenue”
26
http://www.flickr.com/photos/markwalker/3749673425 CC BY-SA
“Publishing”&is&no&longer&a&USP
http://www.flickr.com/photos/spence_sir/2291938423 CC-BY
It’s&not&filtering…
29
http://www.flickr.com/photos/58558794@N07/8682293085 CC BY
…it’s&annotation
30
US Defence Department: 090807-N-5749W-394.jpg
It’s&not&distribution…
http://www.flickr.com/photos/65208723@N07/6032293799 CC BY
…it’s&dissemination
“If'we'allow'TDM'Peter'MurrayBRust'
will'distribute'all'of'our'content”
Our&core&business&is&to&get&authors’&work&in&
the&hands&of&those&who&can&use&it
http://www.flickr.com/photos/thebarrowboy/...
How&do&we&do&this?
36
Licensing&is&not&sufficient…
http://almreports.plos.org/reports/visualizations/9939
http://www.flickr.com/photos/jwyg/4528443760/ CC-BY-SA
40
http://www.flickr.com/photos/theincidental/3459777668 CC BY
Other&people&do&this&for&us…
…so&control&is&in&direct&
tension&with&dissemination
Enabling'TDM&is&core&business&
for&scholarly&publishers…
…in&serving&all'of&our&
customers&and&users…
…and&will&become&a&core&
market&differentiator
Search: PLOS blogs content mining
Impact
Impact
Research
Economic
Cultural
Education
Health
Environment
Impact
Research
Economic
Cultural
Education
Health
Environment
Research
Economic
Cultural
Education
Health
Environment
Research
Economic
Cultural
Education
Health
Environment
Research
Outputs
Research
Economic
Cultural
Education
Health
Environment
Research
Outputs
Research
Research
Outputs
Research
Research
Outputs
Research
Research
Citation
Research
Bookmark
Citation
Research
Bookmark
Citation
Previous
co-authorship
Research
Bookmark
Citation
Previous
co-authorship
Conference
co-attendance
Social media
conversation
http://www.flickr.com/photos/verzo/8020565592/ CC BY
http://www.flickr.com/photos/andrein/372192048/ CC BY-SA
https://www.flickr.com/photos/see-through-the-eye-of-g/5392290809 CC BY
https://www.flickr.com/photos/jenny-pics/3239638494 CC BY
https://www.flickr.com/photos/garycycles2/2575676610 CC BY
https://www.flickr.com/photos/53487196@N08/5439253136 CC BY
Counting…?
https://www.flickr.com/photos/ivanwalsh/5082958705 CC BY
…isn’t much use
Not the end of
https://www.flickr.com/photos/oimax/4373114560 CC BY
“Stories that persuade with data”
Anita de Waard
“Stories that persuade with data”
Anita de Waard
evidence
Building models…
https://www.flickr.com/photos/karen_roe/7616234498 CC BY
…testing models
https://www.flickr.com/photos/wwworks/4255117217/ CC BY
https://www.flickr.com/photos/ell-r-brown/4655401891 CC BY
85
https://www.flickr.com/photos/telemax/4734543265 CC BY-SA
http://flickr.com/photos/virtualsugar/316200555/ CC-BY
@cameronneylon
cneylon@plos.org
http://cameronneylon.net
OpenAIRE-COAR conference 2014: Content Mining in Practice: Challenges and opportunities for publishers, by Cameron Neylon ...
OpenAIRE-COAR conference 2014: Content Mining in Practice: Challenges and opportunities for publishers, by Cameron Neylon ...
OpenAIRE-COAR conference 2014: Content Mining in Practice: Challenges and opportunities for publishers, by Cameron Neylon ...
OpenAIRE-COAR conference 2014: Content Mining in Practice: Challenges and opportunities for publishers, by Cameron Neylon ...
OpenAIRE-COAR conference 2014: Content Mining in Practice: Challenges and opportunities for publishers, by Cameron Neylon ...
OpenAIRE-COAR conference 2014: Content Mining in Practice: Challenges and opportunities for publishers, by Cameron Neylon ...
OpenAIRE-COAR conference 2014: Content Mining in Practice: Challenges and opportunities for publishers, by Cameron Neylon ...
OpenAIRE-COAR conference 2014: Content Mining in Practice: Challenges and opportunities for publishers, by Cameron Neylon ...
OpenAIRE-COAR conference 2014: Content Mining in Practice: Challenges and opportunities for publishers, by Cameron Neylon ...
OpenAIRE-COAR conference 2014: Content Mining in Practice: Challenges and opportunities for publishers, by Cameron Neylon ...
OpenAIRE-COAR conference 2014: Content Mining in Practice: Challenges and opportunities for publishers, by Cameron Neylon ...
OpenAIRE-COAR conference 2014: Content Mining in Practice: Challenges and opportunities for publishers, by Cameron Neylon ...
OpenAIRE-COAR conference 2014: Content Mining in Practice: Challenges and opportunities for publishers, by Cameron Neylon ...
OpenAIRE-COAR conference 2014: Content Mining in Practice: Challenges and opportunities for publishers, by Cameron Neylon ...
Upcoming SlideShare
Loading in …5
×

OpenAIRE-COAR conference 2014: Content Mining in Practice: Challenges and opportunities for publishers, by Cameron Neylon - PLOS

488 views

Published on

Presentation at the OpenAIRE-COAR Conference: "Open Access Movement to Reality: Putting the Pieces Together", Athens - May 21-22, 2014.
Session 3: Maximizing the exploitation of open research results through text mining.
Content Mining in Practice: Challenges and opportunities for publishers, by Cameron Neylon - Advocacy Director for PLOS

Published in: Science, Technology, Sports
0 Comments
0 Likes
Statistics
Notes
  • Be the first to comment

  • Be the first to like this

No Downloads
Views
Total views
488
On SlideShare
0
From Embeds
0
Number of Embeds
1
Actions
Shares
0
Downloads
4
Comments
0
Likes
0
Embeds 0
No embeds

No notes for slide

OpenAIRE-COAR conference 2014: Content Mining in Practice: Challenges and opportunities for publishers, by Cameron Neylon - PLOS

  1. 1. Mining,&Myths&and&the&Parthenon Searching*for*a*narrative*of*value*and*values Cameron(Neylon(–(OpenAire/COAR(Meeting(–(Athens 22(May(2014(<(@cameronneylon(–(cneylon@plos.org 1
  2. 2. 2 https://www.flickr.com/photos/lamnatos/4520967783 CC BY
  3. 3. Myths…
  4. 4. http://www.flickr.com/photos/10101046@N06/3484426248 CC BY
  5. 5. 5 http://www.flickr.com/photos/rosemania/86741023 CC BY
  6. 6. 6 http://www.flickr.com/photos/michiganmoves/3375583341 CC BY-SA
  7. 7. Myth'1
 Researchers'don’t'want'TDM
  8. 8. Researchers&need'TDM…
  9. 9. …but&ask&for&faster&horses
  10. 10. Myth'2
 “TDM'will'crush'our'servers…”
  11. 11. PLOS&ONE&Traffic&over&six&months 15 Pageviewsperhour 0 2750 5500 8250 11000 22/04/2017 31/05/2017 09/07/2017 17/08/2017 26/09
  12. 12. 0 2750 5500 8250 11000 22/04/2017 31/05/2017 09/07/2017 17/08/2017 26/09 PLOS&ONE&Traffic&over&six&months 16 Pageviewsperhour
  13. 13. 17 http://www.reddit.com/r/science/comments/1hv933/ a_mere_60_minutes_of_aerobic_exercise_could/
  14. 14. 18 Page&views&per&hour 0 7500 15000 22500 30000 Daily average Daily peak Reddit DDOS Text mining Pageviewsperhour
  15. 15. 19 Page&views&per&hour 0 150000 300000 450000 600000 Daily average Daily peak Reddit DDOS Text mining Pageviewsperhour
  16. 16. 20 Page&views&per&hour 0 150000 300000 450000 600000 Daily average Daily peak Reddit DDOS Text mining Pageviewsperhour
  17. 17. 21 Page&views&per&hour 0 6250 12500 18750 25000 Daily average Daily peak Reddit DDOS Text mining Pageviewsperhour
  18. 18. 22 Page&views&per&hour 0 2500 5000 7500 10000 Daily average Daily peak Reddit DDOS Text mining Pageviewsperhour
  19. 19. 23 Page&views&per&hour 0 1250 2500 3750 5000 Daily average Daily peak Reddit DDOS Text mining Pageviewsperhour
  20. 20. If&you&can’t&manage&TDM,& you’re&not&competent&to&run&a& modern&web&service
  21. 21. Myth'3
 “This'isn’t'core'business/revenue”
  22. 22. 26 http://www.flickr.com/photos/markwalker/3749673425 CC BY-SA
  23. 23. “Publishing”&is&no&longer&a&USP
  24. 24. http://www.flickr.com/photos/spence_sir/2291938423 CC-BY It’s&not&filtering…
  25. 25. 29 http://www.flickr.com/photos/58558794@N07/8682293085 CC BY …it’s&annotation
  26. 26. 30 US Defence Department: 090807-N-5749W-394.jpg It’s&not&distribution…
  27. 27. http://www.flickr.com/photos/65208723@N07/6032293799 CC BY …it’s&dissemination
  28. 28. “If'we'allow'TDM'Peter'MurrayBRust' will'distribute'all'of'our'content”
  29. 29. Our&core&business&is&to&get&authors’&work&in& the&hands&of&those&who&can&use&it http://www.flickr.com/photos/thebarrowboy/7646188700 CC BY
  30. 30. How&do&we&do&this?
  31. 31. 36
  32. 32. Licensing&is&not&sufficient…
  33. 33. http://almreports.plos.org/reports/visualizations/9939
  34. 34. http://www.flickr.com/photos/jwyg/4528443760/ CC-BY-SA
  35. 35. 40 http://www.flickr.com/photos/theincidental/3459777668 CC BY
  36. 36. Other&people&do&this&for&us…
  37. 37. …so&control&is&in&direct& tension&with&dissemination
  38. 38. Enabling'TDM&is&core&business& for&scholarly&publishers…
  39. 39. …in&serving&all'of&our& customers&and&users…
  40. 40. …and&will&become&a&core& market&differentiator Search: PLOS blogs content mining
  41. 41. Impact
  42. 42. Impact Research Economic Cultural Education Health Environment
  43. 43. Impact Research Economic Cultural Education Health Environment
  44. 44. Research Economic Cultural Education Health Environment
  45. 45. Research Economic Cultural Education Health Environment Research Outputs
  46. 46. Research Economic Cultural Education Health Environment Research Outputs
  47. 47. Research Research Outputs
  48. 48. Research Research Outputs
  49. 49. Research
  50. 50. Research Citation
  51. 51. Research Bookmark Citation
  52. 52. Research Bookmark Citation Previous co-authorship
  53. 53. Research Bookmark Citation Previous co-authorship Conference co-attendance Social media conversation
  54. 54. http://www.flickr.com/photos/verzo/8020565592/ CC BY
  55. 55. http://www.flickr.com/photos/andrein/372192048/ CC BY-SA
  56. 56. https://www.flickr.com/photos/see-through-the-eye-of-g/5392290809 CC BY
  57. 57. https://www.flickr.com/photos/jenny-pics/3239638494 CC BY
  58. 58. https://www.flickr.com/photos/garycycles2/2575676610 CC BY
  59. 59. https://www.flickr.com/photos/53487196@N08/5439253136 CC BY
  60. 60. Counting…? https://www.flickr.com/photos/ivanwalsh/5082958705 CC BY …isn’t much use
  61. 61. Not the end of https://www.flickr.com/photos/oimax/4373114560 CC BY
  62. 62. “Stories that persuade with data” Anita de Waard
  63. 63. “Stories that persuade with data” Anita de Waard evidence
  64. 64. Building models… https://www.flickr.com/photos/karen_roe/7616234498 CC BY
  65. 65. …testing models https://www.flickr.com/photos/wwworks/4255117217/ CC BY
  66. 66. https://www.flickr.com/photos/ell-r-brown/4655401891 CC BY
  67. 67. 85 https://www.flickr.com/photos/telemax/4734543265 CC BY-SA
  68. 68. http://flickr.com/photos/virtualsugar/316200555/ CC-BY
  69. 69. @cameronneylon cneylon@plos.org http://cameronneylon.net

×