SlideShare a Scribd company logo
1 of 38
Directly	
  e-­‐mailing	
  authors	
  of	
  newly	
  
   published	
  papers	
  encourages	
  
        community	
  cura8on	
  
     Stephanie	
  Bunt,	
  Gary	
  Grumbling,	
  Helen	
  Field,	
  Steven	
  Marygold,	
  
    Thom	
  Kaufman,	
  Kathy	
  MaChews,	
  Nick	
  Brown	
  and	
  Gillian	
  Millburn	
  
Overview	
  

 •  Background	
  –	
  why	
  choose	
  triaging	
  ?	
  

 •  Community	
  cura8on	
  pipeline	
  

 •  Results	
  –	
  how	
  successful	
  were	
  we	
  ?	
  

 •  Future	
  plans	
  
•  Background	
  –	
  why	
  choose	
  triaging	
  ?	
  

•  Community	
  cura8on	
  pipeline	
  

•  Results	
  –	
  how	
  successful	
  were	
  we	
  ?	
  

•  Future	
  plans	
  
Background:	
  why	
  choose	
  triaging	
  of	
  papers	
  ?	
  

                Weekly literature search!
                   (semi-automated)!




                         Skim curation"
                  Flag data-types in paper"
                 Record main genes studied"



Use	
  flags	
  to	
  priori8se	
  



                          Full curation!
Background:	
  why	
  choose	
  triaging	
  of	
  papers	
  ?	
  

                Weekly literature search!
                   (semi-automated)!          Examples	
  of	
  data-­‐type	
  flags:	
  
                                              • 	
  new	
  allele	
  
                                              • 	
  new	
  transgenic	
  construct	
  
                                              • 	
  phenotype	
  
                                              • 	
  newly	
  characterised	
  gene	
  
                         Skim curation"       	
  
                  Flag data-types in paper"
                 Record main genes studied"   • 	
  expression	
  data	
  
                                              • 	
  gene	
  model	
  data	
  
Use	
  flags	
  to	
  priori8se	
  
                                              • 	
  physical	
  interac8on	
  data	
  

                          Full curation!
Background:	
  why	
  choose	
  triaging	
  of	
  papers	
  ?	
  

                Weekly literature search!
                   (semi-automated)!


                                              • 	
  skimming	
  takes	
  a	
  significant	
  amount	
  of	
  
                                              curator	
  effort	
  
                                              	
  
                                              • 	
  simple	
  

                         Skim curation"
                  Flag data-types in paper"
                 Record main genes studied"



Use	
  flags	
  to	
  priori8se	
  



                          Full curation!
•  Background	
  –	
  why	
  choose	
  triaging	
  ?	
  

•  Community	
  cura8on	
  pipeline	
  

•  Results	
  –	
  how	
  successful	
  were	
  we	
  ?	
  

•  Future	
  plans	
  
Pipeline:	
  the	
  community	
  cura8on	
  tool	
  
Pipeline:	
  the	
  community	
  cura8on	
  tool	
  
Pipeline:	
  the	
  community	
  cura8on	
  tool	
  
Pipeline:	
  the	
  community	
  cura8on	
  tool	
  
Pipeline:	
  integra8ng	
  community	
  cura8on	
  

                Weekly literature search!
                   (semi-automated)!




                         Skim curation"
                  Flag data-types in paper"   Community curation tool"
                 Record main genes studied"     Community skim curation"



Use	
  flags	
  to	
  priori8se	
  



                          Full curation!
Pipeline:	
  integra8ng	
  community	
  cura8on	
  

                Weekly literature search!
                   (semi-automated)!




                                            Community curation tool"
                                              Community skim curation"



Use	
  flags	
  to	
  priori8se	
  



                          Full curation!
Pipeline:	
  integra8ng	
  community	
  cura8on	
  

                Weekly literature search!
                   (semi-automated)!




               Community curation tool"
                   Community skim curation"



Use	
  flags	
  to	
  priori8se	
  



                          Full curation!
Pipeline:	
  integra8ng	
  community	
  cura8on	
  

                Weekly literature search!
                   (semi-automated)!


                    Download PDF files!
                     (semi-automated)!


             E-mail authors (automated)!


               Community curation tool"
                   Community skim curation"



Use	
  flags	
  to	
  priori8se	
  



                          Full curation!
Pipeline:	
  integra8ng	
  community	
  cura8on	
  

                Weekly literature search!
                   (semi-automated)!


                    Download PDF files!        • 	
  E-­‐mail	
  contains	
  personalised	
  hyperlink	
  
                     (semi-automated)!
                                              • 	
  Takes	
  author	
  to	
  part	
  filled-­‐in	
  tool	
  
             E-mail authors (automated)!


               Community curation tool"
                   Community skim curation"



Use	
  flags	
  to	
  priori8se	
  



                          Full curation!
•  Background	
  –	
  why	
  choose	
  triaging	
  ?	
  

•  Community	
  cura8on	
  pipeline	
  

•  Results	
  –	
  how	
  successful	
  were	
  we	
  ?	
  

•  Future	
  plans	
  
Results:	
  response	
  rate	
  

            First	
  year’s	
  results	
  (Oct	
  2010	
  –	
  Oct	
  2011):	
  
            	
  
            • 	
  1857	
  e-­‐mails	
  sent	
  

            • 	
  815	
  completed	
  responses	
  

            • 	
  =	
  44%	
  response	
  rate	
  

            • 	
  ~	
  68/month	
  =	
  7.5x	
  rate	
  prior	
  to	
  e-­‐mailing	
  
Results:	
  does	
  the	
  age	
  of	
  the	
  paper	
  maCer	
  ?	
  

Weekly	
  e-­‐mailing	
                               Author	
  skim	
  cura8on	
  
(paper	
  in	
  PubMed	
  for	
  <2	
     44%	
  
weeks)	
                                              No	
  response	
  
Results:	
  does	
  the	
  age	
  of	
  the	
  paper	
  maCer	
  ?	
  

Weekly	
  e-­‐mailing	
                                 Author	
  skim	
  cura8on	
  
(paper	
  in	
  PubMed	
  for	
  <2	
         44%	
  
weeks)	
                                                No	
  response	
  




One	
  off-­‐emailing	
  Dec	
  2010	
  
(paper	
  in	
  PubMed	
  for	
  2-­‐13	
  
months)	
  
Results:	
  does	
  the	
  age	
  of	
  the	
  paper	
  maCer	
  ?	
  

Weekly	
  e-­‐mailing	
                                 Author	
  skim	
  cura8on	
  
(paper	
  in	
  PubMed	
  for	
  <2	
         44%	
  
weeks)	
                                                No	
  response	
  




One	
  off-­‐emailing	
  Dec	
  2010	
         36%	
  
(paper	
  in	
  PubMed	
  for	
  2-­‐13	
  
months)	
  
Results:	
  has	
  e-­‐mailing	
  increased	
  volunteer	
  submissions	
  ?	
  


          	
  Before	
  e-­‐mailing	
  

                • 	
  ~	
  9	
  submissions/month	
  
Results:	
  has	
  e-­‐mailing	
  increased	
  volunteer	
  submissions	
  ?	
  


          	
  Before	
  e-­‐mailing	
  

                 • 	
  ~	
  9	
  submissions/month	
  
                 	
  
          	
  Since	
  started	
  e-­‐mailing	
  

                • 	
  ~	
  8	
  submissions/month	
  
Results:	
  targe8ng	
  authors	
  to	
  a	
  specific	
  paper	
  helps	
  
                          (!"



                          '!"



                          &!"
                                                                                                         ./,012"3456"+/27819"
            !""#$%&'()$




Tool	
  usage	
   %!"                                                                                    :7;<2"7=2<7>?"+/27,<>"

                          $!"



                          #!"



                           !"
                          ##)*+,)#!"   #$)*+,)#!"   #%)*+,)#!"    #&)*+,)#!"   #')*+,)#!"   #()*+,)#!"   #-)*+,)#!"

                                                                   *'+)$

                                           General	
  e-­‐mail	
  sent	
  
Results:	
  accuracy	
  

            Analysed	
  1134	
  author	
  skim-­‐curated	
  papers	
  that	
  
                 have	
  subsequently	
  been	
  fully	
  curated	
  
        	
  
        Gene	
  data	
  
        	
  
        • 	
  only	
  had	
  to	
  remove	
  gene(s)	
  from	
  4.8%	
  of	
  papers	
  
        	
  
        	
  
        	
  
        	
  
Results:	
  accuracy	
  of	
  author-­‐curated	
  flags	
  
                      ()*"+,,),)"-."+/)..+0-1"

                               ()*"2.+134)1)"

                        5160+,"78+.+72).69+0-1"

                        :).4)"-;"4)1)".)<-.23"

                                 =)1)".)1+>)"

          ?@<.)336-1"61"+"*6,AB2C<)"/+7D4.-E1A"                                                            § 	
  Correct	
  
                                                                                                             G-..)72"
            ?@<.)336-1"61"+">E2+12"/+7D4.-E1A"

                           F8)1-2C<67"+1+,C363"
                                                                                                           § 	
  False	
  posi8ve	
  
                                                                                                             K+,3)"<-360L)"
                                                                                                             K+,3)"1)4+0L)"
                           F8C367+,"612).+70-1"                                                            § 	
  False	
  nega8ve	
  
                                                                                                             (-2"<.)3)12"
       G8+14)3"2-"HI">),+1-4+32)."4)1)">-A),"                                                              § 	
  Not	
  present	
  
   G8+14)3"2-"1-1BHI">),+1-4+32)."4)1)">-A),"

               :+<<614"-;";)+2E.)3"2-"4)1->)"

               G63B.)4E,+2-.C"),)>)123"A)J1)A"

                                                  !"   #!!"      $!!"    %!!"    &!!"      '!!!"   '#!!"
                                                                  !"#$%&'()'*+*%&,'
                                                              Number	
  of	
  papers	
  
Results:	
  over-­‐flagging	
  
                     ()*"+,,),)"-."+/)..+0-1"

                              ()*"2.+134)1)"

                       5160+,"78+.+72).69+0-1"

                       :).4)"-;"4)1)".)<-.23"

                                =)1)".)1+>)"

         ?@<.)336-1"61"+"*6,AB2C<)"/+7D4.-E1A"                                                            § 	
  Correct	
  
                                                                                                            G-..)72"
           ?@<.)336-1"61"+">E2+12"/+7D4.-E1A"

                          F8)1-2C<67"+1+,C363"
                                                                                                          § 	
  False	
  posi8ve	
  
                                                                                                            K+,3)"<-360L)"
                                                                                                            K+,3)"1)4+0L)"
                          F8C367+,"612).+70-1"                                                            § 	
  False	
  nega8ve	
  
                                                                                                            (-2"<.)3)12"
      G8+14)3"2-"HI">),+1-4+32)."4)1)">-A),"                                                              § 	
  Not	
  present	
  
  G8+14)3"2-"1-1BHI">),+1-4+32)."4)1)">-A),"

              :+<<614"-;";)+2E.)3"2-"4)1->)"

              G63B.)4E,+2-.C"),)>)123"A)J1)A"

                                                 !"   #!!"      $!!"    %!!"    &!!"      '!!!"   '#!!"
                                                                 !"#$%&'()'*+*%&,'
                                                             Number	
  of	
  papers	
  
Results:	
  over-­‐flagging	
  
                     ()*"+,,),)"-."+/)..+0-1"

                              ()*"2.+134)1)"

                       5160+,"78+.+72).69+0-1"

                       :).4)"-;"4)1)".)<-.23"

                                =)1)".)1+>)"

         ?@<.)336-1"61"+"*6,AB2C<)"/+7D4.-E1A"                                                            § 	
  Correct	
  
                                                                                                            G-..)72"
           ?@<.)336-1"61"+">E2+12"/+7D4.-E1A"

                          F8)1-2C<67"+1+,C363"
                                                                                                          § 	
  False	
  posi8ve	
  
                                                                                                            K+,3)"<-360L)"
                                                                                                            K+,3)"1)4+0L)"
                          F8C367+,"612).+70-1"                                                            § 	
  False	
  nega8ve	
  
                                                                                                            (-2"<.)3)12"
      G8+14)3"2-"HI">),+1-4+32)."4)1)">-A),"                                                              § 	
  Not	
  present	
  
  G8+14)3"2-"1-1BHI">),+1-4+32)."4)1)">-A),"

              :+<<614"-;";)+2E.)3"2-"4)1->)"

              G63B.)4E,+2-.C"),)>)123"A)J1)A"

                                                 !"   #!!"      $!!"    %!!"    &!!"      '!!!"   '#!!"
                                                                 !"#$%&'()'*+*%&,'
                                                             Number	
  of	
  papers	
  
Results:	
  under-­‐flagging	
  
                     ()*"+,,),)"-."+/)..+0-1"

                              ()*"2.+134)1)"

                       5160+,"78+.+72).69+0-1"

                       :).4)"-;"4)1)".)<-.23"

                                =)1)".)1+>)"

         ?@<.)336-1"61"+"*6,AB2C<)"/+7D4.-E1A"                                                            § 	
  Correct	
  
                                                                                                            G-..)72"
           ?@<.)336-1"61"+">E2+12"/+7D4.-E1A"

                          F8)1-2C<67"+1+,C363"
                                                                                                          § 	
  False	
  posi8ve	
  
                                                                                                            K+,3)"<-360L)"
                                                                                                            K+,3)"1)4+0L)"
                          F8C367+,"612).+70-1"                                                            § 	
  False	
  nega8ve	
  
                                                                                                            (-2"<.)3)12"
      G8+14)3"2-"HI">),+1-4+32)."4)1)">-A),"                                                              § 	
  Not	
  present	
  
  G8+14)3"2-"1-1BHI">),+1-4+32)."4)1)">-A),"

              :+<<614"-;";)+2E.)3"2-"4)1->)"

              G63B.)4E,+2-.C"),)>)123"A)J1)A"

                                                 !"   #!!"      $!!"    %!!"    &!!"      '!!!"   '#!!"
                                                                 !"#$%&'()'*+*%&,'
                                                             Number	
  of	
  papers	
  
Results:	
  under-­‐flagging	
  
                     ()*"+,,),)"-."+/)..+0-1"

                              ()*"2.+134)1)"

                       5160+,"78+.+72).69+0-1"

                       :).4)"-;"4)1)".)<-.23"

                                =)1)".)1+>)"

         ?@<.)336-1"61"+"*6,AB2C<)"/+7D4.-E1A"                                                            § 	
  Correct	
  
                                                                                                            G-..)72"
           ?@<.)336-1"61"+">E2+12"/+7D4.-E1A"

                          F8)1-2C<67"+1+,C363"
                                                                                                          § 	
  False	
  posi8ve	
  
                                                                                                            K+,3)"<-360L)"
                                                                                                            K+,3)"1)4+0L)"
                          F8C367+,"612).+70-1"                                                            § 	
  False	
  nega8ve	
  
                                                                                                            (-2"<.)3)12"
      G8+14)3"2-"HI">),+1-4+32)."4)1)">-A),"                                                              § 	
  Not	
  present	
  
  G8+14)3"2-"1-1BHI">),+1-4+32)."4)1)">-A),"

              :+<<614"-;";)+2E.)3"2-"4)1->)"

              G63B.)4E,+2-.C"),)>)123"A)J1)A"

                                                 !"   #!!"      $!!"    %!!"    &!!"      '!!!"   '#!!"
                                                                 !"#$%&'()'*+*%&,'
                                                             Number	
  of	
  papers	
  
Results:	
  under-­‐flagging	
  
                     ()*"+,,),)"-."+/)..+0-1"

                              ()*"2.+134)1)"

                       5160+,"78+.+72).69+0-1"

                       :).4)"-;"4)1)".)<-.23"

                                =)1)".)1+>)"

         ?@<.)336-1"61"+"*6,AB2C<)"/+7D4.-E1A"                                                            § 	
  Correct	
  
                                                                                                            G-..)72"
           ?@<.)336-1"61"+">E2+12"/+7D4.-E1A"

                          F8)1-2C<67"+1+,C363"
                                                                                                          § 	
  False	
  posi8ve	
  
                                                                                                            K+,3)"<-360L)"
                                                                                                            K+,3)"1)4+0L)"
                          F8C367+,"612).+70-1"                                                            § 	
  False	
  nega8ve	
  
                                                                                                            (-2"<.)3)12"
      G8+14)3"2-"HI">),+1-4+32)."4)1)">-A),"                                                              § 	
  Not	
  present	
  
  G8+14)3"2-"1-1BHI">),+1-4+32)."4)1)">-A),"

              :+<<614"-;";)+2E.)3"2-"4)1->)"

              G63B.)4E,+2-.C"),)>)123"A)J1)A"

                                                 !"   #!!"      $!!"    %!!"    &!!"      '!!!"   '#!!"
                                                                 !"#$%&'()'*+*%&,'
                                                             Number	
  of	
  papers	
  
•  Background	
  –	
  why	
  choose	
  triaging	
  ?	
  

•  Community	
  cura8on	
  pipeline	
  

•  Results	
  –	
  how	
  successful	
  were	
  we	
  ?	
  

•  Future	
  plans	
  
Future	
  plans:	
  improving	
  the	
  response	
  rate	
  

First	
  year’s	
  results	
                Author	
  skim	
  cura8on	
  
                                  44%	
  
                                            No	
  response	
  
Future	
  plans:	
  improving	
  the	
  response	
  rate	
  

First	
  year’s	
  results	
                         Author	
  skim	
  cura8on	
  
                                           44%	
  
                                                     No	
  response	
  




Sending	
  a	
  reminder	
  e-­‐mail	
  
(since	
  mid-­‐Nov	
  2011)	
  
Future	
  plans:	
  improving	
  the	
  response	
  rate	
  

First	
  year’s	
  results	
                         Author	
  skim	
  cura8on	
  
                                           44%	
  
                                                     No	
  response	
  




Sending	
  a	
  reminder	
  e-­‐mail	
  
(since	
  mid-­‐Nov	
  2011)	
             55%	
  
Future	
  plans:	
  triaging	
  the	
  remaining	
  papers	
  

 •  Text	
  mining	
  to	
  assign	
  data-­‐type	
  flags	
  
     •  See	
  poster	
  #P.109	
  

     •  “Integra8on	
  of	
  an	
  automa8c	
  triaging	
  step	
  into	
  FlyBase	
  Literature	
  
            Cura8on	
  through	
  the	
  use	
  of	
  SVM	
  text-­‐mining	
  methods.”	
  



     	
  
Future	
  plans:	
  expanding	
  scope	
  of	
  community	
  cura8on	
  

 •  Exis8ng	
  pipeline	
  
     •  reviews	
  

 •  	
  Wiki	
  pages	
  
     •  See	
  poster	
  #P.12	
  

     •  “Expanding	
  community	
  cura8on	
  at	
  FlyBase	
  through	
  the	
  design	
  and	
  
        implementa8on	
  of	
  a	
  gene-­‐centric	
  seman8c	
  wiki.”	
  
Acknowledgements	
  

•  FB	
  community	
  cura8on	
  commiCee	
  -­‐	
  for	
  helping	
  improve	
  
    design	
  of	
  tool	
  

•  FB-­‐Cambridge	
  curators	
  -­‐	
  for	
  helping	
  to	
  fully	
  curate	
  the	
  papers	
  
    analysed	
  for	
  accuracy	
  

•  All	
  the	
  authors	
  who	
  have	
  filled	
  in	
  the	
  tool	
  !	
  

More Related Content

Similar to Millburn - Flybase community curation

Mendeley’s Research Catalogue: building it, opening it up and making it even ...
Mendeley’s Research Catalogue: building it, opening it up and making it even ...Mendeley’s Research Catalogue: building it, opening it up and making it even ...
Mendeley’s Research Catalogue: building it, opening it up and making it even ...Kris Jack
 
Measuring Impact: Towards a data citation metric
Measuring Impact: Towards a data citation metricMeasuring Impact: Towards a data citation metric
Measuring Impact: Towards a data citation metricEdward Baker
 
2013 siam-cse-big-data
2013 siam-cse-big-data2013 siam-cse-big-data
2013 siam-cse-big-datac.titus.brown
 
2013 py con awesome big data algorithms
2013 py con awesome big data algorithms2013 py con awesome big data algorithms
2013 py con awesome big data algorithmsc.titus.brown
 
2014 10-01-assembly summaryvariantsoverview
2014 10-01-assembly summaryvariantsoverview2014 10-01-assembly summaryvariantsoverview
2014 10-01-assembly summaryvariantsoverviewYannick Wurm
 
07-Classification.pptx
07-Classification.pptx07-Classification.pptx
07-Classification.pptxShree Shree
 
Improving Semantic Search Using Query Log Analysis
Improving Semantic Search Using Query Log AnalysisImproving Semantic Search Using Query Log Analysis
Improving Semantic Search Using Query Log AnalysisStuart Wrigley
 
Stamps.pptx
Stamps.pptxStamps.pptx
Stamps.pptxaaaa bbb
 
Capturing the Ineffable: Collecting, Analysing, and Automating Web Document ...
Capturing the Ineffable: Collecting, Analysing, and Automating Web Document ...Capturing the Ineffable: Collecting, Analysing, and Automating Web Document ...
Capturing the Ineffable: Collecting, Analysing, and Automating Web Document ...Davide Ceolin
 
Biocuration - Crowdsourcing Gene Annotation
Biocuration - Crowdsourcing Gene AnnotationBiocuration - Crowdsourcing Gene Annotation
Biocuration - Crowdsourcing Gene AnnotationAnurag Priyam
 
SplunkLive! Prelert Session - Extending Splunk with Machine Learning
SplunkLive! Prelert Session - Extending Splunk with Machine LearningSplunkLive! Prelert Session - Extending Splunk with Machine Learning
SplunkLive! Prelert Session - Extending Splunk with Machine LearningSplunk
 
Recommendations and User Understanding at StumbleUpon
Recommendations and User Understandingat StumbleUponRecommendations and User Understandingat StumbleUpon
Recommendations and User Understanding at StumbleUponDebora Donato
 
Jillian ms defense-4-14-14-ja
Jillian ms defense-4-14-14-jaJillian ms defense-4-14-14-ja
Jillian ms defense-4-14-14-jaJillian Aurisano
 
Why Electronic Data Capture?
Why Electronic Data Capture?Why Electronic Data Capture?
Why Electronic Data Capture?Somalee D.
 
Making your data work for you: Scratchpads, publishing & the biodiversity dat...
Making your data work for you: Scratchpads, publishing & the biodiversity dat...Making your data work for you: Scratchpads, publishing & the biodiversity dat...
Making your data work for you: Scratchpads, publishing & the biodiversity dat...Vince Smith
 

Similar to Millburn - Flybase community curation (20)

Mendeley’s Research Catalogue: building it, opening it up and making it even ...
Mendeley’s Research Catalogue: building it, opening it up and making it even ...Mendeley’s Research Catalogue: building it, opening it up and making it even ...
Mendeley’s Research Catalogue: building it, opening it up and making it even ...
 
Measuring Impact: Towards a data citation metric
Measuring Impact: Towards a data citation metricMeasuring Impact: Towards a data citation metric
Measuring Impact: Towards a data citation metric
 
2013 siam-cse-big-data
2013 siam-cse-big-data2013 siam-cse-big-data
2013 siam-cse-big-data
 
Ir1
Ir1Ir1
Ir1
 
2013 py con awesome big data algorithms
2013 py con awesome big data algorithms2013 py con awesome big data algorithms
2013 py con awesome big data algorithms
 
2014 10-01-assembly summaryvariantsoverview
2014 10-01-assembly summaryvariantsoverview2014 10-01-assembly summaryvariantsoverview
2014 10-01-assembly summaryvariantsoverview
 
07-Classification.pptx
07-Classification.pptx07-Classification.pptx
07-Classification.pptx
 
46 clarke
46 clarke46 clarke
46 clarke
 
Improving Semantic Search Using Query Log Analysis
Improving Semantic Search Using Query Log AnalysisImproving Semantic Search Using Query Log Analysis
Improving Semantic Search Using Query Log Analysis
 
Stamps.pptx
Stamps.pptxStamps.pptx
Stamps.pptx
 
Capturing the Ineffable: Collecting, Analysing, and Automating Web Document ...
Capturing the Ineffable: Collecting, Analysing, and Automating Web Document ...Capturing the Ineffable: Collecting, Analysing, and Automating Web Document ...
Capturing the Ineffable: Collecting, Analysing, and Automating Web Document ...
 
The Ebook, The Whole Ebook, and Nothing But The Ebook: A Holistic View of Ebo...
The Ebook, The Whole Ebook, and Nothing But The Ebook: A Holistic View of Ebo...The Ebook, The Whole Ebook, and Nothing But The Ebook: A Holistic View of Ebo...
The Ebook, The Whole Ebook, and Nothing But The Ebook: A Holistic View of Ebo...
 
The Ebook, The Whole Ebook, and Nothing But The Ebook: A Holistic View of Ebo...
The Ebook, The Whole Ebook, and Nothing But The Ebook: A Holistic View of Ebo...The Ebook, The Whole Ebook, and Nothing But The Ebook: A Holistic View of Ebo...
The Ebook, The Whole Ebook, and Nothing But The Ebook: A Holistic View of Ebo...
 
Biocuration - Crowdsourcing Gene Annotation
Biocuration - Crowdsourcing Gene AnnotationBiocuration - Crowdsourcing Gene Annotation
Biocuration - Crowdsourcing Gene Annotation
 
SplunkLive! Prelert Session - Extending Splunk with Machine Learning
SplunkLive! Prelert Session - Extending Splunk with Machine LearningSplunkLive! Prelert Session - Extending Splunk with Machine Learning
SplunkLive! Prelert Session - Extending Splunk with Machine Learning
 
Data Mining Lecture_2.pptx
Data Mining Lecture_2.pptxData Mining Lecture_2.pptx
Data Mining Lecture_2.pptx
 
Recommendations and User Understanding at StumbleUpon
Recommendations and User Understandingat StumbleUponRecommendations and User Understandingat StumbleUpon
Recommendations and User Understanding at StumbleUpon
 
Jillian ms defense-4-14-14-ja
Jillian ms defense-4-14-14-jaJillian ms defense-4-14-14-ja
Jillian ms defense-4-14-14-ja
 
Why Electronic Data Capture?
Why Electronic Data Capture?Why Electronic Data Capture?
Why Electronic Data Capture?
 
Making your data work for you: Scratchpads, publishing & the biodiversity dat...
Making your data work for you: Scratchpads, publishing & the biodiversity dat...Making your data work for you: Scratchpads, publishing & the biodiversity dat...
Making your data work for you: Scratchpads, publishing & the biodiversity dat...
 

Recently uploaded

Call Girls in Gomti Nagar - 7388211116 - With room Service
Call Girls in Gomti Nagar - 7388211116  - With room ServiceCall Girls in Gomti Nagar - 7388211116  - With room Service
Call Girls in Gomti Nagar - 7388211116 - With room Servicediscovermytutordmt
 
GD Birla and his contribution in management
GD Birla and his contribution in managementGD Birla and his contribution in management
GD Birla and his contribution in managementchhavia330
 
M.C Lodges -- Guest House in Jhang.
M.C Lodges --  Guest House in Jhang.M.C Lodges --  Guest House in Jhang.
M.C Lodges -- Guest House in Jhang.Aaiza Hassan
 
Call Girls In Radisson Blu Hotel New Delhi Paschim Vihar ❤️8860477959 Escorts...
Call Girls In Radisson Blu Hotel New Delhi Paschim Vihar ❤️8860477959 Escorts...Call Girls In Radisson Blu Hotel New Delhi Paschim Vihar ❤️8860477959 Escorts...
Call Girls In Radisson Blu Hotel New Delhi Paschim Vihar ❤️8860477959 Escorts...lizamodels9
 
The CMO Survey - Highlights and Insights Report - Spring 2024
The CMO Survey - Highlights and Insights Report - Spring 2024The CMO Survey - Highlights and Insights Report - Spring 2024
The CMO Survey - Highlights and Insights Report - Spring 2024christinemoorman
 
Lowrate Call Girls In Laxmi Nagar Delhi ❤️8860477959 Escorts 100% Genuine Ser...
Lowrate Call Girls In Laxmi Nagar Delhi ❤️8860477959 Escorts 100% Genuine Ser...Lowrate Call Girls In Laxmi Nagar Delhi ❤️8860477959 Escorts 100% Genuine Ser...
Lowrate Call Girls In Laxmi Nagar Delhi ❤️8860477959 Escorts 100% Genuine Ser...lizamodels9
 
Tech Startup Growth Hacking 101 - Basics on Growth Marketing
Tech Startup Growth Hacking 101  - Basics on Growth MarketingTech Startup Growth Hacking 101  - Basics on Growth Marketing
Tech Startup Growth Hacking 101 - Basics on Growth MarketingShawn Pang
 
0183760ssssssssssssssssssssssssssss00101011 (27).pdf
0183760ssssssssssssssssssssssssssss00101011 (27).pdf0183760ssssssssssssssssssssssssssss00101011 (27).pdf
0183760ssssssssssssssssssssssssssss00101011 (27).pdfRenandantas16
 
VIP Call Girl Jamshedpur Aashi 8250192130 Independent Escort Service Jamshedpur
VIP Call Girl Jamshedpur Aashi 8250192130 Independent Escort Service JamshedpurVIP Call Girl Jamshedpur Aashi 8250192130 Independent Escort Service Jamshedpur
VIP Call Girl Jamshedpur Aashi 8250192130 Independent Escort Service JamshedpurSuhani Kapoor
 
Pitch Deck Teardown: NOQX's $200k Pre-seed deck
Pitch Deck Teardown: NOQX's $200k Pre-seed deckPitch Deck Teardown: NOQX's $200k Pre-seed deck
Pitch Deck Teardown: NOQX's $200k Pre-seed deckHajeJanKamps
 
Sales & Marketing Alignment: How to Synergize for Success
Sales & Marketing Alignment: How to Synergize for SuccessSales & Marketing Alignment: How to Synergize for Success
Sales & Marketing Alignment: How to Synergize for SuccessAggregage
 
Monte Carlo simulation : Simulation using MCSM
Monte Carlo simulation : Simulation using MCSMMonte Carlo simulation : Simulation using MCSM
Monte Carlo simulation : Simulation using MCSMRavindra Nath Shukla
 
Grateful 7 speech thanking everyone that has helped.pdf
Grateful 7 speech thanking everyone that has helped.pdfGrateful 7 speech thanking everyone that has helped.pdf
Grateful 7 speech thanking everyone that has helped.pdfPaul Menig
 
Vip Dewas Call Girls #9907093804 Contact Number Escorts Service Dewas
Vip Dewas Call Girls #9907093804 Contact Number Escorts Service DewasVip Dewas Call Girls #9907093804 Contact Number Escorts Service Dewas
Vip Dewas Call Girls #9907093804 Contact Number Escorts Service Dewasmakika9823
 
Keppel Ltd. 1Q 2024 Business Update Presentation Slides
Keppel Ltd. 1Q 2024 Business Update  Presentation SlidesKeppel Ltd. 1Q 2024 Business Update  Presentation Slides
Keppel Ltd. 1Q 2024 Business Update Presentation SlidesKeppelCorporation
 
Vip Female Escorts Noida 9711199171 Greater Noida Escorts Service
Vip Female Escorts Noida 9711199171 Greater Noida Escorts ServiceVip Female Escorts Noida 9711199171 Greater Noida Escorts Service
Vip Female Escorts Noida 9711199171 Greater Noida Escorts Serviceankitnayak356677
 
Lucknow 💋 Escorts in Lucknow - 450+ Call Girl Cash Payment 8923113531 Neha Th...
Lucknow 💋 Escorts in Lucknow - 450+ Call Girl Cash Payment 8923113531 Neha Th...Lucknow 💋 Escorts in Lucknow - 450+ Call Girl Cash Payment 8923113531 Neha Th...
Lucknow 💋 Escorts in Lucknow - 450+ Call Girl Cash Payment 8923113531 Neha Th...anilsa9823
 
Catalogue ONG NUOC PPR DE NHAT .pdf
Catalogue ONG NUOC PPR DE NHAT      .pdfCatalogue ONG NUOC PPR DE NHAT      .pdf
Catalogue ONG NUOC PPR DE NHAT .pdfOrient Homes
 
VIP Kolkata Call Girl Howrah 👉 8250192130 Available With Room
VIP Kolkata Call Girl Howrah 👉 8250192130  Available With RoomVIP Kolkata Call Girl Howrah 👉 8250192130  Available With Room
VIP Kolkata Call Girl Howrah 👉 8250192130 Available With Roomdivyansh0kumar0
 

Recently uploaded (20)

Call Girls in Gomti Nagar - 7388211116 - With room Service
Call Girls in Gomti Nagar - 7388211116  - With room ServiceCall Girls in Gomti Nagar - 7388211116  - With room Service
Call Girls in Gomti Nagar - 7388211116 - With room Service
 
GD Birla and his contribution in management
GD Birla and his contribution in managementGD Birla and his contribution in management
GD Birla and his contribution in management
 
M.C Lodges -- Guest House in Jhang.
M.C Lodges --  Guest House in Jhang.M.C Lodges --  Guest House in Jhang.
M.C Lodges -- Guest House in Jhang.
 
Call Girls In Radisson Blu Hotel New Delhi Paschim Vihar ❤️8860477959 Escorts...
Call Girls In Radisson Blu Hotel New Delhi Paschim Vihar ❤️8860477959 Escorts...Call Girls In Radisson Blu Hotel New Delhi Paschim Vihar ❤️8860477959 Escorts...
Call Girls In Radisson Blu Hotel New Delhi Paschim Vihar ❤️8860477959 Escorts...
 
The CMO Survey - Highlights and Insights Report - Spring 2024
The CMO Survey - Highlights and Insights Report - Spring 2024The CMO Survey - Highlights and Insights Report - Spring 2024
The CMO Survey - Highlights and Insights Report - Spring 2024
 
Lowrate Call Girls In Laxmi Nagar Delhi ❤️8860477959 Escorts 100% Genuine Ser...
Lowrate Call Girls In Laxmi Nagar Delhi ❤️8860477959 Escorts 100% Genuine Ser...Lowrate Call Girls In Laxmi Nagar Delhi ❤️8860477959 Escorts 100% Genuine Ser...
Lowrate Call Girls In Laxmi Nagar Delhi ❤️8860477959 Escorts 100% Genuine Ser...
 
Tech Startup Growth Hacking 101 - Basics on Growth Marketing
Tech Startup Growth Hacking 101  - Basics on Growth MarketingTech Startup Growth Hacking 101  - Basics on Growth Marketing
Tech Startup Growth Hacking 101 - Basics on Growth Marketing
 
0183760ssssssssssssssssssssssssssss00101011 (27).pdf
0183760ssssssssssssssssssssssssssss00101011 (27).pdf0183760ssssssssssssssssssssssssssss00101011 (27).pdf
0183760ssssssssssssssssssssssssssss00101011 (27).pdf
 
VIP Call Girl Jamshedpur Aashi 8250192130 Independent Escort Service Jamshedpur
VIP Call Girl Jamshedpur Aashi 8250192130 Independent Escort Service JamshedpurVIP Call Girl Jamshedpur Aashi 8250192130 Independent Escort Service Jamshedpur
VIP Call Girl Jamshedpur Aashi 8250192130 Independent Escort Service Jamshedpur
 
Pitch Deck Teardown: NOQX's $200k Pre-seed deck
Pitch Deck Teardown: NOQX's $200k Pre-seed deckPitch Deck Teardown: NOQX's $200k Pre-seed deck
Pitch Deck Teardown: NOQX's $200k Pre-seed deck
 
Sales & Marketing Alignment: How to Synergize for Success
Sales & Marketing Alignment: How to Synergize for SuccessSales & Marketing Alignment: How to Synergize for Success
Sales & Marketing Alignment: How to Synergize for Success
 
Monte Carlo simulation : Simulation using MCSM
Monte Carlo simulation : Simulation using MCSMMonte Carlo simulation : Simulation using MCSM
Monte Carlo simulation : Simulation using MCSM
 
Grateful 7 speech thanking everyone that has helped.pdf
Grateful 7 speech thanking everyone that has helped.pdfGrateful 7 speech thanking everyone that has helped.pdf
Grateful 7 speech thanking everyone that has helped.pdf
 
Best Practices for Implementing an External Recruiting Partnership
Best Practices for Implementing an External Recruiting PartnershipBest Practices for Implementing an External Recruiting Partnership
Best Practices for Implementing an External Recruiting Partnership
 
Vip Dewas Call Girls #9907093804 Contact Number Escorts Service Dewas
Vip Dewas Call Girls #9907093804 Contact Number Escorts Service DewasVip Dewas Call Girls #9907093804 Contact Number Escorts Service Dewas
Vip Dewas Call Girls #9907093804 Contact Number Escorts Service Dewas
 
Keppel Ltd. 1Q 2024 Business Update Presentation Slides
Keppel Ltd. 1Q 2024 Business Update  Presentation SlidesKeppel Ltd. 1Q 2024 Business Update  Presentation Slides
Keppel Ltd. 1Q 2024 Business Update Presentation Slides
 
Vip Female Escorts Noida 9711199171 Greater Noida Escorts Service
Vip Female Escorts Noida 9711199171 Greater Noida Escorts ServiceVip Female Escorts Noida 9711199171 Greater Noida Escorts Service
Vip Female Escorts Noida 9711199171 Greater Noida Escorts Service
 
Lucknow 💋 Escorts in Lucknow - 450+ Call Girl Cash Payment 8923113531 Neha Th...
Lucknow 💋 Escorts in Lucknow - 450+ Call Girl Cash Payment 8923113531 Neha Th...Lucknow 💋 Escorts in Lucknow - 450+ Call Girl Cash Payment 8923113531 Neha Th...
Lucknow 💋 Escorts in Lucknow - 450+ Call Girl Cash Payment 8923113531 Neha Th...
 
Catalogue ONG NUOC PPR DE NHAT .pdf
Catalogue ONG NUOC PPR DE NHAT      .pdfCatalogue ONG NUOC PPR DE NHAT      .pdf
Catalogue ONG NUOC PPR DE NHAT .pdf
 
VIP Kolkata Call Girl Howrah 👉 8250192130 Available With Room
VIP Kolkata Call Girl Howrah 👉 8250192130  Available With RoomVIP Kolkata Call Girl Howrah 👉 8250192130  Available With Room
VIP Kolkata Call Girl Howrah 👉 8250192130 Available With Room
 

Millburn - Flybase community curation

  • 1. Directly  e-­‐mailing  authors  of  newly   published  papers  encourages   community  cura8on   Stephanie  Bunt,  Gary  Grumbling,  Helen  Field,  Steven  Marygold,   Thom  Kaufman,  Kathy  MaChews,  Nick  Brown  and  Gillian  Millburn  
  • 2. Overview   •  Background  –  why  choose  triaging  ?   •  Community  cura8on  pipeline   •  Results  –  how  successful  were  we  ?   •  Future  plans  
  • 3. •  Background  –  why  choose  triaging  ?   •  Community  cura8on  pipeline   •  Results  –  how  successful  were  we  ?   •  Future  plans  
  • 4. Background:  why  choose  triaging  of  papers  ?   Weekly literature search! (semi-automated)! Skim curation" Flag data-types in paper" Record main genes studied" Use  flags  to  priori8se   Full curation!
  • 5. Background:  why  choose  triaging  of  papers  ?   Weekly literature search! (semi-automated)! Examples  of  data-­‐type  flags:   •   new  allele   •   new  transgenic  construct   •   phenotype   •   newly  characterised  gene   Skim curation"   Flag data-types in paper" Record main genes studied" •   expression  data   •   gene  model  data   Use  flags  to  priori8se   •   physical  interac8on  data   Full curation!
  • 6. Background:  why  choose  triaging  of  papers  ?   Weekly literature search! (semi-automated)! •   skimming  takes  a  significant  amount  of   curator  effort     •   simple   Skim curation" Flag data-types in paper" Record main genes studied" Use  flags  to  priori8se   Full curation!
  • 7. •  Background  –  why  choose  triaging  ?   •  Community  cura8on  pipeline   •  Results  –  how  successful  were  we  ?   •  Future  plans  
  • 8. Pipeline:  the  community  cura8on  tool  
  • 9. Pipeline:  the  community  cura8on  tool  
  • 10. Pipeline:  the  community  cura8on  tool  
  • 11. Pipeline:  the  community  cura8on  tool  
  • 12. Pipeline:  integra8ng  community  cura8on   Weekly literature search! (semi-automated)! Skim curation" Flag data-types in paper" Community curation tool" Record main genes studied" Community skim curation" Use  flags  to  priori8se   Full curation!
  • 13. Pipeline:  integra8ng  community  cura8on   Weekly literature search! (semi-automated)! Community curation tool" Community skim curation" Use  flags  to  priori8se   Full curation!
  • 14. Pipeline:  integra8ng  community  cura8on   Weekly literature search! (semi-automated)! Community curation tool" Community skim curation" Use  flags  to  priori8se   Full curation!
  • 15. Pipeline:  integra8ng  community  cura8on   Weekly literature search! (semi-automated)! Download PDF files! (semi-automated)! E-mail authors (automated)! Community curation tool" Community skim curation" Use  flags  to  priori8se   Full curation!
  • 16. Pipeline:  integra8ng  community  cura8on   Weekly literature search! (semi-automated)! Download PDF files! •   E-­‐mail  contains  personalised  hyperlink   (semi-automated)! •   Takes  author  to  part  filled-­‐in  tool   E-mail authors (automated)! Community curation tool" Community skim curation" Use  flags  to  priori8se   Full curation!
  • 17. •  Background  –  why  choose  triaging  ?   •  Community  cura8on  pipeline   •  Results  –  how  successful  were  we  ?   •  Future  plans  
  • 18. Results:  response  rate   First  year’s  results  (Oct  2010  –  Oct  2011):     •   1857  e-­‐mails  sent   •   815  completed  responses   •   =  44%  response  rate   •   ~  68/month  =  7.5x  rate  prior  to  e-­‐mailing  
  • 19. Results:  does  the  age  of  the  paper  maCer  ?   Weekly  e-­‐mailing   Author  skim  cura8on   (paper  in  PubMed  for  <2   44%   weeks)   No  response  
  • 20. Results:  does  the  age  of  the  paper  maCer  ?   Weekly  e-­‐mailing   Author  skim  cura8on   (paper  in  PubMed  for  <2   44%   weeks)   No  response   One  off-­‐emailing  Dec  2010   (paper  in  PubMed  for  2-­‐13   months)  
  • 21. Results:  does  the  age  of  the  paper  maCer  ?   Weekly  e-­‐mailing   Author  skim  cura8on   (paper  in  PubMed  for  <2   44%   weeks)   No  response   One  off-­‐emailing  Dec  2010   36%   (paper  in  PubMed  for  2-­‐13   months)  
  • 22. Results:  has  e-­‐mailing  increased  volunteer  submissions  ?    Before  e-­‐mailing   •   ~  9  submissions/month  
  • 23. Results:  has  e-­‐mailing  increased  volunteer  submissions  ?    Before  e-­‐mailing   •   ~  9  submissions/month      Since  started  e-­‐mailing   •   ~  8  submissions/month  
  • 24. Results:  targe8ng  authors  to  a  specific  paper  helps   (!" '!" &!" ./,012"3456"+/27819" !""#$%&'()$ Tool  usage   %!" :7;<2"7=2<7>?"+/27,<>" $!" #!" !" ##)*+,)#!" #$)*+,)#!" #%)*+,)#!" #&)*+,)#!" #')*+,)#!" #()*+,)#!" #-)*+,)#!" *'+)$ General  e-­‐mail  sent  
  • 25. Results:  accuracy   Analysed  1134  author  skim-­‐curated  papers  that   have  subsequently  been  fully  curated     Gene  data     •   only  had  to  remove  gene(s)  from  4.8%  of  papers          
  • 26. Results:  accuracy  of  author-­‐curated  flags   ()*"+,,),)"-."+/)..+0-1" ()*"2.+134)1)" 5160+,"78+.+72).69+0-1" :).4)"-;"4)1)".)<-.23" =)1)".)1+>)" ?@<.)336-1"61"+"*6,AB2C<)"/+7D4.-E1A" §   Correct   G-..)72" ?@<.)336-1"61"+">E2+12"/+7D4.-E1A" F8)1-2C<67"+1+,C363" §   False  posi8ve   K+,3)"<-360L)" K+,3)"1)4+0L)" F8C367+,"612).+70-1" §   False  nega8ve   (-2"<.)3)12" G8+14)3"2-"HI">),+1-4+32)."4)1)">-A)," §   Not  present   G8+14)3"2-"1-1BHI">),+1-4+32)."4)1)">-A)," :+<<614"-;";)+2E.)3"2-"4)1->)" G63B.)4E,+2-.C"),)>)123"A)J1)A" !" #!!" $!!" %!!" &!!" '!!!" '#!!" !"#$%&'()'*+*%&,' Number  of  papers  
  • 27. Results:  over-­‐flagging   ()*"+,,),)"-."+/)..+0-1" ()*"2.+134)1)" 5160+,"78+.+72).69+0-1" :).4)"-;"4)1)".)<-.23" =)1)".)1+>)" ?@<.)336-1"61"+"*6,AB2C<)"/+7D4.-E1A" §   Correct   G-..)72" ?@<.)336-1"61"+">E2+12"/+7D4.-E1A" F8)1-2C<67"+1+,C363" §   False  posi8ve   K+,3)"<-360L)" K+,3)"1)4+0L)" F8C367+,"612).+70-1" §   False  nega8ve   (-2"<.)3)12" G8+14)3"2-"HI">),+1-4+32)."4)1)">-A)," §   Not  present   G8+14)3"2-"1-1BHI">),+1-4+32)."4)1)">-A)," :+<<614"-;";)+2E.)3"2-"4)1->)" G63B.)4E,+2-.C"),)>)123"A)J1)A" !" #!!" $!!" %!!" &!!" '!!!" '#!!" !"#$%&'()'*+*%&,' Number  of  papers  
  • 28. Results:  over-­‐flagging   ()*"+,,),)"-."+/)..+0-1" ()*"2.+134)1)" 5160+,"78+.+72).69+0-1" :).4)"-;"4)1)".)<-.23" =)1)".)1+>)" ?@<.)336-1"61"+"*6,AB2C<)"/+7D4.-E1A" §   Correct   G-..)72" ?@<.)336-1"61"+">E2+12"/+7D4.-E1A" F8)1-2C<67"+1+,C363" §   False  posi8ve   K+,3)"<-360L)" K+,3)"1)4+0L)" F8C367+,"612).+70-1" §   False  nega8ve   (-2"<.)3)12" G8+14)3"2-"HI">),+1-4+32)."4)1)">-A)," §   Not  present   G8+14)3"2-"1-1BHI">),+1-4+32)."4)1)">-A)," :+<<614"-;";)+2E.)3"2-"4)1->)" G63B.)4E,+2-.C"),)>)123"A)J1)A" !" #!!" $!!" %!!" &!!" '!!!" '#!!" !"#$%&'()'*+*%&,' Number  of  papers  
  • 29. Results:  under-­‐flagging   ()*"+,,),)"-."+/)..+0-1" ()*"2.+134)1)" 5160+,"78+.+72).69+0-1" :).4)"-;"4)1)".)<-.23" =)1)".)1+>)" ?@<.)336-1"61"+"*6,AB2C<)"/+7D4.-E1A" §   Correct   G-..)72" ?@<.)336-1"61"+">E2+12"/+7D4.-E1A" F8)1-2C<67"+1+,C363" §   False  posi8ve   K+,3)"<-360L)" K+,3)"1)4+0L)" F8C367+,"612).+70-1" §   False  nega8ve   (-2"<.)3)12" G8+14)3"2-"HI">),+1-4+32)."4)1)">-A)," §   Not  present   G8+14)3"2-"1-1BHI">),+1-4+32)."4)1)">-A)," :+<<614"-;";)+2E.)3"2-"4)1->)" G63B.)4E,+2-.C"),)>)123"A)J1)A" !" #!!" $!!" %!!" &!!" '!!!" '#!!" !"#$%&'()'*+*%&,' Number  of  papers  
  • 30. Results:  under-­‐flagging   ()*"+,,),)"-."+/)..+0-1" ()*"2.+134)1)" 5160+,"78+.+72).69+0-1" :).4)"-;"4)1)".)<-.23" =)1)".)1+>)" ?@<.)336-1"61"+"*6,AB2C<)"/+7D4.-E1A" §   Correct   G-..)72" ?@<.)336-1"61"+">E2+12"/+7D4.-E1A" F8)1-2C<67"+1+,C363" §   False  posi8ve   K+,3)"<-360L)" K+,3)"1)4+0L)" F8C367+,"612).+70-1" §   False  nega8ve   (-2"<.)3)12" G8+14)3"2-"HI">),+1-4+32)."4)1)">-A)," §   Not  present   G8+14)3"2-"1-1BHI">),+1-4+32)."4)1)">-A)," :+<<614"-;";)+2E.)3"2-"4)1->)" G63B.)4E,+2-.C"),)>)123"A)J1)A" !" #!!" $!!" %!!" &!!" '!!!" '#!!" !"#$%&'()'*+*%&,' Number  of  papers  
  • 31. Results:  under-­‐flagging   ()*"+,,),)"-."+/)..+0-1" ()*"2.+134)1)" 5160+,"78+.+72).69+0-1" :).4)"-;"4)1)".)<-.23" =)1)".)1+>)" ?@<.)336-1"61"+"*6,AB2C<)"/+7D4.-E1A" §   Correct   G-..)72" ?@<.)336-1"61"+">E2+12"/+7D4.-E1A" F8)1-2C<67"+1+,C363" §   False  posi8ve   K+,3)"<-360L)" K+,3)"1)4+0L)" F8C367+,"612).+70-1" §   False  nega8ve   (-2"<.)3)12" G8+14)3"2-"HI">),+1-4+32)."4)1)">-A)," §   Not  present   G8+14)3"2-"1-1BHI">),+1-4+32)."4)1)">-A)," :+<<614"-;";)+2E.)3"2-"4)1->)" G63B.)4E,+2-.C"),)>)123"A)J1)A" !" #!!" $!!" %!!" &!!" '!!!" '#!!" !"#$%&'()'*+*%&,' Number  of  papers  
  • 32. •  Background  –  why  choose  triaging  ?   •  Community  cura8on  pipeline   •  Results  –  how  successful  were  we  ?   •  Future  plans  
  • 33. Future  plans:  improving  the  response  rate   First  year’s  results   Author  skim  cura8on   44%   No  response  
  • 34. Future  plans:  improving  the  response  rate   First  year’s  results   Author  skim  cura8on   44%   No  response   Sending  a  reminder  e-­‐mail   (since  mid-­‐Nov  2011)  
  • 35. Future  plans:  improving  the  response  rate   First  year’s  results   Author  skim  cura8on   44%   No  response   Sending  a  reminder  e-­‐mail   (since  mid-­‐Nov  2011)   55%  
  • 36. Future  plans:  triaging  the  remaining  papers   •  Text  mining  to  assign  data-­‐type  flags   •  See  poster  #P.109   •  “Integra8on  of  an  automa8c  triaging  step  into  FlyBase  Literature   Cura8on  through  the  use  of  SVM  text-­‐mining  methods.”    
  • 37. Future  plans:  expanding  scope  of  community  cura8on   •  Exis8ng  pipeline   •  reviews   •   Wiki  pages   •  See  poster  #P.12   •  “Expanding  community  cura8on  at  FlyBase  through  the  design  and   implementa8on  of  a  gene-­‐centric  seman8c  wiki.”  
  • 38. Acknowledgements   •  FB  community  cura8on  commiCee  -­‐  for  helping  improve   design  of  tool   •  FB-­‐Cambridge  curators  -­‐  for  helping  to  fully  curate  the  papers   analysed  for  accuracy   •  All  the  authors  who  have  filled  in  the  tool  !