SlideShare a Scribd company logo
Ben Busby, Ph.D.
Genomics Outreach Coordinator, Bioinformatics Training Lead
NCBI
Founder, Department of Bioinformatics and Data Science
FAES
ben.busby@nih.gov
Making the Transition from Sharing Data to Sharing Knowledge
4
EUtils (Search API) Command Line
EDirect
Google for
EDirect Cookbook
5
EDirect Local Caching!
6
PubMed and PMC (Open) FTP
7
Instead of PubMed FTP…
An automated tool (alpha)
Pubrunner.org
©MartineZilversmit2013
COOL
THING
#1 !
https://trace.ncbi.nlm.nih.gov/Traces/study/
?stat_search= 1561
Please check out slideshare to grab the details.
https://www.slideshare.net/benbusby
32
33
34
35
36
37
38
Punch line: lsrr is not different in typically virulent vs
avirulent strains of E coli so now I need to look at
expression -- but how do I do that?
(Fast section over.)
39
40
41
43
tar -xvzf ncbi-magicblast...
makeblastdb -dbtype nucl -in <fasta> -parse_seqids
magicblast -db <fasta> -sra SRR… -splice F -no_unaligned
44
magicblast -db <fasta> -sra SRR… -splice F -no_unaligned
-num_threads X
48
49
51
52
Polygenic SNP Search Tool
https://github.com/NCBI-
Hackathons/PSST
54
https://github.com/NCBI-
Hackathons/GenomicRobots
https://github.com/NCBI-Hackathons/Complex_Phenogeno
phenvar.colorado.edu
Combined score is
the average of SVs,
mappability, GC..
NCBI region list
Encode blacklist
60
EUtils (Search API) Command Line
EDirect
Google for
EDirect Cookbook
61
EUtils (Search API) Command Line
EDirect
Google for
EDirect Cookbook
https://github.com/NCBI-
Hackathons/ViruSpy
https://github.com/NCBI-
Hackathons/EndoVir
https://github.com/NCBI-
Hackathons/VirusFriends
66
https://www.ncbi.nlm.nih.gov/core/assets/sra/files/Factsheet_SRA.pdf
Available to
anyone!
First 5 lectures
now available
on
72
75
https://ncbi-hackathons.github.io/GeneExpressionAging/ideogram
77
78
79
80
Other People’s Hackathons
@DCGenomics
Communication
Creating a Community
https://ncbi-hackathons.github.io
Creating a Community
Come work at NCBI for 4-6 weeks!
Email bioinformatics-training@ncbi.nlm.nih.gov
for more information!
Resources for Bioinformaticians!
Resources for Bioinformaticians!
Resources for Bioinformaticians!
Resources for Bioinformaticians!
Resources for Bioinformaticians!

More Related Content

Similar to Data science futures_v_lbirn (6)

Genomic futures 3_ngs_v003
Genomic futures 3_ngs_v003Genomic futures 3_ngs_v003
Genomic futures 3_ngs_v003
 
Leveraging large public_data_for_individualized_medicine
Leveraging large public_data_for_individualized_medicineLeveraging large public_data_for_individualized_medicine
Leveraging large public_data_for_individualized_medicine
 
Genomic futures mn_v002
Genomic futures mn_v002Genomic futures mn_v002
Genomic futures mn_v002
 
Hackathons lightning v_nbs
Hackathons lightning v_nbsHackathons lightning v_nbs
Hackathons lightning v_nbs
 
Bioinformatics_resources_SVAI_v2
Bioinformatics_resources_SVAI_v2Bioinformatics_resources_SVAI_v2
Bioinformatics_resources_SVAI_v2
 
Genomic futures v_pitt_kent_osu
Genomic futures v_pitt_kent_osuGenomic futures v_pitt_kent_osu
Genomic futures v_pitt_kent_osu
 

More from Ben Busby

More from Ben Busby (17)

Addressing privacy concerns_in_the_age_of_federated_data_access
Addressing privacy concerns_in_the_age_of_federated_data_accessAddressing privacy concerns_in_the_age_of_federated_data_access
Addressing privacy concerns_in_the_age_of_federated_data_access
 
Containerized attribute indexing and graph genomes for federated data access
Containerized attribute indexing and graph genomes for federated data accessContainerized attribute indexing and graph genomes for federated data access
Containerized attribute indexing and graph genomes for federated data access
 
Artificial_Intelligence_for_Data_Reuse_2019
Artificial_Intelligence_for_Data_Reuse_2019Artificial_Intelligence_for_Data_Reuse_2019
Artificial_Intelligence_for_Data_Reuse_2019
 
Dream.recomb.ncbi.hackathons v003
Dream.recomb.ncbi.hackathons v003Dream.recomb.ncbi.hackathons v003
Dream.recomb.ncbi.hackathons v003
 
Human_Pangenomics_Bio-IT_2019
Human_Pangenomics_Bio-IT_2019Human_Pangenomics_Bio-IT_2019
Human_Pangenomics_Bio-IT_2019
 
RNAML_Bio-IT_2019
RNAML_Bio-IT_2019RNAML_Bio-IT_2019
RNAML_Bio-IT_2019
 
Hackathon_Bio-IT_2019
Hackathon_Bio-IT_2019Hackathon_Bio-IT_2019
Hackathon_Bio-IT_2019
 
Sage 2 19_v5_busby
Sage 2 19_v5_busbySage 2 19_v5_busby
Sage 2 19_v5_busby
 
Bb health ai_jan26_v2
Bb health ai_jan26_v2Bb health ai_jan26_v2
Bb health ai_jan26_v2
 
BB_NCBI_PAG_2019_Workshop
BB_NCBI_PAG_2019_WorkshopBB_NCBI_PAG_2019_Workshop
BB_NCBI_PAG_2019_Workshop
 
Pag three ways_to_ngs_at_ncbi
Pag three ways_to_ngs_at_ncbiPag three ways_to_ngs_at_ncbi
Pag three ways_to_ngs_at_ncbi
 
Robots and hackathons_ga4_gh_v001
Robots and hackathons_ga4_gh_v001Robots and hackathons_ga4_gh_v001
Robots and hackathons_ga4_gh_v001
 
Datasets and tools_from_ncbi_and_elsewhere_for_microbiome_research_v_62817
Datasets and tools_from_ncbi_and_elsewhere_for_microbiome_research_v_62817Datasets and tools_from_ncbi_and_elsewhere_for_microbiome_research_v_62817
Datasets and tools_from_ncbi_and_elsewhere_for_microbiome_research_v_62817
 
Genomic futures v_ucsd
Genomic futures v_ucsdGenomic futures v_ucsd
Genomic futures v_ucsd
 
Isoforms glbio v1
Isoforms glbio v1Isoforms glbio v1
Isoforms glbio v1
 
Contamination Detection and Taxonomic confirmation with magicBLAST
Contamination Detection and Taxonomic confirmation with magicBLASTContamination Detection and Taxonomic confirmation with magicBLAST
Contamination Detection and Taxonomic confirmation with magicBLAST
 
Downloading human genome_sequence_and_annotations
Downloading human genome_sequence_and_annotationsDownloading human genome_sequence_and_annotations
Downloading human genome_sequence_and_annotations
 

Recently uploaded

ESR_factors_affect-clinic significance-Pathysiology.pptx
ESR_factors_affect-clinic significance-Pathysiology.pptxESR_factors_affect-clinic significance-Pathysiology.pptx
ESR_factors_affect-clinic significance-Pathysiology.pptx
muralinath2
 
Mammalian Pineal Body Structure and Also Functions
Mammalian Pineal Body Structure and Also FunctionsMammalian Pineal Body Structure and Also Functions
Mammalian Pineal Body Structure and Also Functions
YOGESH DOGRA
 
Cancer cell metabolism: special Reference to Lactate Pathway
Cancer cell metabolism: special Reference to Lactate PathwayCancer cell metabolism: special Reference to Lactate Pathway
Cancer cell metabolism: special Reference to Lactate Pathway
AADYARAJPANDEY1
 
RNA INTERFERENCE: UNRAVELING GENETIC SILENCING
RNA INTERFERENCE: UNRAVELING GENETIC SILENCINGRNA INTERFERENCE: UNRAVELING GENETIC SILENCING
RNA INTERFERENCE: UNRAVELING GENETIC SILENCING
AADYARAJPANDEY1
 
Circulatory system_ Laplace law. Ohms law.reynaults law,baro-chemo-receptors-...
Circulatory system_ Laplace law. Ohms law.reynaults law,baro-chemo-receptors-...Circulatory system_ Laplace law. Ohms law.reynaults law,baro-chemo-receptors-...
Circulatory system_ Laplace law. Ohms law.reynaults law,baro-chemo-receptors-...
muralinath2
 
Anemia_ different types_causes_ conditions
Anemia_ different types_causes_ conditionsAnemia_ different types_causes_ conditions
Anemia_ different types_causes_ conditions
muralinath2
 
platelets- lifespan -Clot retraction-disorders.pptx
platelets- lifespan -Clot retraction-disorders.pptxplatelets- lifespan -Clot retraction-disorders.pptx
platelets- lifespan -Clot retraction-disorders.pptx
muralinath2
 
Penicillin...........................pptx
Penicillin...........................pptxPenicillin...........................pptx
Penicillin...........................pptx
Cherry
 
extra-chromosomal-inheritance[1].pptx.pdfpdf
extra-chromosomal-inheritance[1].pptx.pdfpdfextra-chromosomal-inheritance[1].pptx.pdfpdf
extra-chromosomal-inheritance[1].pptx.pdfpdf
DiyaBiswas10
 

Recently uploaded (20)

The ASGCT Annual Meeting was packed with exciting progress in the field advan...
The ASGCT Annual Meeting was packed with exciting progress in the field advan...The ASGCT Annual Meeting was packed with exciting progress in the field advan...
The ASGCT Annual Meeting was packed with exciting progress in the field advan...
 
In silico drugs analogue design: novobiocin analogues.pptx
In silico drugs analogue design: novobiocin analogues.pptxIn silico drugs analogue design: novobiocin analogues.pptx
In silico drugs analogue design: novobiocin analogues.pptx
 
THE IMPORTANCE OF MARTIAN ATMOSPHERE SAMPLE RETURN.
THE IMPORTANCE OF MARTIAN ATMOSPHERE SAMPLE RETURN.THE IMPORTANCE OF MARTIAN ATMOSPHERE SAMPLE RETURN.
THE IMPORTANCE OF MARTIAN ATMOSPHERE SAMPLE RETURN.
 
ESR_factors_affect-clinic significance-Pathysiology.pptx
ESR_factors_affect-clinic significance-Pathysiology.pptxESR_factors_affect-clinic significance-Pathysiology.pptx
ESR_factors_affect-clinic significance-Pathysiology.pptx
 
Mammalian Pineal Body Structure and Also Functions
Mammalian Pineal Body Structure and Also FunctionsMammalian Pineal Body Structure and Also Functions
Mammalian Pineal Body Structure and Also Functions
 
Cancer cell metabolism: special Reference to Lactate Pathway
Cancer cell metabolism: special Reference to Lactate PathwayCancer cell metabolism: special Reference to Lactate Pathway
Cancer cell metabolism: special Reference to Lactate Pathway
 
PRESENTATION ABOUT PRINCIPLE OF COSMATIC EVALUATION
PRESENTATION ABOUT PRINCIPLE OF COSMATIC EVALUATIONPRESENTATION ABOUT PRINCIPLE OF COSMATIC EVALUATION
PRESENTATION ABOUT PRINCIPLE OF COSMATIC EVALUATION
 
RNA INTERFERENCE: UNRAVELING GENETIC SILENCING
RNA INTERFERENCE: UNRAVELING GENETIC SILENCINGRNA INTERFERENCE: UNRAVELING GENETIC SILENCING
RNA INTERFERENCE: UNRAVELING GENETIC SILENCING
 
Circulatory system_ Laplace law. Ohms law.reynaults law,baro-chemo-receptors-...
Circulatory system_ Laplace law. Ohms law.reynaults law,baro-chemo-receptors-...Circulatory system_ Laplace law. Ohms law.reynaults law,baro-chemo-receptors-...
Circulatory system_ Laplace law. Ohms law.reynaults law,baro-chemo-receptors-...
 
INSIGHT Partner Profile: Tampere University
INSIGHT Partner Profile: Tampere UniversityINSIGHT Partner Profile: Tampere University
INSIGHT Partner Profile: Tampere University
 
Multi-source connectivity as the driver of solar wind variability in the heli...
Multi-source connectivity as the driver of solar wind variability in the heli...Multi-source connectivity as the driver of solar wind variability in the heli...
Multi-source connectivity as the driver of solar wind variability in the heli...
 
Anemia_ different types_causes_ conditions
Anemia_ different types_causes_ conditionsAnemia_ different types_causes_ conditions
Anemia_ different types_causes_ conditions
 
platelets- lifespan -Clot retraction-disorders.pptx
platelets- lifespan -Clot retraction-disorders.pptxplatelets- lifespan -Clot retraction-disorders.pptx
platelets- lifespan -Clot retraction-disorders.pptx
 
Penicillin...........................pptx
Penicillin...........................pptxPenicillin...........................pptx
Penicillin...........................pptx
 
NuGOweek 2024 Ghent - programme - final version
NuGOweek 2024 Ghent - programme - final versionNuGOweek 2024 Ghent - programme - final version
NuGOweek 2024 Ghent - programme - final version
 
Shuaib Y-basedComprehensive mahmudj.pptx
Shuaib Y-basedComprehensive mahmudj.pptxShuaib Y-basedComprehensive mahmudj.pptx
Shuaib Y-basedComprehensive mahmudj.pptx
 
extra-chromosomal-inheritance[1].pptx.pdfpdf
extra-chromosomal-inheritance[1].pptx.pdfpdfextra-chromosomal-inheritance[1].pptx.pdfpdf
extra-chromosomal-inheritance[1].pptx.pdfpdf
 
Structures and textures of metamorphic rocks
Structures and textures of metamorphic rocksStructures and textures of metamorphic rocks
Structures and textures of metamorphic rocks
 
SCHIZOPHRENIA Disorder/ Brain Disorder.pdf
SCHIZOPHRENIA Disorder/ Brain Disorder.pdfSCHIZOPHRENIA Disorder/ Brain Disorder.pdf
SCHIZOPHRENIA Disorder/ Brain Disorder.pdf
 
EY - Supply Chain Services 2018_template.pptx
EY - Supply Chain Services 2018_template.pptxEY - Supply Chain Services 2018_template.pptx
EY - Supply Chain Services 2018_template.pptx
 

Data science futures_v_lbirn

Editor's Notes

  1. Talk about relative sizes of databases, particularly SRA
  2. Now… with AMR data!
  3. Now… with AMR data!
  4. Now… with AMR data!
  5. Now… with AMR data!
  6. Now… with AMR data!
  7. Now… with AMR data!
  8. Now… with AMR data!
  9. Now… with AMR data!
  10. Now… with AMR data!
  11. Now… with AMR data!
  12. Now… with AMR data!
  13. Things you can build ON!
  14. Things you can build ON!
  15. Things you can build ON!
  16. Things you can build ON!
  17. Things you can build ON!
  18. Things you can build ON!
  19. Things you can build ON!
  20. Things you can build ON!
  21. Things you can build ON!
  22. Things you can build ON!
  23. Things you can build ON!
  24. Things you can build ON!
  25. Things you can build ON!
  26. Things you can build ON!
  27. Things you can build ON!
  28. Things you can build ON!
  29. Now… with AMR data!
  30. Show suggestive search..
  31. Show suggestive search..
  32. Show suggestive search..
  33. Now… with AMR data!
  34. Now… with AMR data!
  35. Now… with AMR data!
  36. Now… with AMR data!
  37. Now… with AMR data!
  38. Mention CUNY-SDSU competition!
  39. Mention CUNY-SDSU competition!
  40. Now… with AMR data!
  41. Now… with AMR data!
  42. Now… with AMR data!
  43. Mention Data Science Mentoring
  44. Now… with AMR data!
  45. Now… with AMR data!
  46. Now… with AMR data!
  47. Now… with AMR data!
  48. Now… with AMR data!
  49. Now… with AMR data!