SlideShare a Scribd company logo
1 of 18
<chemical_informatics_project>
<inspires>
<chemistry_majors>
Stuart J. Chalk
Department of Chemistry
University of North Florida
schalk@unf.edu
2014 Fall ACS Meeting
 Motivation
 Chemical Information Science: The Course
 Syllabus
 Final Project Outline
 Expected Student Activities
 Submitted Data Sets
 Example Data
 Data Modeling
 Compound Data
 Website
 Future Plans
 Conclusion
Outline
From http://www.embl.de/chemcore/chemcore_services/computational_chemistry/
Motivation
 Students are not exposed to informatics in the
regular chemistry curriculum
 There is so much information for the chemist to
access/use they need to know how to deal with it
 Giving students this exposure makes them more
competitive in graduate/professional school
 We need professionals at the interface of chemistry
and information science
Chemical Information Science:
A UNF Elective Class
 First taught as a Freshmen Honors course in 2003
“Chemical Informatics”
 Five iterations over the last 10 years
 Now an upper-level three credit elective class
 Fall 2013 cohort – 21 students (three credit lecture)
“Chemical Information Science”
Syllabus
 What is information? What is data?
What is metadata? What types of data are there?
 How and where does informatics fit in chemistry?
 How is information organized, stored, related,
formatted, typed?
 The objected oriented view of information
(objects, classes, methods)
 The Semantic Web – What is it why is it important?
 Defining relationships between data, Concept maps
 Controlled vocabularies, Thesauri, Ontologies
Syllabus
 The eXtensible Markup Language (XML) and
Scientific Markup Languages
 Understanding and using Web 2.0 technologies
for information retrieval
 Generating Information and Metadata
 Finding Chemical Information
 Tools for Finding, Organizing and Using Chemical Information
 Searching databases
 Internet/browser software for Chemistry
 Using Excel for searching and organizing scientific information
Final Project Outline
 The ChemData Database
 For your project you will gather chemical data from sources on
the Internet, organize/filter the data, added it too the Excel
spreadsheet provided, and then send your completed Excel
spreadsheet to Dr. Chalk by the deadline.
 Requirements
 600 pieces of metadata at minimum must be submitted
(excluding reference data)
 The data must be correctly entered in the spreadsheet
(no extra spaces, loss of accuracy, etc.)
 It must be referenced to its origin, and those reference
included in the spreadsheet
 For chemicals, the InChI must be part of the submitted
metadata for each chemical species
 A minimum of six hours of time for this activity is expected
 The Excel Spreadsheet to use is available on the course website.
 Find suitable data source (hand coded web page) on
‘reputable’ site with original reference
 Download webpage content to computer
 ‘Scrape’ data out of webpage
 Perform any data normalization (e.g. scientific notation)
 Get metadata about chemicals referenced
 Get metadata about original reference (DOI)
 Import data into Excel and organize
 Assign unique ids and add ids to link data
 Add units and other metadata
Expected Student Activities
Submitted Data Sets
 Students used an Excel spreadsheet to organize their data
Submitted Data Sets
 They choose to submit data about
 Organic compound properties
 Organic compound reactions
 Solvent properties
 Types of analytical instrumentation
 Analytical instrument operating conditions
 Mathematical equations used in PChem
 Physical constants
 Unit conversion factors
Example
Data
Data
Modeling
Compound Data Table
Website
 Very positive 
 “Course was an informative and enjoyable overview of the emerging field of
informatics as it relates to the sciences and Chemistry in particular.”
 “What I am taking away from this class is something that can be applied to
other courses and my career. Interesting peek behind the curtains of how
the sharing of scientific knowledge and discovery are evolving.”
 “Dr. Chalk was very enthusiastic about the subject of chemical informatics.
He exposed us to some very helpful chemistry resources that I plan on using
in the future.”
 “Very interesting class with a lot of hands on computer use and learning
experience. The homework was relative to the course information and
helped to prepare for exams. Would retake this class and recommend to a
friend interested in data or computer science.”
Feedback
Future Plans
 Finish curating, cleaning up data
 Make site publically available
 For students: provide detailed instructions on how to
find, curate, and submit their own data
 For faculty: provide detailed description of the project
and Excel spreadsheet
 Write up a paper about this for J. Chem. Ed.
 Use site as the basis for a question bank for online
study questions
Conclusion
 This was a fun project to run at the end of the class
 Bringing together all that we had talked about in an
activity made it much more tangible for students
 Students liked the idea that the Chem Data website
would be used by other students in chemistry
 I can’t wait to teach this again…
 schalk@unf.edu
 Phone: 904-620-5311
 Skype: stuartchalk
 LinkedIn/Slidehare: https://www.linkedin.com/in/stuchalk
 ORCID: http://orcid.org/0000-0002-0703-7776
 ResearcherID: http://www.researcherid.com/rid/D-8577-2013
Questions?

More Related Content

What's hot

6025 2 Research Ppt
6025 2 Research Ppt6025 2 Research Ppt
6025 2 Research Pptmyplace6025
 
How Much do Availability Studies Increase Full Text Success?
How Much do Availability Studies Increase Full Text Success?How Much do Availability Studies Increase Full Text Success?
How Much do Availability Studies Increase Full Text Success?Sanjeet Mann
 
6025 22 Research Ppt
6025 22 Research Ppt6025 22 Research Ppt
6025 22 Research Pptmyplace6025
 
The Value of Purchasing E-book Collections from a Large Publisher (Oxford Uni...
The Value of Purchasing E-book Collections from a Large Publisher (Oxford Uni...The Value of Purchasing E-book Collections from a Large Publisher (Oxford Uni...
The Value of Purchasing E-book Collections from a Large Publisher (Oxford Uni...Jennifer Bazeley
 
The Value of Purchasing E-books From A Large Publisher
The Value of Purchasing E-books From A Large PublisherThe Value of Purchasing E-books From A Large Publisher
The Value of Purchasing E-books From A Large PublisherAaron K. Shrimplin
 
Core Journals in Psychology: A Demonstration Project
Core Journals in Psychology: A Demonstration ProjectCore Journals in Psychology: A Demonstration Project
Core Journals in Psychology: A Demonstration ProjectRobin Paynter
 
Transition in Scientific Journals: towards a new Scientific Communication
Transition in Scientific Journals: towards a new Scientific CommunicationTransition in Scientific Journals: towards a new Scientific Communication
Transition in Scientific Journals: towards a new Scientific CommunicationUniversitat Oberta de Catalunya (UOC)
 
Awash in eJournal Data: What It Is, Where It Is, and What Can Be Done With It.
Awash in eJournal Data: What It Is, Where It Is, and What Can Be Done With It.Awash in eJournal Data: What It Is, Where It Is, and What Can Be Done With It.
Awash in eJournal Data: What It Is, Where It Is, and What Can Be Done With It.Charleston Conference
 
Correcting Accidentals - text from powerpoint
Correcting Accidentals - text from powerpointCorrecting Accidentals - text from powerpoint
Correcting Accidentals - text from powerpointNASIG
 
Scientific information retrieval: Challenges and opportunities
Scientific information retrieval: Challenges and opportunitiesScientific information retrieval: Challenges and opportunities
Scientific information retrieval: Challenges and opportunitiesLudo Waltman
 
Reference Question Data Mining
Reference Question Data MiningReference Question Data Mining
Reference Question Data MiningJoshua Finnell
 
[3.4] Practical Benefits and Annoyences of Sharing Data - Daniël Lakens [3TU....
[3.4] Practical Benefits and Annoyences of Sharing Data - Daniël Lakens [3TU....[3.4] Practical Benefits and Annoyences of Sharing Data - Daniël Lakens [3TU....
[3.4] Practical Benefits and Annoyences of Sharing Data - Daniël Lakens [3TU....3TU.Datacentrum
 
20190527_Marc Vanholsbeeck_Open Science monitoring and the notion of research...
20190527_Marc Vanholsbeeck_Open Science monitoring and the notion of research...20190527_Marc Vanholsbeeck_Open Science monitoring and the notion of research...
20190527_Marc Vanholsbeeck_Open Science monitoring and the notion of research...OpenAIRE
 
Web of Science, Scopus, Dimensions, and beyond: The evolving landscape of bib...
Web of Science, Scopus, Dimensions, and beyond: The evolving landscape of bib...Web of Science, Scopus, Dimensions, and beyond: The evolving landscape of bib...
Web of Science, Scopus, Dimensions, and beyond: The evolving landscape of bib...Ludo Waltman
 
Rachel Hessey JIBS User Group Resource Discovery event February 2013
Rachel Hessey JIBS User Group Resource Discovery event February 2013Rachel Hessey JIBS User Group Resource Discovery event February 2013
Rachel Hessey JIBS User Group Resource Discovery event February 2013sherif user group
 
Comparing bibliographic data sources
Comparing bibliographic data sourcesComparing bibliographic data sources
Comparing bibliographic data sourcesLudo Waltman
 
Patron Problems...or Opportunities for Improvement? It’s all in how you look...
Patron Problems...or Opportunities for Improvement?  It’s all in how you look...Patron Problems...or Opportunities for Improvement?  It’s all in how you look...
Patron Problems...or Opportunities for Improvement? It’s all in how you look...Columbia University
 

What's hot (20)

6025 2 Research Ppt
6025 2 Research Ppt6025 2 Research Ppt
6025 2 Research Ppt
 
How Much do Availability Studies Increase Full Text Success?
How Much do Availability Studies Increase Full Text Success?How Much do Availability Studies Increase Full Text Success?
How Much do Availability Studies Increase Full Text Success?
 
6025 22 Research Ppt
6025 22 Research Ppt6025 22 Research Ppt
6025 22 Research Ppt
 
The Value of Purchasing E-book Collections from a Large Publisher (Oxford Uni...
The Value of Purchasing E-book Collections from a Large Publisher (Oxford Uni...The Value of Purchasing E-book Collections from a Large Publisher (Oxford Uni...
The Value of Purchasing E-book Collections from a Large Publisher (Oxford Uni...
 
The Value of Purchasing E-books From A Large Publisher
The Value of Purchasing E-books From A Large PublisherThe Value of Purchasing E-books From A Large Publisher
The Value of Purchasing E-books From A Large Publisher
 
LOD-SEM
LOD-SEMLOD-SEM
LOD-SEM
 
Core Journals in Psychology: A Demonstration Project
Core Journals in Psychology: A Demonstration ProjectCore Journals in Psychology: A Demonstration Project
Core Journals in Psychology: A Demonstration Project
 
Transition in Scientific Journals: towards a new Scientific Communication
Transition in Scientific Journals: towards a new Scientific CommunicationTransition in Scientific Journals: towards a new Scientific Communication
Transition in Scientific Journals: towards a new Scientific Communication
 
Awash in eJournal Data: What It Is, Where It Is, and What Can Be Done With It.
Awash in eJournal Data: What It Is, Where It Is, and What Can Be Done With It.Awash in eJournal Data: What It Is, Where It Is, and What Can Be Done With It.
Awash in eJournal Data: What It Is, Where It Is, and What Can Be Done With It.
 
Correcting Accidentals - text from powerpoint
Correcting Accidentals - text from powerpointCorrecting Accidentals - text from powerpoint
Correcting Accidentals - text from powerpoint
 
Scientific information retrieval: Challenges and opportunities
Scientific information retrieval: Challenges and opportunitiesScientific information retrieval: Challenges and opportunities
Scientific information retrieval: Challenges and opportunities
 
Eco ed techtalk
Eco ed techtalkEco ed techtalk
Eco ed techtalk
 
Reference Question Data Mining
Reference Question Data MiningReference Question Data Mining
Reference Question Data Mining
 
[3.4] Practical Benefits and Annoyences of Sharing Data - Daniël Lakens [3TU....
[3.4] Practical Benefits and Annoyences of Sharing Data - Daniël Lakens [3TU....[3.4] Practical Benefits and Annoyences of Sharing Data - Daniël Lakens [3TU....
[3.4] Practical Benefits and Annoyences of Sharing Data - Daniël Lakens [3TU....
 
20190527_Marc Vanholsbeeck_Open Science monitoring and the notion of research...
20190527_Marc Vanholsbeeck_Open Science monitoring and the notion of research...20190527_Marc Vanholsbeeck_Open Science monitoring and the notion of research...
20190527_Marc Vanholsbeeck_Open Science monitoring and the notion of research...
 
Web of Science, Scopus, Dimensions, and beyond: The evolving landscape of bib...
Web of Science, Scopus, Dimensions, and beyond: The evolving landscape of bib...Web of Science, Scopus, Dimensions, and beyond: The evolving landscape of bib...
Web of Science, Scopus, Dimensions, and beyond: The evolving landscape of bib...
 
Rachel Hessey JIBS User Group Resource Discovery event February 2013
Rachel Hessey JIBS User Group Resource Discovery event February 2013Rachel Hessey JIBS User Group Resource Discovery event February 2013
Rachel Hessey JIBS User Group Resource Discovery event February 2013
 
Comparing bibliographic data sources
Comparing bibliographic data sourcesComparing bibliographic data sources
Comparing bibliographic data sources
 
eBIRT Update
eBIRT UpdateeBIRT Update
eBIRT Update
 
Patron Problems...or Opportunities for Improvement? It’s all in how you look...
Patron Problems...or Opportunities for Improvement?  It’s all in how you look...Patron Problems...or Opportunities for Improvement?  It’s all in how you look...
Patron Problems...or Opportunities for Improvement? It’s all in how you look...
 

Similar to ACS 248th Paper 104 ChemData Project

Convenience or credibility
Convenience or credibilityConvenience or credibility
Convenience or credibilityHan Woo PARK
 
Summary of June 2014 Workshop Report: Building a Materials Accelerator Network
Summary of June 2014 Workshop Report: Building a Materials Accelerator NetworkSummary of June 2014 Workshop Report: Building a Materials Accelerator Network
Summary of June 2014 Workshop Report: Building a Materials Accelerator NetworkSusann Ely
 
2YC3 Conference - NSF Programs - March 2004
2YC3 Conference - NSF Programs - March 20042YC3 Conference - NSF Programs - March 2004
2YC3 Conference - NSF Programs - March 2004Liz Dorland
 
Trustees Presentation
Trustees PresentationTrustees Presentation
Trustees PresentationCable Green
 
EDUC 4762 Assignment 4.3
EDUC 4762 Assignment 4.3EDUC 4762 Assignment 4.3
EDUC 4762 Assignment 4.3wlavery
 
Experience from 10 months of University Linked Data
Experience from 10 months of University Linked Data Experience from 10 months of University Linked Data
Experience from 10 months of University Linked Data Mathieu d'Aquin
 
Oral Defense presentation
Oral Defense presentationOral Defense presentation
Oral Defense presentationDwayne Squires
 
Annual Community College Day at NSF HQ 4-12-04
Annual Community College Day at NSF HQ 4-12-04Annual Community College Day at NSF HQ 4-12-04
Annual Community College Day at NSF HQ 4-12-04Liz Dorland
 
Data Science for Every Student at RPI
Data Science for Every Student at RPIData Science for Every Student at RPI
Data Science for Every Student at RPISteven Miller
 
Engineering Student Engagement With Project Lead the Way
Engineering Student Engagement With Project Lead the WayEngineering Student Engagement With Project Lead the Way
Engineering Student Engagement With Project Lead the Waymtemples
 
Project 1Evaluation 31Biology 1 (SCIH 025 062)Be sure .docx
Project 1Evaluation 31Biology 1 (SCIH 025 062)Be sure .docxProject 1Evaluation 31Biology 1 (SCIH 025 062)Be sure .docx
Project 1Evaluation 31Biology 1 (SCIH 025 062)Be sure .docxstilliegeorgiana
 
Association Keynote (March, 2009)
Association Keynote (March, 2009)Association Keynote (March, 2009)
Association Keynote (March, 2009)Cable Green
 
Putting Data to Work: Moving science forward together beyond where we thought...
Putting Data to Work: Moving science forward together beyond where we thought...Putting Data to Work: Moving science forward together beyond where we thought...
Putting Data to Work: Moving science forward together beyond where we thought...Erin Robinson
 
WebQuest for Biomes
WebQuest for BiomesWebQuest for Biomes
WebQuest for Biomessmtester2
 
Biomes WebQuest
Biomes WebQuestBiomes WebQuest
Biomes WebQuestsmtester2
 
Lecture_01.1.pptx
Lecture_01.1.pptxLecture_01.1.pptx
Lecture_01.1.pptxRockyIslam5
 
DATA ANALYTICS FOR HIGHER EDUCATION
 DATA ANALYTICS FOR HIGHER EDUCATION DATA ANALYTICS FOR HIGHER EDUCATION
DATA ANALYTICS FOR HIGHER EDUCATIONSamantha Suraweera
 
Collection Intelligence: Using data driven decision making in collection mana...
Collection Intelligence: Using data driven decision making in collection mana...Collection Intelligence: Using data driven decision making in collection mana...
Collection Intelligence: Using data driven decision making in collection mana...Annette Day
 

Similar to ACS 248th Paper 104 ChemData Project (20)

Convenience or credibility
Convenience or credibilityConvenience or credibility
Convenience or credibility
 
Summary of June 2014 Workshop Report: Building a Materials Accelerator Network
Summary of June 2014 Workshop Report: Building a Materials Accelerator NetworkSummary of June 2014 Workshop Report: Building a Materials Accelerator Network
Summary of June 2014 Workshop Report: Building a Materials Accelerator Network
 
2YC3 Conference - NSF Programs - March 2004
2YC3 Conference - NSF Programs - March 20042YC3 Conference - NSF Programs - March 2004
2YC3 Conference - NSF Programs - March 2004
 
Utilizing ChemSpider As A Platform For Education And Exposure Of Student Data...
Utilizing ChemSpider As A Platform For Education And Exposure Of Student Data...Utilizing ChemSpider As A Platform For Education And Exposure Of Student Data...
Utilizing ChemSpider As A Platform For Education And Exposure Of Student Data...
 
Trustees Presentation
Trustees PresentationTrustees Presentation
Trustees Presentation
 
EDUC 4762 Assignment 4.3
EDUC 4762 Assignment 4.3EDUC 4762 Assignment 4.3
EDUC 4762 Assignment 4.3
 
Experience from 10 months of University Linked Data
Experience from 10 months of University Linked Data Experience from 10 months of University Linked Data
Experience from 10 months of University Linked Data
 
Oral Defense presentation
Oral Defense presentationOral Defense presentation
Oral Defense presentation
 
Annual Community College Day at NSF HQ 4-12-04
Annual Community College Day at NSF HQ 4-12-04Annual Community College Day at NSF HQ 4-12-04
Annual Community College Day at NSF HQ 4-12-04
 
Data Science for Every Student at RPI
Data Science for Every Student at RPIData Science for Every Student at RPI
Data Science for Every Student at RPI
 
Engineering Student Engagement With Project Lead the Way
Engineering Student Engagement With Project Lead the WayEngineering Student Engagement With Project Lead the Way
Engineering Student Engagement With Project Lead the Way
 
Ifla 2010
Ifla 2010Ifla 2010
Ifla 2010
 
Project 1Evaluation 31Biology 1 (SCIH 025 062)Be sure .docx
Project 1Evaluation 31Biology 1 (SCIH 025 062)Be sure .docxProject 1Evaluation 31Biology 1 (SCIH 025 062)Be sure .docx
Project 1Evaluation 31Biology 1 (SCIH 025 062)Be sure .docx
 
Association Keynote (March, 2009)
Association Keynote (March, 2009)Association Keynote (March, 2009)
Association Keynote (March, 2009)
 
Putting Data to Work: Moving science forward together beyond where we thought...
Putting Data to Work: Moving science forward together beyond where we thought...Putting Data to Work: Moving science forward together beyond where we thought...
Putting Data to Work: Moving science forward together beyond where we thought...
 
WebQuest for Biomes
WebQuest for BiomesWebQuest for Biomes
WebQuest for Biomes
 
Biomes WebQuest
Biomes WebQuestBiomes WebQuest
Biomes WebQuest
 
Lecture_01.1.pptx
Lecture_01.1.pptxLecture_01.1.pptx
Lecture_01.1.pptx
 
DATA ANALYTICS FOR HIGHER EDUCATION
 DATA ANALYTICS FOR HIGHER EDUCATION DATA ANALYTICS FOR HIGHER EDUCATION
DATA ANALYTICS FOR HIGHER EDUCATION
 
Collection Intelligence: Using data driven decision making in collection mana...
Collection Intelligence: Using data driven decision making in collection mana...Collection Intelligence: Using data driven decision making in collection mana...
Collection Intelligence: Using data driven decision making in collection mana...
 

More from Stuart Chalk

Semantic properties and units
Semantic properties and unitsSemantic properties and units
Semantic properties and unitsStuart Chalk
 
Open semantic chemical structures
Open semantic chemical structuresOpen semantic chemical structures
Open semantic chemical structuresStuart Chalk
 
ChemExtractor: Enhanced Rule-Based Capture and Identification of PDF Based Pr...
ChemExtractor: Enhanced Rule-Based Capture and Identification of PDF Based Pr...ChemExtractor: Enhanced Rule-Based Capture and Identification of PDF Based Pr...
ChemExtractor: Enhanced Rule-Based Capture and Identification of PDF Based Pr...Stuart Chalk
 
AnIML: A New Analytical Data Standard
AnIML: A New Analytical Data StandardAnIML: A New Analytical Data Standard
AnIML: A New Analytical Data StandardStuart Chalk
 
A Generic Scientific Data Model and Ontology for Representation of Chemical Data
A Generic Scientific Data Model and Ontology for Representation of Chemical DataA Generic Scientific Data Model and Ontology for Representation of Chemical Data
A Generic Scientific Data Model and Ontology for Representation of Chemical DataStuart Chalk
 
Scientific Units in the Electronic Age
Scientific Units in the Electronic AgeScientific Units in the Electronic Age
Scientific Units in the Electronic AgeStuart Chalk
 
Toward Semantic Representation of Science in Electronic Laboratory Notebooks ...
Toward Semantic Representation of Science in Electronic Laboratory Notebooks ...Toward Semantic Representation of Science in Electronic Laboratory Notebooks ...
Toward Semantic Representation of Science in Electronic Laboratory Notebooks ...Stuart Chalk
 
Rule-based Capture/Storage of Scientific Data from PDF Files and Export using...
Rule-based Capture/Storage of Scientific Data from PDF Files and Export using...Rule-based Capture/Storage of Scientific Data from PDF Files and Export using...
Rule-based Capture/Storage of Scientific Data from PDF Files and Export using...Stuart Chalk
 
The Electronic Notebook Ontology
The Electronic Notebook OntologyThe Electronic Notebook Ontology
The Electronic Notebook OntologyStuart Chalk
 
Sharing Science Data: Semantically Reimagining the IUPAC Solubility Series Data
Sharing Science Data: Semantically Reimagining the IUPAC Solubility Series DataSharing Science Data: Semantically Reimagining the IUPAC Solubility Series Data
Sharing Science Data: Semantically Reimagining the IUPAC Solubility Series DataStuart Chalk
 
Bringing Flow injection Analysis to the Semantic Web
Bringing Flow injection Analysis to the Semantic WebBringing Flow injection Analysis to the Semantic Web
Bringing Flow injection Analysis to the Semantic WebStuart Chalk
 
Reactions to the Open Spectral Database
Reactions to the Open Spectral DatabaseReactions to the Open Spectral Database
Reactions to the Open Spectral DatabaseStuart Chalk
 
Integrating AnIML Files in Electronic Laboratory Notebooks - PittCon 2015
Integrating AnIML Files in Electronic Laboratory Notebooks - PittCon 2015Integrating AnIML Files in Electronic Laboratory Notebooks - PittCon 2015
Integrating AnIML Files in Electronic Laboratory Notebooks - PittCon 2015Stuart Chalk
 
Building a Standard for Standards: The ChAMP Project
Building a Standard for Standards: The ChAMP ProjectBuilding a Standard for Standards: The ChAMP Project
Building a Standard for Standards: The ChAMP ProjectStuart Chalk
 
A Standard Data Format for Computational Chemistry: CSX
A Standard Data Format for Computational Chemistry: CSXA Standard Data Format for Computational Chemistry: CSX
A Standard Data Format for Computational Chemistry: CSXStuart Chalk
 
Overview of the Analytical Information Markup Language (AnIML)
Overview of the Analytical Information Markup Language (AnIML)Overview of the Analytical Information Markup Language (AnIML)
Overview of the Analytical Information Markup Language (AnIML)Stuart Chalk
 
ACS 248th Paper 146 VIVO/ScientistsDB Integration into Eureka
ACS 248th Paper 146 VIVO/ScientistsDB Integration into EurekaACS 248th Paper 146 VIVO/ScientistsDB Integration into Eureka
ACS 248th Paper 146 VIVO/ScientistsDB Integration into EurekaStuart Chalk
 
ACS 248th Paper 136 JSmol/JSpecView Eureka Integration
ACS 248th Paper 136 JSmol/JSpecView Eureka IntegrationACS 248th Paper 136 JSmol/JSpecView Eureka Integration
ACS 248th Paper 136 JSmol/JSpecView Eureka IntegrationStuart Chalk
 
ACS 248th Paper 108 NIST-IUPAC Solubility Data
ACS 248th Paper 108 NIST-IUPAC Solubility DataACS 248th Paper 108 NIST-IUPAC Solubility Data
ACS 248th Paper 108 NIST-IUPAC Solubility DataStuart Chalk
 
ACS 248th Paper 71 ChAMP Project
ACS 248th Paper 71 ChAMP ProjectACS 248th Paper 71 ChAMP Project
ACS 248th Paper 71 ChAMP ProjectStuart Chalk
 

More from Stuart Chalk (20)

Semantic properties and units
Semantic properties and unitsSemantic properties and units
Semantic properties and units
 
Open semantic chemical structures
Open semantic chemical structuresOpen semantic chemical structures
Open semantic chemical structures
 
ChemExtractor: Enhanced Rule-Based Capture and Identification of PDF Based Pr...
ChemExtractor: Enhanced Rule-Based Capture and Identification of PDF Based Pr...ChemExtractor: Enhanced Rule-Based Capture and Identification of PDF Based Pr...
ChemExtractor: Enhanced Rule-Based Capture and Identification of PDF Based Pr...
 
AnIML: A New Analytical Data Standard
AnIML: A New Analytical Data StandardAnIML: A New Analytical Data Standard
AnIML: A New Analytical Data Standard
 
A Generic Scientific Data Model and Ontology for Representation of Chemical Data
A Generic Scientific Data Model and Ontology for Representation of Chemical DataA Generic Scientific Data Model and Ontology for Representation of Chemical Data
A Generic Scientific Data Model and Ontology for Representation of Chemical Data
 
Scientific Units in the Electronic Age
Scientific Units in the Electronic AgeScientific Units in the Electronic Age
Scientific Units in the Electronic Age
 
Toward Semantic Representation of Science in Electronic Laboratory Notebooks ...
Toward Semantic Representation of Science in Electronic Laboratory Notebooks ...Toward Semantic Representation of Science in Electronic Laboratory Notebooks ...
Toward Semantic Representation of Science in Electronic Laboratory Notebooks ...
 
Rule-based Capture/Storage of Scientific Data from PDF Files and Export using...
Rule-based Capture/Storage of Scientific Data from PDF Files and Export using...Rule-based Capture/Storage of Scientific Data from PDF Files and Export using...
Rule-based Capture/Storage of Scientific Data from PDF Files and Export using...
 
The Electronic Notebook Ontology
The Electronic Notebook OntologyThe Electronic Notebook Ontology
The Electronic Notebook Ontology
 
Sharing Science Data: Semantically Reimagining the IUPAC Solubility Series Data
Sharing Science Data: Semantically Reimagining the IUPAC Solubility Series DataSharing Science Data: Semantically Reimagining the IUPAC Solubility Series Data
Sharing Science Data: Semantically Reimagining the IUPAC Solubility Series Data
 
Bringing Flow injection Analysis to the Semantic Web
Bringing Flow injection Analysis to the Semantic WebBringing Flow injection Analysis to the Semantic Web
Bringing Flow injection Analysis to the Semantic Web
 
Reactions to the Open Spectral Database
Reactions to the Open Spectral DatabaseReactions to the Open Spectral Database
Reactions to the Open Spectral Database
 
Integrating AnIML Files in Electronic Laboratory Notebooks - PittCon 2015
Integrating AnIML Files in Electronic Laboratory Notebooks - PittCon 2015Integrating AnIML Files in Electronic Laboratory Notebooks - PittCon 2015
Integrating AnIML Files in Electronic Laboratory Notebooks - PittCon 2015
 
Building a Standard for Standards: The ChAMP Project
Building a Standard for Standards: The ChAMP ProjectBuilding a Standard for Standards: The ChAMP Project
Building a Standard for Standards: The ChAMP Project
 
A Standard Data Format for Computational Chemistry: CSX
A Standard Data Format for Computational Chemistry: CSXA Standard Data Format for Computational Chemistry: CSX
A Standard Data Format for Computational Chemistry: CSX
 
Overview of the Analytical Information Markup Language (AnIML)
Overview of the Analytical Information Markup Language (AnIML)Overview of the Analytical Information Markup Language (AnIML)
Overview of the Analytical Information Markup Language (AnIML)
 
ACS 248th Paper 146 VIVO/ScientistsDB Integration into Eureka
ACS 248th Paper 146 VIVO/ScientistsDB Integration into EurekaACS 248th Paper 146 VIVO/ScientistsDB Integration into Eureka
ACS 248th Paper 146 VIVO/ScientistsDB Integration into Eureka
 
ACS 248th Paper 136 JSmol/JSpecView Eureka Integration
ACS 248th Paper 136 JSmol/JSpecView Eureka IntegrationACS 248th Paper 136 JSmol/JSpecView Eureka Integration
ACS 248th Paper 136 JSmol/JSpecView Eureka Integration
 
ACS 248th Paper 108 NIST-IUPAC Solubility Data
ACS 248th Paper 108 NIST-IUPAC Solubility DataACS 248th Paper 108 NIST-IUPAC Solubility Data
ACS 248th Paper 108 NIST-IUPAC Solubility Data
 
ACS 248th Paper 71 ChAMP Project
ACS 248th Paper 71 ChAMP ProjectACS 248th Paper 71 ChAMP Project
ACS 248th Paper 71 ChAMP Project
 

Recently uploaded

GBSN - Microbiology (Unit 3)Defense Mechanism of the body
GBSN - Microbiology (Unit 3)Defense Mechanism of the body GBSN - Microbiology (Unit 3)Defense Mechanism of the body
GBSN - Microbiology (Unit 3)Defense Mechanism of the body Areesha Ahmad
 
Thyroid Physiology_Dr.E. Muralinath_ Associate Professor
Thyroid Physiology_Dr.E. Muralinath_ Associate ProfessorThyroid Physiology_Dr.E. Muralinath_ Associate Professor
Thyroid Physiology_Dr.E. Muralinath_ Associate Professormuralinath2
 
Climate Change Impacts on Terrestrial and Aquatic Ecosystems.pptx
Climate Change Impacts on Terrestrial and Aquatic Ecosystems.pptxClimate Change Impacts on Terrestrial and Aquatic Ecosystems.pptx
Climate Change Impacts on Terrestrial and Aquatic Ecosystems.pptxDiariAli
 
X-rays from a Central “Exhaust Vent” of the Galactic Center Chimney
X-rays from a Central “Exhaust Vent” of the Galactic Center ChimneyX-rays from a Central “Exhaust Vent” of the Galactic Center Chimney
X-rays from a Central “Exhaust Vent” of the Galactic Center ChimneySérgio Sacani
 
Concept of gene and Complementation test.pdf
Concept of gene and Complementation test.pdfConcept of gene and Complementation test.pdf
Concept of gene and Complementation test.pdfCherry
 
development of diagnostic enzyme assay to detect leuser virus
development of diagnostic enzyme assay to detect leuser virusdevelopment of diagnostic enzyme assay to detect leuser virus
development of diagnostic enzyme assay to detect leuser virusNazaninKarimi6
 
CYTOGENETIC MAP................ ppt.pptx
CYTOGENETIC MAP................ ppt.pptxCYTOGENETIC MAP................ ppt.pptx
CYTOGENETIC MAP................ ppt.pptxCherry
 
PODOCARPUS...........................pptx
PODOCARPUS...........................pptxPODOCARPUS...........................pptx
PODOCARPUS...........................pptxCherry
 
Digital Dentistry.Digital Dentistryvv.pptx
Digital Dentistry.Digital Dentistryvv.pptxDigital Dentistry.Digital Dentistryvv.pptx
Digital Dentistry.Digital Dentistryvv.pptxMohamedFarag457087
 
Pteris : features, anatomy, morphology and lifecycle
Pteris : features, anatomy, morphology and lifecyclePteris : features, anatomy, morphology and lifecycle
Pteris : features, anatomy, morphology and lifecycleCherry
 
GBSN - Microbiology (Unit 4) Concept of Asepsis
GBSN - Microbiology (Unit 4) Concept of AsepsisGBSN - Microbiology (Unit 4) Concept of Asepsis
GBSN - Microbiology (Unit 4) Concept of AsepsisAreesha Ahmad
 
Understanding Partial Differential Equations: Types and Solution Methods
Understanding Partial Differential Equations: Types and Solution MethodsUnderstanding Partial Differential Equations: Types and Solution Methods
Understanding Partial Differential Equations: Types and Solution Methodsimroshankoirala
 
Efficient spin-up of Earth System Models usingsequence acceleration
Efficient spin-up of Earth System Models usingsequence accelerationEfficient spin-up of Earth System Models usingsequence acceleration
Efficient spin-up of Earth System Models usingsequence accelerationSérgio Sacani
 
Use of mutants in understanding seedling development.pptx
Use of mutants in understanding seedling development.pptxUse of mutants in understanding seedling development.pptx
Use of mutants in understanding seedling development.pptxRenuJangid3
 
Cyathodium bryophyte: morphology, anatomy, reproduction etc.
Cyathodium bryophyte: morphology, anatomy, reproduction etc.Cyathodium bryophyte: morphology, anatomy, reproduction etc.
Cyathodium bryophyte: morphology, anatomy, reproduction etc.Cherry
 
ONLINE VOTING SYSTEM SE Project for vote
ONLINE VOTING SYSTEM SE Project for voteONLINE VOTING SYSTEM SE Project for vote
ONLINE VOTING SYSTEM SE Project for voteRaunakRastogi4
 
FS P2 COMBO MSTA LAST PUSH past exam papers.
FS P2 COMBO MSTA LAST PUSH past exam papers.FS P2 COMBO MSTA LAST PUSH past exam papers.
FS P2 COMBO MSTA LAST PUSH past exam papers.takadzanijustinmaime
 
Porella : features, morphology, anatomy, reproduction etc.
Porella : features, morphology, anatomy, reproduction etc.Porella : features, morphology, anatomy, reproduction etc.
Porella : features, morphology, anatomy, reproduction etc.Cherry
 
Daily Lesson Log in Science 9 Fourth Quarter Physics
Daily Lesson Log in Science 9 Fourth Quarter PhysicsDaily Lesson Log in Science 9 Fourth Quarter Physics
Daily Lesson Log in Science 9 Fourth Quarter PhysicsWILSONROMA4
 
Dr. E. Muralinath_ Blood indices_clinical aspects
Dr. E. Muralinath_ Blood indices_clinical  aspectsDr. E. Muralinath_ Blood indices_clinical  aspects
Dr. E. Muralinath_ Blood indices_clinical aspectsmuralinath2
 

Recently uploaded (20)

GBSN - Microbiology (Unit 3)Defense Mechanism of the body
GBSN - Microbiology (Unit 3)Defense Mechanism of the body GBSN - Microbiology (Unit 3)Defense Mechanism of the body
GBSN - Microbiology (Unit 3)Defense Mechanism of the body
 
Thyroid Physiology_Dr.E. Muralinath_ Associate Professor
Thyroid Physiology_Dr.E. Muralinath_ Associate ProfessorThyroid Physiology_Dr.E. Muralinath_ Associate Professor
Thyroid Physiology_Dr.E. Muralinath_ Associate Professor
 
Climate Change Impacts on Terrestrial and Aquatic Ecosystems.pptx
Climate Change Impacts on Terrestrial and Aquatic Ecosystems.pptxClimate Change Impacts on Terrestrial and Aquatic Ecosystems.pptx
Climate Change Impacts on Terrestrial and Aquatic Ecosystems.pptx
 
X-rays from a Central “Exhaust Vent” of the Galactic Center Chimney
X-rays from a Central “Exhaust Vent” of the Galactic Center ChimneyX-rays from a Central “Exhaust Vent” of the Galactic Center Chimney
X-rays from a Central “Exhaust Vent” of the Galactic Center Chimney
 
Concept of gene and Complementation test.pdf
Concept of gene and Complementation test.pdfConcept of gene and Complementation test.pdf
Concept of gene and Complementation test.pdf
 
development of diagnostic enzyme assay to detect leuser virus
development of diagnostic enzyme assay to detect leuser virusdevelopment of diagnostic enzyme assay to detect leuser virus
development of diagnostic enzyme assay to detect leuser virus
 
CYTOGENETIC MAP................ ppt.pptx
CYTOGENETIC MAP................ ppt.pptxCYTOGENETIC MAP................ ppt.pptx
CYTOGENETIC MAP................ ppt.pptx
 
PODOCARPUS...........................pptx
PODOCARPUS...........................pptxPODOCARPUS...........................pptx
PODOCARPUS...........................pptx
 
Digital Dentistry.Digital Dentistryvv.pptx
Digital Dentistry.Digital Dentistryvv.pptxDigital Dentistry.Digital Dentistryvv.pptx
Digital Dentistry.Digital Dentistryvv.pptx
 
Pteris : features, anatomy, morphology and lifecycle
Pteris : features, anatomy, morphology and lifecyclePteris : features, anatomy, morphology and lifecycle
Pteris : features, anatomy, morphology and lifecycle
 
GBSN - Microbiology (Unit 4) Concept of Asepsis
GBSN - Microbiology (Unit 4) Concept of AsepsisGBSN - Microbiology (Unit 4) Concept of Asepsis
GBSN - Microbiology (Unit 4) Concept of Asepsis
 
Understanding Partial Differential Equations: Types and Solution Methods
Understanding Partial Differential Equations: Types and Solution MethodsUnderstanding Partial Differential Equations: Types and Solution Methods
Understanding Partial Differential Equations: Types and Solution Methods
 
Efficient spin-up of Earth System Models usingsequence acceleration
Efficient spin-up of Earth System Models usingsequence accelerationEfficient spin-up of Earth System Models usingsequence acceleration
Efficient spin-up of Earth System Models usingsequence acceleration
 
Use of mutants in understanding seedling development.pptx
Use of mutants in understanding seedling development.pptxUse of mutants in understanding seedling development.pptx
Use of mutants in understanding seedling development.pptx
 
Cyathodium bryophyte: morphology, anatomy, reproduction etc.
Cyathodium bryophyte: morphology, anatomy, reproduction etc.Cyathodium bryophyte: morphology, anatomy, reproduction etc.
Cyathodium bryophyte: morphology, anatomy, reproduction etc.
 
ONLINE VOTING SYSTEM SE Project for vote
ONLINE VOTING SYSTEM SE Project for voteONLINE VOTING SYSTEM SE Project for vote
ONLINE VOTING SYSTEM SE Project for vote
 
FS P2 COMBO MSTA LAST PUSH past exam papers.
FS P2 COMBO MSTA LAST PUSH past exam papers.FS P2 COMBO MSTA LAST PUSH past exam papers.
FS P2 COMBO MSTA LAST PUSH past exam papers.
 
Porella : features, morphology, anatomy, reproduction etc.
Porella : features, morphology, anatomy, reproduction etc.Porella : features, morphology, anatomy, reproduction etc.
Porella : features, morphology, anatomy, reproduction etc.
 
Daily Lesson Log in Science 9 Fourth Quarter Physics
Daily Lesson Log in Science 9 Fourth Quarter PhysicsDaily Lesson Log in Science 9 Fourth Quarter Physics
Daily Lesson Log in Science 9 Fourth Quarter Physics
 
Dr. E. Muralinath_ Blood indices_clinical aspects
Dr. E. Muralinath_ Blood indices_clinical  aspectsDr. E. Muralinath_ Blood indices_clinical  aspects
Dr. E. Muralinath_ Blood indices_clinical aspects
 

ACS 248th Paper 104 ChemData Project

  • 1. <chemical_informatics_project> <inspires> <chemistry_majors> Stuart J. Chalk Department of Chemistry University of North Florida schalk@unf.edu 2014 Fall ACS Meeting
  • 2.  Motivation  Chemical Information Science: The Course  Syllabus  Final Project Outline  Expected Student Activities  Submitted Data Sets  Example Data  Data Modeling  Compound Data  Website  Future Plans  Conclusion Outline From http://www.embl.de/chemcore/chemcore_services/computational_chemistry/
  • 3. Motivation  Students are not exposed to informatics in the regular chemistry curriculum  There is so much information for the chemist to access/use they need to know how to deal with it  Giving students this exposure makes them more competitive in graduate/professional school  We need professionals at the interface of chemistry and information science
  • 4. Chemical Information Science: A UNF Elective Class  First taught as a Freshmen Honors course in 2003 “Chemical Informatics”  Five iterations over the last 10 years  Now an upper-level three credit elective class  Fall 2013 cohort – 21 students (three credit lecture) “Chemical Information Science”
  • 5. Syllabus  What is information? What is data? What is metadata? What types of data are there?  How and where does informatics fit in chemistry?  How is information organized, stored, related, formatted, typed?  The objected oriented view of information (objects, classes, methods)  The Semantic Web – What is it why is it important?  Defining relationships between data, Concept maps  Controlled vocabularies, Thesauri, Ontologies
  • 6. Syllabus  The eXtensible Markup Language (XML) and Scientific Markup Languages  Understanding and using Web 2.0 technologies for information retrieval  Generating Information and Metadata  Finding Chemical Information  Tools for Finding, Organizing and Using Chemical Information  Searching databases  Internet/browser software for Chemistry  Using Excel for searching and organizing scientific information
  • 7. Final Project Outline  The ChemData Database  For your project you will gather chemical data from sources on the Internet, organize/filter the data, added it too the Excel spreadsheet provided, and then send your completed Excel spreadsheet to Dr. Chalk by the deadline.  Requirements  600 pieces of metadata at minimum must be submitted (excluding reference data)  The data must be correctly entered in the spreadsheet (no extra spaces, loss of accuracy, etc.)  It must be referenced to its origin, and those reference included in the spreadsheet  For chemicals, the InChI must be part of the submitted metadata for each chemical species  A minimum of six hours of time for this activity is expected  The Excel Spreadsheet to use is available on the course website.
  • 8.  Find suitable data source (hand coded web page) on ‘reputable’ site with original reference  Download webpage content to computer  ‘Scrape’ data out of webpage  Perform any data normalization (e.g. scientific notation)  Get metadata about chemicals referenced  Get metadata about original reference (DOI)  Import data into Excel and organize  Assign unique ids and add ids to link data  Add units and other metadata Expected Student Activities
  • 9. Submitted Data Sets  Students used an Excel spreadsheet to organize their data
  • 10. Submitted Data Sets  They choose to submit data about  Organic compound properties  Organic compound reactions  Solvent properties  Types of analytical instrumentation  Analytical instrument operating conditions  Mathematical equations used in PChem  Physical constants  Unit conversion factors
  • 15.  Very positive   “Course was an informative and enjoyable overview of the emerging field of informatics as it relates to the sciences and Chemistry in particular.”  “What I am taking away from this class is something that can be applied to other courses and my career. Interesting peek behind the curtains of how the sharing of scientific knowledge and discovery are evolving.”  “Dr. Chalk was very enthusiastic about the subject of chemical informatics. He exposed us to some very helpful chemistry resources that I plan on using in the future.”  “Very interesting class with a lot of hands on computer use and learning experience. The homework was relative to the course information and helped to prepare for exams. Would retake this class and recommend to a friend interested in data or computer science.” Feedback
  • 16. Future Plans  Finish curating, cleaning up data  Make site publically available  For students: provide detailed instructions on how to find, curate, and submit their own data  For faculty: provide detailed description of the project and Excel spreadsheet  Write up a paper about this for J. Chem. Ed.  Use site as the basis for a question bank for online study questions
  • 17. Conclusion  This was a fun project to run at the end of the class  Bringing together all that we had talked about in an activity made it much more tangible for students  Students liked the idea that the Chem Data website would be used by other students in chemistry  I can’t wait to teach this again…
  • 18.  schalk@unf.edu  Phone: 904-620-5311  Skype: stuartchalk  LinkedIn/Slidehare: https://www.linkedin.com/in/stuchalk  ORCID: http://orcid.org/0000-0002-0703-7776  ResearcherID: http://www.researcherid.com/rid/D-8577-2013 Questions?