SlideShare a Scribd company logo
1 of 38
Data mining – databases
Tim Deprez
Outline
• Today:
– introduction to databases
– Introduction on how to work with MSAccess
• Next coming days: practical excercises with
MSAccess
Data mining
• Exploration of data
• Prerequisite: data should be available in a
minable format - database
• Database = electronic document storing data
– Non-relational: 1 bulk system with non-related
items (eg. Msexcel files, text-documents, non-
related-tables)
– Relational: all items (tables) are linked to each
other (see further)
Why using a database
• Relational database:
– All your data is stored in 1 file
• Easy to retrieve data
• Easy to backup
– Data and metadata stored together
• Data ...
• Metadata: data about the data (documentation)
– Many data-files contain undocumented values:
– Species A has an abundance of 17 ( meaning of value 17?)
Why using a database
• All data in a good relational designed database
is only stored once:
– Example: species list  typing errors
• Nudora thorakista
• Nudora thorrakista
• Nudora thorakhista
• Nudora thorakisa
– 1 species  species richness calculation: 4
– Solution: 1 table with each species 1 record and
use it as a reference
Why using a database
• Data is much more rigid ...
– More difficult to make errors
– E.g. Sorting in excell
Relational database - biology
Species
person
Places
Sample
Country
Density
Equipment
Species
person
Places
Sample
Country
Density
Equipment
Which person was present on samplings in sweden?
Species
person
Places
Sample
Country
Density
Equipment
Which species sampled with a core occur in densities higher than 40
Variable
Var_value
Taxonomy
Photo
Literature
...
...
...
...
Table designs ...
• A table consists of a series of Columns ...
• Each record as such:
– Different fields
– Design of table must be done
before data is entered
– Each field: name, data type
– Each field can also by formatted  layout
Record
ColumnField
Table designs ...
• Field types:
– Numeric – integer/double
– Text
– Date/Time
– Memo
– Autonumber  ID
– Yes/No
Task on field types:
• 12
• 15 jan 1988
• hallo
• 12,456
• 12:56
• Azdazdazd azdda zda azdd dad zd dadazdzd
azdazddazdd azdazd azdazd dzdzdzzd ada zzd
azdaz dda azd da az d z azdzadazd a zd a azd
azd z dd da a z a z zd d ddaa zd
• 09:89
Special field in a table: key
• A key = a unique identifier for a record
– Example: pasport number:
• Number in a database which is unique and relates to all data
about you
– Each record in a table gets also a key
– This key is used to link tables to each other
– Example:
• Nudora sp1 – id: 123776
• Nudora sp2 – id: 34688
– Advantage: species name changes: linked taxa remain
linked
Linking tables through id’s
• Storing numbers is most effecient way to store
data:
• Nudora sp1 is found in the north sea with a
density of 32
• Species 123776 is found in station 2 (North
sea) with a density of 32
• Record in table density becomes:
123776 | 2 | 32
Setting up relations between tables
• Relations: links between tables
• Connecting tables through certain fields in a
rigid way to each other
• Advantage: database becomes a strong unity
• Types of relations:
– 1 to many
– Many to many ( = 2 times 1 to many)
Examples of relations
• Table places: field country (numeric)
• Table countries – list of countries,
each country has unique id
• Relation is made between:
– Field country in places
– Field id in country
• One to many relation: 1 record in table
country linked to multiple records in places
• No deleting of countries possible
Places
Country
Examples of relations
• Many to many
• Id of sample
• Id of species
• Table density: unique combination of sample,
species ...
Species
Sample
Density
Queries
• All data in database:
– Next step: get it out again
– Selections on 1 table: by using filters
– Selections on multiple tables: using queries
– Queries can be saved and reused
– Queries can be the basis for new queries
Sorting on tables
• Sorting
Filtering on tables
Making a simple selection Query
• Create ... Query in design view
• Switching between views:
Making a simple selection Query
• Select the tables and/or queries needed
Making a simple selection Query
• Select the fields needed for output/selection/sorting
Making a simple selection Query
• Select the fields needed for output/selection/sorting
Making a simple selection Query
• Select the fields needed for output/selection/sorting
Making a simple selection Query
• Select the fields needed for output/selection/sorting
Making a simple selection Query
• Set the criteria
Making a simple selection Query
• Select the values to out put and add sorting
options
Output the results
• Go to datasheet view
Making a simple selection Query
• Special options ...
Exporting data
• From msaccess it is possible to export to
different formats!
• Tables, queries, ...
• Exports can be used to do further data mining:
– Through MSExcell  making graphs
– To do statistical analysis
Exporting data
Step by step demonstration
• Open a database
• Different items in database
• Open tables, sorting, filtering
• Table design
• Relationships
• Queries
Query operators
= equals
> Larger than
< Smaller than
>= larger than or equals
Between ... And ...
Is null
Like ...
Not like ...
Query operators
Query operators
and both true
or at least 1 true
< Smaller than
>= larger than or equals
Between ... And ...
Is null
Like ...
Not like ... >"q*" and <"u*" VOORNAAM René, Robbie, Stefan, Stijn, Tim, Tristam
="r*" or "s*" VOORNAAM Robbie, Stefan, Stijn

More Related Content

What's hot (10)

4 create database
4 create database4 create database
4 create database
 
Data structures
Data structuresData structures
Data structures
 
Introduction to mining massive datasets
Introduction to mining massive datasetsIntroduction to mining massive datasets
Introduction to mining massive datasets
 
Introduction to Databases
Introduction to DatabasesIntroduction to Databases
Introduction to Databases
 
23.database
23.database23.database
23.database
 
SQL Fundamentals (Oracle 11g) - Lecture 1
SQL Fundamentals (Oracle 11g) - Lecture 1SQL Fundamentals (Oracle 11g) - Lecture 1
SQL Fundamentals (Oracle 11g) - Lecture 1
 
Semi-automated Exploration and Extraction of Data in Scientific Tables
Semi-automated Exploration and Extraction of Data in Scientific TablesSemi-automated Exploration and Extraction of Data in Scientific Tables
Semi-automated Exploration and Extraction of Data in Scientific Tables
 
HPEC 2021 sparse binary format
HPEC 2021 sparse binary formatHPEC 2021 sparse binary format
HPEC 2021 sparse binary format
 
Dbms Basics
Dbms BasicsDbms Basics
Dbms Basics
 
Trees
TreesTrees
Trees
 

Similar to Data mining – introduction

Main MeMory Data Base
Main MeMory Data BaseMain MeMory Data Base
Main MeMory Data BaseSiva Rushi
 
[Www.pkbulk.blogspot.com]dbms12
[Www.pkbulk.blogspot.com]dbms12[Www.pkbulk.blogspot.com]dbms12
[Www.pkbulk.blogspot.com]dbms12AnusAhmad
 
1.1 introduction to Data Structures.ppt
1.1 introduction to Data Structures.ppt1.1 introduction to Data Structures.ppt
1.1 introduction to Data Structures.pptAshok280385
 
Data warehousing in the era of Big Data: Deep Dive into Amazon Redshift
Data warehousing in the era of Big Data: Deep Dive into Amazon RedshiftData warehousing in the era of Big Data: Deep Dive into Amazon Redshift
Data warehousing in the era of Big Data: Deep Dive into Amazon RedshiftAmazon Web Services
 
Analyzing Extended and Scientific Metadata for Scalable Index Designs
Analyzing Extended and Scientific Metadata for Scalable Index DesignsAnalyzing Extended and Scientific Metadata for Scalable Index Designs
Analyzing Extended and Scientific Metadata for Scalable Index DesignsAleatha Parker-Wood
 
Database Indexes
Database IndexesDatabase Indexes
Database IndexesSperasoft
 
Data preprocessing ppt1
Data preprocessing ppt1Data preprocessing ppt1
Data preprocessing ppt1meenas06
 
chapter09 -Programming Data Structures.pdf
chapter09 -Programming Data Structures.pdfchapter09 -Programming Data Structures.pdf
chapter09 -Programming Data Structures.pdfsatonaka3
 
Unit I Database concepts - RDBMS & ORACLE
Unit I  Database concepts - RDBMS & ORACLEUnit I  Database concepts - RDBMS & ORACLE
Unit I Database concepts - RDBMS & ORACLEDrkhanchanaR
 
Unit-III_External Memory.ppt
Unit-III_External Memory.pptUnit-III_External Memory.ppt
Unit-III_External Memory.pptShantanuDharekar
 
Ch 2-introduction to dbms
Ch 2-introduction to dbmsCh 2-introduction to dbms
Ch 2-introduction to dbmsRupali Rana
 
Data extraction, cleanup &amp; transformation tools 29.1.16
Data extraction, cleanup &amp; transformation tools 29.1.16Data extraction, cleanup &amp; transformation tools 29.1.16
Data extraction, cleanup &amp; transformation tools 29.1.16Dhilsath Fathima
 

Similar to Data mining – introduction (20)

Relational databases
Relational databasesRelational databases
Relational databases
 
Main MeMory Data Base
Main MeMory Data BaseMain MeMory Data Base
Main MeMory Data Base
 
[Www.pkbulk.blogspot.com]dbms12
[Www.pkbulk.blogspot.com]dbms12[Www.pkbulk.blogspot.com]dbms12
[Www.pkbulk.blogspot.com]dbms12
 
Data processing and analysis
Data processing and analysisData processing and analysis
Data processing and analysis
 
Vaex pygrunn
Vaex pygrunnVaex pygrunn
Vaex pygrunn
 
1.1 introduction to Data Structures.ppt
1.1 introduction to Data Structures.ppt1.1 introduction to Data Structures.ppt
1.1 introduction to Data Structures.ppt
 
Data warehousing in the era of Big Data: Deep Dive into Amazon Redshift
Data warehousing in the era of Big Data: Deep Dive into Amazon RedshiftData warehousing in the era of Big Data: Deep Dive into Amazon Redshift
Data warehousing in the era of Big Data: Deep Dive into Amazon Redshift
 
Analyzing Extended and Scientific Metadata for Scalable Index Designs
Analyzing Extended and Scientific Metadata for Scalable Index DesignsAnalyzing Extended and Scientific Metadata for Scalable Index Designs
Analyzing Extended and Scientific Metadata for Scalable Index Designs
 
Database Indexes
Database IndexesDatabase Indexes
Database Indexes
 
Data preprocessing ppt1
Data preprocessing ppt1Data preprocessing ppt1
Data preprocessing ppt1
 
chapter09 -Programming Data Structures.pdf
chapter09 -Programming Data Structures.pdfchapter09 -Programming Data Structures.pdf
chapter09 -Programming Data Structures.pdf
 
Data Structures 8
Data Structures 8Data Structures 8
Data Structures 8
 
Unit I Database concepts - RDBMS & ORACLE
Unit I  Database concepts - RDBMS & ORACLEUnit I  Database concepts - RDBMS & ORACLE
Unit I Database concepts - RDBMS & ORACLE
 
Pandas
PandasPandas
Pandas
 
Unit-III_External Memory.ppt
Unit-III_External Memory.pptUnit-III_External Memory.ppt
Unit-III_External Memory.ppt
 
Redshift 101
Redshift 101Redshift 101
Redshift 101
 
Database
DatabaseDatabase
Database
 
Ch 2-introduction to dbms
Ch 2-introduction to dbmsCh 2-introduction to dbms
Ch 2-introduction to dbms
 
Data extraction, cleanup &amp; transformation tools 29.1.16
Data extraction, cleanup &amp; transformation tools 29.1.16Data extraction, cleanup &amp; transformation tools 29.1.16
Data extraction, cleanup &amp; transformation tools 29.1.16
 
lecture 1.pdf
lecture 1.pdflecture 1.pdf
lecture 1.pdf
 

More from Fiddy Prasetiya

Water pollution in indonesia
Water pollution in indonesiaWater pollution in indonesia
Water pollution in indonesiaFiddy Prasetiya
 
Impact of aquaculture activity on phytoplankton diversity in djuanda reservoi...
Impact of aquaculture activity on phytoplankton diversity in djuanda reservoi...Impact of aquaculture activity on phytoplankton diversity in djuanda reservoi...
Impact of aquaculture activity on phytoplankton diversity in djuanda reservoi...Fiddy Prasetiya
 
Diversity copepods in deep sea coral
Diversity copepods in deep sea coralDiversity copepods in deep sea coral
Diversity copepods in deep sea coralFiddy Prasetiya
 
Assessment sg detection by remote sensing
Assessment sg detection by remote sensingAssessment sg detection by remote sensing
Assessment sg detection by remote sensingFiddy Prasetiya
 
Lecture toxicity testing
Lecture   toxicity testingLecture   toxicity testing
Lecture toxicity testingFiddy Prasetiya
 
Oceangraphic data formats
Oceangraphic data formatsOceangraphic data formats
Oceangraphic data formatsFiddy Prasetiya
 
Data management principles
Data management principlesData management principles
Data management principlesFiddy Prasetiya
 
Water quality degradation & cyanobacterial blooms
Water quality degradation & cyanobacterial bloomsWater quality degradation & cyanobacterial blooms
Water quality degradation & cyanobacterial bloomsFiddy Prasetiya
 
Sea bird mortality at cabo san luca: presentation_fiddy
Sea bird mortality at cabo san luca: presentation_fiddySea bird mortality at cabo san luca: presentation_fiddy
Sea bird mortality at cabo san luca: presentation_fiddyFiddy Prasetiya
 
Primary production in Spuikom lagoon, Belgium
Primary production in Spuikom lagoon, BelgiumPrimary production in Spuikom lagoon, Belgium
Primary production in Spuikom lagoon, BelgiumFiddy Prasetiya
 
Study on the behavior of the heavy metals
Study on the behavior of the heavy metalsStudy on the behavior of the heavy metals
Study on the behavior of the heavy metalsFiddy Prasetiya
 
Benthic fauna of the inner part of ariake
Benthic fauna of the inner part of ariakeBenthic fauna of the inner part of ariake
Benthic fauna of the inner part of ariakeFiddy Prasetiya
 
Allelopatic haslea ostrearia on different species of diatoms
Allelopatic haslea ostrearia on different species of diatomsAllelopatic haslea ostrearia on different species of diatoms
Allelopatic haslea ostrearia on different species of diatomsFiddy Prasetiya
 
2 presentasi pemulihan lahan borobudur 01 juni 2011 ya
2 presentasi pemulihan lahan borobudur 01 juni 2011 ya2 presentasi pemulihan lahan borobudur 01 juni 2011 ya
2 presentasi pemulihan lahan borobudur 01 juni 2011 yaFiddy Prasetiya
 
2 proper 01 juni 2011 rt ok
2 proper 01 juni 2011 rt ok2 proper 01 juni 2011 rt ok
2 proper 01 juni 2011 rt okFiddy Prasetiya
 
2 kriteria plb3 agro & hasil 01 juni 2011 hh ok.ppt
2 kriteria plb3 agro & hasil 01 juni 2011 hh ok.ppt2 kriteria plb3 agro & hasil 01 juni 2011 hh ok.ppt
2 kriteria plb3 agro & hasil 01 juni 2011 hh ok.pptFiddy Prasetiya
 

More from Fiddy Prasetiya (20)

Water pollution in indonesia
Water pollution in indonesiaWater pollution in indonesia
Water pollution in indonesia
 
Impact of aquaculture activity on phytoplankton diversity in djuanda reservoi...
Impact of aquaculture activity on phytoplankton diversity in djuanda reservoi...Impact of aquaculture activity on phytoplankton diversity in djuanda reservoi...
Impact of aquaculture activity on phytoplankton diversity in djuanda reservoi...
 
Diversity copepods in deep sea coral
Diversity copepods in deep sea coralDiversity copepods in deep sea coral
Diversity copepods in deep sea coral
 
Assessment sg detection by remote sensing
Assessment sg detection by remote sensingAssessment sg detection by remote sensing
Assessment sg detection by remote sensing
 
Rq evaluation
Rq evaluationRq evaluation
Rq evaluation
 
Lecture toxicity testing
Lecture   toxicity testingLecture   toxicity testing
Lecture toxicity testing
 
Era2010
Era2010Era2010
Era2010
 
Oceangraphic data formats
Oceangraphic data formatsOceangraphic data formats
Oceangraphic data formats
 
Data policies
Data policiesData policies
Data policies
 
Data management principles
Data management principlesData management principles
Data management principles
 
Vliz poster fiddy
Vliz poster fiddyVliz poster fiddy
Vliz poster fiddy
 
Water quality degradation & cyanobacterial blooms
Water quality degradation & cyanobacterial bloomsWater quality degradation & cyanobacterial blooms
Water quality degradation & cyanobacterial blooms
 
Sea bird mortality at cabo san luca: presentation_fiddy
Sea bird mortality at cabo san luca: presentation_fiddySea bird mortality at cabo san luca: presentation_fiddy
Sea bird mortality at cabo san luca: presentation_fiddy
 
Primary production in Spuikom lagoon, Belgium
Primary production in Spuikom lagoon, BelgiumPrimary production in Spuikom lagoon, Belgium
Primary production in Spuikom lagoon, Belgium
 
Study on the behavior of the heavy metals
Study on the behavior of the heavy metalsStudy on the behavior of the heavy metals
Study on the behavior of the heavy metals
 
Benthic fauna of the inner part of ariake
Benthic fauna of the inner part of ariakeBenthic fauna of the inner part of ariake
Benthic fauna of the inner part of ariake
 
Allelopatic haslea ostrearia on different species of diatoms
Allelopatic haslea ostrearia on different species of diatomsAllelopatic haslea ostrearia on different species of diatoms
Allelopatic haslea ostrearia on different species of diatoms
 
2 presentasi pemulihan lahan borobudur 01 juni 2011 ya
2 presentasi pemulihan lahan borobudur 01 juni 2011 ya2 presentasi pemulihan lahan borobudur 01 juni 2011 ya
2 presentasi pemulihan lahan borobudur 01 juni 2011 ya
 
2 proper 01 juni 2011 rt ok
2 proper 01 juni 2011 rt ok2 proper 01 juni 2011 rt ok
2 proper 01 juni 2011 rt ok
 
2 kriteria plb3 agro & hasil 01 juni 2011 hh ok.ppt
2 kriteria plb3 agro & hasil 01 juni 2011 hh ok.ppt2 kriteria plb3 agro & hasil 01 juni 2011 hh ok.ppt
2 kriteria plb3 agro & hasil 01 juni 2011 hh ok.ppt
 

Recently uploaded

How to Make a Pirate ship Primary Education.pptx
How to Make a Pirate ship Primary Education.pptxHow to Make a Pirate ship Primary Education.pptx
How to Make a Pirate ship Primary Education.pptxmanuelaromero2013
 
Introduction to ArtificiaI Intelligence in Higher Education
Introduction to ArtificiaI Intelligence in Higher EducationIntroduction to ArtificiaI Intelligence in Higher Education
Introduction to ArtificiaI Intelligence in Higher Educationpboyjonauth
 
Software Engineering Methodologies (overview)
Software Engineering Methodologies (overview)Software Engineering Methodologies (overview)
Software Engineering Methodologies (overview)eniolaolutunde
 
Introduction to AI in Higher Education_draft.pptx
Introduction to AI in Higher Education_draft.pptxIntroduction to AI in Higher Education_draft.pptx
Introduction to AI in Higher Education_draft.pptxpboyjonauth
 
Separation of Lanthanides/ Lanthanides and Actinides
Separation of Lanthanides/ Lanthanides and ActinidesSeparation of Lanthanides/ Lanthanides and Actinides
Separation of Lanthanides/ Lanthanides and ActinidesFatimaKhan178732
 
Concept of Vouching. B.Com(Hons) /B.Compdf
Concept of Vouching. B.Com(Hons) /B.CompdfConcept of Vouching. B.Com(Hons) /B.Compdf
Concept of Vouching. B.Com(Hons) /B.CompdfUmakantAnnand
 
BASLIQ CURRENT LOOKBOOK LOOKBOOK(1) (1).pdf
BASLIQ CURRENT LOOKBOOK  LOOKBOOK(1) (1).pdfBASLIQ CURRENT LOOKBOOK  LOOKBOOK(1) (1).pdf
BASLIQ CURRENT LOOKBOOK LOOKBOOK(1) (1).pdfSoniaTolstoy
 
_Math 4-Q4 Week 5.pptx Steps in Collecting Data
_Math 4-Q4 Week 5.pptx Steps in Collecting Data_Math 4-Q4 Week 5.pptx Steps in Collecting Data
_Math 4-Q4 Week 5.pptx Steps in Collecting DataJhengPantaleon
 
ECONOMIC CONTEXT - LONG FORM TV DRAMA - PPT
ECONOMIC CONTEXT - LONG FORM TV DRAMA - PPTECONOMIC CONTEXT - LONG FORM TV DRAMA - PPT
ECONOMIC CONTEXT - LONG FORM TV DRAMA - PPTiammrhaywood
 
Enzyme, Pharmaceutical Aids, Miscellaneous Last Part of Chapter no 5th.pdf
Enzyme, Pharmaceutical Aids, Miscellaneous Last Part of Chapter no 5th.pdfEnzyme, Pharmaceutical Aids, Miscellaneous Last Part of Chapter no 5th.pdf
Enzyme, Pharmaceutical Aids, Miscellaneous Last Part of Chapter no 5th.pdfSumit Tiwari
 
Hybridoma Technology ( Production , Purification , and Application )
Hybridoma Technology  ( Production , Purification , and Application  ) Hybridoma Technology  ( Production , Purification , and Application  )
Hybridoma Technology ( Production , Purification , and Application ) Sakshi Ghasle
 
POINT- BIOCHEMISTRY SEM 2 ENZYMES UNIT 5.pptx
POINT- BIOCHEMISTRY SEM 2 ENZYMES UNIT 5.pptxPOINT- BIOCHEMISTRY SEM 2 ENZYMES UNIT 5.pptx
POINT- BIOCHEMISTRY SEM 2 ENZYMES UNIT 5.pptxSayali Powar
 
Employee wellbeing at the workplace.pptx
Employee wellbeing at the workplace.pptxEmployee wellbeing at the workplace.pptx
Employee wellbeing at the workplace.pptxNirmalaLoungPoorunde1
 
CARE OF CHILD IN INCUBATOR..........pptx
CARE OF CHILD IN INCUBATOR..........pptxCARE OF CHILD IN INCUBATOR..........pptx
CARE OF CHILD IN INCUBATOR..........pptxGaneshChakor2
 
Presentation by Andreas Schleicher Tackling the School Absenteeism Crisis 30 ...
Presentation by Andreas Schleicher Tackling the School Absenteeism Crisis 30 ...Presentation by Andreas Schleicher Tackling the School Absenteeism Crisis 30 ...
Presentation by Andreas Schleicher Tackling the School Absenteeism Crisis 30 ...EduSkills OECD
 
MENTAL STATUS EXAMINATION format.docx
MENTAL     STATUS EXAMINATION format.docxMENTAL     STATUS EXAMINATION format.docx
MENTAL STATUS EXAMINATION format.docxPoojaSen20
 

Recently uploaded (20)

How to Make a Pirate ship Primary Education.pptx
How to Make a Pirate ship Primary Education.pptxHow to Make a Pirate ship Primary Education.pptx
How to Make a Pirate ship Primary Education.pptx
 
Model Call Girl in Bikash Puri Delhi reach out to us at 🔝9953056974🔝
Model Call Girl in Bikash Puri  Delhi reach out to us at 🔝9953056974🔝Model Call Girl in Bikash Puri  Delhi reach out to us at 🔝9953056974🔝
Model Call Girl in Bikash Puri Delhi reach out to us at 🔝9953056974🔝
 
Introduction to ArtificiaI Intelligence in Higher Education
Introduction to ArtificiaI Intelligence in Higher EducationIntroduction to ArtificiaI Intelligence in Higher Education
Introduction to ArtificiaI Intelligence in Higher Education
 
Código Creativo y Arte de Software | Unidad 1
Código Creativo y Arte de Software | Unidad 1Código Creativo y Arte de Software | Unidad 1
Código Creativo y Arte de Software | Unidad 1
 
Software Engineering Methodologies (overview)
Software Engineering Methodologies (overview)Software Engineering Methodologies (overview)
Software Engineering Methodologies (overview)
 
Introduction to AI in Higher Education_draft.pptx
Introduction to AI in Higher Education_draft.pptxIntroduction to AI in Higher Education_draft.pptx
Introduction to AI in Higher Education_draft.pptx
 
Separation of Lanthanides/ Lanthanides and Actinides
Separation of Lanthanides/ Lanthanides and ActinidesSeparation of Lanthanides/ Lanthanides and Actinides
Separation of Lanthanides/ Lanthanides and Actinides
 
Model Call Girl in Tilak Nagar Delhi reach out to us at 🔝9953056974🔝
Model Call Girl in Tilak Nagar Delhi reach out to us at 🔝9953056974🔝Model Call Girl in Tilak Nagar Delhi reach out to us at 🔝9953056974🔝
Model Call Girl in Tilak Nagar Delhi reach out to us at 🔝9953056974🔝
 
Concept of Vouching. B.Com(Hons) /B.Compdf
Concept of Vouching. B.Com(Hons) /B.CompdfConcept of Vouching. B.Com(Hons) /B.Compdf
Concept of Vouching. B.Com(Hons) /B.Compdf
 
BASLIQ CURRENT LOOKBOOK LOOKBOOK(1) (1).pdf
BASLIQ CURRENT LOOKBOOK  LOOKBOOK(1) (1).pdfBASLIQ CURRENT LOOKBOOK  LOOKBOOK(1) (1).pdf
BASLIQ CURRENT LOOKBOOK LOOKBOOK(1) (1).pdf
 
_Math 4-Q4 Week 5.pptx Steps in Collecting Data
_Math 4-Q4 Week 5.pptx Steps in Collecting Data_Math 4-Q4 Week 5.pptx Steps in Collecting Data
_Math 4-Q4 Week 5.pptx Steps in Collecting Data
 
ECONOMIC CONTEXT - LONG FORM TV DRAMA - PPT
ECONOMIC CONTEXT - LONG FORM TV DRAMA - PPTECONOMIC CONTEXT - LONG FORM TV DRAMA - PPT
ECONOMIC CONTEXT - LONG FORM TV DRAMA - PPT
 
Enzyme, Pharmaceutical Aids, Miscellaneous Last Part of Chapter no 5th.pdf
Enzyme, Pharmaceutical Aids, Miscellaneous Last Part of Chapter no 5th.pdfEnzyme, Pharmaceutical Aids, Miscellaneous Last Part of Chapter no 5th.pdf
Enzyme, Pharmaceutical Aids, Miscellaneous Last Part of Chapter no 5th.pdf
 
Hybridoma Technology ( Production , Purification , and Application )
Hybridoma Technology  ( Production , Purification , and Application  ) Hybridoma Technology  ( Production , Purification , and Application  )
Hybridoma Technology ( Production , Purification , and Application )
 
Staff of Color (SOC) Retention Efforts DDSD
Staff of Color (SOC) Retention Efforts DDSDStaff of Color (SOC) Retention Efforts DDSD
Staff of Color (SOC) Retention Efforts DDSD
 
POINT- BIOCHEMISTRY SEM 2 ENZYMES UNIT 5.pptx
POINT- BIOCHEMISTRY SEM 2 ENZYMES UNIT 5.pptxPOINT- BIOCHEMISTRY SEM 2 ENZYMES UNIT 5.pptx
POINT- BIOCHEMISTRY SEM 2 ENZYMES UNIT 5.pptx
 
Employee wellbeing at the workplace.pptx
Employee wellbeing at the workplace.pptxEmployee wellbeing at the workplace.pptx
Employee wellbeing at the workplace.pptx
 
CARE OF CHILD IN INCUBATOR..........pptx
CARE OF CHILD IN INCUBATOR..........pptxCARE OF CHILD IN INCUBATOR..........pptx
CARE OF CHILD IN INCUBATOR..........pptx
 
Presentation by Andreas Schleicher Tackling the School Absenteeism Crisis 30 ...
Presentation by Andreas Schleicher Tackling the School Absenteeism Crisis 30 ...Presentation by Andreas Schleicher Tackling the School Absenteeism Crisis 30 ...
Presentation by Andreas Schleicher Tackling the School Absenteeism Crisis 30 ...
 
MENTAL STATUS EXAMINATION format.docx
MENTAL     STATUS EXAMINATION format.docxMENTAL     STATUS EXAMINATION format.docx
MENTAL STATUS EXAMINATION format.docx
 

Data mining – introduction

  • 1. Data mining – databases Tim Deprez
  • 2. Outline • Today: – introduction to databases – Introduction on how to work with MSAccess • Next coming days: practical excercises with MSAccess
  • 3. Data mining • Exploration of data • Prerequisite: data should be available in a minable format - database • Database = electronic document storing data – Non-relational: 1 bulk system with non-related items (eg. Msexcel files, text-documents, non- related-tables) – Relational: all items (tables) are linked to each other (see further)
  • 4. Why using a database • Relational database: – All your data is stored in 1 file • Easy to retrieve data • Easy to backup – Data and metadata stored together • Data ... • Metadata: data about the data (documentation) – Many data-files contain undocumented values: – Species A has an abundance of 17 ( meaning of value 17?)
  • 5. Why using a database • All data in a good relational designed database is only stored once: – Example: species list  typing errors • Nudora thorakista • Nudora thorrakista • Nudora thorakhista • Nudora thorakisa – 1 species  species richness calculation: 4 – Solution: 1 table with each species 1 record and use it as a reference
  • 6. Why using a database • Data is much more rigid ... – More difficult to make errors – E.g. Sorting in excell
  • 7. Relational database - biology Species person Places Sample Country Density Equipment
  • 11. Table designs ... • A table consists of a series of Columns ... • Each record as such: – Different fields – Design of table must be done before data is entered – Each field: name, data type – Each field can also by formatted  layout Record ColumnField
  • 12. Table designs ... • Field types: – Numeric – integer/double – Text – Date/Time – Memo – Autonumber  ID – Yes/No
  • 13. Task on field types: • 12 • 15 jan 1988 • hallo • 12,456 • 12:56 • Azdazdazd azdda zda azdd dad zd dadazdzd azdazddazdd azdazd azdazd dzdzdzzd ada zzd azdaz dda azd da az d z azdzadazd a zd a azd azd z dd da a z a z zd d ddaa zd • 09:89
  • 14. Special field in a table: key • A key = a unique identifier for a record – Example: pasport number: • Number in a database which is unique and relates to all data about you – Each record in a table gets also a key – This key is used to link tables to each other – Example: • Nudora sp1 – id: 123776 • Nudora sp2 – id: 34688 – Advantage: species name changes: linked taxa remain linked
  • 15. Linking tables through id’s • Storing numbers is most effecient way to store data: • Nudora sp1 is found in the north sea with a density of 32 • Species 123776 is found in station 2 (North sea) with a density of 32 • Record in table density becomes: 123776 | 2 | 32
  • 16. Setting up relations between tables • Relations: links between tables • Connecting tables through certain fields in a rigid way to each other • Advantage: database becomes a strong unity • Types of relations: – 1 to many – Many to many ( = 2 times 1 to many)
  • 17. Examples of relations • Table places: field country (numeric) • Table countries – list of countries, each country has unique id • Relation is made between: – Field country in places – Field id in country • One to many relation: 1 record in table country linked to multiple records in places • No deleting of countries possible Places Country
  • 18. Examples of relations • Many to many • Id of sample • Id of species • Table density: unique combination of sample, species ... Species Sample Density
  • 19. Queries • All data in database: – Next step: get it out again – Selections on 1 table: by using filters – Selections on multiple tables: using queries – Queries can be saved and reused – Queries can be the basis for new queries
  • 22. Making a simple selection Query • Create ... Query in design view • Switching between views:
  • 23. Making a simple selection Query • Select the tables and/or queries needed
  • 24. Making a simple selection Query • Select the fields needed for output/selection/sorting
  • 25. Making a simple selection Query • Select the fields needed for output/selection/sorting
  • 26. Making a simple selection Query • Select the fields needed for output/selection/sorting
  • 27. Making a simple selection Query • Select the fields needed for output/selection/sorting
  • 28. Making a simple selection Query • Set the criteria
  • 29. Making a simple selection Query • Select the values to out put and add sorting options
  • 30. Output the results • Go to datasheet view
  • 31. Making a simple selection Query • Special options ...
  • 32. Exporting data • From msaccess it is possible to export to different formats! • Tables, queries, ... • Exports can be used to do further data mining: – Through MSExcell  making graphs – To do statistical analysis
  • 34.
  • 35. Step by step demonstration • Open a database • Different items in database • Open tables, sorting, filtering • Table design • Relationships • Queries
  • 36. Query operators = equals > Larger than < Smaller than >= larger than or equals Between ... And ... Is null Like ... Not like ...
  • 38. Query operators and both true or at least 1 true < Smaller than >= larger than or equals Between ... And ... Is null Like ... Not like ... >"q*" and <"u*" VOORNAAM René, Robbie, Stefan, Stijn, Tim, Tristam ="r*" or "s*" VOORNAAM Robbie, Stefan, Stijn