SlideShare a Scribd company logo
1 of 13
Download to read offline
Data Mining Threat Identification of
the Australian Terrestrial Biodiversity
Database (NLWRA 2002)
Kurt Pudniks
24 Aug 2012
Overview
• Video clip (7 min)
• Presentation (15 min) 4-Mat system
– Requirements Why
– Design What
– Implementation How
– Maintenance -> future req If
• Questions (5 min)
Why process data?
• Efficiency
– Smart use of limited resources
– Develop knowledge management (eg. SOPs)
– Measurement of progress against goals
• Effectiveness
– Targeted goals that make a difference
– Workforce and policy focus on the right areas
– Awareness and flexibility to adapt tactics
What is datamining (arules)?
• Datamining is use of automated algorithms
– Encouraged to understand fundamentals (R, FOSS)
– eg. Recursive partition, random forest, classify, ML
• Association Rules (arules) is described as:
– Mining frequent itemsets and association rules is a popular and well
researched method for discovering interesting relations between
variables in large databases
What is datamining (arules)?
• Shopping basket:
• Support
– {milk,bread} has a support of 2/5 = 0.4
• Confidence
– {milk,bread} -> {butter} is 0.2 / 0.4 = 0.5
• Lift
– Dev from independent LHS & RHS ie. 0.2 / (0.4 * 0.6) = 0.5 / 0.6
– Keep this assumption in mind for later...
How to use arules
http://adl.brs.gov.au/anrdl/metadata_files/pa_badesr9nn__02211a01.xml
How to use arules
How to use arules
Results (1)
Results (2)
Results (3)
If future work were possible...
• Buyer beware!
– Lies, damned lies, and statistics…
• Read the fine print:
– The lift of a rule is defined as lift(X->Y) = supp(X U Y) / (supp(X) * supp(Y)) and can
be interpreted as the deviation of the support of the whole rule from the support
expected under independence given the supports of the LHS and the RHS. Greater
lift values indicate stronger associations.
• Good example of value of meta-data
– Is meta data simply “data about data”?
• Highlights the need for diversity of views
– Sharing of skills and experience in context
Questions
www.nature.org/australia www.australianwildlife.org www.wildaustralia.org.au
The Nature Conservancy’s Australia Program office ISBN: 978-0-646-53821-1

More Related Content

Viewers also liked (6)

Vulnerability Requirements in the RAN
Vulnerability Requirements in the RANVulnerability Requirements in the RAN
Vulnerability Requirements in the RAN
 
Saleem Raza Channa (saleemrazachanna85@gmail.com).
Saleem Raza Channa (saleemrazachanna85@gmail.com).Saleem Raza Channa (saleemrazachanna85@gmail.com).
Saleem Raza Channa (saleemrazachanna85@gmail.com).
 
ANAELE JOHN VITUS
ANAELE JOHN VITUSANAELE JOHN VITUS
ANAELE JOHN VITUS
 
fault_level_study_ITEE_March_2012
fault_level_study_ITEE_March_2012fault_level_study_ITEE_March_2012
fault_level_study_ITEE_March_2012
 
Integumentary System
Integumentary SystemIntegumentary System
Integumentary System
 
Saleem.C.v
Saleem.C.vSaleem.C.v
Saleem.C.v
 

Similar to wetlands-KP

BI Chapter 03.pdf business business business business business business
BI Chapter 03.pdf business business business business business businessBI Chapter 03.pdf business business business business business business
BI Chapter 03.pdf business business business business business business
JawaherAlbaddawi
 
A Meta Data Vault Approach for Evolutionary Integration of Big Data Sets : Ca...
A Meta Data Vault Approach for Evolutionary Integration of Big Data Sets : Ca...A Meta Data Vault Approach for Evolutionary Integration of Big Data Sets : Ca...
A Meta Data Vault Approach for Evolutionary Integration of Big Data Sets : Ca...
AIRCC Publishing Corporation
 
A META DATA VAULT APPROACH FOR EVOLUTIONARY INTEGRATION OF BIG DATA SETS: CAS...
A META DATA VAULT APPROACH FOR EVOLUTIONARY INTEGRATION OF BIG DATA SETS: CAS...A META DATA VAULT APPROACH FOR EVOLUTIONARY INTEGRATION OF BIG DATA SETS: CAS...
A META DATA VAULT APPROACH FOR EVOLUTIONARY INTEGRATION OF BIG DATA SETS: CAS...
ijcsit
 
DATABASE MANAGEMENT SYSTEMS university course materials useful for students ...
DATABASE MANAGEMENT SYSTEMS  university course materials useful for students ...DATABASE MANAGEMENT SYSTEMS  university course materials useful for students ...
DATABASE MANAGEMENT SYSTEMS university course materials useful for students ...
SakkaravarthiS1
 

Similar to wetlands-KP (20)

Design and implementation of Clinical Databases using openEHR
Design and implementation of Clinical Databases using openEHRDesign and implementation of Clinical Databases using openEHR
Design and implementation of Clinical Databases using openEHR
 
Advances And Research Directions In Data-Warehousing Technology
Advances And Research Directions In Data-Warehousing TechnologyAdvances And Research Directions In Data-Warehousing Technology
Advances And Research Directions In Data-Warehousing Technology
 
Introduction to data mining and data warehousing
Introduction to data mining and data warehousingIntroduction to data mining and data warehousing
Introduction to data mining and data warehousing
 
Dw 07032018-dr pl pradhan
Dw 07032018-dr pl pradhanDw 07032018-dr pl pradhan
Dw 07032018-dr pl pradhan
 
Data mining notes
Data mining notesData mining notes
Data mining notes
 
dwdm unit 1.ppt
dwdm unit 1.pptdwdm unit 1.ppt
dwdm unit 1.ppt
 
Dwbasics
DwbasicsDwbasics
Dwbasics
 
Introduction to business intelligence
Introduction to business intelligenceIntroduction to business intelligence
Introduction to business intelligence
 
BI Chapter 03.pdf business business business business business business
BI Chapter 03.pdf business business business business business businessBI Chapter 03.pdf business business business business business business
BI Chapter 03.pdf business business business business business business
 
Sad chapter-1
Sad chapter-1Sad chapter-1
Sad chapter-1
 
A META DATA VAULT APPROACH FOR EVOLUTIONARY INTEGRATION OF BIG DATA SETS: C...
A META DATA VAULT APPROACH FOR  EVOLUTIONARY INTEGRATION OF BIG DATA SETS:  C...A META DATA VAULT APPROACH FOR  EVOLUTIONARY INTEGRATION OF BIG DATA SETS:  C...
A META DATA VAULT APPROACH FOR EVOLUTIONARY INTEGRATION OF BIG DATA SETS: C...
 
A Meta Data Vault Approach for Evolutionary Integration of Big Data Sets : Ca...
A Meta Data Vault Approach for Evolutionary Integration of Big Data Sets : Ca...A Meta Data Vault Approach for Evolutionary Integration of Big Data Sets : Ca...
A Meta Data Vault Approach for Evolutionary Integration of Big Data Sets : Ca...
 
A META DATA VAULT APPROACH FOR EVOLUTIONARY INTEGRATION OF BIG DATA SETS: CAS...
A META DATA VAULT APPROACH FOR EVOLUTIONARY INTEGRATION OF BIG DATA SETS: CAS...A META DATA VAULT APPROACH FOR EVOLUTIONARY INTEGRATION OF BIG DATA SETS: CAS...
A META DATA VAULT APPROACH FOR EVOLUTIONARY INTEGRATION OF BIG DATA SETS: CAS...
 
Designing modern dw and data lake
Designing modern dw and data lakeDesigning modern dw and data lake
Designing modern dw and data lake
 
1_DBMS_Introduction.pdf
1_DBMS_Introduction.pdf1_DBMS_Introduction.pdf
1_DBMS_Introduction.pdf
 
Modern Information Systems
Modern Information SystemsModern Information Systems
Modern Information Systems
 
fundamentals of data warehouse. initial level.
fundamentals of data warehouse. initial level.fundamentals of data warehouse. initial level.
fundamentals of data warehouse. initial level.
 
Big Data Analytics: From SQL to Machine Learning and Graph Analysis
Big Data Analytics: From SQL to Machine Learning and Graph AnalysisBig Data Analytics: From SQL to Machine Learning and Graph Analysis
Big Data Analytics: From SQL to Machine Learning and Graph Analysis
 
DATABASE MANAGEMENT SYSTEMS university course materials useful for students ...
DATABASE MANAGEMENT SYSTEMS  university course materials useful for students ...DATABASE MANAGEMENT SYSTEMS  university course materials useful for students ...
DATABASE MANAGEMENT SYSTEMS university course materials useful for students ...
 
DATABASE MANAGEMENT SYSTEMS.pdf
DATABASE MANAGEMENT SYSTEMS.pdfDATABASE MANAGEMENT SYSTEMS.pdf
DATABASE MANAGEMENT SYSTEMS.pdf
 

wetlands-KP

  • 1. Data Mining Threat Identification of the Australian Terrestrial Biodiversity Database (NLWRA 2002) Kurt Pudniks 24 Aug 2012
  • 2. Overview • Video clip (7 min) • Presentation (15 min) 4-Mat system – Requirements Why – Design What – Implementation How – Maintenance -> future req If • Questions (5 min)
  • 3. Why process data? • Efficiency – Smart use of limited resources – Develop knowledge management (eg. SOPs) – Measurement of progress against goals • Effectiveness – Targeted goals that make a difference – Workforce and policy focus on the right areas – Awareness and flexibility to adapt tactics
  • 4. What is datamining (arules)? • Datamining is use of automated algorithms – Encouraged to understand fundamentals (R, FOSS) – eg. Recursive partition, random forest, classify, ML • Association Rules (arules) is described as: – Mining frequent itemsets and association rules is a popular and well researched method for discovering interesting relations between variables in large databases
  • 5. What is datamining (arules)? • Shopping basket: • Support – {milk,bread} has a support of 2/5 = 0.4 • Confidence – {milk,bread} -> {butter} is 0.2 / 0.4 = 0.5 • Lift – Dev from independent LHS & RHS ie. 0.2 / (0.4 * 0.6) = 0.5 / 0.6 – Keep this assumption in mind for later...
  • 6. How to use arules http://adl.brs.gov.au/anrdl/metadata_files/pa_badesr9nn__02211a01.xml
  • 7. How to use arules
  • 8. How to use arules
  • 12. If future work were possible... • Buyer beware! – Lies, damned lies, and statistics… • Read the fine print: – The lift of a rule is defined as lift(X->Y) = supp(X U Y) / (supp(X) * supp(Y)) and can be interpreted as the deviation of the support of the whole rule from the support expected under independence given the supports of the LHS and the RHS. Greater lift values indicate stronger associations. • Good example of value of meta-data – Is meta data simply “data about data”? • Highlights the need for diversity of views – Sharing of skills and experience in context
  • 13. Questions www.nature.org/australia www.australianwildlife.org www.wildaustralia.org.au The Nature Conservancy’s Australia Program office ISBN: 978-0-646-53821-1