SlideShare a Scribd company logo
1 of 12
Download to read offline
   
Geophone Data Mining
Ricardo Aguirre
   
Università degli Studi di 
Padova
   
What does Data Mining is?
● The process for:
– select
– explore
– model big data volumes.
● For discover periodicity and not knowing
relations.
● It search useful and clear results
● Is many useful for the data base proprietary.
   
Knowledge Data Discovering
   
The Data mining process
● 1. Defining analysis goals.
● 2. Select, organize and prepare data.
● 3. Data exploration analysis and eventual
transformation.
● 4. Establish statistical methods for
elaboration phase:
– Exploratory methods
– Descriptive methods
– Forecast methods
– Local methods
   
The Data mining process (will continue)
● 5. Data elaboration, using the previously selected
methods.
● 6. Evaluation and validation of statistical methods,
select a final analysis model.
● 7. Model interpretation and future appliances on
decision processes.
   
1. Defining analysis goals.
● shows citizen behaviors like:
– Who lives/works in certain area
– Which are their “working days”
– Who like certain things (such as assist to the
soccer stadium each two weeks)
– Who buy in well known supermarkets.
– Who has little childs, because go to kinder
garden each day in the open/close hours.
– Who is using train service, because he follow
the rail lines.
– Who use car daily, because follow the freeway
route.
   
2. Select, organize and prepare data.
● Create Metadata Database
● Populate it
– Delimitate problem just for a city
– Make an database extraction just
for considering that city.
– Research entire city services and its
addresses
– Transform each addresses in
geopositions
– Create Relations between “Service
Places” and “base stations”
   
3. Data exploration analysis and
eventual transformation
● Transaction Data with Missing and Incomplete
Fields
– CELL_TO_LOCATION_TRACE()
– lookUnlocalizedCells()
● Content changes along the time
   
4. Establish statistical methods for
elaboration phase
● We decide to use a Business Rule-Engine
– The underlying idea of a rule engine is
to externalize the business or application logic
   
Data Mining Differs from
Typical Operational Business
   
next steps?
● Finish Geophone Data Mining
– Continue working with the Rule-Engine
– Making Decision Trees
– Link analysis
– Cluster Analysis
● Create Real-time Embedded System,
– this software piece will replace Mobile Application
– will be installed on Base-Stations
– will avoid all cell management problems and many
of current data acquisition problems.
● Get ready for Anthropological Approach

More Related Content

Similar to Geophone -- Data Mining Presentation

Similar to Geophone -- Data Mining Presentation (20)

EFFICIENT DATA EXTRACTION USING ARTIFICIAL INTELLIGENCE
EFFICIENT DATA EXTRACTION USING  ARTIFICIAL INTELLIGENCEEFFICIENT DATA EXTRACTION USING  ARTIFICIAL INTELLIGENCE
EFFICIENT DATA EXTRACTION USING ARTIFICIAL INTELLIGENCE
 
Knowledge discovery process
Knowledge discovery process Knowledge discovery process
Knowledge discovery process
 
Methodology - Conceptual Database Design Transparencies .pptx
Methodology -  Conceptual Database Design Transparencies .pptxMethodology -  Conceptual Database Design Transparencies .pptx
Methodology - Conceptual Database Design Transparencies .pptx
 
Internship Presentation.pdf
Internship Presentation.pdfInternship Presentation.pdf
Internship Presentation.pdf
 
Machine learning and big data
Machine learning and big dataMachine learning and big data
Machine learning and big data
 
A Space X Industry Day Briefing 7 Jul08 Jgm R4
A Space X Industry Day Briefing 7 Jul08 Jgm R4A Space X Industry Day Briefing 7 Jul08 Jgm R4
A Space X Industry Day Briefing 7 Jul08 Jgm R4
 
Post office management system project ..pdf
Post office management system project ..pdfPost office management system project ..pdf
Post office management system project ..pdf
 
EMOS 2018 Big Data methods and techniques
EMOS 2018 Big Data methods and techniquesEMOS 2018 Big Data methods and techniques
EMOS 2018 Big Data methods and techniques
 
Advanced Use Cases for Analytics Breakout Session
Advanced Use Cases for Analytics Breakout SessionAdvanced Use Cases for Analytics Breakout Session
Advanced Use Cases for Analytics Breakout Session
 
Data management planning: the what, the why, the who, the how
Data management planning: the what, the why, the who, the howData management planning: the what, the why, the who, the how
Data management planning: the what, the why, the who, the how
 
Datagrinch product experience
Datagrinch product experienceDatagrinch product experience
Datagrinch product experience
 
Big Data Infrastructure and Analytics Solution on FITAT2013
Big Data Infrastructure and Analytics Solution on FITAT2013Big Data Infrastructure and Analytics Solution on FITAT2013
Big Data Infrastructure and Analytics Solution on FITAT2013
 
Business analytics and data visualisation
Business analytics and data visualisationBusiness analytics and data visualisation
Business analytics and data visualisation
 
Data mining slide for data mining process
Data mining slide for data mining processData mining slide for data mining process
Data mining slide for data mining process
 
Unit 1 DSS
Unit 1 DSSUnit 1 DSS
Unit 1 DSS
 
Data Analytics and Big Data on IoT
Data Analytics and Big Data on IoTData Analytics and Big Data on IoT
Data Analytics and Big Data on IoT
 
IT Architectures for Handling Big Data in Official Statistics: the Case of Sc...
IT Architectures for Handling Big Data in Official Statistics: the Case of Sc...IT Architectures for Handling Big Data in Official Statistics: the Case of Sc...
IT Architectures for Handling Big Data in Official Statistics: the Case of Sc...
 
Management Information system
Management Information systemManagement Information system
Management Information system
 
Strategies for the seamless deployment of travel diary collection systems to ...
Strategies for the seamless deployment of travel diary collection systems to ...Strategies for the seamless deployment of travel diary collection systems to ...
Strategies for the seamless deployment of travel diary collection systems to ...
 
Webinar: Realizing the Promise of Machine to Machine (M2M) with MongoDB
Webinar: Realizing the Promise of Machine to Machine (M2M) with MongoDBWebinar: Realizing the Promise of Machine to Machine (M2M) with MongoDB
Webinar: Realizing the Promise of Machine to Machine (M2M) with MongoDB
 

Recently uploaded

Hyatt driving innovation and exceptional customer experiences with FIDO passw...
Hyatt driving innovation and exceptional customer experiences with FIDO passw...Hyatt driving innovation and exceptional customer experiences with FIDO passw...
Hyatt driving innovation and exceptional customer experiences with FIDO passw...
FIDO Alliance
 
TrustArc Webinar - Unified Trust Center for Privacy, Security, Compliance, an...
TrustArc Webinar - Unified Trust Center for Privacy, Security, Compliance, an...TrustArc Webinar - Unified Trust Center for Privacy, Security, Compliance, an...
TrustArc Webinar - Unified Trust Center for Privacy, Security, Compliance, an...
TrustArc
 
Revolutionizing SAP® Processes with Automation and Artificial Intelligence
Revolutionizing SAP® Processes with Automation and Artificial IntelligenceRevolutionizing SAP® Processes with Automation and Artificial Intelligence
Revolutionizing SAP® Processes with Automation and Artificial Intelligence
Precisely
 

Recently uploaded (20)

Event-Driven Architecture Masterclass: Engineering a Robust, High-performance...
Event-Driven Architecture Masterclass: Engineering a Robust, High-performance...Event-Driven Architecture Masterclass: Engineering a Robust, High-performance...
Event-Driven Architecture Masterclass: Engineering a Robust, High-performance...
 
Frisco Automating Purchase Orders with MuleSoft IDP- May 10th, 2024.pptx.pdf
Frisco Automating Purchase Orders with MuleSoft IDP- May 10th, 2024.pptx.pdfFrisco Automating Purchase Orders with MuleSoft IDP- May 10th, 2024.pptx.pdf
Frisco Automating Purchase Orders with MuleSoft IDP- May 10th, 2024.pptx.pdf
 
Design and Development of a Provenance Capture Platform for Data Science
Design and Development of a Provenance Capture Platform for Data ScienceDesign and Development of a Provenance Capture Platform for Data Science
Design and Development of a Provenance Capture Platform for Data Science
 
Human Expert Website Manual WCAG 2.0 2.1 2.2 Audit - Digital Accessibility Au...
Human Expert Website Manual WCAG 2.0 2.1 2.2 Audit - Digital Accessibility Au...Human Expert Website Manual WCAG 2.0 2.1 2.2 Audit - Digital Accessibility Au...
Human Expert Website Manual WCAG 2.0 2.1 2.2 Audit - Digital Accessibility Au...
 
Using IESVE for Room Loads Analysis - UK & Ireland
Using IESVE for Room Loads Analysis - UK & IrelandUsing IESVE for Room Loads Analysis - UK & Ireland
Using IESVE for Room Loads Analysis - UK & Ireland
 
Introduction to FIDO Authentication and Passkeys.pptx
Introduction to FIDO Authentication and Passkeys.pptxIntroduction to FIDO Authentication and Passkeys.pptx
Introduction to FIDO Authentication and Passkeys.pptx
 
Hyatt driving innovation and exceptional customer experiences with FIDO passw...
Hyatt driving innovation and exceptional customer experiences with FIDO passw...Hyatt driving innovation and exceptional customer experiences with FIDO passw...
Hyatt driving innovation and exceptional customer experiences with FIDO passw...
 
TrustArc Webinar - Unified Trust Center for Privacy, Security, Compliance, an...
TrustArc Webinar - Unified Trust Center for Privacy, Security, Compliance, an...TrustArc Webinar - Unified Trust Center for Privacy, Security, Compliance, an...
TrustArc Webinar - Unified Trust Center for Privacy, Security, Compliance, an...
 
Generative AI Use Cases and Applications.pdf
Generative AI Use Cases and Applications.pdfGenerative AI Use Cases and Applications.pdf
Generative AI Use Cases and Applications.pdf
 
Oauth 2.0 Introduction and Flows with MuleSoft
Oauth 2.0 Introduction and Flows with MuleSoftOauth 2.0 Introduction and Flows with MuleSoft
Oauth 2.0 Introduction and Flows with MuleSoft
 
WebAssembly is Key to Better LLM Performance
WebAssembly is Key to Better LLM PerformanceWebAssembly is Key to Better LLM Performance
WebAssembly is Key to Better LLM Performance
 
Overview of Hyperledger Foundation
Overview of Hyperledger FoundationOverview of Hyperledger Foundation
Overview of Hyperledger Foundation
 
Revolutionizing SAP® Processes with Automation and Artificial Intelligence
Revolutionizing SAP® Processes with Automation and Artificial IntelligenceRevolutionizing SAP® Processes with Automation and Artificial Intelligence
Revolutionizing SAP® Processes with Automation and Artificial Intelligence
 
Syngulon - Selection technology May 2024.pdf
Syngulon - Selection technology May 2024.pdfSyngulon - Selection technology May 2024.pdf
Syngulon - Selection technology May 2024.pdf
 
How to Check CNIC Information Online with Pakdata cf
How to Check CNIC Information Online with Pakdata cfHow to Check CNIC Information Online with Pakdata cf
How to Check CNIC Information Online with Pakdata cf
 
AI mind or machine power point presentation
AI mind or machine power point presentationAI mind or machine power point presentation
AI mind or machine power point presentation
 
(Explainable) Data-Centric AI: what are you explaininhg, and to whom?
(Explainable) Data-Centric AI: what are you explaininhg, and to whom?(Explainable) Data-Centric AI: what are you explaininhg, and to whom?
(Explainable) Data-Centric AI: what are you explaininhg, and to whom?
 
Google I/O Extended 2024 Warsaw
Google I/O Extended 2024 WarsawGoogle I/O Extended 2024 Warsaw
Google I/O Extended 2024 Warsaw
 
Long journey of Ruby Standard library at RubyKaigi 2024
Long journey of Ruby Standard library at RubyKaigi 2024Long journey of Ruby Standard library at RubyKaigi 2024
Long journey of Ruby Standard library at RubyKaigi 2024
 
Intro to Passkeys and the State of Passwordless.pptx
Intro to Passkeys and the State of Passwordless.pptxIntro to Passkeys and the State of Passwordless.pptx
Intro to Passkeys and the State of Passwordless.pptx
 

Geophone -- Data Mining Presentation

  • 1.     Geophone Data Mining Ricardo Aguirre
  • 3.     What does Data Mining is? ● The process for: – select – explore – model big data volumes. ● For discover periodicity and not knowing relations. ● It search useful and clear results ● Is many useful for the data base proprietary.
  • 4.     Knowledge Data Discovering
  • 5.     The Data mining process ● 1. Defining analysis goals. ● 2. Select, organize and prepare data. ● 3. Data exploration analysis and eventual transformation. ● 4. Establish statistical methods for elaboration phase: – Exploratory methods – Descriptive methods – Forecast methods – Local methods
  • 6.     The Data mining process (will continue) ● 5. Data elaboration, using the previously selected methods. ● 6. Evaluation and validation of statistical methods, select a final analysis model. ● 7. Model interpretation and future appliances on decision processes.
  • 7.     1. Defining analysis goals. ● shows citizen behaviors like: – Who lives/works in certain area – Which are their “working days” – Who like certain things (such as assist to the soccer stadium each two weeks) – Who buy in well known supermarkets. – Who has little childs, because go to kinder garden each day in the open/close hours. – Who is using train service, because he follow the rail lines. – Who use car daily, because follow the freeway route.
  • 8.     2. Select, organize and prepare data. ● Create Metadata Database ● Populate it – Delimitate problem just for a city – Make an database extraction just for considering that city. – Research entire city services and its addresses – Transform each addresses in geopositions – Create Relations between “Service Places” and “base stations”
  • 9.     3. Data exploration analysis and eventual transformation ● Transaction Data with Missing and Incomplete Fields – CELL_TO_LOCATION_TRACE() – lookUnlocalizedCells() ● Content changes along the time
  • 10.     4. Establish statistical methods for elaboration phase ● We decide to use a Business Rule-Engine – The underlying idea of a rule engine is to externalize the business or application logic
  • 11.     Data Mining Differs from Typical Operational Business
  • 12.     next steps? ● Finish Geophone Data Mining – Continue working with the Rule-Engine – Making Decision Trees – Link analysis – Cluster Analysis ● Create Real-time Embedded System, – this software piece will replace Mobile Application – will be installed on Base-Stations – will avoid all cell management problems and many of current data acquisition problems. ● Get ready for Anthropological Approach