SlideShare a Scribd company logo
From Big Data to
Smart data
Jie (Jack) Yang | April 2016
—What is Big Data?
—Challenge of Big Data processing
—Smart Learning framework
—Applications
—Conclusions
Outline
—No single standard definition
—5-V information assets that require innovative
techniques, algorithms, and analytics that enable
decision making, and process automation
Big Data definition
1 – Scale (Volume)
12+ TBs
of tweet data
every day
25+ TBs of
log data every
day
?TBsof
dataeveryday
2+ billion people on
the Web by end
2011
30 billion RFID tags
today
(1.3B in 2005)
4.6 billion
camera
phones
world wide
100s of
millions of
GPS enabled
devices sold
annually
76 million smart meters in
2009…
200M by 2014
The ability to manage, analyse, summarise, visualise,
and discover knowledge from the collected data in a
timely and scalable manner
2 – Speed (Velocity)
Social media and networks
(millions of active users)
Mobile devices
(tracking objects all the time)
Infrastructure sensors and/or
instruments
(measuring all kinds of data)
Various formats, types and structures:
— Text
— Numerical
— Multi-dim arrays
— Images, audio, video, sequences
— Time series
— Graph (network)
— Streaming data
— etc
3 – Complexity (Varity)
4 – Uncertainty (Veracity)
5 – Benefit (Value)
Value ($, time, performance)
Beer & Diaper (Woolworths in Illawarra)
“A number of convenience store clerks noticed that men
often bought beer at the same time they bought diapers.
The store mined its receipts and proved the clerks'
observations correct. So, the store began stocking
diapers next to the beer coolers, and sales skyrocketed”
Asimple example
Hardware
—Choose machines
—System failure
Challenge of Big Data processing
16 Cores, 32G RAM, $AUD6000+
Software
—Different data sources
—Really slow
—Memory issue (out of memory, for 52 million
records)
Challenge of Big Data processing
Smart Learning Framework
Data harvesting
data partners
Data mining
Data storage
Data streaming
Data visualisation
Hardware
—Money wise
—Tolerance to hardware failure
Smart Learning Framework
16 Cores, 32G RAM, $AUD6000+ 4 Cores, 8G RAM, $AUD 600+
Main features
— Collection across different platforms and formats
• APIs
• Web crawling
— 1 master and 6 workers
• distributing–working–waiting–reactivating
process
— Data volume (per day)
• 20K+ records user activities
• 25K+ records from social platforms
• 200K+ tweets around AU and EU
Data harvesting
Main features
— save data into different formats
• Pure TXT / CSV
• (NO)SQL
— Query across all
— Fast respond
Data storage
SELECT * FROM
(SELECT * FROM /web/logs/CSV) t0
JOIN
( SELECT country, count(*)
FROM mysql.web.users
GROUP BY country) t1
JOIN
(SELECT timestamp
FROM s3.root.clicks.json
WHERE user_id = 'jdoe‘) t2
Main features
— Preprocessing (filtering, cleansing, feature
extraction)
— Event simulation
— Saving to DBs
— Running ML jobs on the fly
• Receiver throughput = 3kb /sec
• Consumer throughput = 2kb /sec
• Consumer latency = 0.23 sec
Data streaming
Main features (35 online training jobs per day)
— Supervised (with a human assisting in classification) /
unsupervised machine learning techniques, to assist with
classification, clustering and prediction;
— Geospatial analysis: K-pop cluster in geographical regions;
— Network analysis to understand social connections between
consumers and producers;
— Other analysis including:
• More sophisticated number crunching of comments, such as
time series analysis to examine trends;
• Natural language processing techniques to assist with
sentiment analysis.
Data mining
Student behaviour analysis (OLPC, until Feb 2016):
— 153+ schools
— 20K+ active laptops
— 4.2M+ activity records
Application 1
0
1000
2000
3000
1.2M 2.6M 4.2M
Most popular Apps (per school) App usage (per school)
0
1000
2000
3000
1.2M 2.6M 4.2M
Car parking
Application 2
Car parking
— Every 2 minutes
— 604800 records (May to Oct 2015)
— Temporal and spatial features
Application 2
Application 2
Average classification accuracy (%) as a function of the size of the selected samples.
Average computational time (second)
Social media analysis
— 70K+ films
— 228K+ users (2M + friendships)
— 1M+ reviews
— 13 features
Application 3
— User profile vs film preference
— User profile vs topics
Application 3
— Network analysis
— Opinion leadership
Application 3
4K nodes + 7K edges 76 nodes + 253 edges
Jie Yang; Jun Ma, A structure optimization algorithm of neural networks for large-scale data sets, Fuzz-IEEE,2014;
Jie Yang; Jun Ma, A Sparsity-Based Training Algorithm for Least Squares SVM, IEEE SSCI, 2014;
Jie Yang, Jun Ma, A big-data processing framework for uncertainties in Transportation data, Fuzz-IEEE, 2015
Jie Yang, Jun Ma, and Sarah K. Howard, A Structure Optimization Algorithm of Neural Networks for Pattern Learning from Educational Data, Springer
Studies in Computational Intelligence ANN Modelling, 2015
Jie Yang; Jun Ma, A hybrid gene expression programming algorithm based on orthogonal design, International Journal of Computational Intelligence
Systems, 2015
Jie Yang, Brian Yecies, Mining Chinese Social Media UGC A SmartLearning Framework For Analyzing Douban Movie Reviews, Journal of Big Data,
2016
Jie Yang; Jun Ma, A structure optimization framework for feed-forward neural networks using sparse representation, Knowledge-Based Systems, 2016;
Jie Yang; Jun Ma, Sarah K. Howard, Exploring Technology Integration in Education using Fuzzy Representation and Feature Selection, Fuzz-IEEE,
2016
Brian Yecies, Jie Yang, Matthew Berryman, Kai Soh, Marketing Bait: Using SMART Data to Identify E-guanxi Among China’s ‘Internet Aborigines,
Film Marketing in a Global Era, 2015
Brian Yecies, Jie Yang, Matthew Berryman, Aegyung Shim, and Kai Soh, Korean Female Writer-Directors and SMART Analysis of Douban
commentary Among China’s Digital Natives, Women Screenwriters: An International Guide, 2015
Brian Yecies, Jie Yang, Matthew Berryman, Aegyung Shim, and Kai Soh, Korean Female Writer–Directors and SMART Analysis of Douban
Commentary Among China’s Digital Natives, Participations: International Journal of Audience Research, 2016
Sarah K. Howard, Jun Ma, Jie Yang, Kate Thompson, The use of data mining to explore factors of technology integration in learning and teaching,
EARLI 2015
Sarah K. Howard, Ellie Rennie, Jun Ma, Jie Yang, Big Data, Big Theory: Moving Beyond New Empiricism to Generate Powerful Explanations, The
New Data “Revolution” in Sociology, 2016
Jun Ma, Jie Yang, Rohan W. Denagamage and Murad Safadi, A Conceptual Model for Clustering Local Government Areas using Complex Fuzzy Sets,
Fuzz-IEEE, 2016
Publications
— OLPC (ARC-Linkage)
— NSW-DER
— CAAR
— China-South Korean Foundation
— Healthcare (Pubmed, Seer)
— Tourism business project (UTS)
— MTR
Projects and grants
— Big Data processing:
• Data collection; streaming data; data storage; and Machine
learning
• Open source libraries
— Other domains:
• Public transportation
• Business Intelligence
• Health care
Conclusions
Thankyou

More Related Content

What's hot

Data Science Courses - BigData VS Data Science
Data Science Courses - BigData VS Data ScienceData Science Courses - BigData VS Data Science
Data Science Courses - BigData VS Data Science
DataMites
 
Spark
SparkSpark
What is Data Science
What is Data ScienceWhat is Data Science
What is Data Science
Ioannis Kourouklides
 
Kostas Kastrantas | Business Opportunities with Linked Open Data
Kostas Kastrantas  | Business Opportunities with Linked Open DataKostas Kastrantas  | Business Opportunities with Linked Open Data
Kostas Kastrantas | Business Opportunities with Linked Open Data
semanticsconference
 
Big Data Analytics : Understanding for Research Activity
Big Data Analytics : Understanding for Research ActivityBig Data Analytics : Understanding for Research Activity
Big Data Analytics : Understanding for Research Activity
Andry Alamsyah
 
Big Data: Beyond the hype, Delivering value
Big Data: Beyond the hype, Delivering valueBig Data: Beyond the hype, Delivering value
Big Data: Beyond the hype, Delivering value
Edward Curry
 
Lect 1 introduction
Lect 1 introductionLect 1 introduction
Lect 1 introduction
hktripathy
 
Intro to big data and applications - day 1
Intro to big data and applications - day 1Intro to big data and applications - day 1
Intro to big data and applications - day 1
Parviz Vakili
 
Thomas Vavra | New Ways of Handling Old Data
Thomas Vavra | New Ways of Handling Old DataThomas Vavra | New Ways of Handling Old Data
Thomas Vavra | New Ways of Handling Old Data
semanticsconference
 
Social Big Data in Government
Social Big Data in GovernmentSocial Big Data in Government
Social Big Data in Government
Adegboyega Ojo
 
Introduction to Big Data Analytics
Introduction to Big Data AnalyticsIntroduction to Big Data Analytics
Introduction to Big Data Analytics
Utkarsh Sharma
 
Introduction to Big Data & Analytics
Introduction to Big Data & AnalyticsIntroduction to Big Data & Analytics
Introduction to Big Data & Analytics
Prasad Chitta
 
Big Data Paper
Big Data PaperBig Data Paper
Big Data Paper
Andile Ngcaba
 
Metadata
MetadataMetadata
Metadata
Kuldeep Ghetiya
 
Introduction to Big Data
Introduction to Big DataIntroduction to Big Data
Introduction to Big Data
Akshata Humbe
 
Bigger and Better: Employing a Holistic Strategy for Big Data toward a Strong...
Bigger and Better: Employing a Holistic Strategy for Big Data toward a Strong...Bigger and Better: Employing a Holistic Strategy for Big Data toward a Strong...
Bigger and Better: Employing a Holistic Strategy for Big Data toward a Strong...
IT Network marcus evans
 
The Evidence Hub: Harnessing the Collective Intelligence of Communities to Bu...
The Evidence Hub: Harnessing the Collective Intelligence of Communities to Bu...The Evidence Hub: Harnessing the Collective Intelligence of Communities to Bu...
The Evidence Hub: Harnessing the Collective Intelligence of Communities to Bu...
Anna De Liddo
 
Data analytics course in bangalore
Data analytics course in bangaloreData analytics course in bangalore
Data analytics course in bangalore
Umeshchandra Reddy Tera
 

What's hot (20)

Data Science Courses - BigData VS Data Science
Data Science Courses - BigData VS Data ScienceData Science Courses - BigData VS Data Science
Data Science Courses - BigData VS Data Science
 
Spark
SparkSpark
Spark
 
What is Data Science
What is Data ScienceWhat is Data Science
What is Data Science
 
Kostas Kastrantas | Business Opportunities with Linked Open Data
Kostas Kastrantas  | Business Opportunities with Linked Open DataKostas Kastrantas  | Business Opportunities with Linked Open Data
Kostas Kastrantas | Business Opportunities with Linked Open Data
 
Big Data Analytics : Understanding for Research Activity
Big Data Analytics : Understanding for Research ActivityBig Data Analytics : Understanding for Research Activity
Big Data Analytics : Understanding for Research Activity
 
Data mining
Data miningData mining
Data mining
 
Big Data: Beyond the hype, Delivering value
Big Data: Beyond the hype, Delivering valueBig Data: Beyond the hype, Delivering value
Big Data: Beyond the hype, Delivering value
 
Data mining
Data miningData mining
Data mining
 
Lect 1 introduction
Lect 1 introductionLect 1 introduction
Lect 1 introduction
 
Intro to big data and applications - day 1
Intro to big data and applications - day 1Intro to big data and applications - day 1
Intro to big data and applications - day 1
 
Thomas Vavra | New Ways of Handling Old Data
Thomas Vavra | New Ways of Handling Old DataThomas Vavra | New Ways of Handling Old Data
Thomas Vavra | New Ways of Handling Old Data
 
Social Big Data in Government
Social Big Data in GovernmentSocial Big Data in Government
Social Big Data in Government
 
Introduction to Big Data Analytics
Introduction to Big Data AnalyticsIntroduction to Big Data Analytics
Introduction to Big Data Analytics
 
Introduction to Big Data & Analytics
Introduction to Big Data & AnalyticsIntroduction to Big Data & Analytics
Introduction to Big Data & Analytics
 
Big Data Paper
Big Data PaperBig Data Paper
Big Data Paper
 
Metadata
MetadataMetadata
Metadata
 
Introduction to Big Data
Introduction to Big DataIntroduction to Big Data
Introduction to Big Data
 
Bigger and Better: Employing a Holistic Strategy for Big Data toward a Strong...
Bigger and Better: Employing a Holistic Strategy for Big Data toward a Strong...Bigger and Better: Employing a Holistic Strategy for Big Data toward a Strong...
Bigger and Better: Employing a Holistic Strategy for Big Data toward a Strong...
 
The Evidence Hub: Harnessing the Collective Intelligence of Communities to Bu...
The Evidence Hub: Harnessing the Collective Intelligence of Communities to Bu...The Evidence Hub: Harnessing the Collective Intelligence of Communities to Bu...
The Evidence Hub: Harnessing the Collective Intelligence of Communities to Bu...
 
Data analytics course in bangalore
Data analytics course in bangaloreData analytics course in bangalore
Data analytics course in bangalore
 

Viewers also liked

The Market Research Software Survey
The Market Research Software SurveyThe Market Research Software Survey
The Market Research Software SurveyResearch Magazine
 
Frank sebastian-chaparro-zarta
Frank sebastian-chaparro-zartaFrank sebastian-chaparro-zarta
Frank sebastian-chaparro-zarta
kexal97
 
Wireshark tips
Wireshark tipsWireshark tips
Engineering Services Forum L&T & A123 Systems
Engineering Services Forum L&T & A123 SystemsEngineering Services Forum L&T & A123 Systems
Engineering Services Forum L&T & A123 Systems
Information Services Group (ISG)
 
Creative waystousepodcastsintheclassroom
Creative waystousepodcastsintheclassroomCreative waystousepodcastsintheclassroom
Creative waystousepodcastsintheclassroomkosovoireland
 
PetaJakarta.org Student Presentation
PetaJakarta.org Student PresentationPetaJakarta.org Student Presentation
PetaJakarta.org Student Presentation
SMART Infrastructure Facility
 
Plan de estudio curso básico
Plan de estudio curso básicoPlan de estudio curso básico
Plan de estudio curso básico
Iván Aguirre
 
Corso Fare Marketing sul Web_Varese - Slide introduttive
Corso Fare Marketing sul Web_Varese - Slide introduttiveCorso Fare Marketing sul Web_Varese - Slide introduttive
Corso Fare Marketing sul Web_Varese - Slide introduttive
sFormati Digitali
 
Don't Be a Settler
Don't Be a SettlerDon't Be a Settler
Scan0001
Scan0001Scan0001
Scan0001
CONGVANDEN_PKT
 
#DBS2016 Digital Labor Disruption
#DBS2016 Digital Labor Disruption#DBS2016 Digital Labor Disruption
#DBS2016 Digital Labor Disruption
Information Services Group (ISG)
 
Riesgos físicos
Riesgos físicosRiesgos físicos
Riesgos físicos
BTARequiem
 
Lewitt paragraphs on conceptual art
Lewitt paragraphs on conceptual artLewitt paragraphs on conceptual art
Lewitt paragraphs on conceptual artCCricket
 
Sur de Dublin
Sur de DublinSur de Dublin
Sur de Dublin
Carlos Colomer
 

Viewers also liked (20)

The Market Research Software Survey
The Market Research Software SurveyThe Market Research Software Survey
The Market Research Software Survey
 
Frank sebastian-chaparro-zarta
Frank sebastian-chaparro-zartaFrank sebastian-chaparro-zarta
Frank sebastian-chaparro-zarta
 
9707 s14 ms_23
9707 s14 ms_239707 s14 ms_23
9707 s14 ms_23
 
Wireshark tips
Wireshark tipsWireshark tips
Wireshark tips
 
Engineering Services Forum L&T & A123 Systems
Engineering Services Forum L&T & A123 SystemsEngineering Services Forum L&T & A123 Systems
Engineering Services Forum L&T & A123 Systems
 
Creative waystousepodcastsintheclassroom
Creative waystousepodcastsintheclassroomCreative waystousepodcastsintheclassroom
Creative waystousepodcastsintheclassroom
 
PetaJakarta.org Student Presentation
PetaJakarta.org Student PresentationPetaJakarta.org Student Presentation
PetaJakarta.org Student Presentation
 
Sistem informasi computer
Sistem informasi computerSistem informasi computer
Sistem informasi computer
 
Plan de estudio curso básico
Plan de estudio curso básicoPlan de estudio curso básico
Plan de estudio curso básico
 
Corso Fare Marketing sul Web_Varese - Slide introduttive
Corso Fare Marketing sul Web_Varese - Slide introduttiveCorso Fare Marketing sul Web_Varese - Slide introduttive
Corso Fare Marketing sul Web_Varese - Slide introduttive
 
Don't Be a Settler
Don't Be a SettlerDon't Be a Settler
Don't Be a Settler
 
UCDAAP9NOV16
UCDAAP9NOV16UCDAAP9NOV16
UCDAAP9NOV16
 
Scan0001
Scan0001Scan0001
Scan0001
 
#DBS2016 Digital Labor Disruption
#DBS2016 Digital Labor Disruption#DBS2016 Digital Labor Disruption
#DBS2016 Digital Labor Disruption
 
Riesgos físicos
Riesgos físicosRiesgos físicos
Riesgos físicos
 
Lewitt paragraphs on conceptual art
Lewitt paragraphs on conceptual artLewitt paragraphs on conceptual art
Lewitt paragraphs on conceptual art
 
9707 s14 ms_13
9707 s14 ms_139707 s14 ms_13
9707 s14 ms_13
 
Onet m6 52 thai
Onet m6 52  thaiOnet m6 52  thai
Onet m6 52 thai
 
Christmas Store Slideshow
Christmas Store SlideshowChristmas Store Slideshow
Christmas Store Slideshow
 
Sur de Dublin
Sur de DublinSur de Dublin
Sur de Dublin
 

Similar to SMART Seminar Series: "From Big Data to Smart data"

A New Paradigm on Analytic-Driven Information and Automation V2.pdf
A New Paradigm on Analytic-Driven Information and Automation V2.pdfA New Paradigm on Analytic-Driven Information and Automation V2.pdf
A New Paradigm on Analytic-Driven Information and Automation V2.pdf
ArmyTrilidiaDevegaSK
 
00-01 DSnDA.pdf
00-01 DSnDA.pdf00-01 DSnDA.pdf
00-01 DSnDA.pdf
SugumarSarDurai
 
Lecture1 introduction to big data
Lecture1 introduction to big dataLecture1 introduction to big data
Lecture1 introduction to big data
hktripathy
 
Opportunities and methodological challenges of Big Data for official statist...
Opportunities and methodological challenges of  Big Data for official statist...Opportunities and methodological challenges of  Big Data for official statist...
Opportunities and methodological challenges of Big Data for official statist...
Piet J.H. Daas
 
FR.pptx
FR.pptxFR.pptx
Overview of the Research in Wimmics 2018
Overview of the Research in Wimmics 2018Overview of the Research in Wimmics 2018
Overview of the Research in Wimmics 2018
Fabien Gandon
 
Big data mining
Big data miningBig data mining
Classifying malicious websites using an ensemble weighted features
Classifying malicious websites using an ensemble weighted featuresClassifying malicious websites using an ensemble weighted features
Classifying malicious websites using an ensemble weighted features
Dharmendra Vishwakarma
 
General introduction to IoTCrawler
General introduction to IoTCrawlerGeneral introduction to IoTCrawler
General introduction to IoTCrawler
IoTCrawler
 
Data Science, Personalisation & Product management
Data Science, Personalisation & Product managementData Science, Personalisation & Product management
Data Science, Personalisation & Product management
Bhaskar Krishnan
 
Advanced Analytics and Data Science Expertise
Advanced Analytics and Data Science ExpertiseAdvanced Analytics and Data Science Expertise
Advanced Analytics and Data Science Expertise
SoftServe
 
Computational intelligence for big data analytics bda 2013
Computational intelligence for big data analytics   bda 2013Computational intelligence for big data analytics   bda 2013
Computational intelligence for big data analytics bda 2013
oj08
 
Building a healthy data ecosystem around Kafka and Hadoop: Lessons learned at...
Building a healthy data ecosystem around Kafka and Hadoop: Lessons learned at...Building a healthy data ecosystem around Kafka and Hadoop: Lessons learned at...
Building a healthy data ecosystem around Kafka and Hadoop: Lessons learned at...
Yael Garten
 
Strata 2017 (San Jose): Building a healthy data ecosystem around Kafka and Ha...
Strata 2017 (San Jose): Building a healthy data ecosystem around Kafka and Ha...Strata 2017 (San Jose): Building a healthy data ecosystem around Kafka and Ha...
Strata 2017 (San Jose): Building a healthy data ecosystem around Kafka and Ha...
Shirshanka Das
 
Harness the Power of Big Data with Oracle
Harness the Power of Big Data with OracleHarness the Power of Big Data with Oracle
Harness the Power of Big Data with OracleSai Janakiram Penumuru
 
Data Mining Intro
Data Mining IntroData Mining Intro
Data Mining Intro
ShubhamSamrat5
 
01Intro.ppt
01Intro.ppt01Intro.ppt
01Intro.ppt
AidaMustapha6
 
01Introduction to data mining chapter 1.ppt
01Introduction to data mining chapter 1.ppt01Introduction to data mining chapter 1.ppt
01Introduction to data mining chapter 1.ppt
admsoyadm4
 
data mining
data miningdata mining
data mining
AMITKUMAR202236
 
01Intro.ppt
01Intro.ppt01Intro.ppt
01Intro.ppt
VaibhavGupta447155
 

Similar to SMART Seminar Series: "From Big Data to Smart data" (20)

A New Paradigm on Analytic-Driven Information and Automation V2.pdf
A New Paradigm on Analytic-Driven Information and Automation V2.pdfA New Paradigm on Analytic-Driven Information and Automation V2.pdf
A New Paradigm on Analytic-Driven Information and Automation V2.pdf
 
00-01 DSnDA.pdf
00-01 DSnDA.pdf00-01 DSnDA.pdf
00-01 DSnDA.pdf
 
Lecture1 introduction to big data
Lecture1 introduction to big dataLecture1 introduction to big data
Lecture1 introduction to big data
 
Opportunities and methodological challenges of Big Data for official statist...
Opportunities and methodological challenges of  Big Data for official statist...Opportunities and methodological challenges of  Big Data for official statist...
Opportunities and methodological challenges of Big Data for official statist...
 
FR.pptx
FR.pptxFR.pptx
FR.pptx
 
Overview of the Research in Wimmics 2018
Overview of the Research in Wimmics 2018Overview of the Research in Wimmics 2018
Overview of the Research in Wimmics 2018
 
Big data mining
Big data miningBig data mining
Big data mining
 
Classifying malicious websites using an ensemble weighted features
Classifying malicious websites using an ensemble weighted featuresClassifying malicious websites using an ensemble weighted features
Classifying malicious websites using an ensemble weighted features
 
General introduction to IoTCrawler
General introduction to IoTCrawlerGeneral introduction to IoTCrawler
General introduction to IoTCrawler
 
Data Science, Personalisation & Product management
Data Science, Personalisation & Product managementData Science, Personalisation & Product management
Data Science, Personalisation & Product management
 
Advanced Analytics and Data Science Expertise
Advanced Analytics and Data Science ExpertiseAdvanced Analytics and Data Science Expertise
Advanced Analytics and Data Science Expertise
 
Computational intelligence for big data analytics bda 2013
Computational intelligence for big data analytics   bda 2013Computational intelligence for big data analytics   bda 2013
Computational intelligence for big data analytics bda 2013
 
Building a healthy data ecosystem around Kafka and Hadoop: Lessons learned at...
Building a healthy data ecosystem around Kafka and Hadoop: Lessons learned at...Building a healthy data ecosystem around Kafka and Hadoop: Lessons learned at...
Building a healthy data ecosystem around Kafka and Hadoop: Lessons learned at...
 
Strata 2017 (San Jose): Building a healthy data ecosystem around Kafka and Ha...
Strata 2017 (San Jose): Building a healthy data ecosystem around Kafka and Ha...Strata 2017 (San Jose): Building a healthy data ecosystem around Kafka and Ha...
Strata 2017 (San Jose): Building a healthy data ecosystem around Kafka and Ha...
 
Harness the Power of Big Data with Oracle
Harness the Power of Big Data with OracleHarness the Power of Big Data with Oracle
Harness the Power of Big Data with Oracle
 
Data Mining Intro
Data Mining IntroData Mining Intro
Data Mining Intro
 
01Intro.ppt
01Intro.ppt01Intro.ppt
01Intro.ppt
 
01Introduction to data mining chapter 1.ppt
01Introduction to data mining chapter 1.ppt01Introduction to data mining chapter 1.ppt
01Introduction to data mining chapter 1.ppt
 
data mining
data miningdata mining
data mining
 
01Intro.ppt
01Intro.ppt01Intro.ppt
01Intro.ppt
 

More from SMART Infrastructure Facility

SMART Seminar Series: "Cognitive Illusions in Virtual Reality: What do I mean...
SMART Seminar Series: "Cognitive Illusions in Virtual Reality: What do I mean...SMART Seminar Series: "Cognitive Illusions in Virtual Reality: What do I mean...
SMART Seminar Series: "Cognitive Illusions in Virtual Reality: What do I mean...
SMART Infrastructure Facility
 
SMART Seminar Series: "Trusted Autonomous Systems as System of Systems". Pres...
SMART Seminar Series: "Trusted Autonomous Systems as System of Systems". Pres...SMART Seminar Series: "Trusted Autonomous Systems as System of Systems". Pres...
SMART Seminar Series: "Trusted Autonomous Systems as System of Systems". Pres...
SMART Infrastructure Facility
 
SMART Seminar Series: "User-centric digital collaboration to build resilient ...
SMART Seminar Series: "User-centric digital collaboration to build resilient ...SMART Seminar Series: "User-centric digital collaboration to build resilient ...
SMART Seminar Series: "User-centric digital collaboration to build resilient ...
SMART Infrastructure Facility
 
SMART Seminar Series: "The Evolution of the Metric System: From Precious Lump...
SMART Seminar Series: "The Evolution of the Metric System: From Precious Lump...SMART Seminar Series: "The Evolution of the Metric System: From Precious Lump...
SMART Seminar Series: "The Evolution of the Metric System: From Precious Lump...
SMART Infrastructure Facility
 
SMART Seminar Series: "Using AI and edge computing devices for traffic flow m...
SMART Seminar Series: "Using AI and edge computing devices for traffic flow m...SMART Seminar Series: "Using AI and edge computing devices for traffic flow m...
SMART Seminar Series: "Using AI and edge computing devices for traffic flow m...
SMART Infrastructure Facility
 
SMART Seminar Series: "Blockchain and its Applications". Presented by Prof Wi...
SMART Seminar Series: "Blockchain and its Applications". Presented by Prof Wi...SMART Seminar Series: "Blockchain and its Applications". Presented by Prof Wi...
SMART Seminar Series: "Blockchain and its Applications". Presented by Prof Wi...
SMART Infrastructure Facility
 
SMART Seminar Series: "From an IoT cloud based architecture to Edge for dynam...
SMART Seminar Series: "From an IoT cloud based architecture to Edge for dynam...SMART Seminar Series: "From an IoT cloud based architecture to Edge for dynam...
SMART Seminar Series: "From an IoT cloud based architecture to Edge for dynam...
SMART Infrastructure Facility
 
SMART Seminar Series: "Is bus bunching serious in Sydney? Preliminary finding...
SMART Seminar Series: "Is bus bunching serious in Sydney? Preliminary finding...SMART Seminar Series: "Is bus bunching serious in Sydney? Preliminary finding...
SMART Seminar Series: "Is bus bunching serious in Sydney? Preliminary finding...
SMART Infrastructure Facility
 
SMART Seminar Series: "Keep it SMART, keep it simple! – Challenging complexit...
SMART Seminar Series: "Keep it SMART, keep it simple! – Challenging complexit...SMART Seminar Series: "Keep it SMART, keep it simple! – Challenging complexit...
SMART Seminar Series: "Keep it SMART, keep it simple! – Challenging complexit...
SMART Infrastructure Facility
 
SMART Seminar Series: "Risk-based bridge assessment under changing load-deman...
SMART Seminar Series: "Risk-based bridge assessment under changing load-deman...SMART Seminar Series: "Risk-based bridge assessment under changing load-deman...
SMART Seminar Series: "Risk-based bridge assessment under changing load-deman...
SMART Infrastructure Facility
 
SMART Seminar Series: "Deep Learning: Fundamentals and Practice". Presented b...
SMART Seminar Series: "Deep Learning: Fundamentals and Practice". Presented b...SMART Seminar Series: "Deep Learning: Fundamentals and Practice". Presented b...
SMART Seminar Series: "Deep Learning: Fundamentals and Practice". Presented b...
SMART Infrastructure Facility
 
SMART Seminar Series: "Infrastructure Resilience: Planning for Future Extreme...
SMART Seminar Series: "Infrastructure Resilience: Planning for Future Extreme...SMART Seminar Series: "Infrastructure Resilience: Planning for Future Extreme...
SMART Seminar Series: "Infrastructure Resilience: Planning for Future Extreme...
SMART Infrastructure Facility
 
SMART Seminar Series: "Potential use of drones for infrastructure inspection ...
SMART Seminar Series: "Potential use of drones for infrastructure inspection ...SMART Seminar Series: "Potential use of drones for infrastructure inspection ...
SMART Seminar Series: "Potential use of drones for infrastructure inspection ...
SMART Infrastructure Facility
 
SMART Seminar Series: "A journey in the zoo of Turing patterns: the topology ...
SMART Seminar Series: "A journey in the zoo of Turing patterns: the topology ...SMART Seminar Series: "A journey in the zoo of Turing patterns: the topology ...
SMART Seminar Series: "A journey in the zoo of Turing patterns: the topology ...
SMART Infrastructure Facility
 
SMART Seminar Series: "Human behaviour modelling and simulation for crisis ma...
SMART Seminar Series: "Human behaviour modelling and simulation for crisis ma...SMART Seminar Series: "Human behaviour modelling and simulation for crisis ma...
SMART Seminar Series: "Human behaviour modelling and simulation for crisis ma...
SMART Infrastructure Facility
 
SMART Seminar Series: "Dealing with uncertainty: With the observer in the loo...
SMART Seminar Series: "Dealing with uncertainty: With the observer in the loo...SMART Seminar Series: "Dealing with uncertainty: With the observer in the loo...
SMART Seminar Series: "Dealing with uncertainty: With the observer in the loo...
SMART Infrastructure Facility
 
SMART Seminar Series: "Smart Cities: The Good, The Bad & The Ugly"
SMART Seminar Series: "Smart Cities: The Good, The Bad & The Ugly"SMART Seminar Series: "Smart Cities: The Good, The Bad & The Ugly"
SMART Seminar Series: "Smart Cities: The Good, The Bad & The Ugly"
SMART Infrastructure Facility
 
SMART Seminar Series: "How to improve the order of evolutionary models in age...
SMART Seminar Series: "How to improve the order of evolutionary models in age...SMART Seminar Series: "How to improve the order of evolutionary models in age...
SMART Seminar Series: "How to improve the order of evolutionary models in age...
SMART Infrastructure Facility
 
SMART Seminar Series: "OneM2M – Towards end-to-end interoperability of the IoT"
SMART Seminar Series: "OneM2M – Towards end-to-end interoperability of the IoT"SMART Seminar Series: "OneM2M – Towards end-to-end interoperability of the IoT"
SMART Seminar Series: "OneM2M – Towards end-to-end interoperability of the IoT"
SMART Infrastructure Facility
 
SMART Seminar Series: "Blue-Green vs. Grey-Black infrastructure – which is be...
SMART Seminar Series: "Blue-Green vs. Grey-Black infrastructure – which is be...SMART Seminar Series: "Blue-Green vs. Grey-Black infrastructure – which is be...
SMART Seminar Series: "Blue-Green vs. Grey-Black infrastructure – which is be...
SMART Infrastructure Facility
 

More from SMART Infrastructure Facility (20)

SMART Seminar Series: "Cognitive Illusions in Virtual Reality: What do I mean...
SMART Seminar Series: "Cognitive Illusions in Virtual Reality: What do I mean...SMART Seminar Series: "Cognitive Illusions in Virtual Reality: What do I mean...
SMART Seminar Series: "Cognitive Illusions in Virtual Reality: What do I mean...
 
SMART Seminar Series: "Trusted Autonomous Systems as System of Systems". Pres...
SMART Seminar Series: "Trusted Autonomous Systems as System of Systems". Pres...SMART Seminar Series: "Trusted Autonomous Systems as System of Systems". Pres...
SMART Seminar Series: "Trusted Autonomous Systems as System of Systems". Pres...
 
SMART Seminar Series: "User-centric digital collaboration to build resilient ...
SMART Seminar Series: "User-centric digital collaboration to build resilient ...SMART Seminar Series: "User-centric digital collaboration to build resilient ...
SMART Seminar Series: "User-centric digital collaboration to build resilient ...
 
SMART Seminar Series: "The Evolution of the Metric System: From Precious Lump...
SMART Seminar Series: "The Evolution of the Metric System: From Precious Lump...SMART Seminar Series: "The Evolution of the Metric System: From Precious Lump...
SMART Seminar Series: "The Evolution of the Metric System: From Precious Lump...
 
SMART Seminar Series: "Using AI and edge computing devices for traffic flow m...
SMART Seminar Series: "Using AI and edge computing devices for traffic flow m...SMART Seminar Series: "Using AI and edge computing devices for traffic flow m...
SMART Seminar Series: "Using AI and edge computing devices for traffic flow m...
 
SMART Seminar Series: "Blockchain and its Applications". Presented by Prof Wi...
SMART Seminar Series: "Blockchain and its Applications". Presented by Prof Wi...SMART Seminar Series: "Blockchain and its Applications". Presented by Prof Wi...
SMART Seminar Series: "Blockchain and its Applications". Presented by Prof Wi...
 
SMART Seminar Series: "From an IoT cloud based architecture to Edge for dynam...
SMART Seminar Series: "From an IoT cloud based architecture to Edge for dynam...SMART Seminar Series: "From an IoT cloud based architecture to Edge for dynam...
SMART Seminar Series: "From an IoT cloud based architecture to Edge for dynam...
 
SMART Seminar Series: "Is bus bunching serious in Sydney? Preliminary finding...
SMART Seminar Series: "Is bus bunching serious in Sydney? Preliminary finding...SMART Seminar Series: "Is bus bunching serious in Sydney? Preliminary finding...
SMART Seminar Series: "Is bus bunching serious in Sydney? Preliminary finding...
 
SMART Seminar Series: "Keep it SMART, keep it simple! – Challenging complexit...
SMART Seminar Series: "Keep it SMART, keep it simple! – Challenging complexit...SMART Seminar Series: "Keep it SMART, keep it simple! – Challenging complexit...
SMART Seminar Series: "Keep it SMART, keep it simple! – Challenging complexit...
 
SMART Seminar Series: "Risk-based bridge assessment under changing load-deman...
SMART Seminar Series: "Risk-based bridge assessment under changing load-deman...SMART Seminar Series: "Risk-based bridge assessment under changing load-deman...
SMART Seminar Series: "Risk-based bridge assessment under changing load-deman...
 
SMART Seminar Series: "Deep Learning: Fundamentals and Practice". Presented b...
SMART Seminar Series: "Deep Learning: Fundamentals and Practice". Presented b...SMART Seminar Series: "Deep Learning: Fundamentals and Practice". Presented b...
SMART Seminar Series: "Deep Learning: Fundamentals and Practice". Presented b...
 
SMART Seminar Series: "Infrastructure Resilience: Planning for Future Extreme...
SMART Seminar Series: "Infrastructure Resilience: Planning for Future Extreme...SMART Seminar Series: "Infrastructure Resilience: Planning for Future Extreme...
SMART Seminar Series: "Infrastructure Resilience: Planning for Future Extreme...
 
SMART Seminar Series: "Potential use of drones for infrastructure inspection ...
SMART Seminar Series: "Potential use of drones for infrastructure inspection ...SMART Seminar Series: "Potential use of drones for infrastructure inspection ...
SMART Seminar Series: "Potential use of drones for infrastructure inspection ...
 
SMART Seminar Series: "A journey in the zoo of Turing patterns: the topology ...
SMART Seminar Series: "A journey in the zoo of Turing patterns: the topology ...SMART Seminar Series: "A journey in the zoo of Turing patterns: the topology ...
SMART Seminar Series: "A journey in the zoo of Turing patterns: the topology ...
 
SMART Seminar Series: "Human behaviour modelling and simulation for crisis ma...
SMART Seminar Series: "Human behaviour modelling and simulation for crisis ma...SMART Seminar Series: "Human behaviour modelling and simulation for crisis ma...
SMART Seminar Series: "Human behaviour modelling and simulation for crisis ma...
 
SMART Seminar Series: "Dealing with uncertainty: With the observer in the loo...
SMART Seminar Series: "Dealing with uncertainty: With the observer in the loo...SMART Seminar Series: "Dealing with uncertainty: With the observer in the loo...
SMART Seminar Series: "Dealing with uncertainty: With the observer in the loo...
 
SMART Seminar Series: "Smart Cities: The Good, The Bad & The Ugly"
SMART Seminar Series: "Smart Cities: The Good, The Bad & The Ugly"SMART Seminar Series: "Smart Cities: The Good, The Bad & The Ugly"
SMART Seminar Series: "Smart Cities: The Good, The Bad & The Ugly"
 
SMART Seminar Series: "How to improve the order of evolutionary models in age...
SMART Seminar Series: "How to improve the order of evolutionary models in age...SMART Seminar Series: "How to improve the order of evolutionary models in age...
SMART Seminar Series: "How to improve the order of evolutionary models in age...
 
SMART Seminar Series: "OneM2M – Towards end-to-end interoperability of the IoT"
SMART Seminar Series: "OneM2M – Towards end-to-end interoperability of the IoT"SMART Seminar Series: "OneM2M – Towards end-to-end interoperability of the IoT"
SMART Seminar Series: "OneM2M – Towards end-to-end interoperability of the IoT"
 
SMART Seminar Series: "Blue-Green vs. Grey-Black infrastructure – which is be...
SMART Seminar Series: "Blue-Green vs. Grey-Black infrastructure – which is be...SMART Seminar Series: "Blue-Green vs. Grey-Black infrastructure – which is be...
SMART Seminar Series: "Blue-Green vs. Grey-Black infrastructure – which is be...
 

Recently uploaded

Levelwise PageRank with Loop-Based Dead End Handling Strategy : SHORT REPORT ...
Levelwise PageRank with Loop-Based Dead End Handling Strategy : SHORT REPORT ...Levelwise PageRank with Loop-Based Dead End Handling Strategy : SHORT REPORT ...
Levelwise PageRank with Loop-Based Dead End Handling Strategy : SHORT REPORT ...
Subhajit Sahu
 
Q1’2024 Update: MYCI’s Leap Year Rebound
Q1’2024 Update: MYCI’s Leap Year ReboundQ1’2024 Update: MYCI’s Leap Year Rebound
Q1’2024 Update: MYCI’s Leap Year Rebound
Oppotus
 
Criminal IP - Threat Hunting Webinar.pdf
Criminal IP - Threat Hunting Webinar.pdfCriminal IP - Threat Hunting Webinar.pdf
Criminal IP - Threat Hunting Webinar.pdf
Criminal IP
 
Data Centers - Striving Within A Narrow Range - Research Report - MCG - May 2...
Data Centers - Striving Within A Narrow Range - Research Report - MCG - May 2...Data Centers - Striving Within A Narrow Range - Research Report - MCG - May 2...
Data Centers - Striving Within A Narrow Range - Research Report - MCG - May 2...
pchutichetpong
 
原版制作(Deakin毕业证书)迪肯大学毕业证学位证一模一样
原版制作(Deakin毕业证书)迪肯大学毕业证学位证一模一样原版制作(Deakin毕业证书)迪肯大学毕业证学位证一模一样
原版制作(Deakin毕业证书)迪肯大学毕业证学位证一模一样
u86oixdj
 
FP Growth Algorithm and its Applications
FP Growth Algorithm and its ApplicationsFP Growth Algorithm and its Applications
FP Growth Algorithm and its Applications
MaleehaSheikh2
 
Empowering Data Analytics Ecosystem.pptx
Empowering Data Analytics Ecosystem.pptxEmpowering Data Analytics Ecosystem.pptx
Empowering Data Analytics Ecosystem.pptx
benishzehra469
 
Ch03-Managing the Object-Oriented Information Systems Project a.pdf
Ch03-Managing the Object-Oriented Information Systems Project a.pdfCh03-Managing the Object-Oriented Information Systems Project a.pdf
Ch03-Managing the Object-Oriented Information Systems Project a.pdf
haila53
 
一比一原版(NYU毕业证)纽约大学毕业证成绩单
一比一原版(NYU毕业证)纽约大学毕业证成绩单一比一原版(NYU毕业证)纽约大学毕业证成绩单
一比一原版(NYU毕业证)纽约大学毕业证成绩单
ewymefz
 
Malana- Gimlet Market Analysis (Portfolio 2)
Malana- Gimlet Market Analysis (Portfolio 2)Malana- Gimlet Market Analysis (Portfolio 2)
Malana- Gimlet Market Analysis (Portfolio 2)
TravisMalana
 
【社内勉強会資料_Octo: An Open-Source Generalist Robot Policy】
【社内勉強会資料_Octo: An Open-Source Generalist Robot Policy】【社内勉強会資料_Octo: An Open-Source Generalist Robot Policy】
【社内勉強会資料_Octo: An Open-Source Generalist Robot Policy】
NABLAS株式会社
 
一比一原版(YU毕业证)约克大学毕业证成绩单
一比一原版(YU毕业证)约克大学毕业证成绩单一比一原版(YU毕业证)约克大学毕业证成绩单
一比一原版(YU毕业证)约克大学毕业证成绩单
enxupq
 
一比一原版(RUG毕业证)格罗宁根大学毕业证成绩单
一比一原版(RUG毕业证)格罗宁根大学毕业证成绩单一比一原版(RUG毕业证)格罗宁根大学毕业证成绩单
一比一原版(RUG毕业证)格罗宁根大学毕业证成绩单
vcaxypu
 
做(mqu毕业证书)麦考瑞大学毕业证硕士文凭证书学费发票原版一模一样
做(mqu毕业证书)麦考瑞大学毕业证硕士文凭证书学费发票原版一模一样做(mqu毕业证书)麦考瑞大学毕业证硕士文凭证书学费发票原版一模一样
做(mqu毕业证书)麦考瑞大学毕业证硕士文凭证书学费发票原版一模一样
axoqas
 
哪里卖(usq毕业证书)南昆士兰大学毕业证研究生文凭证书托福证书原版一模一样
哪里卖(usq毕业证书)南昆士兰大学毕业证研究生文凭证书托福证书原版一模一样哪里卖(usq毕业证书)南昆士兰大学毕业证研究生文凭证书托福证书原版一模一样
哪里卖(usq毕业证书)南昆士兰大学毕业证研究生文凭证书托福证书原版一模一样
axoqas
 
1.Seydhcuxhxyxhccuuxuxyxyxmisolids 2019.pptx
1.Seydhcuxhxyxhccuuxuxyxyxmisolids 2019.pptx1.Seydhcuxhxyxhccuuxuxyxyxmisolids 2019.pptx
1.Seydhcuxhxyxhccuuxuxyxyxmisolids 2019.pptx
Tiktokethiodaily
 
一比一原版(UPenn毕业证)宾夕法尼亚大学毕业证成绩单
一比一原版(UPenn毕业证)宾夕法尼亚大学毕业证成绩单一比一原版(UPenn毕业证)宾夕法尼亚大学毕业证成绩单
一比一原版(UPenn毕业证)宾夕法尼亚大学毕业证成绩单
ewymefz
 
一比一原版(CBU毕业证)卡普顿大学毕业证如何办理
一比一原版(CBU毕业证)卡普顿大学毕业证如何办理一比一原版(CBU毕业证)卡普顿大学毕业证如何办理
一比一原版(CBU毕业证)卡普顿大学毕业证如何办理
ahzuo
 
SOCRadar Germany 2024 Threat Landscape Report
SOCRadar Germany 2024 Threat Landscape ReportSOCRadar Germany 2024 Threat Landscape Report
SOCRadar Germany 2024 Threat Landscape Report
SOCRadar
 
The affect of service quality and online reviews on customer loyalty in the E...
The affect of service quality and online reviews on customer loyalty in the E...The affect of service quality and online reviews on customer loyalty in the E...
The affect of service quality and online reviews on customer loyalty in the E...
jerlynmaetalle
 

Recently uploaded (20)

Levelwise PageRank with Loop-Based Dead End Handling Strategy : SHORT REPORT ...
Levelwise PageRank with Loop-Based Dead End Handling Strategy : SHORT REPORT ...Levelwise PageRank with Loop-Based Dead End Handling Strategy : SHORT REPORT ...
Levelwise PageRank with Loop-Based Dead End Handling Strategy : SHORT REPORT ...
 
Q1’2024 Update: MYCI’s Leap Year Rebound
Q1’2024 Update: MYCI’s Leap Year ReboundQ1’2024 Update: MYCI’s Leap Year Rebound
Q1’2024 Update: MYCI’s Leap Year Rebound
 
Criminal IP - Threat Hunting Webinar.pdf
Criminal IP - Threat Hunting Webinar.pdfCriminal IP - Threat Hunting Webinar.pdf
Criminal IP - Threat Hunting Webinar.pdf
 
Data Centers - Striving Within A Narrow Range - Research Report - MCG - May 2...
Data Centers - Striving Within A Narrow Range - Research Report - MCG - May 2...Data Centers - Striving Within A Narrow Range - Research Report - MCG - May 2...
Data Centers - Striving Within A Narrow Range - Research Report - MCG - May 2...
 
原版制作(Deakin毕业证书)迪肯大学毕业证学位证一模一样
原版制作(Deakin毕业证书)迪肯大学毕业证学位证一模一样原版制作(Deakin毕业证书)迪肯大学毕业证学位证一模一样
原版制作(Deakin毕业证书)迪肯大学毕业证学位证一模一样
 
FP Growth Algorithm and its Applications
FP Growth Algorithm and its ApplicationsFP Growth Algorithm and its Applications
FP Growth Algorithm and its Applications
 
Empowering Data Analytics Ecosystem.pptx
Empowering Data Analytics Ecosystem.pptxEmpowering Data Analytics Ecosystem.pptx
Empowering Data Analytics Ecosystem.pptx
 
Ch03-Managing the Object-Oriented Information Systems Project a.pdf
Ch03-Managing the Object-Oriented Information Systems Project a.pdfCh03-Managing the Object-Oriented Information Systems Project a.pdf
Ch03-Managing the Object-Oriented Information Systems Project a.pdf
 
一比一原版(NYU毕业证)纽约大学毕业证成绩单
一比一原版(NYU毕业证)纽约大学毕业证成绩单一比一原版(NYU毕业证)纽约大学毕业证成绩单
一比一原版(NYU毕业证)纽约大学毕业证成绩单
 
Malana- Gimlet Market Analysis (Portfolio 2)
Malana- Gimlet Market Analysis (Portfolio 2)Malana- Gimlet Market Analysis (Portfolio 2)
Malana- Gimlet Market Analysis (Portfolio 2)
 
【社内勉強会資料_Octo: An Open-Source Generalist Robot Policy】
【社内勉強会資料_Octo: An Open-Source Generalist Robot Policy】【社内勉強会資料_Octo: An Open-Source Generalist Robot Policy】
【社内勉強会資料_Octo: An Open-Source Generalist Robot Policy】
 
一比一原版(YU毕业证)约克大学毕业证成绩单
一比一原版(YU毕业证)约克大学毕业证成绩单一比一原版(YU毕业证)约克大学毕业证成绩单
一比一原版(YU毕业证)约克大学毕业证成绩单
 
一比一原版(RUG毕业证)格罗宁根大学毕业证成绩单
一比一原版(RUG毕业证)格罗宁根大学毕业证成绩单一比一原版(RUG毕业证)格罗宁根大学毕业证成绩单
一比一原版(RUG毕业证)格罗宁根大学毕业证成绩单
 
做(mqu毕业证书)麦考瑞大学毕业证硕士文凭证书学费发票原版一模一样
做(mqu毕业证书)麦考瑞大学毕业证硕士文凭证书学费发票原版一模一样做(mqu毕业证书)麦考瑞大学毕业证硕士文凭证书学费发票原版一模一样
做(mqu毕业证书)麦考瑞大学毕业证硕士文凭证书学费发票原版一模一样
 
哪里卖(usq毕业证书)南昆士兰大学毕业证研究生文凭证书托福证书原版一模一样
哪里卖(usq毕业证书)南昆士兰大学毕业证研究生文凭证书托福证书原版一模一样哪里卖(usq毕业证书)南昆士兰大学毕业证研究生文凭证书托福证书原版一模一样
哪里卖(usq毕业证书)南昆士兰大学毕业证研究生文凭证书托福证书原版一模一样
 
1.Seydhcuxhxyxhccuuxuxyxyxmisolids 2019.pptx
1.Seydhcuxhxyxhccuuxuxyxyxmisolids 2019.pptx1.Seydhcuxhxyxhccuuxuxyxyxmisolids 2019.pptx
1.Seydhcuxhxyxhccuuxuxyxyxmisolids 2019.pptx
 
一比一原版(UPenn毕业证)宾夕法尼亚大学毕业证成绩单
一比一原版(UPenn毕业证)宾夕法尼亚大学毕业证成绩单一比一原版(UPenn毕业证)宾夕法尼亚大学毕业证成绩单
一比一原版(UPenn毕业证)宾夕法尼亚大学毕业证成绩单
 
一比一原版(CBU毕业证)卡普顿大学毕业证如何办理
一比一原版(CBU毕业证)卡普顿大学毕业证如何办理一比一原版(CBU毕业证)卡普顿大学毕业证如何办理
一比一原版(CBU毕业证)卡普顿大学毕业证如何办理
 
SOCRadar Germany 2024 Threat Landscape Report
SOCRadar Germany 2024 Threat Landscape ReportSOCRadar Germany 2024 Threat Landscape Report
SOCRadar Germany 2024 Threat Landscape Report
 
The affect of service quality and online reviews on customer loyalty in the E...
The affect of service quality and online reviews on customer loyalty in the E...The affect of service quality and online reviews on customer loyalty in the E...
The affect of service quality and online reviews on customer loyalty in the E...
 

SMART Seminar Series: "From Big Data to Smart data"

  • 1. From Big Data to Smart data Jie (Jack) Yang | April 2016
  • 2. —What is Big Data? —Challenge of Big Data processing —Smart Learning framework —Applications —Conclusions Outline
  • 3. —No single standard definition —5-V information assets that require innovative techniques, algorithms, and analytics that enable decision making, and process automation Big Data definition
  • 4. 1 – Scale (Volume) 12+ TBs of tweet data every day 25+ TBs of log data every day ?TBsof dataeveryday 2+ billion people on the Web by end 2011 30 billion RFID tags today (1.3B in 2005) 4.6 billion camera phones world wide 100s of millions of GPS enabled devices sold annually 76 million smart meters in 2009… 200M by 2014
  • 5. The ability to manage, analyse, summarise, visualise, and discover knowledge from the collected data in a timely and scalable manner 2 – Speed (Velocity) Social media and networks (millions of active users) Mobile devices (tracking objects all the time) Infrastructure sensors and/or instruments (measuring all kinds of data)
  • 6. Various formats, types and structures: — Text — Numerical — Multi-dim arrays — Images, audio, video, sequences — Time series — Graph (network) — Streaming data — etc 3 – Complexity (Varity)
  • 7. 4 – Uncertainty (Veracity)
  • 8. 5 – Benefit (Value) Value ($, time, performance)
  • 9. Beer & Diaper (Woolworths in Illawarra) “A number of convenience store clerks noticed that men often bought beer at the same time they bought diapers. The store mined its receipts and proved the clerks' observations correct. So, the store began stocking diapers next to the beer coolers, and sales skyrocketed” Asimple example
  • 10. Hardware —Choose machines —System failure Challenge of Big Data processing 16 Cores, 32G RAM, $AUD6000+
  • 11. Software —Different data sources —Really slow —Memory issue (out of memory, for 52 million records) Challenge of Big Data processing
  • 12. Smart Learning Framework Data harvesting data partners Data mining Data storage Data streaming Data visualisation
  • 13. Hardware —Money wise —Tolerance to hardware failure Smart Learning Framework 16 Cores, 32G RAM, $AUD6000+ 4 Cores, 8G RAM, $AUD 600+
  • 14. Main features — Collection across different platforms and formats • APIs • Web crawling — 1 master and 6 workers • distributing–working–waiting–reactivating process — Data volume (per day) • 20K+ records user activities • 25K+ records from social platforms • 200K+ tweets around AU and EU Data harvesting
  • 15. Main features — save data into different formats • Pure TXT / CSV • (NO)SQL — Query across all — Fast respond Data storage SELECT * FROM (SELECT * FROM /web/logs/CSV) t0 JOIN ( SELECT country, count(*) FROM mysql.web.users GROUP BY country) t1 JOIN (SELECT timestamp FROM s3.root.clicks.json WHERE user_id = 'jdoe‘) t2
  • 16. Main features — Preprocessing (filtering, cleansing, feature extraction) — Event simulation — Saving to DBs — Running ML jobs on the fly • Receiver throughput = 3kb /sec • Consumer throughput = 2kb /sec • Consumer latency = 0.23 sec Data streaming
  • 17. Main features (35 online training jobs per day) — Supervised (with a human assisting in classification) / unsupervised machine learning techniques, to assist with classification, clustering and prediction; — Geospatial analysis: K-pop cluster in geographical regions; — Network analysis to understand social connections between consumers and producers; — Other analysis including: • More sophisticated number crunching of comments, such as time series analysis to examine trends; • Natural language processing techniques to assist with sentiment analysis. Data mining
  • 18. Student behaviour analysis (OLPC, until Feb 2016): — 153+ schools — 20K+ active laptops — 4.2M+ activity records Application 1 0 1000 2000 3000 1.2M 2.6M 4.2M Most popular Apps (per school) App usage (per school) 0 1000 2000 3000 1.2M 2.6M 4.2M
  • 20. Car parking — Every 2 minutes — 604800 records (May to Oct 2015) — Temporal and spatial features Application 2
  • 21. Application 2 Average classification accuracy (%) as a function of the size of the selected samples. Average computational time (second)
  • 22. Social media analysis — 70K+ films — 228K+ users (2M + friendships) — 1M+ reviews — 13 features Application 3
  • 23. — User profile vs film preference — User profile vs topics Application 3
  • 24. — Network analysis — Opinion leadership Application 3 4K nodes + 7K edges 76 nodes + 253 edges
  • 25. Jie Yang; Jun Ma, A structure optimization algorithm of neural networks for large-scale data sets, Fuzz-IEEE,2014; Jie Yang; Jun Ma, A Sparsity-Based Training Algorithm for Least Squares SVM, IEEE SSCI, 2014; Jie Yang, Jun Ma, A big-data processing framework for uncertainties in Transportation data, Fuzz-IEEE, 2015 Jie Yang, Jun Ma, and Sarah K. Howard, A Structure Optimization Algorithm of Neural Networks for Pattern Learning from Educational Data, Springer Studies in Computational Intelligence ANN Modelling, 2015 Jie Yang; Jun Ma, A hybrid gene expression programming algorithm based on orthogonal design, International Journal of Computational Intelligence Systems, 2015 Jie Yang, Brian Yecies, Mining Chinese Social Media UGC A SmartLearning Framework For Analyzing Douban Movie Reviews, Journal of Big Data, 2016 Jie Yang; Jun Ma, A structure optimization framework for feed-forward neural networks using sparse representation, Knowledge-Based Systems, 2016; Jie Yang; Jun Ma, Sarah K. Howard, Exploring Technology Integration in Education using Fuzzy Representation and Feature Selection, Fuzz-IEEE, 2016 Brian Yecies, Jie Yang, Matthew Berryman, Kai Soh, Marketing Bait: Using SMART Data to Identify E-guanxi Among China’s ‘Internet Aborigines, Film Marketing in a Global Era, 2015 Brian Yecies, Jie Yang, Matthew Berryman, Aegyung Shim, and Kai Soh, Korean Female Writer-Directors and SMART Analysis of Douban commentary Among China’s Digital Natives, Women Screenwriters: An International Guide, 2015 Brian Yecies, Jie Yang, Matthew Berryman, Aegyung Shim, and Kai Soh, Korean Female Writer–Directors and SMART Analysis of Douban Commentary Among China’s Digital Natives, Participations: International Journal of Audience Research, 2016 Sarah K. Howard, Jun Ma, Jie Yang, Kate Thompson, The use of data mining to explore factors of technology integration in learning and teaching, EARLI 2015 Sarah K. Howard, Ellie Rennie, Jun Ma, Jie Yang, Big Data, Big Theory: Moving Beyond New Empiricism to Generate Powerful Explanations, The New Data “Revolution” in Sociology, 2016 Jun Ma, Jie Yang, Rohan W. Denagamage and Murad Safadi, A Conceptual Model for Clustering Local Government Areas using Complex Fuzzy Sets, Fuzz-IEEE, 2016 Publications
  • 26. — OLPC (ARC-Linkage) — NSW-DER — CAAR — China-South Korean Foundation — Healthcare (Pubmed, Seer) — Tourism business project (UTS) — MTR Projects and grants
  • 27. — Big Data processing: • Data collection; streaming data; data storage; and Machine learning • Open source libraries — Other domains: • Public transportation • Business Intelligence • Health care Conclusions