SlideShare a Scribd company logo
1 of 23
Jongwook Woo
HiPIC
CSULA
CheckMate: Geolocation
Analyzer for Safe Residence
College of Business & Economics
Faculty Research Colloquium
Priyanka Kale, Priyal Mistry, Hitesh Jagtap, Jongwook Woo
California State University Los Angeles
December 1st , 2016
High-Performance Information Computing Center (HiPIC)
California State University Los Angeles
High Performance Information Computing Center
Jongwook Woo
CSULA
Table of Contents
Introduction to Big Data
Implementation
Visualization
Conclusion
References
High Performance Information Computing Center
Jongwook Woo
CSULA
Data Issues
Large-Scale data
Tera-Byte (1012), Peta-byte (1015)
– Because of web
– Sensor Data (IoT), Bioinformatics, Social Computing,
Streaming data, smart phone, online game…
Cannot handle with the legacy approach
Too big
Non-/Semi-structured data
Too expensive
Need new systems
Non-expensive
High Performance Information Computing Center
Jongwook Woo
CSULA
Big Data
Volume
Complexity
Variety
Variability
High Performance Information Computing Center
Jongwook Woo
CSULA
Two Cores in Big Data
How to store Big Data
How to compute Big Data
Google
How to store Big Data
– GFS
– Distributed Systems on non-expensive commodity
computers
How to compute Big Data
– Map-Reduce
– Parallel Computing with non-expensive computers
Data Intensive Super Computers
Published papers in 2003, 2004
High Performance Information Computing Center
Jongwook Woo
CSULA
Definition: Big Data
Non-expensive frameworks that can
store a large scale data and process it
faster in parallel [5, 6]
Hadoop
– Non-expensive Super Computer
– More public than the traditional super computers
• You can store and process your applications
– In your university labs, small companies, research
centers
NoSQL DB
– Cassandra, Hbase, Couchbase, MongoDB
High Performance Information Computing Center
Jongwook Woo
CSULA
Table of Contents
Introduction to Big Data
Implementation
Visualization
Conclusion
 References
High Performance Information Computing Center
Jongwook Woo
CSULA
Motivation
Issues
 Person looking for place of residence may find it
difficult to get real details about the place from safety
point of view.
 Person can be confused to select location for residence.
Solutions
Analyzing open data to find the location for safety
perspective.
Our Approach
 System Design with the flow chart
Big Data
– Hadoop Hive using MS Azure Cloud Computing
High Performance Information Computing Center
Jongwook Woo
CSULA
Flowchart
Download Public
Dataset
Upload data into Big Data HDFS
Trigger Hadoop Hive Queries
Result Data Tables
Output Visualization
High Performance Information Computing Center
Jongwook Woo
CSULA
Specifications in Big Data cluster
10
Cloud Computing: Microsoft Azure with Hortonworks sandbox
1. Linux OS system
2. Number of nodes: 4
3. CPU: 8 cores
4. Memory Size: 14 GB
High Performance Information Computing Center
Jongwook Woo
CSULA
Implementation : Hadoop Hive
 Hortonworks HDP
 Hadoop Big Data Platform
 Ambari: a web GUI in Hadoop
 Explore HDFS file systems
 Process data with Hive and Tez queries as MapReduce jobs
High Performance Information Computing Center
Jongwook Woo
CSULA
Table of Contents
Introduction to Big Data
Implementation
Visualization
Conclusion
 References
High Performance Information Computing Center
Jongwook Woo
CSULA
Analysis Result
Detailed analysis made it easy to understand
which part of Chicago City is having safe
place of residence and which areas are
unsafe.
 This analysis can play vital role in any house
finding website if added as module, it can
help people know more about the crime
history of the place.
 Subsequent slides shows us some of the
important outcomes of the analysis.
High Performance Information Computing Center
Jongwook Woo
CSULA
Queries and Visualization
Rank of Crime type as per occurrences
High Performance Information Computing Center
Jongwook Woo
CSULA
Queries and Visualization
Table
High Performance Information Computing Center
Jongwook Woo
CSULA
Queries and Visualization
High Performance Information Computing Center
Jongwook Woo
CSULA
Final Outcome of Analysis
 Using similar approach
we find out which area of Chicago city is safer than
the other.
 Geo Spatial Analysis
Outcome file is plotted against the map of Chicago
city
 Visualized using green and red mark points.
– Green indicated safe areas
– Red indicates unsafe areas
Top 50 results are highlighted.
High Performance Information Computing Center
Jongwook Woo
CSULA
Final Outcome of Analysis
MAP
High Performance Information Computing Center
Jongwook Woo
CSULA
Table of Contents
Introduction to Big Data
Implementation
Visualization
Conclusion
References
High Performance Information Computing Center
Jongwook Woo
CSULA
Conclusion
An exhaustive analysis of geolocation data
for Chicago City is done
a user searching a place for residence can
easily select better neighborhood depending
on its crime history
Future work
Further Analysis of individual area can be done based
on other factors affecting the residence
 Integrate this analysis with the rental or lease
companies
 Analyze more data at different locations
High Performance Information Computing Center
Jongwook Woo
CSULA
Table of Contents
Introduction to Big Data
Implementation
Visualization
Conclusion
 References
High Performance Information Computing Center
Jongwook Woo
CSULA
1. https://catalog.data.gov
2. https://cwiki.apache.org/confluence/display/Hive/Tutorial
3. https://hortonworks.com/tutorials
4. GitHub Link: https://github.com/priya708/Project-520
5. “Market Basket Analysis Algorithm with Map/Reduce of Cloud
Computing”, Jongwook Woo and Yuhang Xu, The 2011 international
Conference on Parallel and Distributed Processing Techniques and
Applications (PDPTA 2011), Las Vegas (July 18-21, 2011)
6. Jongwook Woo, DMKD-00150, “Market Basket Analysis Algorithms with
MapReduce”, Wiley Interdisciplinary Reviews Data Mining and
Knowledge Discovery, Oct 28 2013, Volume 3, Issue 6, pp445-452
7. Jongwook Woo, “Big Data Trend and Open Data”, UKC 2016, Dallas, TX
Aug 12 2016
References
High Performance Information Computing Center
Jongwook Woo
CSULA
THANK YOU
Any Questions?

More Related Content

What's hot

Reactive Databases for Big Data applications
Reactive Databases for Big Data applicationsReactive Databases for Big Data applications
Reactive Databases for Big Data applicationsGraph-TA
 
Big data analysis and modelling
Big data analysis and modellingBig data analysis and modelling
Big data analysis and modellingkeivan mahdavi
 
Time travel and time series analysis with pandas + statsmodels
Time travel and time series analysis with pandas + statsmodelsTime travel and time series analysis with pandas + statsmodels
Time travel and time series analysis with pandas + statsmodelsAlexander Hendorf
 
Data Centric HPC for Numerical Weather Forecasting
Data Centric HPC for Numerical Weather ForecastingData Centric HPC for Numerical Weather Forecasting
Data Centric HPC for Numerical Weather ForecastingJames Arnold Faeldon
 
Real World Cognition Loop for IoT
Real World Cognition Loop for IoTReal World Cognition Loop for IoT
Real World Cognition Loop for IoTDarminder
 
Kaushik shakkari internship - resume
Kaushik shakkari   internship - resumeKaushik shakkari   internship - resume
Kaushik shakkari internship - resumeKaushik Shakkari
 
Presented by Ahmed Abdulhakim Al-Absi - Scaling map reduce applications acro...
Presented by Ahmed Abdulhakim Al-Absi -  Scaling map reduce applications acro...Presented by Ahmed Abdulhakim Al-Absi -  Scaling map reduce applications acro...
Presented by Ahmed Abdulhakim Al-Absi - Scaling map reduce applications acro...Absi Ahmed
 
See the forest AND the trees: Free tools for data visualisation
See the forest AND the trees: Free tools for data visualisationSee the forest AND the trees: Free tools for data visualisation
See the forest AND the trees: Free tools for data visualisationPaul Rowe
 
Ahmed Absi slides bigbwa
Ahmed Absi slides  bigbwaAhmed Absi slides  bigbwa
Ahmed Absi slides bigbwaAbsi Ahmed
 
Collaboratively Conceived, Designed and Implemented: Matching Visualization ...
Collaboratively Conceived, Designed and Implemented:  Matching Visualization ...Collaboratively Conceived, Designed and Implemented:  Matching Visualization ...
Collaboratively Conceived, Designed and Implemented: Matching Visualization ...Nancy Hoebelheinrich
 
Graphalytics: A big data benchmark for graph processing platforms
Graphalytics: A big data benchmark for graph processing platformsGraphalytics: A big data benchmark for graph processing platforms
Graphalytics: A big data benchmark for graph processing platformsGraph-TA
 
Semantic-based Process Analysis
Semantic-based Process AnalysisSemantic-based Process Analysis
Semantic-based Process AnalysisMauro Dragoni
 
Overview - Track Group Analytics
Overview - Track Group AnalyticsOverview - Track Group Analytics
Overview - Track Group AnalyticsToby Keeping
 
The State of Postgres | Strata San Jose 2018 | Umur Cubukcu
The State of Postgres | Strata San Jose 2018 | Umur CubukcuThe State of Postgres | Strata San Jose 2018 | Umur Cubukcu
The State of Postgres | Strata San Jose 2018 | Umur CubukcuCitus Data
 

What's hot (18)

Reactive Databases for Big Data applications
Reactive Databases for Big Data applicationsReactive Databases for Big Data applications
Reactive Databases for Big Data applications
 
survey paper 2
survey paper 2survey paper 2
survey paper 2
 
Big data analysis and modelling
Big data analysis and modellingBig data analysis and modelling
Big data analysis and modelling
 
Time travel and time series analysis with pandas + statsmodels
Time travel and time series analysis with pandas + statsmodelsTime travel and time series analysis with pandas + statsmodels
Time travel and time series analysis with pandas + statsmodels
 
Data Centric HPC for Numerical Weather Forecasting
Data Centric HPC for Numerical Weather ForecastingData Centric HPC for Numerical Weather Forecasting
Data Centric HPC for Numerical Weather Forecasting
 
Real World Cognition Loop for IoT
Real World Cognition Loop for IoTReal World Cognition Loop for IoT
Real World Cognition Loop for IoT
 
Kaushik shakkari internship - resume
Kaushik shakkari   internship - resumeKaushik shakkari   internship - resume
Kaushik shakkari internship - resume
 
Presented by Ahmed Abdulhakim Al-Absi - Scaling map reduce applications acro...
Presented by Ahmed Abdulhakim Al-Absi -  Scaling map reduce applications acro...Presented by Ahmed Abdulhakim Al-Absi -  Scaling map reduce applications acro...
Presented by Ahmed Abdulhakim Al-Absi - Scaling map reduce applications acro...
 
See the forest AND the trees: Free tools for data visualisation
See the forest AND the trees: Free tools for data visualisationSee the forest AND the trees: Free tools for data visualisation
See the forest AND the trees: Free tools for data visualisation
 
Ahmed Absi slides bigbwa
Ahmed Absi slides  bigbwaAhmed Absi slides  bigbwa
Ahmed Absi slides bigbwa
 
Domain research presentation Final
Domain research presentation FinalDomain research presentation Final
Domain research presentation Final
 
view_hdf
view_hdfview_hdf
view_hdf
 
Collaboratively Conceived, Designed and Implemented: Matching Visualization ...
Collaboratively Conceived, Designed and Implemented:  Matching Visualization ...Collaboratively Conceived, Designed and Implemented:  Matching Visualization ...
Collaboratively Conceived, Designed and Implemented: Matching Visualization ...
 
Graphalytics: A big data benchmark for graph processing platforms
Graphalytics: A big data benchmark for graph processing platformsGraphalytics: A big data benchmark for graph processing platforms
Graphalytics: A big data benchmark for graph processing platforms
 
Semantic-based Process Analysis
Semantic-based Process AnalysisSemantic-based Process Analysis
Semantic-based Process Analysis
 
Overview - Track Group Analytics
Overview - Track Group AnalyticsOverview - Track Group Analytics
Overview - Track Group Analytics
 
EventShop ISG talk 140213
EventShop ISG talk 140213EventShop ISG talk 140213
EventShop ISG talk 140213
 
The State of Postgres | Strata San Jose 2018 | Umur Cubukcu
The State of Postgres | Strata San Jose 2018 | Umur CubukcuThe State of Postgres | Strata San Jose 2018 | Umur Cubukcu
The State of Postgres | Strata San Jose 2018 | Umur Cubukcu
 

Viewers also liked

Lunch2.0 - Geolocation - July 2010
Lunch2.0 - Geolocation - July 2010Lunch2.0 - Geolocation - July 2010
Lunch2.0 - Geolocation - July 2010Jeff Schneidewind
 
Wireless fidelity (a.k.a Wi-Fi)
Wireless fidelity (a.k.a Wi-Fi)Wireless fidelity (a.k.a Wi-Fi)
Wireless fidelity (a.k.a Wi-Fi)Vineet Sansare
 
HTML5 Geolocation API
HTML5 Geolocation APIHTML5 Geolocation API
HTML5 Geolocation APIFilip Mares
 
Vehicular sensor netwks ppt
Vehicular sensor netwks pptVehicular sensor netwks ppt
Vehicular sensor netwks pptVenkatesh Kaduru
 
W3C Geolocation API - Making Websites Location-aware
W3C Geolocation API - Making Websites Location-awareW3C Geolocation API - Making Websites Location-aware
W3C Geolocation API - Making Websites Location-awareRyan Sarver
 
Wlan 802.11n - MAC Sublayer
Wlan 802.11n - MAC SublayerWlan 802.11n - MAC Sublayer
Wlan 802.11n - MAC SublayerMd Syed Ahamad
 
Mechanism of a Microwave Oven.
Mechanism of a Microwave Oven.Mechanism of a Microwave Oven.
Mechanism of a Microwave Oven.Md. Afif Al Mamun
 
Wireless Geolocation
Wireless GeolocationWireless Geolocation
Wireless GeolocationFatema Zohora
 
Optical Wireless Communications - from the space to the chip.
Optical Wireless Communications - from the space to the chip.Optical Wireless Communications - from the space to the chip.
Optical Wireless Communications - from the space to the chip.Joaquin Perez
 
Indoor geolocation
Indoor geolocationIndoor geolocation
Indoor geolocationharisri269
 

Viewers also liked (20)

Social commerce
Social commerceSocial commerce
Social commerce
 
Lunch2.0 - Geolocation - July 2010
Lunch2.0 - Geolocation - July 2010Lunch2.0 - Geolocation - July 2010
Lunch2.0 - Geolocation - July 2010
 
Seminar
SeminarSeminar
Seminar
 
Wireless fidelity (a.k.a Wi-Fi)
Wireless fidelity (a.k.a Wi-Fi)Wireless fidelity (a.k.a Wi-Fi)
Wireless fidelity (a.k.a Wi-Fi)
 
HTML5 Geolocation API
HTML5 Geolocation APIHTML5 Geolocation API
HTML5 Geolocation API
 
Wi-Fi Tecnology
Wi-Fi TecnologyWi-Fi Tecnology
Wi-Fi Tecnology
 
Vehicular sensor netwks ppt
Vehicular sensor netwks pptVehicular sensor netwks ppt
Vehicular sensor netwks ppt
 
W3C Geolocation API - Making Websites Location-aware
W3C Geolocation API - Making Websites Location-awareW3C Geolocation API - Making Websites Location-aware
W3C Geolocation API - Making Websites Location-aware
 
Wi Fi
Wi FiWi Fi
Wi Fi
 
Vehicular Networks
Vehicular NetworksVehicular Networks
Vehicular Networks
 
Wlan 802.11n - MAC Sublayer
Wlan 802.11n - MAC SublayerWlan 802.11n - MAC Sublayer
Wlan 802.11n - MAC Sublayer
 
Microwave oven
Microwave ovenMicrowave oven
Microwave oven
 
Vehicular ad hoc network
Vehicular ad hoc networkVehicular ad hoc network
Vehicular ad hoc network
 
Vehicular network
Vehicular networkVehicular network
Vehicular network
 
Mechanism of a Microwave Oven.
Mechanism of a Microwave Oven.Mechanism of a Microwave Oven.
Mechanism of a Microwave Oven.
 
Adhoc network
Adhoc networkAdhoc network
Adhoc network
 
Wireless Geolocation
Wireless GeolocationWireless Geolocation
Wireless Geolocation
 
Ultra wide band
Ultra wide bandUltra wide band
Ultra wide band
 
Optical Wireless Communications - from the space to the chip.
Optical Wireless Communications - from the space to the chip.Optical Wireless Communications - from the space to the chip.
Optical Wireless Communications - from the space to the chip.
 
Indoor geolocation
Indoor geolocationIndoor geolocation
Indoor geolocation
 

Similar to Geolocation Analyzer Finds Safe Chicago Neighborhoods

Big Data and Advanced Data Intensive Computing
Big Data and Advanced Data Intensive ComputingBig Data and Advanced Data Intensive Computing
Big Data and Advanced Data Intensive ComputingJongwook Woo
 
Big Data and Data Intensive Computing on Networks
Big Data and Data Intensive Computing on NetworksBig Data and Data Intensive Computing on Networks
Big Data and Data Intensive Computing on NetworksJongwook Woo
 
Big Data Platform adopting Spark and Use Cases with Open Data
Big Data  Platform adopting Spark and Use Cases with Open DataBig Data  Platform adopting Spark and Use Cases with Open Data
Big Data Platform adopting Spark and Use Cases with Open DataJongwook Woo
 
Data analysis using hive ql & tableau
Data analysis using hive ql & tableauData analysis using hive ql & tableau
Data analysis using hive ql & tableaupkale1708
 
Big Data and Data Intensive Computing: Use Cases
Big Data and Data Intensive Computing: Use CasesBig Data and Data Intensive Computing: Use Cases
Big Data and Data Intensive Computing: Use CasesJongwook Woo
 
Geolocation analysis using hive ql
Geolocation analysis using hive qlGeolocation analysis using hive ql
Geolocation analysis using hive qlpriyalmistry4
 
Geolocation analysis using hive ql
Geolocation analysis using hive qlGeolocation analysis using hive ql
Geolocation analysis using hive qlpriyal mistry
 
Big Data and Data Intensive Computing: Education and Training
Big Data and Data Intensive Computing: Education and TrainingBig Data and Data Intensive Computing: Education and Training
Big Data and Data Intensive Computing: Education and TrainingJongwook Woo
 
Traffic Data Analysis and Prediction using Big Data
Traffic Data Analysis and Prediction using Big DataTraffic Data Analysis and Prediction using Big Data
Traffic Data Analysis and Prediction using Big DataJongwook Woo
 
Introduction To Big Data and Use Cases using Hadoop
Introduction To Big Data and Use Cases using HadoopIntroduction To Big Data and Use Cases using Hadoop
Introduction To Big Data and Use Cases using HadoopJongwook Woo
 
President Election of Korea in 2017
President Election of Korea in 2017President Election of Korea in 2017
President Election of Korea in 2017Jongwook Woo
 
Big Data Trend with Open Platform
Big Data Trend with Open PlatformBig Data Trend with Open Platform
Big Data Trend with Open PlatformJongwook Woo
 
Big Data Trend and Open Data
Big Data Trend and Open DataBig Data Trend and Open Data
Big Data Trend and Open DataJongwook Woo
 
Introduction to Spark: Data Analysis and Use Cases in Big Data
Introduction to Spark: Data Analysis and Use Cases in Big Data Introduction to Spark: Data Analysis and Use Cases in Big Data
Introduction to Spark: Data Analysis and Use Cases in Big Data Jongwook Woo
 
Rating Prediction using Deep Learning and Spark
Rating Prediction using Deep Learning and SparkRating Prediction using Deep Learning and Spark
Rating Prediction using Deep Learning and SparkJongwook Woo
 
Big Data and Predictive Analysis
Big Data and Predictive AnalysisBig Data and Predictive Analysis
Big Data and Predictive AnalysisJongwook Woo
 
Introduction to Big Data: Smart Factory
Introduction to Big Data: Smart FactoryIntroduction to Big Data: Smart Factory
Introduction to Big Data: Smart FactoryJongwook Woo
 
IRJET- Survey of Big Data with Hadoop
IRJET-  	  Survey of Big Data with HadoopIRJET-  	  Survey of Big Data with Hadoop
IRJET- Survey of Big Data with HadoopIRJET Journal
 
Introduction to Big Data and its Trends
Introduction to Big Data and its TrendsIntroduction to Big Data and its Trends
Introduction to Big Data and its TrendsJongwook Woo
 

Similar to Geolocation Analyzer Finds Safe Chicago Neighborhoods (20)

Big Data and Advanced Data Intensive Computing
Big Data and Advanced Data Intensive ComputingBig Data and Advanced Data Intensive Computing
Big Data and Advanced Data Intensive Computing
 
Big Data and Data Intensive Computing on Networks
Big Data and Data Intensive Computing on NetworksBig Data and Data Intensive Computing on Networks
Big Data and Data Intensive Computing on Networks
 
Big Data Platform adopting Spark and Use Cases with Open Data
Big Data  Platform adopting Spark and Use Cases with Open DataBig Data  Platform adopting Spark and Use Cases with Open Data
Big Data Platform adopting Spark and Use Cases with Open Data
 
AI on Big Data
AI on Big DataAI on Big Data
AI on Big Data
 
Data analysis using hive ql & tableau
Data analysis using hive ql & tableauData analysis using hive ql & tableau
Data analysis using hive ql & tableau
 
Big Data and Data Intensive Computing: Use Cases
Big Data and Data Intensive Computing: Use CasesBig Data and Data Intensive Computing: Use Cases
Big Data and Data Intensive Computing: Use Cases
 
Geolocation analysis using hive ql
Geolocation analysis using hive qlGeolocation analysis using hive ql
Geolocation analysis using hive ql
 
Geolocation analysis using hive ql
Geolocation analysis using hive qlGeolocation analysis using hive ql
Geolocation analysis using hive ql
 
Big Data and Data Intensive Computing: Education and Training
Big Data and Data Intensive Computing: Education and TrainingBig Data and Data Intensive Computing: Education and Training
Big Data and Data Intensive Computing: Education and Training
 
Traffic Data Analysis and Prediction using Big Data
Traffic Data Analysis and Prediction using Big DataTraffic Data Analysis and Prediction using Big Data
Traffic Data Analysis and Prediction using Big Data
 
Introduction To Big Data and Use Cases using Hadoop
Introduction To Big Data and Use Cases using HadoopIntroduction To Big Data and Use Cases using Hadoop
Introduction To Big Data and Use Cases using Hadoop
 
President Election of Korea in 2017
President Election of Korea in 2017President Election of Korea in 2017
President Election of Korea in 2017
 
Big Data Trend with Open Platform
Big Data Trend with Open PlatformBig Data Trend with Open Platform
Big Data Trend with Open Platform
 
Big Data Trend and Open Data
Big Data Trend and Open DataBig Data Trend and Open Data
Big Data Trend and Open Data
 
Introduction to Spark: Data Analysis and Use Cases in Big Data
Introduction to Spark: Data Analysis and Use Cases in Big Data Introduction to Spark: Data Analysis and Use Cases in Big Data
Introduction to Spark: Data Analysis and Use Cases in Big Data
 
Rating Prediction using Deep Learning and Spark
Rating Prediction using Deep Learning and SparkRating Prediction using Deep Learning and Spark
Rating Prediction using Deep Learning and Spark
 
Big Data and Predictive Analysis
Big Data and Predictive AnalysisBig Data and Predictive Analysis
Big Data and Predictive Analysis
 
Introduction to Big Data: Smart Factory
Introduction to Big Data: Smart FactoryIntroduction to Big Data: Smart Factory
Introduction to Big Data: Smart Factory
 
IRJET- Survey of Big Data with Hadoop
IRJET-  	  Survey of Big Data with HadoopIRJET-  	  Survey of Big Data with Hadoop
IRJET- Survey of Big Data with Hadoop
 
Introduction to Big Data and its Trends
Introduction to Big Data and its TrendsIntroduction to Big Data and its Trends
Introduction to Big Data and its Trends
 

Recently uploaded

VIP High Profile Call Girls Amravati Aarushi 8250192130 Independent Escort Se...
VIP High Profile Call Girls Amravati Aarushi 8250192130 Independent Escort Se...VIP High Profile Call Girls Amravati Aarushi 8250192130 Independent Escort Se...
VIP High Profile Call Girls Amravati Aarushi 8250192130 Independent Escort Se...Suhani Kapoor
 
PKS-TGC-1084-630 - Stage 1 Proposal.pptx
PKS-TGC-1084-630 - Stage 1 Proposal.pptxPKS-TGC-1084-630 - Stage 1 Proposal.pptx
PKS-TGC-1084-630 - Stage 1 Proposal.pptxPramod Kumar Srivastava
 
Best VIP Call Girls Noida Sector 39 Call Me: 8448380779
Best VIP Call Girls Noida Sector 39 Call Me: 8448380779Best VIP Call Girls Noida Sector 39 Call Me: 8448380779
Best VIP Call Girls Noida Sector 39 Call Me: 8448380779Delhi Call girls
 
{Pooja: 9892124323 } Call Girl in Mumbai | Jas Kaur Rate 4500 Free Hotel Del...
{Pooja:  9892124323 } Call Girl in Mumbai | Jas Kaur Rate 4500 Free Hotel Del...{Pooja:  9892124323 } Call Girl in Mumbai | Jas Kaur Rate 4500 Free Hotel Del...
{Pooja: 9892124323 } Call Girl in Mumbai | Jas Kaur Rate 4500 Free Hotel Del...Pooja Nehwal
 
Log Analysis using OSSEC sasoasasasas.pptx
Log Analysis using OSSEC sasoasasasas.pptxLog Analysis using OSSEC sasoasasasas.pptx
Log Analysis using OSSEC sasoasasasas.pptxJohnnyPlasten
 
定制英国白金汉大学毕业证(UCB毕业证书) 成绩单原版一比一
定制英国白金汉大学毕业证(UCB毕业证书)																			成绩单原版一比一定制英国白金汉大学毕业证(UCB毕业证书)																			成绩单原版一比一
定制英国白金汉大学毕业证(UCB毕业证书) 成绩单原版一比一ffjhghh
 
From idea to production in a day – Leveraging Azure ML and Streamlit to build...
From idea to production in a day – Leveraging Azure ML and Streamlit to build...From idea to production in a day – Leveraging Azure ML and Streamlit to build...
From idea to production in a day – Leveraging Azure ML and Streamlit to build...Florian Roscheck
 
Full night 🥵 Call Girls Delhi New Friends Colony {9711199171} Sanya Reddy ✌️o...
Full night 🥵 Call Girls Delhi New Friends Colony {9711199171} Sanya Reddy ✌️o...Full night 🥵 Call Girls Delhi New Friends Colony {9711199171} Sanya Reddy ✌️o...
Full night 🥵 Call Girls Delhi New Friends Colony {9711199171} Sanya Reddy ✌️o...shivangimorya083
 
Kantar AI Summit- Under Embargo till Wednesday, 24th April 2024, 4 PM, IST.pdf
Kantar AI Summit- Under Embargo till Wednesday, 24th April 2024, 4 PM, IST.pdfKantar AI Summit- Under Embargo till Wednesday, 24th April 2024, 4 PM, IST.pdf
Kantar AI Summit- Under Embargo till Wednesday, 24th April 2024, 4 PM, IST.pdfSocial Samosa
 
Call Girls in Defence Colony Delhi 💯Call Us 🔝8264348440🔝
Call Girls in Defence Colony Delhi 💯Call Us 🔝8264348440🔝Call Girls in Defence Colony Delhi 💯Call Us 🔝8264348440🔝
Call Girls in Defence Colony Delhi 💯Call Us 🔝8264348440🔝soniya singh
 
High Class Call Girls Noida Sector 39 Aarushi 🔝8264348440🔝 Independent Escort...
High Class Call Girls Noida Sector 39 Aarushi 🔝8264348440🔝 Independent Escort...High Class Call Girls Noida Sector 39 Aarushi 🔝8264348440🔝 Independent Escort...
High Class Call Girls Noida Sector 39 Aarushi 🔝8264348440🔝 Independent Escort...soniya singh
 
VIP High Class Call Girls Jamshedpur Anushka 8250192130 Independent Escort Se...
VIP High Class Call Girls Jamshedpur Anushka 8250192130 Independent Escort Se...VIP High Class Call Girls Jamshedpur Anushka 8250192130 Independent Escort Se...
VIP High Class Call Girls Jamshedpur Anushka 8250192130 Independent Escort Se...Suhani Kapoor
 
Customer Service Analytics - Make Sense of All Your Data.pptx
Customer Service Analytics - Make Sense of All Your Data.pptxCustomer Service Analytics - Make Sense of All Your Data.pptx
Customer Service Analytics - Make Sense of All Your Data.pptxEmmanuel Dauda
 
Dubai Call Girls Wifey O52&786472 Call Girls Dubai
Dubai Call Girls Wifey O52&786472 Call Girls DubaiDubai Call Girls Wifey O52&786472 Call Girls Dubai
Dubai Call Girls Wifey O52&786472 Call Girls Dubaihf8803863
 
代办国外大学文凭《原版美国UCLA文凭证书》加州大学洛杉矶分校毕业证制作成绩单修改
代办国外大学文凭《原版美国UCLA文凭证书》加州大学洛杉矶分校毕业证制作成绩单修改代办国外大学文凭《原版美国UCLA文凭证书》加州大学洛杉矶分校毕业证制作成绩单修改
代办国外大学文凭《原版美国UCLA文凭证书》加州大学洛杉矶分校毕业证制作成绩单修改atducpo
 
Delhi Call Girls Punjabi Bagh 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip Call
Delhi Call Girls Punjabi Bagh 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip CallDelhi Call Girls Punjabi Bagh 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip Call
Delhi Call Girls Punjabi Bagh 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip Callshivangimorya083
 
Industrialised data - the key to AI success.pdf
Industrialised data - the key to AI success.pdfIndustrialised data - the key to AI success.pdf
Industrialised data - the key to AI success.pdfLars Albertsson
 
(PARI) Call Girls Wanowrie ( 7001035870 ) HI-Fi Pune Escorts Service
(PARI) Call Girls Wanowrie ( 7001035870 ) HI-Fi Pune Escorts Service(PARI) Call Girls Wanowrie ( 7001035870 ) HI-Fi Pune Escorts Service
(PARI) Call Girls Wanowrie ( 7001035870 ) HI-Fi Pune Escorts Serviceranjana rawat
 

Recently uploaded (20)

Delhi 99530 vip 56974 Genuine Escort Service Call Girls in Kishangarh
Delhi 99530 vip 56974 Genuine Escort Service Call Girls in  KishangarhDelhi 99530 vip 56974 Genuine Escort Service Call Girls in  Kishangarh
Delhi 99530 vip 56974 Genuine Escort Service Call Girls in Kishangarh
 
VIP High Profile Call Girls Amravati Aarushi 8250192130 Independent Escort Se...
VIP High Profile Call Girls Amravati Aarushi 8250192130 Independent Escort Se...VIP High Profile Call Girls Amravati Aarushi 8250192130 Independent Escort Se...
VIP High Profile Call Girls Amravati Aarushi 8250192130 Independent Escort Se...
 
PKS-TGC-1084-630 - Stage 1 Proposal.pptx
PKS-TGC-1084-630 - Stage 1 Proposal.pptxPKS-TGC-1084-630 - Stage 1 Proposal.pptx
PKS-TGC-1084-630 - Stage 1 Proposal.pptx
 
Best VIP Call Girls Noida Sector 39 Call Me: 8448380779
Best VIP Call Girls Noida Sector 39 Call Me: 8448380779Best VIP Call Girls Noida Sector 39 Call Me: 8448380779
Best VIP Call Girls Noida Sector 39 Call Me: 8448380779
 
{Pooja: 9892124323 } Call Girl in Mumbai | Jas Kaur Rate 4500 Free Hotel Del...
{Pooja:  9892124323 } Call Girl in Mumbai | Jas Kaur Rate 4500 Free Hotel Del...{Pooja:  9892124323 } Call Girl in Mumbai | Jas Kaur Rate 4500 Free Hotel Del...
{Pooja: 9892124323 } Call Girl in Mumbai | Jas Kaur Rate 4500 Free Hotel Del...
 
Log Analysis using OSSEC sasoasasasas.pptx
Log Analysis using OSSEC sasoasasasas.pptxLog Analysis using OSSEC sasoasasasas.pptx
Log Analysis using OSSEC sasoasasasas.pptx
 
定制英国白金汉大学毕业证(UCB毕业证书) 成绩单原版一比一
定制英国白金汉大学毕业证(UCB毕业证书)																			成绩单原版一比一定制英国白金汉大学毕业证(UCB毕业证书)																			成绩单原版一比一
定制英国白金汉大学毕业证(UCB毕业证书) 成绩单原版一比一
 
From idea to production in a day – Leveraging Azure ML and Streamlit to build...
From idea to production in a day – Leveraging Azure ML and Streamlit to build...From idea to production in a day – Leveraging Azure ML and Streamlit to build...
From idea to production in a day – Leveraging Azure ML and Streamlit to build...
 
Full night 🥵 Call Girls Delhi New Friends Colony {9711199171} Sanya Reddy ✌️o...
Full night 🥵 Call Girls Delhi New Friends Colony {9711199171} Sanya Reddy ✌️o...Full night 🥵 Call Girls Delhi New Friends Colony {9711199171} Sanya Reddy ✌️o...
Full night 🥵 Call Girls Delhi New Friends Colony {9711199171} Sanya Reddy ✌️o...
 
Kantar AI Summit- Under Embargo till Wednesday, 24th April 2024, 4 PM, IST.pdf
Kantar AI Summit- Under Embargo till Wednesday, 24th April 2024, 4 PM, IST.pdfKantar AI Summit- Under Embargo till Wednesday, 24th April 2024, 4 PM, IST.pdf
Kantar AI Summit- Under Embargo till Wednesday, 24th April 2024, 4 PM, IST.pdf
 
Call Girls in Defence Colony Delhi 💯Call Us 🔝8264348440🔝
Call Girls in Defence Colony Delhi 💯Call Us 🔝8264348440🔝Call Girls in Defence Colony Delhi 💯Call Us 🔝8264348440🔝
Call Girls in Defence Colony Delhi 💯Call Us 🔝8264348440🔝
 
High Class Call Girls Noida Sector 39 Aarushi 🔝8264348440🔝 Independent Escort...
High Class Call Girls Noida Sector 39 Aarushi 🔝8264348440🔝 Independent Escort...High Class Call Girls Noida Sector 39 Aarushi 🔝8264348440🔝 Independent Escort...
High Class Call Girls Noida Sector 39 Aarushi 🔝8264348440🔝 Independent Escort...
 
VIP High Class Call Girls Jamshedpur Anushka 8250192130 Independent Escort Se...
VIP High Class Call Girls Jamshedpur Anushka 8250192130 Independent Escort Se...VIP High Class Call Girls Jamshedpur Anushka 8250192130 Independent Escort Se...
VIP High Class Call Girls Jamshedpur Anushka 8250192130 Independent Escort Se...
 
Customer Service Analytics - Make Sense of All Your Data.pptx
Customer Service Analytics - Make Sense of All Your Data.pptxCustomer Service Analytics - Make Sense of All Your Data.pptx
Customer Service Analytics - Make Sense of All Your Data.pptx
 
Dubai Call Girls Wifey O52&786472 Call Girls Dubai
Dubai Call Girls Wifey O52&786472 Call Girls DubaiDubai Call Girls Wifey O52&786472 Call Girls Dubai
Dubai Call Girls Wifey O52&786472 Call Girls Dubai
 
代办国外大学文凭《原版美国UCLA文凭证书》加州大学洛杉矶分校毕业证制作成绩单修改
代办国外大学文凭《原版美国UCLA文凭证书》加州大学洛杉矶分校毕业证制作成绩单修改代办国外大学文凭《原版美国UCLA文凭证书》加州大学洛杉矶分校毕业证制作成绩单修改
代办国外大学文凭《原版美国UCLA文凭证书》加州大学洛杉矶分校毕业证制作成绩单修改
 
Delhi Call Girls Punjabi Bagh 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip Call
Delhi Call Girls Punjabi Bagh 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip CallDelhi Call Girls Punjabi Bagh 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip Call
Delhi Call Girls Punjabi Bagh 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip Call
 
Industrialised data - the key to AI success.pdf
Industrialised data - the key to AI success.pdfIndustrialised data - the key to AI success.pdf
Industrialised data - the key to AI success.pdf
 
(PARI) Call Girls Wanowrie ( 7001035870 ) HI-Fi Pune Escorts Service
(PARI) Call Girls Wanowrie ( 7001035870 ) HI-Fi Pune Escorts Service(PARI) Call Girls Wanowrie ( 7001035870 ) HI-Fi Pune Escorts Service
(PARI) Call Girls Wanowrie ( 7001035870 ) HI-Fi Pune Escorts Service
 
VIP Call Girls Service Charbagh { Lucknow Call Girls Service 9548273370 } Boo...
VIP Call Girls Service Charbagh { Lucknow Call Girls Service 9548273370 } Boo...VIP Call Girls Service Charbagh { Lucknow Call Girls Service 9548273370 } Boo...
VIP Call Girls Service Charbagh { Lucknow Call Girls Service 9548273370 } Boo...
 

Geolocation Analyzer Finds Safe Chicago Neighborhoods

  • 1. Jongwook Woo HiPIC CSULA CheckMate: Geolocation Analyzer for Safe Residence College of Business & Economics Faculty Research Colloquium Priyanka Kale, Priyal Mistry, Hitesh Jagtap, Jongwook Woo California State University Los Angeles December 1st , 2016 High-Performance Information Computing Center (HiPIC) California State University Los Angeles
  • 2. High Performance Information Computing Center Jongwook Woo CSULA Table of Contents Introduction to Big Data Implementation Visualization Conclusion References
  • 3. High Performance Information Computing Center Jongwook Woo CSULA Data Issues Large-Scale data Tera-Byte (1012), Peta-byte (1015) – Because of web – Sensor Data (IoT), Bioinformatics, Social Computing, Streaming data, smart phone, online game… Cannot handle with the legacy approach Too big Non-/Semi-structured data Too expensive Need new systems Non-expensive
  • 4. High Performance Information Computing Center Jongwook Woo CSULA Big Data Volume Complexity Variety Variability
  • 5. High Performance Information Computing Center Jongwook Woo CSULA Two Cores in Big Data How to store Big Data How to compute Big Data Google How to store Big Data – GFS – Distributed Systems on non-expensive commodity computers How to compute Big Data – Map-Reduce – Parallel Computing with non-expensive computers Data Intensive Super Computers Published papers in 2003, 2004
  • 6. High Performance Information Computing Center Jongwook Woo CSULA Definition: Big Data Non-expensive frameworks that can store a large scale data and process it faster in parallel [5, 6] Hadoop – Non-expensive Super Computer – More public than the traditional super computers • You can store and process your applications – In your university labs, small companies, research centers NoSQL DB – Cassandra, Hbase, Couchbase, MongoDB
  • 7. High Performance Information Computing Center Jongwook Woo CSULA Table of Contents Introduction to Big Data Implementation Visualization Conclusion  References
  • 8. High Performance Information Computing Center Jongwook Woo CSULA Motivation Issues  Person looking for place of residence may find it difficult to get real details about the place from safety point of view.  Person can be confused to select location for residence. Solutions Analyzing open data to find the location for safety perspective. Our Approach  System Design with the flow chart Big Data – Hadoop Hive using MS Azure Cloud Computing
  • 9. High Performance Information Computing Center Jongwook Woo CSULA Flowchart Download Public Dataset Upload data into Big Data HDFS Trigger Hadoop Hive Queries Result Data Tables Output Visualization
  • 10. High Performance Information Computing Center Jongwook Woo CSULA Specifications in Big Data cluster 10 Cloud Computing: Microsoft Azure with Hortonworks sandbox 1. Linux OS system 2. Number of nodes: 4 3. CPU: 8 cores 4. Memory Size: 14 GB
  • 11. High Performance Information Computing Center Jongwook Woo CSULA Implementation : Hadoop Hive  Hortonworks HDP  Hadoop Big Data Platform  Ambari: a web GUI in Hadoop  Explore HDFS file systems  Process data with Hive and Tez queries as MapReduce jobs
  • 12. High Performance Information Computing Center Jongwook Woo CSULA Table of Contents Introduction to Big Data Implementation Visualization Conclusion  References
  • 13. High Performance Information Computing Center Jongwook Woo CSULA Analysis Result Detailed analysis made it easy to understand which part of Chicago City is having safe place of residence and which areas are unsafe.  This analysis can play vital role in any house finding website if added as module, it can help people know more about the crime history of the place.  Subsequent slides shows us some of the important outcomes of the analysis.
  • 14. High Performance Information Computing Center Jongwook Woo CSULA Queries and Visualization Rank of Crime type as per occurrences
  • 15. High Performance Information Computing Center Jongwook Woo CSULA Queries and Visualization Table
  • 16. High Performance Information Computing Center Jongwook Woo CSULA Queries and Visualization
  • 17. High Performance Information Computing Center Jongwook Woo CSULA Final Outcome of Analysis  Using similar approach we find out which area of Chicago city is safer than the other.  Geo Spatial Analysis Outcome file is plotted against the map of Chicago city  Visualized using green and red mark points. – Green indicated safe areas – Red indicates unsafe areas Top 50 results are highlighted.
  • 18. High Performance Information Computing Center Jongwook Woo CSULA Final Outcome of Analysis MAP
  • 19. High Performance Information Computing Center Jongwook Woo CSULA Table of Contents Introduction to Big Data Implementation Visualization Conclusion References
  • 20. High Performance Information Computing Center Jongwook Woo CSULA Conclusion An exhaustive analysis of geolocation data for Chicago City is done a user searching a place for residence can easily select better neighborhood depending on its crime history Future work Further Analysis of individual area can be done based on other factors affecting the residence  Integrate this analysis with the rental or lease companies  Analyze more data at different locations
  • 21. High Performance Information Computing Center Jongwook Woo CSULA Table of Contents Introduction to Big Data Implementation Visualization Conclusion  References
  • 22. High Performance Information Computing Center Jongwook Woo CSULA 1. https://catalog.data.gov 2. https://cwiki.apache.org/confluence/display/Hive/Tutorial 3. https://hortonworks.com/tutorials 4. GitHub Link: https://github.com/priya708/Project-520 5. “Market Basket Analysis Algorithm with Map/Reduce of Cloud Computing”, Jongwook Woo and Yuhang Xu, The 2011 international Conference on Parallel and Distributed Processing Techniques and Applications (PDPTA 2011), Las Vegas (July 18-21, 2011) 6. Jongwook Woo, DMKD-00150, “Market Basket Analysis Algorithms with MapReduce”, Wiley Interdisciplinary Reviews Data Mining and Knowledge Discovery, Oct 28 2013, Volume 3, Issue 6, pp445-452 7. Jongwook Woo, “Big Data Trend and Open Data”, UKC 2016, Dallas, TX Aug 12 2016 References
  • 23. High Performance Information Computing Center Jongwook Woo CSULA THANK YOU Any Questions?