SlideShare a Scribd company logo
BIG + IOT FOR IDEAS
A Case Study Approach
Sharjeel Imtiaz | PhD Data Science – last stage | University of East London, UK
Big Data Definition
• Any piece of information can be considered as data.
• This data can be in various forms and in various sizes. It can vary from
small data to very big Data.
• Any data that can reside in RAM or memory is considered as small data.
Small data is less than 10s of GBs.
• Any data that can reside in Hard Disk is considered as medium data.
Medium data is in the range of 10s to 1000s of GBs.
• Any data which cannot reside in Hard disk or in a single system is
considered as Big Data. Its size is more than 1000s of GBs.
Trends?
How to process Big Data
To process and manage such huge volume
of data, different Big Data technologies
come into picture. It is a new data
challenge that requires leveraging
existing systems differently as some time
ago, data type and volume were not of the
type as it is today.
Big data reveals Shakespeare co-authored 17 of his
plays (A English Literature Domain- NLP)
• To process and manage such huge
volume of data, different Big Data
technologies come into picture.
• It is a new data challenge that
requires leveraging existing
systems differently as some time
ago, data type and volume were
not of the type as it is today.
• He said that it remains unclear
exactly how the authors worked
together. It could have been that
Marlowe wrote the texts and
Shakespeare later edited them.
We counted how often particular words and
phrases appeared in texts by Shakespeare and
other authors of his day. These patterns were
pretty unmistakable," researcher Gabriel Egan of
De Montfort University in Leicester told news
agency dpa.
Marlowe wrote the texts and Shakespeare later
edited them. (May be)
Sentiment Analysis with Topic Modeling –
(A COMPUTER SCIENCE DOMAIN)
• What are aspect of particular domain like Hotel have different aspect
Service, cleanliness , room, location, and 7 more ….
• A product in amazon is having many product features price, and other
• A stock market product is having many features and aspect basedOn
tweeter
• Disaster alerts data from tweeter is having aspects.
Sentiment analysis of
aspect in Review is
challenge and trend…..
IOT WITH BIG DATA INFRASTRUCTURE
.The ideal architecture connect devices to data
Analytics
. Security IOT based devices that monitor all
Type of network security attacks
. IOT device that monitor the network traffic and
Avoid congestion with auto monitoring and
Avoidance mechanism
• Disaster monitoring and alert mechanism
for big data analytics)
Engineering Good Topics (Good for Oman)
The solar panel produces the
Current due to the hitting of the
photons from the sun on the
silicon atoms.
Or Layman
Thus in simple words, SUNLIGHT
FALLS ON THE SOLAR PANEL
AND CURRENT is produced
Example for 100 watts pannel, Voc=20V or 21V and Isc=5 A thus
P=100 watts (approx) ---( How to manage it efficiently)
Why Cloud
• Cloud process big data volume and process in parallel nodes
framework and provide Machine Learning library.
• It is not possible to process data in Machine learning model like Neural
network on single machine within few second or a minute
More features more
processing
IOT TO BIG Data to Analytics
• Integrated IOT with big data is not the easy required domain knowledge
• Amazon Domain products data &
• category
• Tweets data
• Social media data
Tools SPSS is not for Big
data and not so good
WEKA, RMiner.
R + Python + RStudio +
Cloud based ML
What is your network architecture?
Architecture
IOT of network security or health or
tourism devices with Big data
BIG DATA WITH IOT IS TREND
OUR IOT RECOMMENDATION SYSTEM FOR
OMAN TOURISM (IOT + BIG DATA)
IOT Recommendation system for Oman Tourism
A Case study with process of Data Science and
Computations
Step 1&2&3: Data Collection + Preparation
+ storage
Data Collection
Data
Preparation
Storage &
Structure on
Hadoop (HDFS)
Data Exploration
Sentiment
Analysis
Model Analysis
with Hadoop
TRC funding Project – Trip advisor Web Site
scraping Tool
• TripAdvisor is the most well-known customer for reviewing
website for hotels and restaurants in Spain, although
Yelp.com is more popular on a global scale.
• TripAdvisor users can write reviews and post scores from 1
(“terrible”) to 5 (“excellent”) following a number of criteria.
• Webharvy
WebHarvy ---
• It scrap the reviews but required cleaning
and preparation of data together.
• Put on local disk and google cloud
• Step 4: Data Exploration Analysis
Data Collection
Data
Preparation
Storage &
Structure on
Hadoop (HDFS)
Data Exploration
Sentiment
Analysis
Model Analysis
with Hadoop
Explore the most Important terms
It is topic modeling:
• Which term is more frequently
coming in review
• Well, we finalize following
words/terms
• Room, good, downtown, stay,
service, clean, room, pool, and
more
• Remember we did stemming,
punctuation, white space, and
many ore preprocessing step
before that
Room, location, value,
service
• Step 5: sentiment analysis
Data Collection
Data
Preparation
Storage &
Structure on
Hadoop (HDFS)
Data Exploration
Sentiment
Analysis
Model Analysis
with Hadoop
IS THIS ENOGH ? Answer is NO
• Take a list of positive and negative words
Positive
Good
Great
Fantastic
Excellent
Friendly
Awesome
Enjoyed
Negative
Bad
Worse
Rubbish
Sucked
Awful
Terrible
Bogus
I had a fantastic time on
holiday at your resort. The
service was excellent and
friendly. My family all really
enjoyed themselves.
The pool was closed, which
kind of sucked though.
4 1- = 3
Overall sentiment:
Positive
Sentiment Analysis with Neural Network
indeed it is good choice rather dictionary based
appraoch!
Dashboard – Sentiment Analysis of Dubai vs UK
hotel
• It is sentiment analysis
• Which give score of positive or
negative
• We evaluate the score of aspects
• Location , service, value, room
• We conclude that UK luxury
customers are more focusing on
location and service on the other
hand Dubai people focus on value
and room. Which Is vital fact for
GULF to re-consider.
Analyze aspect terms accuracy 95.81%
• It is K-Mean analysis
• Created clusters or groups for
aspects Location , service, value
• We concluded that our top terms
are better candidate for sentiment
analysis otherwise go again and
Find out other terms
Hadoop processing ML Model (Big Data)
• Configure Hadoop 2.8.3 version
• After starting Hadoop services HDFS
• We analyze k-mean in MAP-REDUCE to
process efficiently.
Other ML Model and IOT Integration
• Plan to integrate with IOT.
• Next will explore lights , sound and lock,
music player services of room because
people in GULF prefer room ambiance as
we analyzed.
• IOT devices in summer with dashboard
reflect the usefulness and effectiveness of
big data in tourism with more than 50
recommendation in our final report.
•
Plan
Activities
Months ( Academic year 2017-2018)
From September to
December
From January to April
From May
to June
Sept
201
7
Oct
201
7
Nov
201
7
Dec
201
7
Jan
201
8
Feb
2018
March
2018
April
201
8
May
201
8
June
201
8
Literature review
Data Collection & Scraping  
Data Preparation    
Data Insight    
Data Exploration 
Data Model    
Software Building   
Reports    
• We left 2 months to explore more
• 50 plus Recommendation
• Visualizing reports
• Finally more data on cloud to
• Convince ministry of tourism about
effectiveness of our project in the domain
of IOT + BIG.
• Google cloud will show the future plan
• And large scale storage benefits for
Ministry Hotels data around the world.
Thank you! Question start what I will do.. Not what you
did.
Works Cited
• https://data.gov.uk/dataset/road-accidents-safety-data
• https://www.thebalance.com/what-is-crowdsourcing-marketing-and-how-is-it-
used-2295467
• http://www.shu.edu/technology/
• http://archive.ics.uci.edu/ml/datasets.html?sort=nameUp&view=list
• http://www.dw.com/en/big-data-reveals-shakespeare-co-authored-17-of-his-
plays/a-36145979

More Related Content

What's hot

Big Data Analytics - Introduction
Big Data Analytics - IntroductionBig Data Analytics - Introduction
Big Data Analytics - Introduction
Alex Meadows
 
Detailed presentation on big data hadoop +Hadoop Project Near Duplicate Detec...
Detailed presentation on big data hadoop +Hadoop Project Near Duplicate Detec...Detailed presentation on big data hadoop +Hadoop Project Near Duplicate Detec...
Detailed presentation on big data hadoop +Hadoop Project Near Duplicate Detec...
Ashok Royal
 
Introduction to Data Engineering
Introduction to Data EngineeringIntroduction to Data Engineering
Introduction to Data Engineering
Vivek Aanand Ganesan
 
Exploring Big Data Analytics Tools
Exploring Big Data Analytics ToolsExploring Big Data Analytics Tools
Exploring Big Data Analytics Tools
Multisoft Virtual Academy
 
Machine Learning Deep Learning AI and Data Science
Machine Learning Deep Learning AI and Data Science Machine Learning Deep Learning AI and Data Science
Machine Learning Deep Learning AI and Data Science
Venkata Reddy Konasani
 
Applications of R (DataWeek 2014)
Applications of R (DataWeek 2014)Applications of R (DataWeek 2014)
Applications of R (DataWeek 2014)
Revolution Analytics
 
Big Data - Analytics with R
Big Data - Analytics with RBig Data - Analytics with R
Big Data - Analytics with R
Techsparks
 
Software team linkedin
Software team linkedinSoftware team linkedin
Software team linkedin
Prysmian Group
 
American Century (Revolution Analytics Customer Day)
American Century (Revolution Analytics Customer Day)American Century (Revolution Analytics Customer Day)
American Century (Revolution Analytics Customer Day)
Revolution Analytics
 
Intro to Machine Learning with H2O and AWS
Intro to Machine Learning with H2O and AWSIntro to Machine Learning with H2O and AWS
Intro to Machine Learning with H2O and AWS
Sri Ambati
 
Strata Online_road_to_enterprise_data_2011
Strata Online_road_to_enterprise_data_2011Strata Online_road_to_enterprise_data_2011
Strata Online_road_to_enterprise_data_2011
Lynn Langit
 
Dataiku hadoop summit - semi-supervised learning with hadoop for understand...
Dataiku   hadoop summit - semi-supervised learning with hadoop for understand...Dataiku   hadoop summit - semi-supervised learning with hadoop for understand...
Dataiku hadoop summit - semi-supervised learning with hadoop for understand...
Dataiku
 
The Business Economics and Opportunity of Open Source Data Science
The Business Economics and Opportunity of Open Source Data ScienceThe Business Economics and Opportunity of Open Source Data Science
The Business Economics and Opportunity of Open Source Data Science
Revolution Analytics
 
Aginity Big Data Research Lab V3
Aginity Big Data Research Lab V3Aginity Big Data Research Lab V3
Aginity Big Data Research Lab V3mcacicio
 
YHORG Presentation 23 February 2016
YHORG Presentation 23 February 2016YHORG Presentation 23 February 2016
YHORG Presentation 23 February 2016
Richard Vidgen
 
Presentation on Big Data Analytics
Presentation on Big Data AnalyticsPresentation on Big Data Analytics
Presentation on Big Data Analytics
S P Sajjan
 
Data science e machine learning
Data science e machine learningData science e machine learning
Data science e machine learning
Giuseppe Manco
 
Leveraging Open Source Automated Data Science Tools
Leveraging Open Source Automated Data Science ToolsLeveraging Open Source Automated Data Science Tools
Leveraging Open Source Automated Data Science Tools
Domino Data Lab
 

What's hot (19)

Big Data Analytics - Introduction
Big Data Analytics - IntroductionBig Data Analytics - Introduction
Big Data Analytics - Introduction
 
Detailed presentation on big data hadoop +Hadoop Project Near Duplicate Detec...
Detailed presentation on big data hadoop +Hadoop Project Near Duplicate Detec...Detailed presentation on big data hadoop +Hadoop Project Near Duplicate Detec...
Detailed presentation on big data hadoop +Hadoop Project Near Duplicate Detec...
 
Introduction to Data Engineering
Introduction to Data EngineeringIntroduction to Data Engineering
Introduction to Data Engineering
 
Exploring Big Data Analytics Tools
Exploring Big Data Analytics ToolsExploring Big Data Analytics Tools
Exploring Big Data Analytics Tools
 
Big Data Analysis Starts with R
Big Data Analysis Starts with RBig Data Analysis Starts with R
Big Data Analysis Starts with R
 
Machine Learning Deep Learning AI and Data Science
Machine Learning Deep Learning AI and Data Science Machine Learning Deep Learning AI and Data Science
Machine Learning Deep Learning AI and Data Science
 
Applications of R (DataWeek 2014)
Applications of R (DataWeek 2014)Applications of R (DataWeek 2014)
Applications of R (DataWeek 2014)
 
Big Data - Analytics with R
Big Data - Analytics with RBig Data - Analytics with R
Big Data - Analytics with R
 
Software team linkedin
Software team linkedinSoftware team linkedin
Software team linkedin
 
American Century (Revolution Analytics Customer Day)
American Century (Revolution Analytics Customer Day)American Century (Revolution Analytics Customer Day)
American Century (Revolution Analytics Customer Day)
 
Intro to Machine Learning with H2O and AWS
Intro to Machine Learning with H2O and AWSIntro to Machine Learning with H2O and AWS
Intro to Machine Learning with H2O and AWS
 
Strata Online_road_to_enterprise_data_2011
Strata Online_road_to_enterprise_data_2011Strata Online_road_to_enterprise_data_2011
Strata Online_road_to_enterprise_data_2011
 
Dataiku hadoop summit - semi-supervised learning with hadoop for understand...
Dataiku   hadoop summit - semi-supervised learning with hadoop for understand...Dataiku   hadoop summit - semi-supervised learning with hadoop for understand...
Dataiku hadoop summit - semi-supervised learning with hadoop for understand...
 
The Business Economics and Opportunity of Open Source Data Science
The Business Economics and Opportunity of Open Source Data ScienceThe Business Economics and Opportunity of Open Source Data Science
The Business Economics and Opportunity of Open Source Data Science
 
Aginity Big Data Research Lab V3
Aginity Big Data Research Lab V3Aginity Big Data Research Lab V3
Aginity Big Data Research Lab V3
 
YHORG Presentation 23 February 2016
YHORG Presentation 23 February 2016YHORG Presentation 23 February 2016
YHORG Presentation 23 February 2016
 
Presentation on Big Data Analytics
Presentation on Big Data AnalyticsPresentation on Big Data Analytics
Presentation on Big Data Analytics
 
Data science e machine learning
Data science e machine learningData science e machine learning
Data science e machine learning
 
Leveraging Open Source Automated Data Science Tools
Leveraging Open Source Automated Data Science ToolsLeveraging Open Source Automated Data Science Tools
Leveraging Open Source Automated Data Science Tools
 

Similar to Big Data with IOT approach and trends with case study

Data analytics & its Trends
Data analytics & its TrendsData analytics & its Trends
Data analytics & its Trends
Dr.K.Sreenivas Rao
 
Big data4businessusers
Big data4businessusersBig data4businessusers
Big data4businessusers
Bob Hardaway
 
SKILLWISE-BIGDATA ANALYSIS
SKILLWISE-BIGDATA ANALYSISSKILLWISE-BIGDATA ANALYSIS
SKILLWISE-BIGDATA ANALYSIS
Skillwise Consulting
 
Big Data in Action : Operations, Analytics and more
Big Data in Action : Operations, Analytics and moreBig Data in Action : Operations, Analytics and more
Big Data in Action : Operations, Analytics and more
Softweb Solutions
 
Hadoop HDFS.ppt
Hadoop HDFS.pptHadoop HDFS.ppt
Hadoop HDFS.ppt
6535ANURAGANURAG
 
Big Data
Big DataBig Data
Big Data
Priyanka Tuteja
 
Oh! Session on Introduction to BIG Data
Oh! Session on Introduction to BIG DataOh! Session on Introduction to BIG Data
Oh! Session on Introduction to BIG Data
Prakalp Agarwal
 
BigData.pptx
BigData.pptxBigData.pptx
BigData.pptx
vidhi171881
 
Hadoop Master Class : A concise overview
Hadoop Master Class : A concise overviewHadoop Master Class : A concise overview
Hadoop Master Class : A concise overview
Abhishek Roy
 
Big Data and Hadoop
Big Data and HadoopBig Data and Hadoop
Big Data and Hadoop
MaulikLakhani
 
Big-Data-Seminar-6-Aug-2014-Koenig
Big-Data-Seminar-6-Aug-2014-KoenigBig-Data-Seminar-6-Aug-2014-Koenig
Big-Data-Seminar-6-Aug-2014-KoenigManish Chopra
 
Big Data
Big DataBig Data
big-data-notes1.ppt
big-data-notes1.pptbig-data-notes1.ppt
big-data-notes1.ppt
SutanuGhosal1
 
Big Data & Hadoop Introduction
Big Data & Hadoop IntroductionBig Data & Hadoop Introduction
Big Data & Hadoop Introduction
Jayant Mukherjee
 
Big data by Mithlesh sadh
Big data by Mithlesh sadhBig data by Mithlesh sadh
Big data by Mithlesh sadh
Mithlesh Sadh
 
Big Data Analytics Strategy and Roadmap
Big Data Analytics Strategy and RoadmapBig Data Analytics Strategy and Roadmap
Big Data Analytics Strategy and RoadmapSrinath Perera
 
Big data with Hadoop - Introduction
Big data with Hadoop - IntroductionBig data with Hadoop - Introduction
Big data with Hadoop - Introduction
Tomy Rhymond
 
Introduction to Cloud computing and Big Data-Hadoop
Introduction to Cloud computing and  Big Data-HadoopIntroduction to Cloud computing and  Big Data-Hadoop
Introduction to Cloud computing and Big Data-Hadoop
Nagarjuna D.N
 
Introduction to Big Data
Introduction to Big DataIntroduction to Big Data
Introduction to Big Data
SpringPeople
 
Présentation on radoop
Présentation on radoop   Présentation on radoop
Présentation on radoop
siliconsudipt
 

Similar to Big Data with IOT approach and trends with case study (20)

Data analytics & its Trends
Data analytics & its TrendsData analytics & its Trends
Data analytics & its Trends
 
Big data4businessusers
Big data4businessusersBig data4businessusers
Big data4businessusers
 
SKILLWISE-BIGDATA ANALYSIS
SKILLWISE-BIGDATA ANALYSISSKILLWISE-BIGDATA ANALYSIS
SKILLWISE-BIGDATA ANALYSIS
 
Big Data in Action : Operations, Analytics and more
Big Data in Action : Operations, Analytics and moreBig Data in Action : Operations, Analytics and more
Big Data in Action : Operations, Analytics and more
 
Hadoop HDFS.ppt
Hadoop HDFS.pptHadoop HDFS.ppt
Hadoop HDFS.ppt
 
Big Data
Big DataBig Data
Big Data
 
Oh! Session on Introduction to BIG Data
Oh! Session on Introduction to BIG DataOh! Session on Introduction to BIG Data
Oh! Session on Introduction to BIG Data
 
BigData.pptx
BigData.pptxBigData.pptx
BigData.pptx
 
Hadoop Master Class : A concise overview
Hadoop Master Class : A concise overviewHadoop Master Class : A concise overview
Hadoop Master Class : A concise overview
 
Big Data and Hadoop
Big Data and HadoopBig Data and Hadoop
Big Data and Hadoop
 
Big-Data-Seminar-6-Aug-2014-Koenig
Big-Data-Seminar-6-Aug-2014-KoenigBig-Data-Seminar-6-Aug-2014-Koenig
Big-Data-Seminar-6-Aug-2014-Koenig
 
Big Data
Big DataBig Data
Big Data
 
big-data-notes1.ppt
big-data-notes1.pptbig-data-notes1.ppt
big-data-notes1.ppt
 
Big Data & Hadoop Introduction
Big Data & Hadoop IntroductionBig Data & Hadoop Introduction
Big Data & Hadoop Introduction
 
Big data by Mithlesh sadh
Big data by Mithlesh sadhBig data by Mithlesh sadh
Big data by Mithlesh sadh
 
Big Data Analytics Strategy and Roadmap
Big Data Analytics Strategy and RoadmapBig Data Analytics Strategy and Roadmap
Big Data Analytics Strategy and Roadmap
 
Big data with Hadoop - Introduction
Big data with Hadoop - IntroductionBig data with Hadoop - Introduction
Big data with Hadoop - Introduction
 
Introduction to Cloud computing and Big Data-Hadoop
Introduction to Cloud computing and  Big Data-HadoopIntroduction to Cloud computing and  Big Data-Hadoop
Introduction to Cloud computing and Big Data-Hadoop
 
Introduction to Big Data
Introduction to Big DataIntroduction to Big Data
Introduction to Big Data
 
Présentation on radoop
Présentation on radoop   Présentation on radoop
Présentation on radoop
 

Recently uploaded

Orion Air Quality Monitoring Systems - CWS
Orion Air Quality Monitoring Systems - CWSOrion Air Quality Monitoring Systems - CWS
Orion Air Quality Monitoring Systems - CWS
Columbia Weather Systems
 
GBSN - Microbiology (Lab 4) Culture Media
GBSN - Microbiology (Lab 4) Culture MediaGBSN - Microbiology (Lab 4) Culture Media
GBSN - Microbiology (Lab 4) Culture Media
Areesha Ahmad
 
Lab report on liquid viscosity of glycerin
Lab report on liquid viscosity of glycerinLab report on liquid viscosity of glycerin
Lab report on liquid viscosity of glycerin
ossaicprecious19
 
Citrus Greening Disease and its Management
Citrus Greening Disease and its ManagementCitrus Greening Disease and its Management
Citrus Greening Disease and its Management
subedisuryaofficial
 
Nucleic Acid-its structural and functional complexity.
Nucleic Acid-its structural and functional complexity.Nucleic Acid-its structural and functional complexity.
Nucleic Acid-its structural and functional complexity.
Nistarini College, Purulia (W.B) India
 
Structures and textures of metamorphic rocks
Structures and textures of metamorphic rocksStructures and textures of metamorphic rocks
Structures and textures of metamorphic rocks
kumarmathi863
 
role of pramana in research.pptx in science
role of pramana in research.pptx in sciencerole of pramana in research.pptx in science
role of pramana in research.pptx in science
sonaliswain16
 
4. An Overview of Sugarcane White Leaf Disease in Vietnam.pdf
4. An Overview of Sugarcane White Leaf Disease in Vietnam.pdf4. An Overview of Sugarcane White Leaf Disease in Vietnam.pdf
4. An Overview of Sugarcane White Leaf Disease in Vietnam.pdf
ssuserbfdca9
 
Leaf Initiation, Growth and Differentiation.pdf
Leaf Initiation, Growth and Differentiation.pdfLeaf Initiation, Growth and Differentiation.pdf
Leaf Initiation, Growth and Differentiation.pdf
RenuJangid3
 
GBSN- Microbiology (Lab 3) Gram Staining
GBSN- Microbiology (Lab 3) Gram StainingGBSN- Microbiology (Lab 3) Gram Staining
GBSN- Microbiology (Lab 3) Gram Staining
Areesha Ahmad
 
Comparative structure of adrenal gland in vertebrates
Comparative structure of adrenal gland in vertebratesComparative structure of adrenal gland in vertebrates
Comparative structure of adrenal gland in vertebrates
sachin783648
 
NuGOweek 2024 Ghent - programme - final version
NuGOweek 2024 Ghent - programme - final versionNuGOweek 2024 Ghent - programme - final version
NuGOweek 2024 Ghent - programme - final version
pablovgd
 
filosofia boliviana introducción jsjdjd.pptx
filosofia boliviana introducción jsjdjd.pptxfilosofia boliviana introducción jsjdjd.pptx
filosofia boliviana introducción jsjdjd.pptx
IvanMallco1
 
Circulatory system_ Laplace law. Ohms law.reynaults law,baro-chemo-receptors-...
Circulatory system_ Laplace law. Ohms law.reynaults law,baro-chemo-receptors-...Circulatory system_ Laplace law. Ohms law.reynaults law,baro-chemo-receptors-...
Circulatory system_ Laplace law. Ohms law.reynaults law,baro-chemo-receptors-...
muralinath2
 
Seminar of U.V. Spectroscopy by SAMIR PANDA
 Seminar of U.V. Spectroscopy by SAMIR PANDA Seminar of U.V. Spectroscopy by SAMIR PANDA
Seminar of U.V. Spectroscopy by SAMIR PANDA
SAMIR PANDA
 
insect taxonomy importance systematics and classification
insect taxonomy importance systematics and classificationinsect taxonomy importance systematics and classification
insect taxonomy importance systematics and classification
anitaento25
 
in vitro propagation of plants lecture note.pptx
in vitro propagation of plants lecture note.pptxin vitro propagation of plants lecture note.pptx
in vitro propagation of plants lecture note.pptx
yusufzako14
 
Richard's entangled aventures in wonderland
Richard's entangled aventures in wonderlandRichard's entangled aventures in wonderland
Richard's entangled aventures in wonderland
Richard Gill
 
Hemoglobin metabolism_pathophysiology.pptx
Hemoglobin metabolism_pathophysiology.pptxHemoglobin metabolism_pathophysiology.pptx
Hemoglobin metabolism_pathophysiology.pptx
muralinath2
 
Astronomy Update- Curiosity’s exploration of Mars _ Local Briefs _ leadertele...
Astronomy Update- Curiosity’s exploration of Mars _ Local Briefs _ leadertele...Astronomy Update- Curiosity’s exploration of Mars _ Local Briefs _ leadertele...
Astronomy Update- Curiosity’s exploration of Mars _ Local Briefs _ leadertele...
NathanBaughman3
 

Recently uploaded (20)

Orion Air Quality Monitoring Systems - CWS
Orion Air Quality Monitoring Systems - CWSOrion Air Quality Monitoring Systems - CWS
Orion Air Quality Monitoring Systems - CWS
 
GBSN - Microbiology (Lab 4) Culture Media
GBSN - Microbiology (Lab 4) Culture MediaGBSN - Microbiology (Lab 4) Culture Media
GBSN - Microbiology (Lab 4) Culture Media
 
Lab report on liquid viscosity of glycerin
Lab report on liquid viscosity of glycerinLab report on liquid viscosity of glycerin
Lab report on liquid viscosity of glycerin
 
Citrus Greening Disease and its Management
Citrus Greening Disease and its ManagementCitrus Greening Disease and its Management
Citrus Greening Disease and its Management
 
Nucleic Acid-its structural and functional complexity.
Nucleic Acid-its structural and functional complexity.Nucleic Acid-its structural and functional complexity.
Nucleic Acid-its structural and functional complexity.
 
Structures and textures of metamorphic rocks
Structures and textures of metamorphic rocksStructures and textures of metamorphic rocks
Structures and textures of metamorphic rocks
 
role of pramana in research.pptx in science
role of pramana in research.pptx in sciencerole of pramana in research.pptx in science
role of pramana in research.pptx in science
 
4. An Overview of Sugarcane White Leaf Disease in Vietnam.pdf
4. An Overview of Sugarcane White Leaf Disease in Vietnam.pdf4. An Overview of Sugarcane White Leaf Disease in Vietnam.pdf
4. An Overview of Sugarcane White Leaf Disease in Vietnam.pdf
 
Leaf Initiation, Growth and Differentiation.pdf
Leaf Initiation, Growth and Differentiation.pdfLeaf Initiation, Growth and Differentiation.pdf
Leaf Initiation, Growth and Differentiation.pdf
 
GBSN- Microbiology (Lab 3) Gram Staining
GBSN- Microbiology (Lab 3) Gram StainingGBSN- Microbiology (Lab 3) Gram Staining
GBSN- Microbiology (Lab 3) Gram Staining
 
Comparative structure of adrenal gland in vertebrates
Comparative structure of adrenal gland in vertebratesComparative structure of adrenal gland in vertebrates
Comparative structure of adrenal gland in vertebrates
 
NuGOweek 2024 Ghent - programme - final version
NuGOweek 2024 Ghent - programme - final versionNuGOweek 2024 Ghent - programme - final version
NuGOweek 2024 Ghent - programme - final version
 
filosofia boliviana introducción jsjdjd.pptx
filosofia boliviana introducción jsjdjd.pptxfilosofia boliviana introducción jsjdjd.pptx
filosofia boliviana introducción jsjdjd.pptx
 
Circulatory system_ Laplace law. Ohms law.reynaults law,baro-chemo-receptors-...
Circulatory system_ Laplace law. Ohms law.reynaults law,baro-chemo-receptors-...Circulatory system_ Laplace law. Ohms law.reynaults law,baro-chemo-receptors-...
Circulatory system_ Laplace law. Ohms law.reynaults law,baro-chemo-receptors-...
 
Seminar of U.V. Spectroscopy by SAMIR PANDA
 Seminar of U.V. Spectroscopy by SAMIR PANDA Seminar of U.V. Spectroscopy by SAMIR PANDA
Seminar of U.V. Spectroscopy by SAMIR PANDA
 
insect taxonomy importance systematics and classification
insect taxonomy importance systematics and classificationinsect taxonomy importance systematics and classification
insect taxonomy importance systematics and classification
 
in vitro propagation of plants lecture note.pptx
in vitro propagation of plants lecture note.pptxin vitro propagation of plants lecture note.pptx
in vitro propagation of plants lecture note.pptx
 
Richard's entangled aventures in wonderland
Richard's entangled aventures in wonderlandRichard's entangled aventures in wonderland
Richard's entangled aventures in wonderland
 
Hemoglobin metabolism_pathophysiology.pptx
Hemoglobin metabolism_pathophysiology.pptxHemoglobin metabolism_pathophysiology.pptx
Hemoglobin metabolism_pathophysiology.pptx
 
Astronomy Update- Curiosity’s exploration of Mars _ Local Briefs _ leadertele...
Astronomy Update- Curiosity’s exploration of Mars _ Local Briefs _ leadertele...Astronomy Update- Curiosity’s exploration of Mars _ Local Briefs _ leadertele...
Astronomy Update- Curiosity’s exploration of Mars _ Local Briefs _ leadertele...
 

Big Data with IOT approach and trends with case study

  • 1. BIG + IOT FOR IDEAS A Case Study Approach Sharjeel Imtiaz | PhD Data Science – last stage | University of East London, UK
  • 2. Big Data Definition • Any piece of information can be considered as data. • This data can be in various forms and in various sizes. It can vary from small data to very big Data. • Any data that can reside in RAM or memory is considered as small data. Small data is less than 10s of GBs. • Any data that can reside in Hard Disk is considered as medium data. Medium data is in the range of 10s to 1000s of GBs. • Any data which cannot reside in Hard disk or in a single system is considered as Big Data. Its size is more than 1000s of GBs.
  • 4. How to process Big Data To process and manage such huge volume of data, different Big Data technologies come into picture. It is a new data challenge that requires leveraging existing systems differently as some time ago, data type and volume were not of the type as it is today.
  • 5. Big data reveals Shakespeare co-authored 17 of his plays (A English Literature Domain- NLP) • To process and manage such huge volume of data, different Big Data technologies come into picture. • It is a new data challenge that requires leveraging existing systems differently as some time ago, data type and volume were not of the type as it is today. • He said that it remains unclear exactly how the authors worked together. It could have been that Marlowe wrote the texts and Shakespeare later edited them. We counted how often particular words and phrases appeared in texts by Shakespeare and other authors of his day. These patterns were pretty unmistakable," researcher Gabriel Egan of De Montfort University in Leicester told news agency dpa. Marlowe wrote the texts and Shakespeare later edited them. (May be)
  • 6. Sentiment Analysis with Topic Modeling – (A COMPUTER SCIENCE DOMAIN) • What are aspect of particular domain like Hotel have different aspect Service, cleanliness , room, location, and 7 more …. • A product in amazon is having many product features price, and other • A stock market product is having many features and aspect basedOn tweeter • Disaster alerts data from tweeter is having aspects. Sentiment analysis of aspect in Review is challenge and trend…..
  • 7. IOT WITH BIG DATA INFRASTRUCTURE .The ideal architecture connect devices to data Analytics . Security IOT based devices that monitor all Type of network security attacks . IOT device that monitor the network traffic and Avoid congestion with auto monitoring and Avoidance mechanism • Disaster monitoring and alert mechanism for big data analytics)
  • 8. Engineering Good Topics (Good for Oman) The solar panel produces the Current due to the hitting of the photons from the sun on the silicon atoms. Or Layman Thus in simple words, SUNLIGHT FALLS ON THE SOLAR PANEL AND CURRENT is produced Example for 100 watts pannel, Voc=20V or 21V and Isc=5 A thus P=100 watts (approx) ---( How to manage it efficiently)
  • 9. Why Cloud • Cloud process big data volume and process in parallel nodes framework and provide Machine Learning library. • It is not possible to process data in Machine learning model like Neural network on single machine within few second or a minute More features more processing
  • 10. IOT TO BIG Data to Analytics • Integrated IOT with big data is not the easy required domain knowledge • Amazon Domain products data & • category • Tweets data • Social media data Tools SPSS is not for Big data and not so good WEKA, RMiner. R + Python + RStudio + Cloud based ML
  • 11. What is your network architecture?
  • 12. Architecture IOT of network security or health or tourism devices with Big data BIG DATA WITH IOT IS TREND
  • 13. OUR IOT RECOMMENDATION SYSTEM FOR OMAN TOURISM (IOT + BIG DATA)
  • 14. IOT Recommendation system for Oman Tourism A Case study with process of Data Science and Computations Step 1&2&3: Data Collection + Preparation + storage Data Collection Data Preparation Storage & Structure on Hadoop (HDFS) Data Exploration Sentiment Analysis Model Analysis with Hadoop
  • 15. TRC funding Project – Trip advisor Web Site scraping Tool • TripAdvisor is the most well-known customer for reviewing website for hotels and restaurants in Spain, although Yelp.com is more popular on a global scale. • TripAdvisor users can write reviews and post scores from 1 (“terrible”) to 5 (“excellent”) following a number of criteria. • Webharvy
  • 16. WebHarvy --- • It scrap the reviews but required cleaning and preparation of data together. • Put on local disk and google cloud
  • 17. • Step 4: Data Exploration Analysis Data Collection Data Preparation Storage & Structure on Hadoop (HDFS) Data Exploration Sentiment Analysis Model Analysis with Hadoop
  • 18. Explore the most Important terms It is topic modeling: • Which term is more frequently coming in review • Well, we finalize following words/terms • Room, good, downtown, stay, service, clean, room, pool, and more • Remember we did stemming, punctuation, white space, and many ore preprocessing step before that Room, location, value, service
  • 19. • Step 5: sentiment analysis Data Collection Data Preparation Storage & Structure on Hadoop (HDFS) Data Exploration Sentiment Analysis Model Analysis with Hadoop
  • 20. IS THIS ENOGH ? Answer is NO • Take a list of positive and negative words Positive Good Great Fantastic Excellent Friendly Awesome Enjoyed Negative Bad Worse Rubbish Sucked Awful Terrible Bogus I had a fantastic time on holiday at your resort. The service was excellent and friendly. My family all really enjoyed themselves. The pool was closed, which kind of sucked though. 4 1- = 3 Overall sentiment: Positive
  • 21. Sentiment Analysis with Neural Network indeed it is good choice rather dictionary based appraoch!
  • 22. Dashboard – Sentiment Analysis of Dubai vs UK hotel • It is sentiment analysis • Which give score of positive or negative • We evaluate the score of aspects • Location , service, value, room • We conclude that UK luxury customers are more focusing on location and service on the other hand Dubai people focus on value and room. Which Is vital fact for GULF to re-consider.
  • 23. Analyze aspect terms accuracy 95.81% • It is K-Mean analysis • Created clusters or groups for aspects Location , service, value • We concluded that our top terms are better candidate for sentiment analysis otherwise go again and Find out other terms
  • 24. Hadoop processing ML Model (Big Data) • Configure Hadoop 2.8.3 version • After starting Hadoop services HDFS • We analyze k-mean in MAP-REDUCE to process efficiently.
  • 25. Other ML Model and IOT Integration • Plan to integrate with IOT. • Next will explore lights , sound and lock, music player services of room because people in GULF prefer room ambiance as we analyzed. • IOT devices in summer with dashboard reflect the usefulness and effectiveness of big data in tourism with more than 50 recommendation in our final report. •
  • 26. Plan Activities Months ( Academic year 2017-2018) From September to December From January to April From May to June Sept 201 7 Oct 201 7 Nov 201 7 Dec 201 7 Jan 201 8 Feb 2018 March 2018 April 201 8 May 201 8 June 201 8 Literature review Data Collection & Scraping   Data Preparation     Data Insight     Data Exploration  Data Model     Software Building    Reports     • We left 2 months to explore more • 50 plus Recommendation • Visualizing reports • Finally more data on cloud to • Convince ministry of tourism about effectiveness of our project in the domain of IOT + BIG. • Google cloud will show the future plan • And large scale storage benefits for Ministry Hotels data around the world.
  • 27. Thank you! Question start what I will do.. Not what you did.
  • 28. Works Cited • https://data.gov.uk/dataset/road-accidents-safety-data • https://www.thebalance.com/what-is-crowdsourcing-marketing-and-how-is-it- used-2295467 • http://www.shu.edu/technology/ • http://archive.ics.uci.edu/ml/datasets.html?sort=nameUp&view=list • http://www.dw.com/en/big-data-reveals-shakespeare-co-authored-17-of-his- plays/a-36145979