SlideShare a Scribd company logo
A Framework for Real-Time
Analysis, Storage, and
Visualization of Big Data
Divy
Sensor Network
• Composed of
• Microcontroller units (MCU)
• Various sensors
• Temperature
• Humidity
• Photocells
• Web API
• Control MCUs
• Message Broker
• Route messages
• Manage communication channels
• Manage ques
The Ideal MCU
• Size
• Small form factor
• Power
• Low Power Consumption
• Battery Based
• Sleep Modes
• GPIO
• Digital
• Analog
• Communication
• Wifi 802.11 b/g/n
• Bluetooth
• Cellular 2G/3G
• Software
• Developed support
• Functionality Libraries
• Language Support
• C
• Javascript
Our Pick: ESP8266-12e
● Size
○ 24mm x16mm x 3mm
● Power
○ 3v3
○ Sleep mode
● Full IP Stack
○ WIFI 802.11 b/g/n
○ Can act as client or host
● Low Cost
○ $1.50
● Software
○ Several firmata available
○ Huge online community
https://acrobotic.com/media/wysiwyg/products/esp8266_esp12e_horizontal-01.png
Web API
• Web API
• Supervises MCUs
• MCUs initialization
• Connect to API Web Server
• Over Websockets
• Enter REPL
• Read Evaluate Print Loop (REPL)
• Inject Code directly into device
• Keep Status of Devices
• Modes
• Sleep
• On
• Off
Message Queue Telemetry Transport
(MQTT)
• MQTT: Lightweight Comm. Protocol
• Over TCP/IP
• Topic
• Data stream subject/identifier
• Publisher
• Transmits data on select topic
• Active connection
• Subscriber
• Listens for data on select topic
• Passive connection
• MQTT Broker
• Routes traffic based on subscriptions
Message Broker
• Intermediary
• Between sender and receiver
• Manages
• Delivery
• Routing
• Message Queue
• Protocol Conversion
• Message translation
Topic Subscriber Publisher
Temperature Server Temp. Sensor
Feedback Server Thermostat
mn … m4 m3 m2 m1 m0
m0 m1 m2 m3 m4 … mn
Temp. Sensor
Thermostat
Message Broker
Topic: Feedback
Topic: Temperature Server Side
Calculations
Message Queues
RabbitMQ
• Cross Language Support
• Java, Python, JavaScript, Ruby, & .NET
• Cross Protocol Support
• MQTT, AMQP, HTTP, & STOMP
• Asynchronous Messaging
• Many subs
• Many pubs
• QoS
• Data persists in queues until read by subscriber
• Low Latency
• Critical for real time apps.
Alternative Input Devices
• ROS Robots
• Drones
• Rovers
• Multi Input, Multi Output Systems
• Sensor + Actuator Pairs
Eclipse Kura- Gateway
Device for IOT
What is Kura?
● OSGI based framework for IOT Gateways
● Runs in a JVM
● Built-in MQTT cloud services
● Browser based GUI
Key Features-Network and Cloud Services
● Extensive MQTT configuration options
● Helps implement more complex interaction flows beyond publish/subscribe
● Remote management of M2M applications
Key Features-Configurable Services
● 3 ways to add service packages:
○ Eclipse Marketplace
○ URL
○ Uploaded files.
● Can be configured during runtime through the web GUI
● Can access device hardware: GPIO, GPS, etc
Key Features-Kura Wires
● Add new processes to Kura in a block based visual representation
● Runs automatically once changes are applied; no need to compile
● Implementations for MQTT, building databases, data filtering, and more
● Additional assets can be added to Kura Wires by adding packages
Distributed Stream
Processing Engine
Prototype Project with Temperature Sensor
Processing Tool:
● Handle streams of big data in real-time
● Low Latency
● Robust
● Simple / Flexible Implementation
What is Storm?
Storm = Distributed Stream Processing Engine (DSPE)
Topology :
What is Storm?
Parallel Processing with Apache - Zookeeper
Cluster Architecture :
Apache Storm - FEATURES
● Scalable infrastructure :
○ Zookeeper
● Simple API :
○ Free and open source
● Guarantee data processing :
○ Anchor tuples
○ At-least-once processing (Default)
○ Exactly-once processing (Apache -
Trident)
● Fault tolerant :
○ Daemons are stateless and fail-
fast
○ Zookeeper uses heartbeats
● Any language :
○ Apache - Thrift
● Data variability :
○ Kyro Serialization
FRAMEWORK
VISION
Data Storage
SQL & NoSQL Comparison
SQL
● Vertically Scalable
● Predetermined data structure
● ACID (Atomicity, Consistency, Isolation
and Durability)
● Uses the Universal SQL (Structured
Query Language) language which
provides a powerful tool to manipulate
and define data
● Allows for complex queries
● Examples:
○ MySQL, Sqlite, and Postgres
NoSQL
● Horizontally Scalable
● Unstructured data storage
● CAP theorem ( Consistency, Availability
and Partition tolerance )
● Typically NoSQL relies on a collection of
documents and the syntax varies per the
database
● No standard interface for complex
queries
● Examples:
○ MongoDB, Redis, and Hbase
MySQL
● Open source relational database management system
(RDBMS)
● Provides a password system that is very flexible and
secure
● MySQL supports large databases, up to 50 million rows
or more in a table
● MySQL is also used in the industry as well (Facebook,
Twitter, Flickr and YouTube)
Where does
MySQL fit?
● Data Storage portion of the
architecture
● Apache Storm writes data
to MySQL
● Spark is then able to query
the database and provide a
visualization tool
Database Structure
MySQL
Place_id(PK)
Name
Bldg_id(PK)
Name
Place_id(FK)
Room_id(PK)
Name
Floor_id(FK)
Device_id(PK)
Name
Device_Type(FK)
Service_Date
Floor_id(PK)
Name
Bldg_id(FK)
Sensor_id(PK)
Device_id(FK)
Name
Sensor_Type(FK)
Data_id(PK)
Sensor_id(FK)
Time
Data_Value
Room_id(FK)
Coordinates
Device_Type(PK)
Name
Sensor_Type(PK)
Name
Place Building Floor Room
Device Sensor Sensor Data
Device Type Sensor Type
Design Challenges (Kyle)
● Normalizing the database was challenging
because I never experienced high level data
modeling before. It took me a couple of office
hour sessions with my professor for me to
grasp the concept.
● Creating the connection between Apache
Storm was incredibly frustrating. It was a
gruelling debugging process that me and a
fellow engineer were stuck on for about a day
or two.
● The time spent to build up my knowledge of all
the technologies integrated into the framework.
Chris
Apache Spark
YARN
Stand Alone
Cluster
Mesos
HDFS(Hadoop, S3, or local)
Spark Streaming Spark MLib Spark GraphX Spark SQL
Apache Spark Architecture
Cloud Batch Processing
• Four components to Apache Spark:
• Spark SQL
• Introduces the concept of the “Resilient
Distributed Dataset”.
• Enables reading/writing and querying a
database using SQL.
• Spark MLib
• Machine Learning Algorithms
• Spark Streaming
• Real-time streaming using the process
of micro-batching.
• Spark GraphX
• Extends RDD’s for graphs and graph-
parallel computation.
Apache Zeppelin
Data Visualization
• Web-based book for data analysis
and visualization.
• Frames the question: “What is really
happening behind the hood?”.
• Versatile
• Interpreters for any language.
• Easy to share web-based notebook
• Able to set permissions per person.
• Integration with Apache Spark
Web Browser
Web
Server
Local
Interpreters
Zeppelin Daemon
Remote
Interpreters
Spark Master Node
Spark Worker
Node
Spark Worker
Node
Apache Spark Apache Zeppelin
✔ Python rather than Scala.
✔Plenty of Examples with a big community.
✔Troubleshooting was not that bad.
✔Basically Spark, but visualized.
✔Creating interpreters is useful.
✔Fun to explore and use.
🗙I used Python, but I’m new to Python
also.
🗙Understanding the program is one
thing, using it is another.
🗙New to Open-Source and Linux OS.
🗙 I learned Zeppelin, before Spark
🗙 Not as big of a community as Spark.
🗙 Troubleshooting was more difficult.
🗙More to Zeppelin than just Spark
interpreter..
Implementation
Parking on campus
• Time-to-park is an issue on UTSA campus
• UTSA claims to have enough parking to meet typical demand
• Too many people in the same parking lot / Random distribution of
newly opened parking spots
• Solution: Accurate and reliable system to track (and predict)
parking patterns, providing real-time information to drivers to
assist in parking decisions
High Level Design
Cloud
Processing
• Percentage availability
• Predicted time-to-park (real time)
• Potential time-to-park (past data)

More Related Content

What's hot

Bootstrap SaaS startup using Open Source Tools
Bootstrap SaaS startup using Open Source ToolsBootstrap SaaS startup using Open Source Tools
Bootstrap SaaS startup using Open Source Tools
botsplash.com
 
Membase Intro from Membase Meetup San Francisco
Membase Intro from Membase Meetup San FranciscoMembase Intro from Membase Meetup San Francisco
Membase Intro from Membase Meetup San Francisco
Membase
 
Automating using Ansible
Automating using AnsibleAutomating using Ansible
Automating using Ansible
Alok Patra
 
How does Riak compare to Cassandra? [Cassandra London User Group July 2011]
How does Riak compare to Cassandra? [Cassandra London User Group July 2011]How does Riak compare to Cassandra? [Cassandra London User Group July 2011]
How does Riak compare to Cassandra? [Cassandra London User Group July 2011]
Rainforest QA
 
Infinispan, transactional key value data grid and nosql database
Infinispan, transactional key value data grid and nosql databaseInfinispan, transactional key value data grid and nosql database
Infinispan, transactional key value data grid and nosql database
Alexander Petrov
 
Column and hadoop
Column and hadoopColumn and hadoop
Column and hadoop
Alex Jiang
 
Scaling with Riak at Showyou
Scaling with Riak at ShowyouScaling with Riak at Showyou
Scaling with Riak at Showyou
John Muellerleile
 
Stream processing at Hotstar
Stream processing at HotstarStream processing at Hotstar
Stream processing at Hotstar
KafkaZone
 
Infinispan - Galder Zamarreno - October 2010
Infinispan - Galder Zamarreno - October 2010Infinispan - Galder Zamarreno - October 2010
Infinispan - Galder Zamarreno - October 2010
JUG Lausanne
 
Scaling ELK Stack - DevOpsDays Singapore
Scaling ELK Stack - DevOpsDays SingaporeScaling ELK Stack - DevOpsDays Singapore
Scaling ELK Stack - DevOpsDays Singapore
Angad Singh
 
Membase East Coast Meetups
Membase East Coast MeetupsMembase East Coast Meetups
Membase East Coast Meetups
Membase
 
Membase Meetup - Silicon Valley
Membase Meetup - Silicon ValleyMembase Meetup - Silicon Valley
Membase Meetup - Silicon Valley
Membase
 
Solr cloud the 'search first' nosql database extended deep dive
Solr cloud the 'search first' nosql database   extended deep diveSolr cloud the 'search first' nosql database   extended deep dive
Solr cloud the 'search first' nosql database extended deep dive
lucenerevolution
 
Rolling With Riak
Rolling With RiakRolling With Riak
Rolling With Riak
John Lynch
 
Couchbase@live person meetup july 22nd
Couchbase@live person meetup   july 22ndCouchbase@live person meetup   july 22nd
Couchbase@live person meetup july 22nd
Ido Shilon
 
Kafka and Kafka Streams in the Global Schibsted Data Platform
Kafka and Kafka Streams in the Global Schibsted Data PlatformKafka and Kafka Streams in the Global Schibsted Data Platform
Kafka and Kafka Streams in the Global Schibsted Data Platform
Fredrik Vraalsen
 
Billions of Messages in Real Time: Why Paypal & LinkedIn Trust an Engagement ...
Billions of Messages in Real Time: Why Paypal & LinkedIn Trust an Engagement ...Billions of Messages in Real Time: Why Paypal & LinkedIn Trust an Engagement ...
Billions of Messages in Real Time: Why Paypal & LinkedIn Trust an Engagement ...
confluent
 
MongoDB Administration 101
MongoDB Administration 101MongoDB Administration 101
MongoDB Administration 101
MongoDB
 
Biomatters and Amazon Web Services
Biomatters and Amazon Web Services Biomatters and Amazon Web Services
Biomatters and Amazon Web Services
Biomatters
 
RealTime Recommendations @Netflix - Spark
RealTime Recommendations @Netflix - SparkRealTime Recommendations @Netflix - Spark
RealTime Recommendations @Netflix - Spark
Nitin S
 

What's hot (20)

Bootstrap SaaS startup using Open Source Tools
Bootstrap SaaS startup using Open Source ToolsBootstrap SaaS startup using Open Source Tools
Bootstrap SaaS startup using Open Source Tools
 
Membase Intro from Membase Meetup San Francisco
Membase Intro from Membase Meetup San FranciscoMembase Intro from Membase Meetup San Francisco
Membase Intro from Membase Meetup San Francisco
 
Automating using Ansible
Automating using AnsibleAutomating using Ansible
Automating using Ansible
 
How does Riak compare to Cassandra? [Cassandra London User Group July 2011]
How does Riak compare to Cassandra? [Cassandra London User Group July 2011]How does Riak compare to Cassandra? [Cassandra London User Group July 2011]
How does Riak compare to Cassandra? [Cassandra London User Group July 2011]
 
Infinispan, transactional key value data grid and nosql database
Infinispan, transactional key value data grid and nosql databaseInfinispan, transactional key value data grid and nosql database
Infinispan, transactional key value data grid and nosql database
 
Column and hadoop
Column and hadoopColumn and hadoop
Column and hadoop
 
Scaling with Riak at Showyou
Scaling with Riak at ShowyouScaling with Riak at Showyou
Scaling with Riak at Showyou
 
Stream processing at Hotstar
Stream processing at HotstarStream processing at Hotstar
Stream processing at Hotstar
 
Infinispan - Galder Zamarreno - October 2010
Infinispan - Galder Zamarreno - October 2010Infinispan - Galder Zamarreno - October 2010
Infinispan - Galder Zamarreno - October 2010
 
Scaling ELK Stack - DevOpsDays Singapore
Scaling ELK Stack - DevOpsDays SingaporeScaling ELK Stack - DevOpsDays Singapore
Scaling ELK Stack - DevOpsDays Singapore
 
Membase East Coast Meetups
Membase East Coast MeetupsMembase East Coast Meetups
Membase East Coast Meetups
 
Membase Meetup - Silicon Valley
Membase Meetup - Silicon ValleyMembase Meetup - Silicon Valley
Membase Meetup - Silicon Valley
 
Solr cloud the 'search first' nosql database extended deep dive
Solr cloud the 'search first' nosql database   extended deep diveSolr cloud the 'search first' nosql database   extended deep dive
Solr cloud the 'search first' nosql database extended deep dive
 
Rolling With Riak
Rolling With RiakRolling With Riak
Rolling With Riak
 
Couchbase@live person meetup july 22nd
Couchbase@live person meetup   july 22ndCouchbase@live person meetup   july 22nd
Couchbase@live person meetup july 22nd
 
Kafka and Kafka Streams in the Global Schibsted Data Platform
Kafka and Kafka Streams in the Global Schibsted Data PlatformKafka and Kafka Streams in the Global Schibsted Data Platform
Kafka and Kafka Streams in the Global Schibsted Data Platform
 
Billions of Messages in Real Time: Why Paypal & LinkedIn Trust an Engagement ...
Billions of Messages in Real Time: Why Paypal & LinkedIn Trust an Engagement ...Billions of Messages in Real Time: Why Paypal & LinkedIn Trust an Engagement ...
Billions of Messages in Real Time: Why Paypal & LinkedIn Trust an Engagement ...
 
MongoDB Administration 101
MongoDB Administration 101MongoDB Administration 101
MongoDB Administration 101
 
Biomatters and Amazon Web Services
Biomatters and Amazon Web Services Biomatters and Amazon Web Services
Biomatters and Amazon Web Services
 
RealTime Recommendations @Netflix - Spark
RealTime Recommendations @Netflix - SparkRealTime Recommendations @Netflix - Spark
RealTime Recommendations @Netflix - Spark
 

Similar to Summer 2017 undergraduate research powerpoint

Hpc lunch and learn
Hpc lunch and learnHpc lunch and learn
Hpc lunch and learn
John D Almon
 
Introduction to Google Cloud Platform
Introduction to Google Cloud PlatformIntroduction to Google Cloud Platform
Introduction to Google Cloud Platform
Sujai Prakasam
 
Public Cloud Workshop
Public Cloud WorkshopPublic Cloud Workshop
Public Cloud Workshop
Amer Ather
 
Netflix oss season 1 episode 3
Netflix oss season 1 episode 3 Netflix oss season 1 episode 3
Netflix oss season 1 episode 3
Ruslan Meshenberg
 
Big data conference europe real-time streaming in any and all clouds, hybri...
Big data conference europe   real-time streaming in any and all clouds, hybri...Big data conference europe   real-time streaming in any and all clouds, hybri...
Big data conference europe real-time streaming in any and all clouds, hybri...
Timothy Spann
 
NoSQL on the move
NoSQL on the moveNoSQL on the move
NoSQL on the move
Codemotion
 
HPC and cloud distributed computing, as a journey
HPC and cloud distributed computing, as a journeyHPC and cloud distributed computing, as a journey
HPC and cloud distributed computing, as a journey
Peter Clapham
 
USENIX LISA15: How TubeMogul Handles over One Trillion HTTP Requests a Month
USENIX LISA15: How TubeMogul Handles over One Trillion HTTP Requests a MonthUSENIX LISA15: How TubeMogul Handles over One Trillion HTTP Requests a Month
USENIX LISA15: How TubeMogul Handles over One Trillion HTTP Requests a Month
Nicolas Brousse
 
BISSA: Empowering Web gadget Communication with Tuple Spaces
BISSA: Empowering Web gadget Communication with Tuple SpacesBISSA: Empowering Web gadget Communication with Tuple Spaces
BISSA: Empowering Web gadget Communication with Tuple Spaces
Srinath Perera
 
Introduction to Apache Mesos and DC/OS
Introduction to Apache Mesos and DC/OSIntroduction to Apache Mesos and DC/OS
Introduction to Apache Mesos and DC/OS
Steve Wong
 
Tokyo Azure Meetup #4 - Build 2016 Overview
Tokyo Azure Meetup #4 -  Build 2016 OverviewTokyo Azure Meetup #4 -  Build 2016 Overview
Tokyo Azure Meetup #4 - Build 2016 Overview
Tokyo Azure Meetup
 
BlackRay - The open Source Data Engine
BlackRay - The open Source Data EngineBlackRay - The open Source Data Engine
BlackRay - The open Source Data Engine
fschupp
 
Lessons Learned from Real-World Deployments of Java EE 7 at JavaOne 2014
Lessons Learned from Real-World Deployments of Java EE 7 at JavaOne 2014Lessons Learned from Real-World Deployments of Java EE 7 at JavaOne 2014
Lessons Learned from Real-World Deployments of Java EE 7 at JavaOne 2014
Arun Gupta
 
NetflixOSS Meetup season 3 episode 1
NetflixOSS Meetup season 3 episode 1NetflixOSS Meetup season 3 episode 1
NetflixOSS Meetup season 3 episode 1
Ruslan Meshenberg
 
Microservices - opportunities, dilemmas and problems
Microservices - opportunities, dilemmas and problemsMicroservices - opportunities, dilemmas and problems
Microservices - opportunities, dilemmas and problems
Łukasz Sowa
 
Getting started with postgresql
Getting started with postgresqlGetting started with postgresql
Getting started with postgresql
botsplash.com
 
Blackray @ SAPO CodeBits 2009
Blackray @ SAPO CodeBits 2009Blackray @ SAPO CodeBits 2009
Blackray @ SAPO CodeBits 2009
fschupp
 
Redpanda and ClickHouse
Redpanda and ClickHouseRedpanda and ClickHouse
Redpanda and ClickHouse
Altinity Ltd
 
Rami Sayar - Node microservices with Docker
Rami Sayar - Node microservices with DockerRami Sayar - Node microservices with Docker
Rami Sayar - Node microservices with Docker
Web à Québec
 
SpringPeople - Introduction to Cloud Computing
SpringPeople - Introduction to Cloud ComputingSpringPeople - Introduction to Cloud Computing
SpringPeople - Introduction to Cloud Computing
SpringPeople
 

Similar to Summer 2017 undergraduate research powerpoint (20)

Hpc lunch and learn
Hpc lunch and learnHpc lunch and learn
Hpc lunch and learn
 
Introduction to Google Cloud Platform
Introduction to Google Cloud PlatformIntroduction to Google Cloud Platform
Introduction to Google Cloud Platform
 
Public Cloud Workshop
Public Cloud WorkshopPublic Cloud Workshop
Public Cloud Workshop
 
Netflix oss season 1 episode 3
Netflix oss season 1 episode 3 Netflix oss season 1 episode 3
Netflix oss season 1 episode 3
 
Big data conference europe real-time streaming in any and all clouds, hybri...
Big data conference europe   real-time streaming in any and all clouds, hybri...Big data conference europe   real-time streaming in any and all clouds, hybri...
Big data conference europe real-time streaming in any and all clouds, hybri...
 
NoSQL on the move
NoSQL on the moveNoSQL on the move
NoSQL on the move
 
HPC and cloud distributed computing, as a journey
HPC and cloud distributed computing, as a journeyHPC and cloud distributed computing, as a journey
HPC and cloud distributed computing, as a journey
 
USENIX LISA15: How TubeMogul Handles over One Trillion HTTP Requests a Month
USENIX LISA15: How TubeMogul Handles over One Trillion HTTP Requests a MonthUSENIX LISA15: How TubeMogul Handles over One Trillion HTTP Requests a Month
USENIX LISA15: How TubeMogul Handles over One Trillion HTTP Requests a Month
 
BISSA: Empowering Web gadget Communication with Tuple Spaces
BISSA: Empowering Web gadget Communication with Tuple SpacesBISSA: Empowering Web gadget Communication with Tuple Spaces
BISSA: Empowering Web gadget Communication with Tuple Spaces
 
Introduction to Apache Mesos and DC/OS
Introduction to Apache Mesos and DC/OSIntroduction to Apache Mesos and DC/OS
Introduction to Apache Mesos and DC/OS
 
Tokyo Azure Meetup #4 - Build 2016 Overview
Tokyo Azure Meetup #4 -  Build 2016 OverviewTokyo Azure Meetup #4 -  Build 2016 Overview
Tokyo Azure Meetup #4 - Build 2016 Overview
 
BlackRay - The open Source Data Engine
BlackRay - The open Source Data EngineBlackRay - The open Source Data Engine
BlackRay - The open Source Data Engine
 
Lessons Learned from Real-World Deployments of Java EE 7 at JavaOne 2014
Lessons Learned from Real-World Deployments of Java EE 7 at JavaOne 2014Lessons Learned from Real-World Deployments of Java EE 7 at JavaOne 2014
Lessons Learned from Real-World Deployments of Java EE 7 at JavaOne 2014
 
NetflixOSS Meetup season 3 episode 1
NetflixOSS Meetup season 3 episode 1NetflixOSS Meetup season 3 episode 1
NetflixOSS Meetup season 3 episode 1
 
Microservices - opportunities, dilemmas and problems
Microservices - opportunities, dilemmas and problemsMicroservices - opportunities, dilemmas and problems
Microservices - opportunities, dilemmas and problems
 
Getting started with postgresql
Getting started with postgresqlGetting started with postgresql
Getting started with postgresql
 
Blackray @ SAPO CodeBits 2009
Blackray @ SAPO CodeBits 2009Blackray @ SAPO CodeBits 2009
Blackray @ SAPO CodeBits 2009
 
Redpanda and ClickHouse
Redpanda and ClickHouseRedpanda and ClickHouse
Redpanda and ClickHouse
 
Rami Sayar - Node microservices with Docker
Rami Sayar - Node microservices with DockerRami Sayar - Node microservices with Docker
Rami Sayar - Node microservices with Docker
 
SpringPeople - Introduction to Cloud Computing
SpringPeople - Introduction to Cloud ComputingSpringPeople - Introduction to Cloud Computing
SpringPeople - Introduction to Cloud Computing
 

Recently uploaded

Material for memory and display system h
Material for memory and display system hMaterial for memory and display system h
Material for memory and display system h
gowrishankartb2005
 
22CYT12-Unit-V-E Waste and its Management.ppt
22CYT12-Unit-V-E Waste and its Management.ppt22CYT12-Unit-V-E Waste and its Management.ppt
22CYT12-Unit-V-E Waste and its Management.ppt
KrishnaveniKrishnara1
 
132/33KV substation case study Presentation
132/33KV substation case study Presentation132/33KV substation case study Presentation
132/33KV substation case study Presentation
kandramariana6
 
Use PyCharm for remote debugging of WSL on a Windo cf5c162d672e4e58b4dde5d797...
Use PyCharm for remote debugging of WSL on a Windo cf5c162d672e4e58b4dde5d797...Use PyCharm for remote debugging of WSL on a Windo cf5c162d672e4e58b4dde5d797...
Use PyCharm for remote debugging of WSL on a Windo cf5c162d672e4e58b4dde5d797...
shadow0702a
 
2008 BUILDING CONSTRUCTION Illustrated - Ching Chapter 02 The Building.pdf
2008 BUILDING CONSTRUCTION Illustrated - Ching Chapter 02 The Building.pdf2008 BUILDING CONSTRUCTION Illustrated - Ching Chapter 02 The Building.pdf
2008 BUILDING CONSTRUCTION Illustrated - Ching Chapter 02 The Building.pdf
Yasser Mahgoub
 
DEEP LEARNING FOR SMART GRID INTRUSION DETECTION: A HYBRID CNN-LSTM-BASED MODEL
DEEP LEARNING FOR SMART GRID INTRUSION DETECTION: A HYBRID CNN-LSTM-BASED MODELDEEP LEARNING FOR SMART GRID INTRUSION DETECTION: A HYBRID CNN-LSTM-BASED MODEL
DEEP LEARNING FOR SMART GRID INTRUSION DETECTION: A HYBRID CNN-LSTM-BASED MODEL
gerogepatton
 
Understanding Inductive Bias in Machine Learning
Understanding Inductive Bias in Machine LearningUnderstanding Inductive Bias in Machine Learning
Understanding Inductive Bias in Machine Learning
SUTEJAS
 
Hematology Analyzer Machine - Complete Blood Count
Hematology Analyzer Machine - Complete Blood CountHematology Analyzer Machine - Complete Blood Count
Hematology Analyzer Machine - Complete Blood Count
shahdabdulbaset
 
Properties Railway Sleepers and Test.pptx
Properties Railway Sleepers and Test.pptxProperties Railway Sleepers and Test.pptx
Properties Railway Sleepers and Test.pptx
MDSABBIROJJAMANPAYEL
 
Embedded machine learning-based road conditions and driving behavior monitoring
Embedded machine learning-based road conditions and driving behavior monitoringEmbedded machine learning-based road conditions and driving behavior monitoring
Embedded machine learning-based road conditions and driving behavior monitoring
IJECEIAES
 
Manufacturing Process of molasses based distillery ppt.pptx
Manufacturing Process of molasses based distillery ppt.pptxManufacturing Process of molasses based distillery ppt.pptx
Manufacturing Process of molasses based distillery ppt.pptx
Madan Karki
 
Generative AI leverages algorithms to create various forms of content
Generative AI leverages algorithms to create various forms of contentGenerative AI leverages algorithms to create various forms of content
Generative AI leverages algorithms to create various forms of content
Hitesh Mohapatra
 
Casting-Defect-inSlab continuous casting.pdf
Casting-Defect-inSlab continuous casting.pdfCasting-Defect-inSlab continuous casting.pdf
Casting-Defect-inSlab continuous casting.pdf
zubairahmad848137
 
一比一原版(CalArts毕业证)加利福尼亚艺术学院毕业证如何办理
一比一原版(CalArts毕业证)加利福尼亚艺术学院毕业证如何办理一比一原版(CalArts毕业证)加利福尼亚艺术学院毕业证如何办理
一比一原版(CalArts毕业证)加利福尼亚艺术学院毕业证如何办理
ecqow
 
Redefining brain tumor segmentation: a cutting-edge convolutional neural netw...
Redefining brain tumor segmentation: a cutting-edge convolutional neural netw...Redefining brain tumor segmentation: a cutting-edge convolutional neural netw...
Redefining brain tumor segmentation: a cutting-edge convolutional neural netw...
IJECEIAES
 
UNLOCKING HEALTHCARE 4.0: NAVIGATING CRITICAL SUCCESS FACTORS FOR EFFECTIVE I...
UNLOCKING HEALTHCARE 4.0: NAVIGATING CRITICAL SUCCESS FACTORS FOR EFFECTIVE I...UNLOCKING HEALTHCARE 4.0: NAVIGATING CRITICAL SUCCESS FACTORS FOR EFFECTIVE I...
UNLOCKING HEALTHCARE 4.0: NAVIGATING CRITICAL SUCCESS FACTORS FOR EFFECTIVE I...
amsjournal
 
gray level transformation unit 3(image processing))
gray level transformation unit 3(image processing))gray level transformation unit 3(image processing))
gray level transformation unit 3(image processing))
shivani5543
 
Advanced control scheme of doubly fed induction generator for wind turbine us...
Advanced control scheme of doubly fed induction generator for wind turbine us...Advanced control scheme of doubly fed induction generator for wind turbine us...
Advanced control scheme of doubly fed induction generator for wind turbine us...
IJECEIAES
 
CompEx~Manual~1210 (2).pdf COMPEX GAS AND VAPOURS
CompEx~Manual~1210 (2).pdf COMPEX GAS AND VAPOURSCompEx~Manual~1210 (2).pdf COMPEX GAS AND VAPOURS
CompEx~Manual~1210 (2).pdf COMPEX GAS AND VAPOURS
RamonNovais6
 
CHINA’S GEO-ECONOMIC OUTREACH IN CENTRAL ASIAN COUNTRIES AND FUTURE PROSPECT
CHINA’S GEO-ECONOMIC OUTREACH IN CENTRAL ASIAN COUNTRIES AND FUTURE PROSPECTCHINA’S GEO-ECONOMIC OUTREACH IN CENTRAL ASIAN COUNTRIES AND FUTURE PROSPECT
CHINA’S GEO-ECONOMIC OUTREACH IN CENTRAL ASIAN COUNTRIES AND FUTURE PROSPECT
jpsjournal1
 

Recently uploaded (20)

Material for memory and display system h
Material for memory and display system hMaterial for memory and display system h
Material for memory and display system h
 
22CYT12-Unit-V-E Waste and its Management.ppt
22CYT12-Unit-V-E Waste and its Management.ppt22CYT12-Unit-V-E Waste and its Management.ppt
22CYT12-Unit-V-E Waste and its Management.ppt
 
132/33KV substation case study Presentation
132/33KV substation case study Presentation132/33KV substation case study Presentation
132/33KV substation case study Presentation
 
Use PyCharm for remote debugging of WSL on a Windo cf5c162d672e4e58b4dde5d797...
Use PyCharm for remote debugging of WSL on a Windo cf5c162d672e4e58b4dde5d797...Use PyCharm for remote debugging of WSL on a Windo cf5c162d672e4e58b4dde5d797...
Use PyCharm for remote debugging of WSL on a Windo cf5c162d672e4e58b4dde5d797...
 
2008 BUILDING CONSTRUCTION Illustrated - Ching Chapter 02 The Building.pdf
2008 BUILDING CONSTRUCTION Illustrated - Ching Chapter 02 The Building.pdf2008 BUILDING CONSTRUCTION Illustrated - Ching Chapter 02 The Building.pdf
2008 BUILDING CONSTRUCTION Illustrated - Ching Chapter 02 The Building.pdf
 
DEEP LEARNING FOR SMART GRID INTRUSION DETECTION: A HYBRID CNN-LSTM-BASED MODEL
DEEP LEARNING FOR SMART GRID INTRUSION DETECTION: A HYBRID CNN-LSTM-BASED MODELDEEP LEARNING FOR SMART GRID INTRUSION DETECTION: A HYBRID CNN-LSTM-BASED MODEL
DEEP LEARNING FOR SMART GRID INTRUSION DETECTION: A HYBRID CNN-LSTM-BASED MODEL
 
Understanding Inductive Bias in Machine Learning
Understanding Inductive Bias in Machine LearningUnderstanding Inductive Bias in Machine Learning
Understanding Inductive Bias in Machine Learning
 
Hematology Analyzer Machine - Complete Blood Count
Hematology Analyzer Machine - Complete Blood CountHematology Analyzer Machine - Complete Blood Count
Hematology Analyzer Machine - Complete Blood Count
 
Properties Railway Sleepers and Test.pptx
Properties Railway Sleepers and Test.pptxProperties Railway Sleepers and Test.pptx
Properties Railway Sleepers and Test.pptx
 
Embedded machine learning-based road conditions and driving behavior monitoring
Embedded machine learning-based road conditions and driving behavior monitoringEmbedded machine learning-based road conditions and driving behavior monitoring
Embedded machine learning-based road conditions and driving behavior monitoring
 
Manufacturing Process of molasses based distillery ppt.pptx
Manufacturing Process of molasses based distillery ppt.pptxManufacturing Process of molasses based distillery ppt.pptx
Manufacturing Process of molasses based distillery ppt.pptx
 
Generative AI leverages algorithms to create various forms of content
Generative AI leverages algorithms to create various forms of contentGenerative AI leverages algorithms to create various forms of content
Generative AI leverages algorithms to create various forms of content
 
Casting-Defect-inSlab continuous casting.pdf
Casting-Defect-inSlab continuous casting.pdfCasting-Defect-inSlab continuous casting.pdf
Casting-Defect-inSlab continuous casting.pdf
 
一比一原版(CalArts毕业证)加利福尼亚艺术学院毕业证如何办理
一比一原版(CalArts毕业证)加利福尼亚艺术学院毕业证如何办理一比一原版(CalArts毕业证)加利福尼亚艺术学院毕业证如何办理
一比一原版(CalArts毕业证)加利福尼亚艺术学院毕业证如何办理
 
Redefining brain tumor segmentation: a cutting-edge convolutional neural netw...
Redefining brain tumor segmentation: a cutting-edge convolutional neural netw...Redefining brain tumor segmentation: a cutting-edge convolutional neural netw...
Redefining brain tumor segmentation: a cutting-edge convolutional neural netw...
 
UNLOCKING HEALTHCARE 4.0: NAVIGATING CRITICAL SUCCESS FACTORS FOR EFFECTIVE I...
UNLOCKING HEALTHCARE 4.0: NAVIGATING CRITICAL SUCCESS FACTORS FOR EFFECTIVE I...UNLOCKING HEALTHCARE 4.0: NAVIGATING CRITICAL SUCCESS FACTORS FOR EFFECTIVE I...
UNLOCKING HEALTHCARE 4.0: NAVIGATING CRITICAL SUCCESS FACTORS FOR EFFECTIVE I...
 
gray level transformation unit 3(image processing))
gray level transformation unit 3(image processing))gray level transformation unit 3(image processing))
gray level transformation unit 3(image processing))
 
Advanced control scheme of doubly fed induction generator for wind turbine us...
Advanced control scheme of doubly fed induction generator for wind turbine us...Advanced control scheme of doubly fed induction generator for wind turbine us...
Advanced control scheme of doubly fed induction generator for wind turbine us...
 
CompEx~Manual~1210 (2).pdf COMPEX GAS AND VAPOURS
CompEx~Manual~1210 (2).pdf COMPEX GAS AND VAPOURSCompEx~Manual~1210 (2).pdf COMPEX GAS AND VAPOURS
CompEx~Manual~1210 (2).pdf COMPEX GAS AND VAPOURS
 
CHINA’S GEO-ECONOMIC OUTREACH IN CENTRAL ASIAN COUNTRIES AND FUTURE PROSPECT
CHINA’S GEO-ECONOMIC OUTREACH IN CENTRAL ASIAN COUNTRIES AND FUTURE PROSPECTCHINA’S GEO-ECONOMIC OUTREACH IN CENTRAL ASIAN COUNTRIES AND FUTURE PROSPECT
CHINA’S GEO-ECONOMIC OUTREACH IN CENTRAL ASIAN COUNTRIES AND FUTURE PROSPECT
 

Summer 2017 undergraduate research powerpoint

  • 1. A Framework for Real-Time Analysis, Storage, and Visualization of Big Data
  • 3. Sensor Network • Composed of • Microcontroller units (MCU) • Various sensors • Temperature • Humidity • Photocells • Web API • Control MCUs • Message Broker • Route messages • Manage communication channels • Manage ques
  • 4. The Ideal MCU • Size • Small form factor • Power • Low Power Consumption • Battery Based • Sleep Modes • GPIO • Digital • Analog • Communication • Wifi 802.11 b/g/n • Bluetooth • Cellular 2G/3G • Software • Developed support • Functionality Libraries • Language Support • C • Javascript
  • 5. Our Pick: ESP8266-12e ● Size ○ 24mm x16mm x 3mm ● Power ○ 3v3 ○ Sleep mode ● Full IP Stack ○ WIFI 802.11 b/g/n ○ Can act as client or host ● Low Cost ○ $1.50 ● Software ○ Several firmata available ○ Huge online community https://acrobotic.com/media/wysiwyg/products/esp8266_esp12e_horizontal-01.png
  • 6. Web API • Web API • Supervises MCUs • MCUs initialization • Connect to API Web Server • Over Websockets • Enter REPL • Read Evaluate Print Loop (REPL) • Inject Code directly into device • Keep Status of Devices • Modes • Sleep • On • Off
  • 7. Message Queue Telemetry Transport (MQTT) • MQTT: Lightweight Comm. Protocol • Over TCP/IP • Topic • Data stream subject/identifier • Publisher • Transmits data on select topic • Active connection • Subscriber • Listens for data on select topic • Passive connection • MQTT Broker • Routes traffic based on subscriptions
  • 8. Message Broker • Intermediary • Between sender and receiver • Manages • Delivery • Routing • Message Queue • Protocol Conversion • Message translation Topic Subscriber Publisher Temperature Server Temp. Sensor Feedback Server Thermostat mn … m4 m3 m2 m1 m0 m0 m1 m2 m3 m4 … mn Temp. Sensor Thermostat Message Broker Topic: Feedback Topic: Temperature Server Side Calculations Message Queues
  • 9. RabbitMQ • Cross Language Support • Java, Python, JavaScript, Ruby, & .NET • Cross Protocol Support • MQTT, AMQP, HTTP, & STOMP • Asynchronous Messaging • Many subs • Many pubs • QoS • Data persists in queues until read by subscriber • Low Latency • Critical for real time apps.
  • 10. Alternative Input Devices • ROS Robots • Drones • Rovers • Multi Input, Multi Output Systems • Sensor + Actuator Pairs
  • 12. What is Kura? ● OSGI based framework for IOT Gateways ● Runs in a JVM ● Built-in MQTT cloud services ● Browser based GUI
  • 13. Key Features-Network and Cloud Services ● Extensive MQTT configuration options ● Helps implement more complex interaction flows beyond publish/subscribe ● Remote management of M2M applications
  • 14. Key Features-Configurable Services ● 3 ways to add service packages: ○ Eclipse Marketplace ○ URL ○ Uploaded files. ● Can be configured during runtime through the web GUI ● Can access device hardware: GPIO, GPS, etc
  • 15. Key Features-Kura Wires ● Add new processes to Kura in a block based visual representation ● Runs automatically once changes are applied; no need to compile ● Implementations for MQTT, building databases, data filtering, and more ● Additional assets can be added to Kura Wires by adding packages
  • 17. Prototype Project with Temperature Sensor Processing Tool: ● Handle streams of big data in real-time ● Low Latency ● Robust ● Simple / Flexible Implementation
  • 18. What is Storm? Storm = Distributed Stream Processing Engine (DSPE) Topology :
  • 19. What is Storm? Parallel Processing with Apache - Zookeeper Cluster Architecture :
  • 20. Apache Storm - FEATURES ● Scalable infrastructure : ○ Zookeeper ● Simple API : ○ Free and open source ● Guarantee data processing : ○ Anchor tuples ○ At-least-once processing (Default) ○ Exactly-once processing (Apache - Trident) ● Fault tolerant : ○ Daemons are stateless and fail- fast ○ Zookeeper uses heartbeats ● Any language : ○ Apache - Thrift ● Data variability : ○ Kyro Serialization
  • 23. SQL & NoSQL Comparison SQL ● Vertically Scalable ● Predetermined data structure ● ACID (Atomicity, Consistency, Isolation and Durability) ● Uses the Universal SQL (Structured Query Language) language which provides a powerful tool to manipulate and define data ● Allows for complex queries ● Examples: ○ MySQL, Sqlite, and Postgres NoSQL ● Horizontally Scalable ● Unstructured data storage ● CAP theorem ( Consistency, Availability and Partition tolerance ) ● Typically NoSQL relies on a collection of documents and the syntax varies per the database ● No standard interface for complex queries ● Examples: ○ MongoDB, Redis, and Hbase
  • 24. MySQL ● Open source relational database management system (RDBMS) ● Provides a password system that is very flexible and secure ● MySQL supports large databases, up to 50 million rows or more in a table ● MySQL is also used in the industry as well (Facebook, Twitter, Flickr and YouTube)
  • 25. Where does MySQL fit? ● Data Storage portion of the architecture ● Apache Storm writes data to MySQL ● Spark is then able to query the database and provide a visualization tool
  • 27. Design Challenges (Kyle) ● Normalizing the database was challenging because I never experienced high level data modeling before. It took me a couple of office hour sessions with my professor for me to grasp the concept. ● Creating the connection between Apache Storm was incredibly frustrating. It was a gruelling debugging process that me and a fellow engineer were stuck on for about a day or two. ● The time spent to build up my knowledge of all the technologies integrated into the framework.
  • 28. Chris
  • 30. YARN Stand Alone Cluster Mesos HDFS(Hadoop, S3, or local) Spark Streaming Spark MLib Spark GraphX Spark SQL Apache Spark Architecture
  • 31. Cloud Batch Processing • Four components to Apache Spark: • Spark SQL • Introduces the concept of the “Resilient Distributed Dataset”. • Enables reading/writing and querying a database using SQL. • Spark MLib • Machine Learning Algorithms • Spark Streaming • Real-time streaming using the process of micro-batching. • Spark GraphX • Extends RDD’s for graphs and graph- parallel computation.
  • 33. Data Visualization • Web-based book for data analysis and visualization. • Frames the question: “What is really happening behind the hood?”. • Versatile • Interpreters for any language. • Easy to share web-based notebook • Able to set permissions per person. • Integration with Apache Spark Web Browser Web Server Local Interpreters Zeppelin Daemon Remote Interpreters Spark Master Node Spark Worker Node Spark Worker Node
  • 34. Apache Spark Apache Zeppelin ✔ Python rather than Scala. ✔Plenty of Examples with a big community. ✔Troubleshooting was not that bad. ✔Basically Spark, but visualized. ✔Creating interpreters is useful. ✔Fun to explore and use. 🗙I used Python, but I’m new to Python also. 🗙Understanding the program is one thing, using it is another. 🗙New to Open-Source and Linux OS. 🗙 I learned Zeppelin, before Spark 🗙 Not as big of a community as Spark. 🗙 Troubleshooting was more difficult. 🗙More to Zeppelin than just Spark interpreter..
  • 36. Parking on campus • Time-to-park is an issue on UTSA campus • UTSA claims to have enough parking to meet typical demand • Too many people in the same parking lot / Random distribution of newly opened parking spots • Solution: Accurate and reliable system to track (and predict) parking patterns, providing real-time information to drivers to assist in parking decisions
  • 37. High Level Design Cloud Processing • Percentage availability • Predicted time-to-park (real time) • Potential time-to-park (past data)