SlideShare a Scribd company logo
Understanding City Traffic Dynamics Utilizing Sensor
and Textual Observations
Pramod Anantharam, Krishnaprasad Thirunarayan, Surendra Marupudi,
Amit Sheth, and Tanvi Banerjee
Ohio Center of Excellence in Knowledge-enabled Computing(Kno.e.sis),
Wright State University, USA
The Thirtieth AAAI Conference on Artificial Intelligence (AAAI-16), February 12–17, Phoenix, Arizona, USA.
1
Multimodal Manifestation of Events: Traffic Scenario
2
• Why?
– Explain/Interpret average speed and link travel time
variations using events provided by city authorities and
traffic events shared on Twitter
– Past work: Predict congestion based on historical sensor
data
• What?
– Combine
• 511.org data about Bay Area Road Network Traffic
– E.g., Average speed and link travel time data stream (Sensor data)
– E.g., (Happened or planned) event reports (Textual data)
• Tweets that report events including ad hoc ones (Textual data)
Multimodal Data Integration: Traffic Scenario
3
• How?
– Step 1: Extract textual events from tweets stream
– Step 2: Build statistical models of normalcy, and
thereby anomaly, for sensor time series data
– Step 3: Correlate multimodal streams, using
spatio-temporal information, to explain
“anomalies” in sensor time series data with
textual events
Multimodal Data Integration: Traffic Scenario
4
• How?
– Step 1: Extract textual events from tweets stream
– Step 2: Build statistical models of normalcy, and
thereby anomaly, from numerical sensor data
streams
– Step 3: Correlate multimodal streams, using
spatio-temporal information, to explain
“anomalies” in sensor time series data with
textual events
Multimodal Data Integration: Traffic Scenario
5
Extracting Textual Events from Tweets: Annotation + Extraction
Pramod Anantharam, Payam Barnaghi, Krishnaprasad Thirunarayan, and Amit Sheth. 2015. Extracting City Traffic Events from Social Streams.
ACM Trans. Intell. Syst. Technol. 6, 4, Article 43 (July 2015), 27 pages. DOI=10.1145/2717317 http://doi.acm.org/10.1145/2717317
1Event Extraction Tool on Open Science Foundation: https://osf.io/b4q2t/wiki/home/
NER – Named Entity Recognition
OSM – Open Street Maps
39,208 traffic related incidents extracted from over 20 million tweets1
6
• How?
– Step 1: Extract textual events from tweets stream
– Step 2: Build statistical models of normalcy, and
thereby anomaly, from numerical sensor data
streams
– Step 3: Correlate multimodal streams, using
spatio-temporal information, to explain
“anomalies” in sensor time series data with
textual events
Multimodal Data Integration: Traffic Scenario
7
Image credit: http://traffic.511.org/index
Multiple events
Varying influence
Event interactions
Time of Day (approx. 1 observation/minute)Speed in km/h
Building Normalcy Models of Traffic Dynamics*: Challenges
*Traffic Dynamics here refers to speed and travel time variations observed in sensor data
8
• Temporal landmarks : peak hour vs. off-peak
traffic vs. weekend traffic
• Effect of location
• Scheduled events such as road construction,
baseball game, or music concert
• Unexpected events such as accidents, heavy
rains, fog
• Random variations (viz., stochasticity) such as
people visiting downtown by mere coincidence
Possible Causes of Nonlinearity in Traffic Dynamics
9
Modeling City Traffic Dynamics: A Closer Look
Image credits: http://bit.ly/1N1wu5g, http://bit.ly/1O8d9gn, http://bit.ly/1N8L5tf, http://bit.ly/1HLDYui
Speed of vehicles reported
by sensors on the road
Events
People
Influx
Vehicle
Influx
Vehicle
Speed
Hidden State
Observed Evidence
• Do we know all the events?
• Do we know all the event
interactions?
• Do we know the number of
event participants?
• How are the participants
traveling?
• Do we know the vehicle
volume on the road?
• Do we know the speed of
vehicles on the road?
• What is the nature of the
speed time series data?
10
Nature of the Problem to Linear Dynamical System (LDS)
1. There are both
hidden states and
observed evidence
2. Current observed
evidence depends on the
current hidden state
3. Current hidden
state depends on
the previous
hidden states
v1
s1
…
…
v2
s1
vT
sT
v1
s1
…
…
v2
s1
vT
sT
v1
s1
…
…
v2
s1
vT
sT
For simplicity of explanation, we
consider vehicle influx as a
hidden variable and the
observed speed as evidence
variable
Vehicle influx at a certain point in
time t would influence speed of
vehicles as the same time t
Vehicle influx at a certain point in
time t depends only on the
previous vehicle influx (first-
order Markov process)
11
Linear Dynamical Systems Model
v1
s1
…
…
v2
s1
vT
sT
Replacing discrete valued state and
observation nodes in the rain example
(previous slide) with continuous valued states
and observations, we get an LDS model the
has transition and observation models
The transition model is specified by At
and the observation model is specified by
Bt along with associated noise
The joint distribution over all the hidden
and observed variables is shown along
with the conditional distributions
Barber, David. Bayesian reasoning and machine learning. Cambridge University Press, 2012.
12
Hourly Traffic Dynamics Over all Mondays between Aug-14 to Jan-15
x-axis: observation number for each hour of day
y-axis: average speed of vehicles in km/h
13
14
Learning Context Specific LDS Models
7 × 24
LDS(1,1), LDS(1,2) ,….,
LDS(1,24)
LDS(7,1), LDS(7,2) ,….,
LDS(7,24)
.
.
.
di
hj
Mo
n. Tu
e. We
d. Th
u. Fr
i. Sa
t. Su
n.
Mo
n. Tue
. We
d. Thu.
Fri.
Sat.
Sun.
Speed/travel-time time
series data from a link
Time series data for
each hour of day (1-24)
for each day of week
(Monday – Sunday)
Mean time series
computed for each day
of week and hour of day
along with the medoid
168 LDS models for
each link; Total models
learned = 425,712 i.e.,
(2,534 links × 168
models per link)
Step 1: Index data for each
link for day of week and hour
of day utilizing the traffic
domain knowledge for piece-
wise linear approximation
Step 2: Find the “typical”
dynamics by computing the
mean and choosing the
medoid for each hour of day
and day of week
Step 3: Learn LDS parameters
for the medoid for each hour
of day (24 hours) and each day
of week (7 days) resulting in
24 × 7 = 168 models for each
link
Utilizing Context Specific LDS Models to Learn Normalcy
Log-
likelihood
score
15
16
Tagging Anomalies using Context Specific LDS Models
Compute Log Likelihood for
each hour of observed data
(di,hj) LDS(hj,di)
7 × 24
Lik(1,1), Lik(1,2) ,…., Lik(1,24)
Lik(7,1), Lik(7,2) ,…., Lik(7,24)
.
.
.
Train
?
Yes (Training phase)
Tag Anomalous hours using the
Log Likelihood Range
No
(di,hj) (min. likelihood)
Anomalies
L =
Partition based on (di,hj)
Speed and travel-time time
Observations from a link
Log likelihood min. and
max. values obtained from
five number summary
Partition based on (di,hj)
7 × 24
LDS(1,1), LDS(1,2) ,…., LDS(1,24)
LDS(7,1), LDS(7,2) ,…., LDS(7,24)
.
.
.
di
hj
(Input)
(Output)
• How?
– Step 1: Extract textual events from tweets stream
– Step 2: Build statistical models of normalcy, and
thereby anomaly, from numerical sensor data
streams
– Step 3: Correlate multimodal streams, using
spatio-temporal information, to explain
“anomalies” in sensor time series data with
textual events
Multimodal Data Integration: Traffic Scenario
17
• If an anomaly is detected on a link L and during time
period [tst, tet], then the anomaly is explained by an
event if the event occurred in the vicinity within 0.5km
radius and during [tst-1, tet+1].
• CAVEAT: An anomaly may not be explained because of
missing data.
Explaining Anomalies in Sensor Data using Textual Events
18
Anomalies
City Traffic Events from Twitter
⟨et, el, est, eet, ei⟩
Explained_by
Link sensor data
City tweets
(Anantharam 2014)
https://osf.io/b4q2t/wiki/home/
⟨ast, aet⟩
Δte = est ~ eet
Δta = ast - 1 hour ~ aet + 1 hour
Explains
(if there is an overlap
between Δte and Δta)
• Data collected from San Francisco Bay Area between May 2014 to
May 2015
– 511.org:
• 1,638 traffic incident reports
• 1.4 billion speed and travel time observations
– Twitter Data: 39,208 traffic related incidents extracted from over 20
million tweets1
• Naïve implementation for learning normalcy models for 2,534 links
resulted in 40 minutes per link (~ 2 months of processing time for
our data)
– 2.66 GHz, Intel Core 2 Duo with 8 GB main memory
• Scalable implementation by exploiting the nature of the problem
resulted in learning normalcy models within 24 hours
– The Apache Spark cluster used in our evaluation has 864 cores and
17TB main memory.
Real-World Dataset and Scalability Issues
1Extracting city traffic events from social streams. https://osf.io/b4q2t/wiki/home/
19
Evaluation Results
20
• Events extracted from social media can complement
sensor data
• Knowledge of the domain can be utilized for a
piecewise linear approximation of non-linear speed
dynamics
• Events reported by people most likely manifests in
sensor data relative to traffic events by formal
sources.
• Short-term events manifest as anomalies in sensor
data while the long-term events may not result in
anomaly manifestations
21
Conclusion
Acknowledgements
This material is based upon work supported by the National Science Foundation under
Grant No. EAR 1520870 titled “Hazards SEES: Social and Physical Sensing Enabled Decision
Support for Disaster Management and Response”. Any opinions, findings, and conclusions or
recommendations expressed in this material are those of the author(s) and do not
necessarily reflect the views of the National Science Foundation.
22
Thank you 
Thank you, and please visit us at http://knoesis.org
Link to the paper: http://www.knoesis.org/library/resource.php?id=2228
Hazards SEES: Social and Physical Sensing Enabled Decision Support for Disaster
Management and Response
23

More Related Content

What's hot

Smart IoT for Connected Manufacturing
Smart IoT for Connected ManufacturingSmart IoT for Connected Manufacturing
Smart IoT for Connected Manufacturing
Amit Sheth
 
Smart Data for you and me: Personalized and Actionable Physical Cyber Social ...
Smart Data for you and me: Personalized and Actionable Physical Cyber Social ...Smart Data for you and me: Personalized and Actionable Physical Cyber Social ...
Smart Data for you and me: Personalized and Actionable Physical Cyber Social ...
Amit Sheth
 
Semantics-empowered Approaches to Big Data Processing for Physical-Cyber-Soci...
Semantics-empowered Approaches to Big Data Processing for Physical-Cyber-Soci...Semantics-empowered Approaches to Big Data Processing for Physical-Cyber-Soci...
Semantics-empowered Approaches to Big Data Processing for Physical-Cyber-Soci...
Artificial Intelligence Institute at UofSC
 
Citizen Sensor Data Mining, Social Media Analytics and Applications
Citizen Sensor Data Mining, Social Media Analytics and ApplicationsCitizen Sensor Data Mining, Social Media Analytics and Applications
Citizen Sensor Data Mining, Social Media Analytics and Applications
Amit Sheth
 
Semantic, Cognitive, and Perceptual Computing – three intertwined strands of ...
Semantic, Cognitive, and Perceptual Computing – three intertwined strands of ...Semantic, Cognitive, and Perceptual Computing – three intertwined strands of ...
Semantic, Cognitive, and Perceptual Computing – three intertwined strands of ...
Amit Sheth
 
Knowledge Will Propel Machine Understanding of Big Data
Knowledge Will Propel Machine Understanding of Big DataKnowledge Will Propel Machine Understanding of Big Data
Knowledge Will Propel Machine Understanding of Big Data
Amit Sheth
 
TRANSFORMING BIG DATA INTO SMART DATA: Deriving Value via Harnessing Volume, ...
TRANSFORMING BIG DATA INTO SMART DATA: Deriving Value via Harnessing Volume, ...TRANSFORMING BIG DATA INTO SMART DATA: Deriving Value via Harnessing Volume, ...
TRANSFORMING BIG DATA INTO SMART DATA: Deriving Value via Harnessing Volume, ...
Amit Sheth
 
Transforming Big Data into Smart Data for Smart Energy: Deriving Value via ha...
Transforming Big Data into Smart Data for Smart Energy: Deriving Value via ha...Transforming Big Data into Smart Data for Smart Energy: Deriving Value via ha...
Transforming Big Data into Smart Data for Smart Energy: Deriving Value via ha...
Amit Sheth
 
Computing for Human Experience [v4]: Keynote @ OnTheMove Federated Conferences
Computing for Human Experience [v4]: Keynote @ OnTheMove Federated ConferencesComputing for Human Experience [v4]: Keynote @ OnTheMove Federated Conferences
Computing for Human Experience [v4]: Keynote @ OnTheMove Federated Conferences
Artificial Intelligence Institute at UofSC
 
Multipleregression covidmobility and Covid-19 policy recommendation
Multipleregression covidmobility and Covid-19 policy recommendationMultipleregression covidmobility and Covid-19 policy recommendation
Multipleregression covidmobility and Covid-19 policy recommendation
Kan Yuenyong
 
NG2S: A Study of Pro-Environmental Tipping Point via ABMs
NG2S: A Study of Pro-Environmental Tipping Point via ABMsNG2S: A Study of Pro-Environmental Tipping Point via ABMs
NG2S: A Study of Pro-Environmental Tipping Point via ABMs
Kan Yuenyong
 
Physical Cyber Social Computing
Physical Cyber Social ComputingPhysical Cyber Social Computing
Physical Cyber Social Computing
Amit Sheth
 
Network Science: Theory, Modeling and Applications
Network Science: Theory, Modeling and ApplicationsNetwork Science: Theory, Modeling and Applications
Network Science: Theory, Modeling and Applications
Biocomplexity Institute of Virginia Tech
 
Future of hpc
Future of hpcFuture of hpc
Future of hpc
Putchong Uthayopas
 
The human face of AI: how collective and augmented intelligence can help sol...
The human face of AI:  how collective and augmented intelligence can help sol...The human face of AI:  how collective and augmented intelligence can help sol...
The human face of AI: how collective and augmented intelligence can help sol...
Elena Simperl
 
Physical Cyber Social Computing: An early 21st century approach to Computing ...
Physical Cyber Social Computing: An early 21st century approach to Computing ...Physical Cyber Social Computing: An early 21st century approach to Computing ...
Physical Cyber Social Computing: An early 21st century approach to Computing ...
Amit Sheth
 
CINET: A Cyber-Infrastructure for Network Science Overview
CINET: A Cyber-Infrastructure for Network Science OverviewCINET: A Cyber-Infrastructure for Network Science Overview
CINET: A Cyber-Infrastructure for Network Science Overview
Biocomplexity Institute of Virginia Tech
 
CINET: A CyberInfrastructure for Network Science
CINET: A CyberInfrastructure for Network ScienceCINET: A CyberInfrastructure for Network Science
CINET: A CyberInfrastructure for Network Science
Biocomplexity Institute of Virginia Tech
 
Semantics based Summarization of Entities in Knowledge Graphs
Semantics based Summarization of Entities in Knowledge GraphsSemantics based Summarization of Entities in Knowledge Graphs
Semantics based Summarization of Entities in Knowledge Graphs
Artificial Intelligence Institute at UofSC
 
Monitoring world geopolitics through Big Data by Tomasa Rodrigo and Álvaro Or...
Monitoring world geopolitics through Big Data by Tomasa Rodrigo and Álvaro Or...Monitoring world geopolitics through Big Data by Tomasa Rodrigo and Álvaro Or...
Monitoring world geopolitics through Big Data by Tomasa Rodrigo and Álvaro Or...
Big Data Spain
 

What's hot (20)

Smart IoT for Connected Manufacturing
Smart IoT for Connected ManufacturingSmart IoT for Connected Manufacturing
Smart IoT for Connected Manufacturing
 
Smart Data for you and me: Personalized and Actionable Physical Cyber Social ...
Smart Data for you and me: Personalized and Actionable Physical Cyber Social ...Smart Data for you and me: Personalized and Actionable Physical Cyber Social ...
Smart Data for you and me: Personalized and Actionable Physical Cyber Social ...
 
Semantics-empowered Approaches to Big Data Processing for Physical-Cyber-Soci...
Semantics-empowered Approaches to Big Data Processing for Physical-Cyber-Soci...Semantics-empowered Approaches to Big Data Processing for Physical-Cyber-Soci...
Semantics-empowered Approaches to Big Data Processing for Physical-Cyber-Soci...
 
Citizen Sensor Data Mining, Social Media Analytics and Applications
Citizen Sensor Data Mining, Social Media Analytics and ApplicationsCitizen Sensor Data Mining, Social Media Analytics and Applications
Citizen Sensor Data Mining, Social Media Analytics and Applications
 
Semantic, Cognitive, and Perceptual Computing – three intertwined strands of ...
Semantic, Cognitive, and Perceptual Computing – three intertwined strands of ...Semantic, Cognitive, and Perceptual Computing – three intertwined strands of ...
Semantic, Cognitive, and Perceptual Computing – three intertwined strands of ...
 
Knowledge Will Propel Machine Understanding of Big Data
Knowledge Will Propel Machine Understanding of Big DataKnowledge Will Propel Machine Understanding of Big Data
Knowledge Will Propel Machine Understanding of Big Data
 
TRANSFORMING BIG DATA INTO SMART DATA: Deriving Value via Harnessing Volume, ...
TRANSFORMING BIG DATA INTO SMART DATA: Deriving Value via Harnessing Volume, ...TRANSFORMING BIG DATA INTO SMART DATA: Deriving Value via Harnessing Volume, ...
TRANSFORMING BIG DATA INTO SMART DATA: Deriving Value via Harnessing Volume, ...
 
Transforming Big Data into Smart Data for Smart Energy: Deriving Value via ha...
Transforming Big Data into Smart Data for Smart Energy: Deriving Value via ha...Transforming Big Data into Smart Data for Smart Energy: Deriving Value via ha...
Transforming Big Data into Smart Data for Smart Energy: Deriving Value via ha...
 
Computing for Human Experience [v4]: Keynote @ OnTheMove Federated Conferences
Computing for Human Experience [v4]: Keynote @ OnTheMove Federated ConferencesComputing for Human Experience [v4]: Keynote @ OnTheMove Federated Conferences
Computing for Human Experience [v4]: Keynote @ OnTheMove Federated Conferences
 
Multipleregression covidmobility and Covid-19 policy recommendation
Multipleregression covidmobility and Covid-19 policy recommendationMultipleregression covidmobility and Covid-19 policy recommendation
Multipleregression covidmobility and Covid-19 policy recommendation
 
NG2S: A Study of Pro-Environmental Tipping Point via ABMs
NG2S: A Study of Pro-Environmental Tipping Point via ABMsNG2S: A Study of Pro-Environmental Tipping Point via ABMs
NG2S: A Study of Pro-Environmental Tipping Point via ABMs
 
Physical Cyber Social Computing
Physical Cyber Social ComputingPhysical Cyber Social Computing
Physical Cyber Social Computing
 
Network Science: Theory, Modeling and Applications
Network Science: Theory, Modeling and ApplicationsNetwork Science: Theory, Modeling and Applications
Network Science: Theory, Modeling and Applications
 
Future of hpc
Future of hpcFuture of hpc
Future of hpc
 
The human face of AI: how collective and augmented intelligence can help sol...
The human face of AI:  how collective and augmented intelligence can help sol...The human face of AI:  how collective and augmented intelligence can help sol...
The human face of AI: how collective and augmented intelligence can help sol...
 
Physical Cyber Social Computing: An early 21st century approach to Computing ...
Physical Cyber Social Computing: An early 21st century approach to Computing ...Physical Cyber Social Computing: An early 21st century approach to Computing ...
Physical Cyber Social Computing: An early 21st century approach to Computing ...
 
CINET: A Cyber-Infrastructure for Network Science Overview
CINET: A Cyber-Infrastructure for Network Science OverviewCINET: A Cyber-Infrastructure for Network Science Overview
CINET: A Cyber-Infrastructure for Network Science Overview
 
CINET: A CyberInfrastructure for Network Science
CINET: A CyberInfrastructure for Network ScienceCINET: A CyberInfrastructure for Network Science
CINET: A CyberInfrastructure for Network Science
 
Semantics based Summarization of Entities in Knowledge Graphs
Semantics based Summarization of Entities in Knowledge GraphsSemantics based Summarization of Entities in Knowledge Graphs
Semantics based Summarization of Entities in Knowledge Graphs
 
Monitoring world geopolitics through Big Data by Tomasa Rodrigo and Álvaro Or...
Monitoring world geopolitics through Big Data by Tomasa Rodrigo and Álvaro Or...Monitoring world geopolitics through Big Data by Tomasa Rodrigo and Álvaro Or...
Monitoring world geopolitics through Big Data by Tomasa Rodrigo and Álvaro Or...
 

Similar to Understanding City Traffic Dynamics Utilizing Sensor and Textual Observations

Integrating Sensor and Social Data for Understanding City Events
Integrating Sensor and Social Data for Understanding City EventsIntegrating Sensor and Social Data for Understanding City Events
Integrating Sensor and Social Data for Understanding City Events
Artificial Intelligence Institute at UofSC
 
A First Step Towards Stream Reasoning at FIS 2008
A First Step Towards Stream Reasoning at FIS 2008A First Step Towards Stream Reasoning at FIS 2008
A First Step Towards Stream Reasoning at FIS 2008
Emanuele Della Valle
 
Multimedia Mining
Multimedia Mining Multimedia Mining
Multimedia Mining
Biniam Asnake
 
SC4 Workshop 2: Josep Maria Salanova - Pilot in Thessaloniki
SC4 Workshop 2: Josep Maria Salanova - Pilot in ThessalonikiSC4 Workshop 2: Josep Maria Salanova - Pilot in Thessaloniki
SC4 Workshop 2: Josep Maria Salanova - Pilot in Thessaloniki
BigData_Europe
 
Streaming Weather Data from Web APIs to Jupyter through Kafka
Streaming Weather Data from Web APIs to Jupyter through KafkaStreaming Weather Data from Web APIs to Jupyter through Kafka
Streaming Weather Data from Web APIs to Jupyter through Kafka
Leo Salemann
 
Streaming Weather Data from Web APIs to Jupyter through Kafka
Streaming Weather Data from Web APIs to Jupyter through KafkaStreaming Weather Data from Web APIs to Jupyter through Kafka
Streaming Weather Data from Web APIs to Jupyter through Kafka
Wenfan Xu
 
Machine Learning Approach to Report Prioritization with an ...
Machine Learning Approach to Report Prioritization with an ...Machine Learning Approach to Report Prioritization with an ...
Machine Learning Approach to Report Prioritization with an ...
butest
 
KIT-601 Lecture Notes-UNIT-3.pdf Mining Data Stream
KIT-601 Lecture Notes-UNIT-3.pdf Mining Data StreamKIT-601 Lecture Notes-UNIT-3.pdf Mining Data Stream
KIT-601 Lecture Notes-UNIT-3.pdf Mining Data Stream
Dr. Radhey Shyam
 
IoT beneath your feet - building smart roads and networks
IoT beneath your feet - building smart roads and networksIoT beneath your feet - building smart roads and networks
IoT beneath your feet - building smart roads and networks
Alcatel-Lucent Enterprise
 
Transport for London: Using data to keep London moving
Transport for London: Using data to keep London movingTransport for London: Using data to keep London moving
Transport for London: Using data to keep London moving
WSO2
 
The data streaming processing paradigm and its use in modern fog architectures
The data streaming processing paradigm and its use in modern fog architecturesThe data streaming processing paradigm and its use in modern fog architectures
The data streaming processing paradigm and its use in modern fog architectures
Vincenzo Gulisano
 
SFScon22 - Gianluca Antonacci - Traffic management in a Smart City scenario.pdf
SFScon22 - Gianluca Antonacci - Traffic management in a Smart City scenario.pdfSFScon22 - Gianluca Antonacci - Traffic management in a Smart City scenario.pdf
SFScon22 - Gianluca Antonacci - Traffic management in a Smart City scenario.pdf
South Tyrol Free Software Conference
 
Standards Show
Standards ShowStandards Show
Standards Show
mcusack
 
The Design of a Simulation for the Modeling and Analysis of Public Transporta...
The Design of a Simulation for the Modeling and Analysis of Public Transporta...The Design of a Simulation for the Modeling and Analysis of Public Transporta...
The Design of a Simulation for the Modeling and Analysis of Public Transporta...
CSCJournals
 
Realtime Big Data Analytics for Event Detection in Highways
Realtime Big Data Analytics for Event Detection in HighwaysRealtime Big Data Analytics for Event Detection in Highways
Realtime Big Data Analytics for Event Detection in Highways
York University
 
Internet measurement (Presentation)
Internet measurement (Presentation)Internet measurement (Presentation)
Internet measurement (Presentation)
Amir Hossein Mandegar
 
BDE_SC4_WS3_7_Josep Maria Salanova - The Mobility Use Case in Thessaloniki
BDE_SC4_WS3_7_Josep Maria Salanova - The Mobility Use Case in ThessalonikiBDE_SC4_WS3_7_Josep Maria Salanova - The Mobility Use Case in Thessaloniki
BDE_SC4_WS3_7_Josep Maria Salanova - The Mobility Use Case in Thessaloniki
BigData_Europe
 
Prediction of traveller information and route choice
Prediction of traveller information and route choicePrediction of traveller information and route choice
Prediction of traveller information and route choice
ayishairshad
 
A Web of Things Based Eco-System for Urban Computing - Towards Smarter Cities
A Web of Things Based Eco-System for Urban Computing - Towards Smarter CitiesA Web of Things Based Eco-System for Urban Computing - Towards Smarter Cities
A Web of Things Based Eco-System for Urban Computing - Towards Smarter Cities
Andreas Kamilaris
 
Role of Big Data for Smart City Applications
Role of Big Data for Smart City ApplicationsRole of Big Data for Smart City Applications
Role of Big Data for Smart City Applications
Artificial Intelligence Institute at UofSC
 

Similar to Understanding City Traffic Dynamics Utilizing Sensor and Textual Observations (20)

Integrating Sensor and Social Data for Understanding City Events
Integrating Sensor and Social Data for Understanding City EventsIntegrating Sensor and Social Data for Understanding City Events
Integrating Sensor and Social Data for Understanding City Events
 
A First Step Towards Stream Reasoning at FIS 2008
A First Step Towards Stream Reasoning at FIS 2008A First Step Towards Stream Reasoning at FIS 2008
A First Step Towards Stream Reasoning at FIS 2008
 
Multimedia Mining
Multimedia Mining Multimedia Mining
Multimedia Mining
 
SC4 Workshop 2: Josep Maria Salanova - Pilot in Thessaloniki
SC4 Workshop 2: Josep Maria Salanova - Pilot in ThessalonikiSC4 Workshop 2: Josep Maria Salanova - Pilot in Thessaloniki
SC4 Workshop 2: Josep Maria Salanova - Pilot in Thessaloniki
 
Streaming Weather Data from Web APIs to Jupyter through Kafka
Streaming Weather Data from Web APIs to Jupyter through KafkaStreaming Weather Data from Web APIs to Jupyter through Kafka
Streaming Weather Data from Web APIs to Jupyter through Kafka
 
Streaming Weather Data from Web APIs to Jupyter through Kafka
Streaming Weather Data from Web APIs to Jupyter through KafkaStreaming Weather Data from Web APIs to Jupyter through Kafka
Streaming Weather Data from Web APIs to Jupyter through Kafka
 
Machine Learning Approach to Report Prioritization with an ...
Machine Learning Approach to Report Prioritization with an ...Machine Learning Approach to Report Prioritization with an ...
Machine Learning Approach to Report Prioritization with an ...
 
KIT-601 Lecture Notes-UNIT-3.pdf Mining Data Stream
KIT-601 Lecture Notes-UNIT-3.pdf Mining Data StreamKIT-601 Lecture Notes-UNIT-3.pdf Mining Data Stream
KIT-601 Lecture Notes-UNIT-3.pdf Mining Data Stream
 
IoT beneath your feet - building smart roads and networks
IoT beneath your feet - building smart roads and networksIoT beneath your feet - building smart roads and networks
IoT beneath your feet - building smart roads and networks
 
Transport for London: Using data to keep London moving
Transport for London: Using data to keep London movingTransport for London: Using data to keep London moving
Transport for London: Using data to keep London moving
 
The data streaming processing paradigm and its use in modern fog architectures
The data streaming processing paradigm and its use in modern fog architecturesThe data streaming processing paradigm and its use in modern fog architectures
The data streaming processing paradigm and its use in modern fog architectures
 
SFScon22 - Gianluca Antonacci - Traffic management in a Smart City scenario.pdf
SFScon22 - Gianluca Antonacci - Traffic management in a Smart City scenario.pdfSFScon22 - Gianluca Antonacci - Traffic management in a Smart City scenario.pdf
SFScon22 - Gianluca Antonacci - Traffic management in a Smart City scenario.pdf
 
Standards Show
Standards ShowStandards Show
Standards Show
 
The Design of a Simulation for the Modeling and Analysis of Public Transporta...
The Design of a Simulation for the Modeling and Analysis of Public Transporta...The Design of a Simulation for the Modeling and Analysis of Public Transporta...
The Design of a Simulation for the Modeling and Analysis of Public Transporta...
 
Realtime Big Data Analytics for Event Detection in Highways
Realtime Big Data Analytics for Event Detection in HighwaysRealtime Big Data Analytics for Event Detection in Highways
Realtime Big Data Analytics for Event Detection in Highways
 
Internet measurement (Presentation)
Internet measurement (Presentation)Internet measurement (Presentation)
Internet measurement (Presentation)
 
BDE_SC4_WS3_7_Josep Maria Salanova - The Mobility Use Case in Thessaloniki
BDE_SC4_WS3_7_Josep Maria Salanova - The Mobility Use Case in ThessalonikiBDE_SC4_WS3_7_Josep Maria Salanova - The Mobility Use Case in Thessaloniki
BDE_SC4_WS3_7_Josep Maria Salanova - The Mobility Use Case in Thessaloniki
 
Prediction of traveller information and route choice
Prediction of traveller information and route choicePrediction of traveller information and route choice
Prediction of traveller information and route choice
 
A Web of Things Based Eco-System for Urban Computing - Towards Smarter Cities
A Web of Things Based Eco-System for Urban Computing - Towards Smarter CitiesA Web of Things Based Eco-System for Urban Computing - Towards Smarter Cities
A Web of Things Based Eco-System for Urban Computing - Towards Smarter Cities
 
Role of Big Data for Smart City Applications
Role of Big Data for Smart City ApplicationsRole of Big Data for Smart City Applications
Role of Big Data for Smart City Applications
 

Recently uploaded

一比一原版莱斯大学毕业证(rice毕业证)如何办理
一比一原版莱斯大学毕业证(rice毕业证)如何办理一比一原版莱斯大学毕业证(rice毕业证)如何办理
一比一原版莱斯大学毕业证(rice毕业证)如何办理
zsafxbf
 
Sample Devops SRE Product Companies .pdf
Sample Devops SRE  Product Companies .pdfSample Devops SRE  Product Companies .pdf
Sample Devops SRE Product Companies .pdf
Vineet
 
Experts live - Improving user adoption with AI
Experts live - Improving user adoption with AIExperts live - Improving user adoption with AI
Experts live - Improving user adoption with AI
jitskeb
 
一比一原版(UofT毕业证)多伦多大学毕业证如何办理
一比一原版(UofT毕业证)多伦多大学毕业证如何办理一比一原版(UofT毕业证)多伦多大学毕业证如何办理
一比一原版(UofT毕业证)多伦多大学毕业证如何办理
exukyp
 
A gentle exploration of Retrieval Augmented Generation
A gentle exploration of Retrieval Augmented GenerationA gentle exploration of Retrieval Augmented Generation
A gentle exploration of Retrieval Augmented Generation
dataschool1
 
一比一原版斯威本理工大学毕业证(swinburne毕业证)如何办理
一比一原版斯威本理工大学毕业证(swinburne毕业证)如何办理一比一原版斯威本理工大学毕业证(swinburne毕业证)如何办理
一比一原版斯威本理工大学毕业证(swinburne毕业证)如何办理
actyx
 
DSSML24_tspann_CodelessGenerativeAIPipelines
DSSML24_tspann_CodelessGenerativeAIPipelinesDSSML24_tspann_CodelessGenerativeAIPipelines
DSSML24_tspann_CodelessGenerativeAIPipelines
Timothy Spann
 
Drownings spike from May to August in children
Drownings spike from May to August in childrenDrownings spike from May to August in children
Drownings spike from May to August in children
Bisnar Chase Personal Injury Attorneys
 
Econ3060_Screen Time and Success_ final_GroupProject.pdf
Econ3060_Screen Time and Success_ final_GroupProject.pdfEcon3060_Screen Time and Success_ final_GroupProject.pdf
Econ3060_Screen Time and Success_ final_GroupProject.pdf
blueshagoo1
 
Module 1 ppt BIG DATA ANALYTICS NOTES FOR MCA
Module 1 ppt BIG DATA ANALYTICS NOTES FOR MCAModule 1 ppt BIG DATA ANALYTICS NOTES FOR MCA
Module 1 ppt BIG DATA ANALYTICS NOTES FOR MCA
yuvarajkumar334
 
一比一原版(Sheffield毕业证书)谢菲尔德大学毕业证如何办理
一比一原版(Sheffield毕业证书)谢菲尔德大学毕业证如何办理一比一原版(Sheffield毕业证书)谢菲尔德大学毕业证如何办理
一比一原版(Sheffield毕业证书)谢菲尔德大学毕业证如何办理
1tyxnjpia
 
Open Source Contributions to Postgres: The Basics POSETTE 2024
Open Source Contributions to Postgres: The Basics POSETTE 2024Open Source Contributions to Postgres: The Basics POSETTE 2024
Open Source Contributions to Postgres: The Basics POSETTE 2024
ElizabethGarrettChri
 
06-12-2024-BudapestDataForum-BuildingReal-timePipelineswithFLaNK AIM
06-12-2024-BudapestDataForum-BuildingReal-timePipelineswithFLaNK AIM06-12-2024-BudapestDataForum-BuildingReal-timePipelineswithFLaNK AIM
06-12-2024-BudapestDataForum-BuildingReal-timePipelineswithFLaNK AIM
Timothy Spann
 
REUSE-SCHOOL-DATA-INTEGRATED-SYSTEMS.pptx
REUSE-SCHOOL-DATA-INTEGRATED-SYSTEMS.pptxREUSE-SCHOOL-DATA-INTEGRATED-SYSTEMS.pptx
REUSE-SCHOOL-DATA-INTEGRATED-SYSTEMS.pptx
KiriakiENikolaidou
 
一比一原版南昆士兰大学毕业证如何办理
一比一原版南昆士兰大学毕业证如何办理一比一原版南昆士兰大学毕业证如何办理
一比一原版南昆士兰大学毕业证如何办理
ugydym
 
一比一原版爱尔兰都柏林大学毕业证(本硕)ucd学位证书如何办理
一比一原版爱尔兰都柏林大学毕业证(本硕)ucd学位证书如何办理一比一原版爱尔兰都柏林大学毕业证(本硕)ucd学位证书如何办理
一比一原版爱尔兰都柏林大学毕业证(本硕)ucd学位证书如何办理
hqfek
 
Template xxxxxxxx ssssssssssss Sertifikat.pptx
Template xxxxxxxx ssssssssssss Sertifikat.pptxTemplate xxxxxxxx ssssssssssss Sertifikat.pptx
Template xxxxxxxx ssssssssssss Sertifikat.pptx
TeukuEriSyahputra
 
ML-PPT-UNIT-2 Generative Classifiers Discriminative Classifiers
ML-PPT-UNIT-2 Generative Classifiers Discriminative ClassifiersML-PPT-UNIT-2 Generative Classifiers Discriminative Classifiers
ML-PPT-UNIT-2 Generative Classifiers Discriminative Classifiers
MastanaihnaiduYasam
 
Digital Marketing Performance Marketing Sample .pdf
Digital Marketing Performance Marketing  Sample .pdfDigital Marketing Performance Marketing  Sample .pdf
Digital Marketing Performance Marketing Sample .pdf
Vineet
 
一比一原版卡尔加里大学毕业证(uc毕业证)如何办理
一比一原版卡尔加里大学毕业证(uc毕业证)如何办理一比一原版卡尔加里大学毕业证(uc毕业证)如何办理
一比一原版卡尔加里大学毕业证(uc毕业证)如何办理
oaxefes
 

Recently uploaded (20)

一比一原版莱斯大学毕业证(rice毕业证)如何办理
一比一原版莱斯大学毕业证(rice毕业证)如何办理一比一原版莱斯大学毕业证(rice毕业证)如何办理
一比一原版莱斯大学毕业证(rice毕业证)如何办理
 
Sample Devops SRE Product Companies .pdf
Sample Devops SRE  Product Companies .pdfSample Devops SRE  Product Companies .pdf
Sample Devops SRE Product Companies .pdf
 
Experts live - Improving user adoption with AI
Experts live - Improving user adoption with AIExperts live - Improving user adoption with AI
Experts live - Improving user adoption with AI
 
一比一原版(UofT毕业证)多伦多大学毕业证如何办理
一比一原版(UofT毕业证)多伦多大学毕业证如何办理一比一原版(UofT毕业证)多伦多大学毕业证如何办理
一比一原版(UofT毕业证)多伦多大学毕业证如何办理
 
A gentle exploration of Retrieval Augmented Generation
A gentle exploration of Retrieval Augmented GenerationA gentle exploration of Retrieval Augmented Generation
A gentle exploration of Retrieval Augmented Generation
 
一比一原版斯威本理工大学毕业证(swinburne毕业证)如何办理
一比一原版斯威本理工大学毕业证(swinburne毕业证)如何办理一比一原版斯威本理工大学毕业证(swinburne毕业证)如何办理
一比一原版斯威本理工大学毕业证(swinburne毕业证)如何办理
 
DSSML24_tspann_CodelessGenerativeAIPipelines
DSSML24_tspann_CodelessGenerativeAIPipelinesDSSML24_tspann_CodelessGenerativeAIPipelines
DSSML24_tspann_CodelessGenerativeAIPipelines
 
Drownings spike from May to August in children
Drownings spike from May to August in childrenDrownings spike from May to August in children
Drownings spike from May to August in children
 
Econ3060_Screen Time and Success_ final_GroupProject.pdf
Econ3060_Screen Time and Success_ final_GroupProject.pdfEcon3060_Screen Time and Success_ final_GroupProject.pdf
Econ3060_Screen Time and Success_ final_GroupProject.pdf
 
Module 1 ppt BIG DATA ANALYTICS NOTES FOR MCA
Module 1 ppt BIG DATA ANALYTICS NOTES FOR MCAModule 1 ppt BIG DATA ANALYTICS NOTES FOR MCA
Module 1 ppt BIG DATA ANALYTICS NOTES FOR MCA
 
一比一原版(Sheffield毕业证书)谢菲尔德大学毕业证如何办理
一比一原版(Sheffield毕业证书)谢菲尔德大学毕业证如何办理一比一原版(Sheffield毕业证书)谢菲尔德大学毕业证如何办理
一比一原版(Sheffield毕业证书)谢菲尔德大学毕业证如何办理
 
Open Source Contributions to Postgres: The Basics POSETTE 2024
Open Source Contributions to Postgres: The Basics POSETTE 2024Open Source Contributions to Postgres: The Basics POSETTE 2024
Open Source Contributions to Postgres: The Basics POSETTE 2024
 
06-12-2024-BudapestDataForum-BuildingReal-timePipelineswithFLaNK AIM
06-12-2024-BudapestDataForum-BuildingReal-timePipelineswithFLaNK AIM06-12-2024-BudapestDataForum-BuildingReal-timePipelineswithFLaNK AIM
06-12-2024-BudapestDataForum-BuildingReal-timePipelineswithFLaNK AIM
 
REUSE-SCHOOL-DATA-INTEGRATED-SYSTEMS.pptx
REUSE-SCHOOL-DATA-INTEGRATED-SYSTEMS.pptxREUSE-SCHOOL-DATA-INTEGRATED-SYSTEMS.pptx
REUSE-SCHOOL-DATA-INTEGRATED-SYSTEMS.pptx
 
一比一原版南昆士兰大学毕业证如何办理
一比一原版南昆士兰大学毕业证如何办理一比一原版南昆士兰大学毕业证如何办理
一比一原版南昆士兰大学毕业证如何办理
 
一比一原版爱尔兰都柏林大学毕业证(本硕)ucd学位证书如何办理
一比一原版爱尔兰都柏林大学毕业证(本硕)ucd学位证书如何办理一比一原版爱尔兰都柏林大学毕业证(本硕)ucd学位证书如何办理
一比一原版爱尔兰都柏林大学毕业证(本硕)ucd学位证书如何办理
 
Template xxxxxxxx ssssssssssss Sertifikat.pptx
Template xxxxxxxx ssssssssssss Sertifikat.pptxTemplate xxxxxxxx ssssssssssss Sertifikat.pptx
Template xxxxxxxx ssssssssssss Sertifikat.pptx
 
ML-PPT-UNIT-2 Generative Classifiers Discriminative Classifiers
ML-PPT-UNIT-2 Generative Classifiers Discriminative ClassifiersML-PPT-UNIT-2 Generative Classifiers Discriminative Classifiers
ML-PPT-UNIT-2 Generative Classifiers Discriminative Classifiers
 
Digital Marketing Performance Marketing Sample .pdf
Digital Marketing Performance Marketing  Sample .pdfDigital Marketing Performance Marketing  Sample .pdf
Digital Marketing Performance Marketing Sample .pdf
 
一比一原版卡尔加里大学毕业证(uc毕业证)如何办理
一比一原版卡尔加里大学毕业证(uc毕业证)如何办理一比一原版卡尔加里大学毕业证(uc毕业证)如何办理
一比一原版卡尔加里大学毕业证(uc毕业证)如何办理
 

Understanding City Traffic Dynamics Utilizing Sensor and Textual Observations

  • 1. Understanding City Traffic Dynamics Utilizing Sensor and Textual Observations Pramod Anantharam, Krishnaprasad Thirunarayan, Surendra Marupudi, Amit Sheth, and Tanvi Banerjee Ohio Center of Excellence in Knowledge-enabled Computing(Kno.e.sis), Wright State University, USA The Thirtieth AAAI Conference on Artificial Intelligence (AAAI-16), February 12–17, Phoenix, Arizona, USA. 1
  • 2. Multimodal Manifestation of Events: Traffic Scenario 2
  • 3. • Why? – Explain/Interpret average speed and link travel time variations using events provided by city authorities and traffic events shared on Twitter – Past work: Predict congestion based on historical sensor data • What? – Combine • 511.org data about Bay Area Road Network Traffic – E.g., Average speed and link travel time data stream (Sensor data) – E.g., (Happened or planned) event reports (Textual data) • Tweets that report events including ad hoc ones (Textual data) Multimodal Data Integration: Traffic Scenario 3
  • 4. • How? – Step 1: Extract textual events from tweets stream – Step 2: Build statistical models of normalcy, and thereby anomaly, for sensor time series data – Step 3: Correlate multimodal streams, using spatio-temporal information, to explain “anomalies” in sensor time series data with textual events Multimodal Data Integration: Traffic Scenario 4
  • 5. • How? – Step 1: Extract textual events from tweets stream – Step 2: Build statistical models of normalcy, and thereby anomaly, from numerical sensor data streams – Step 3: Correlate multimodal streams, using spatio-temporal information, to explain “anomalies” in sensor time series data with textual events Multimodal Data Integration: Traffic Scenario 5
  • 6. Extracting Textual Events from Tweets: Annotation + Extraction Pramod Anantharam, Payam Barnaghi, Krishnaprasad Thirunarayan, and Amit Sheth. 2015. Extracting City Traffic Events from Social Streams. ACM Trans. Intell. Syst. Technol. 6, 4, Article 43 (July 2015), 27 pages. DOI=10.1145/2717317 http://doi.acm.org/10.1145/2717317 1Event Extraction Tool on Open Science Foundation: https://osf.io/b4q2t/wiki/home/ NER – Named Entity Recognition OSM – Open Street Maps 39,208 traffic related incidents extracted from over 20 million tweets1 6
  • 7. • How? – Step 1: Extract textual events from tweets stream – Step 2: Build statistical models of normalcy, and thereby anomaly, from numerical sensor data streams – Step 3: Correlate multimodal streams, using spatio-temporal information, to explain “anomalies” in sensor time series data with textual events Multimodal Data Integration: Traffic Scenario 7
  • 8. Image credit: http://traffic.511.org/index Multiple events Varying influence Event interactions Time of Day (approx. 1 observation/minute)Speed in km/h Building Normalcy Models of Traffic Dynamics*: Challenges *Traffic Dynamics here refers to speed and travel time variations observed in sensor data 8
  • 9. • Temporal landmarks : peak hour vs. off-peak traffic vs. weekend traffic • Effect of location • Scheduled events such as road construction, baseball game, or music concert • Unexpected events such as accidents, heavy rains, fog • Random variations (viz., stochasticity) such as people visiting downtown by mere coincidence Possible Causes of Nonlinearity in Traffic Dynamics 9
  • 10. Modeling City Traffic Dynamics: A Closer Look Image credits: http://bit.ly/1N1wu5g, http://bit.ly/1O8d9gn, http://bit.ly/1N8L5tf, http://bit.ly/1HLDYui Speed of vehicles reported by sensors on the road Events People Influx Vehicle Influx Vehicle Speed Hidden State Observed Evidence • Do we know all the events? • Do we know all the event interactions? • Do we know the number of event participants? • How are the participants traveling? • Do we know the vehicle volume on the road? • Do we know the speed of vehicles on the road? • What is the nature of the speed time series data? 10
  • 11. Nature of the Problem to Linear Dynamical System (LDS) 1. There are both hidden states and observed evidence 2. Current observed evidence depends on the current hidden state 3. Current hidden state depends on the previous hidden states v1 s1 … … v2 s1 vT sT v1 s1 … … v2 s1 vT sT v1 s1 … … v2 s1 vT sT For simplicity of explanation, we consider vehicle influx as a hidden variable and the observed speed as evidence variable Vehicle influx at a certain point in time t would influence speed of vehicles as the same time t Vehicle influx at a certain point in time t depends only on the previous vehicle influx (first- order Markov process) 11
  • 12. Linear Dynamical Systems Model v1 s1 … … v2 s1 vT sT Replacing discrete valued state and observation nodes in the rain example (previous slide) with continuous valued states and observations, we get an LDS model the has transition and observation models The transition model is specified by At and the observation model is specified by Bt along with associated noise The joint distribution over all the hidden and observed variables is shown along with the conditional distributions Barber, David. Bayesian reasoning and machine learning. Cambridge University Press, 2012. 12
  • 13. Hourly Traffic Dynamics Over all Mondays between Aug-14 to Jan-15 x-axis: observation number for each hour of day y-axis: average speed of vehicles in km/h 13
  • 14. 14 Learning Context Specific LDS Models 7 × 24 LDS(1,1), LDS(1,2) ,…., LDS(1,24) LDS(7,1), LDS(7,2) ,…., LDS(7,24) . . . di hj Mo n. Tu e. We d. Th u. Fr i. Sa t. Su n. Mo n. Tue . We d. Thu. Fri. Sat. Sun. Speed/travel-time time series data from a link Time series data for each hour of day (1-24) for each day of week (Monday – Sunday) Mean time series computed for each day of week and hour of day along with the medoid 168 LDS models for each link; Total models learned = 425,712 i.e., (2,534 links × 168 models per link) Step 1: Index data for each link for day of week and hour of day utilizing the traffic domain knowledge for piece- wise linear approximation Step 2: Find the “typical” dynamics by computing the mean and choosing the medoid for each hour of day and day of week Step 3: Learn LDS parameters for the medoid for each hour of day (24 hours) and each day of week (7 days) resulting in 24 × 7 = 168 models for each link
  • 15. Utilizing Context Specific LDS Models to Learn Normalcy Log- likelihood score 15
  • 16. 16 Tagging Anomalies using Context Specific LDS Models Compute Log Likelihood for each hour of observed data (di,hj) LDS(hj,di) 7 × 24 Lik(1,1), Lik(1,2) ,…., Lik(1,24) Lik(7,1), Lik(7,2) ,…., Lik(7,24) . . . Train ? Yes (Training phase) Tag Anomalous hours using the Log Likelihood Range No (di,hj) (min. likelihood) Anomalies L = Partition based on (di,hj) Speed and travel-time time Observations from a link Log likelihood min. and max. values obtained from five number summary Partition based on (di,hj) 7 × 24 LDS(1,1), LDS(1,2) ,…., LDS(1,24) LDS(7,1), LDS(7,2) ,…., LDS(7,24) . . . di hj (Input) (Output)
  • 17. • How? – Step 1: Extract textual events from tweets stream – Step 2: Build statistical models of normalcy, and thereby anomaly, from numerical sensor data streams – Step 3: Correlate multimodal streams, using spatio-temporal information, to explain “anomalies” in sensor time series data with textual events Multimodal Data Integration: Traffic Scenario 17
  • 18. • If an anomaly is detected on a link L and during time period [tst, tet], then the anomaly is explained by an event if the event occurred in the vicinity within 0.5km radius and during [tst-1, tet+1]. • CAVEAT: An anomaly may not be explained because of missing data. Explaining Anomalies in Sensor Data using Textual Events 18 Anomalies City Traffic Events from Twitter ⟨et, el, est, eet, ei⟩ Explained_by Link sensor data City tweets (Anantharam 2014) https://osf.io/b4q2t/wiki/home/ ⟨ast, aet⟩ Δte = est ~ eet Δta = ast - 1 hour ~ aet + 1 hour Explains (if there is an overlap between Δte and Δta)
  • 19. • Data collected from San Francisco Bay Area between May 2014 to May 2015 – 511.org: • 1,638 traffic incident reports • 1.4 billion speed and travel time observations – Twitter Data: 39,208 traffic related incidents extracted from over 20 million tweets1 • Naïve implementation for learning normalcy models for 2,534 links resulted in 40 minutes per link (~ 2 months of processing time for our data) – 2.66 GHz, Intel Core 2 Duo with 8 GB main memory • Scalable implementation by exploiting the nature of the problem resulted in learning normalcy models within 24 hours – The Apache Spark cluster used in our evaluation has 864 cores and 17TB main memory. Real-World Dataset and Scalability Issues 1Extracting city traffic events from social streams. https://osf.io/b4q2t/wiki/home/ 19
  • 21. • Events extracted from social media can complement sensor data • Knowledge of the domain can be utilized for a piecewise linear approximation of non-linear speed dynamics • Events reported by people most likely manifests in sensor data relative to traffic events by formal sources. • Short-term events manifest as anomalies in sensor data while the long-term events may not result in anomaly manifestations 21 Conclusion
  • 22. Acknowledgements This material is based upon work supported by the National Science Foundation under Grant No. EAR 1520870 titled “Hazards SEES: Social and Physical Sensing Enabled Decision Support for Disaster Management and Response”. Any opinions, findings, and conclusions or recommendations expressed in this material are those of the author(s) and do not necessarily reflect the views of the National Science Foundation. 22
  • 23. Thank you  Thank you, and please visit us at http://knoesis.org Link to the paper: http://www.knoesis.org/library/resource.php?id=2228 Hazards SEES: Social and Physical Sensing Enabled Decision Support for Disaster Management and Response 23

Editor's Notes

  1. Time series observations are readily and naturally available in domains such as finance, health care, smart cities, and system health monitoring. Increasingly, time series observations include both sensor and textual data generated in the same spatio-temporal context creating both challenges for dealing with heterogeneous data and opportunities for obtaining comprehensive situational awareness. For example, in a city, there are machine sensors and citizen sensors observing the city infrastructure (e.g., bridges, power grids) and city dynamics (e.g., traffic flow, power consumption). In this research, we investigate extraction of city events from textual observations and utilize them explain variations in the sensor observations. This will improve our understanding of city events and their manifestations due to the complementary nature of observations provided by the machine sensors and citizen sensors.
  2. Multimodal manifestation of real-world events – sporting event manifesting in sensor and social data
  3. Events, Volume of vehicles (hidden) Average speed, travel time (observed) Why not AR, ARIMA? Choice of generative model vs. discriminative model? Why LDS and why not GMM?
  4. Examined the theoretical nature of the problem of modeling traffic dynamics to systematically recommend Linear Dynamical Systems (LDS) Formalized non-linear traffic dynamics using linear models as an approximation for segments with approximate linear behavior – derived from traffic domain knowledge Created normalcy models based on log-likelihood scores for spotting traffic anomalies in sensor data Evaluated our approach over a real-world dataset collected from 511.org and Twitter for over a year (May-2014 to May 2015) with promising results