• Share
  • Email
  • Embed
  • Like
  • Save
  • Private Content
Efficiently Maintaining Distributed Model-Based Views on Real-Time Data Streams
 

Efficiently Maintaining Distributed Model-Based Views on Real-Time Data Streams

on

  • 468 views

This topic was presented by Alexandru Arion at the 54th annual conference IEEE Global Communications Conference (GLOBECOM 2011) from 5 – 9 December 2011 in Houston, Texas. ...

This topic was presented by Alexandru Arion at the 54th annual conference IEEE Global Communications Conference (GLOBECOM 2011) from 5 – 9 December 2011 in Houston, Texas.

Publication: http://bit.ly/A3iKbv

Abstract:
The trend for more online linked data becomes stronger. Foreseeing a future where "everything" will be online and linked, we ask the critical question; what is next? We envision that managing, query- ing and storing large amounts of links and data is far from yet another query processing task. We highlight two distinct and promising research directions towards managing and making sense of linked data. We in- troduce linked views to help focusing on specific link and data instances and linked history to help observe how links and data change over time.

Statistics

Views

Total Views
468
Views on SlideShare
444
Embed Views
24

Actions

Likes
0
Downloads
0
Comments
0

3 Embeds 24

http://www.planet-data.eu 17
http://planet-data.eu 5
http://planet-data.org 2

Accessibility

Upload Details

Uploaded via as Adobe PDF

Usage Rights

© All Rights Reserved

Report content

Flagged as inappropriate Flag as inappropriate
Flag as inappropriate

Select your reason for flagging this presentation as inappropriate.

Cancel
  • Full Name Full Name Comment goes here.
    Are you sure you want to
    Your message goes here
    Processing…
Post Comment
Edit your comment

    Efficiently Maintaining Distributed Model-Based Views on Real-Time Data Streams Efficiently Maintaining Distributed Model-Based Views on Real-Time Data Streams Presentation Transcript

    • Efficiently Maintaining Distributed Model- Based Views on Real-Time Data Streams Alexandru Arion, Hoyoung Jeung, Karl Aberer EPFL, 2011
    • Data networksLocal: low power connected devices transmit to base stations.Large scale: base stations transmit over large distances using existingcommunication infrastructure.
    • RelevanceLarge numbers of sensor networks are already beinginterconnected and share huge amount of streaming data.Example: SwissEx (http://www.swiss-experiment.ch)
    • Related workS. Shah, et all., “An efficient and resilient approach to filtering and disseminatingstreaming data,” in VLDB, 2003, pp. 57–68.Y. Zhou, et all., “Disseminating streaming data in a dynamic environment: anadaptive and cost-based approach,” The VLDB Journal, vol. 17, no. 6, pp. 1465–1483, 2008.D. J. Abadi, et all., “The design of the Borealis stream processing engine,” inCIDR, 2005, pp. 277–289.M. Balazinska, et all., “Load management and high availability in the medusadistributed stream processing system,” in SIGMOD, 2004, pp. 929–930.P. Pietzuch, et all., “Network-aware operator placement for stream-processing systems,” in ICDE, 2006, p. 49.
    • The framework
    • Key features (1)Feature 1: reduces communication costs (does notrequire any data transfer of actual streams)Feature 2: any type of queries can be processed (alldata required for query processing is available toconsumer nodes)
    • Key features (2)Feature 3: any type of model can be employed(serves any application)Feature 4: systematic solution that can guaranteeuser-specified accuracy requirements for model-based views.
    • Algorithms (1)Coded model update:● predetermines parameter values● encodes them with bitmaps● updates models efficiently sending only bitmaps
    • Algorithms (2)Coded inter-variable model:● uses correlation information● reduces data redundancy
    • Framework propertiesAccuracy requirements solution: ● The producer node generates a model-driven value when a new raw reading is streamed, and checks whether the difference between the raw value and the model-driven value stays within the error bound.
    •   ● If the difference does not exceed the error bound, no communication is required between the two nodes, and the consumer node generates values for their model-based views.
    •   ● Otherwise, the producer node reconstructs its model, so that the model-driven value generated from the reconstructed model does not exceed the error bound from the current raw reading. Next, the producer node updates the models at consumer nodes by sending new parameter values of the reconstructed model.
    • Coded Model Update
    • Coded Inter-variable Model
    • Coded Inter-variable Model (2)
    • Experiments (1)
    • Experiments (2)
    • Experiments (3)
    • Further related workA. Deshpande and S. Madden, “MauveDB: supporting model-based user views indatabase systems,” in SIGMOD, 2006Y. Ahmad, O. Papaemmanouil, U. C¸ etintemel, and J. Rogers, “Simultaneousequation systems for query processing on continuous-time data streams,” in ICDE,2008A. Thiagarajan and S. Madden, “Querying continuous functions in a databasesystem,” in SIGMOD, 2008A. Deligiannakis, Y. Kotidis, and N. Roussopoulos, "Compressing historicalinformation in sensor networks,” in SIGMOD, 2004H. Chen, J. Li, and P. Mohapatra, “RACE: time series compression withrate adaptivity and error bound for sensor networks,” 2004S. Gandhi, S. Nath, S. Suri, and J. Liu, “Gamps: Compressing multisensor data by grouping and amplitude scaling,” in SIGMOD, 2009
    • Conclusions● Generic framework● Arbitrary numerical models● Coded model update● Coded inter-variable model