The Troll under the Bridge: Data Management for Huge Web Science Mediabases

Loading...

Flash Player 9 (or above) is needed to view presentations.
We have detected that you do not have it on your computer. To install it, go here.

0 comments

Post a comment

    Post a comment
    Embed Video
    Edit your comment Cancel

    Favorites, Groups & Events

    The Troll under the Bridge: Data Management for Huge Web Science Mediabases - Presentation Transcript

    1. Zinayida Petrushyna, Ralf Klamma RWTH Aachen University Workshop “Digital social networks”, Munich September 12, 2008 The Troll under the Bridge: Data Management for Huge Web Science Mediabases
    2. Agenda
      • Motivation & Problem definition
      • Data Management for Web Science
        • Crawling: Watchers
        • Analysis: Patterns
        • Visualization: Graphs
      • Conclusion
      • Outlook
    3. Data Management issues in Web Science
      • Interoperable formats
        • XML based – Wikis , RSS Feeds, Microformat
        • SQL based – Deep Web
        • Text based – Websites, Forums
      • Non-continuous analysis
        • Crawling vs. Dumps
        • Special purpose vs. General purpose
      • Aggregation level is not possible to achieve
        • Data warehouses
        • Theoretical considerations of agency – Actor network theory
    4. Data Model for the Web 2.0 Latour: On Recalling ANT , 1999
    5. Mediabase
      • A Mediabase is a six-tuple graph
    6. Actors in the Mediabase
    7. Crawling Technologies Mix of dumps (Wikis) and special purpose crawlers:
    8. Trolls under the Bridge
      • What is a disturbance, e.g. a troll?
        • Sensing an incompatibility between theories exposed and theories-in-use
      • Disturbances are starting points of learning processes
        • Disturbances disturb, prevent … but they are creating reflection
      • Disturbances are hard to detect or to forecast
    9. Complex Troll Pattern in Basic Notation
    10. Complex Troll Pattern in Basic Notation 10 )
    11. Pattern Language
      • Variables – simple variables ( troll, thread ), properties ( thread.author ) and set variables ( v 1 ,…,v n ).
      • Operations
        • Arithmetic (+, -, *, / )
        • Aggregate ( SUM , COUNT , AVERAGE )
        • Logical (&, |, ~, FORALL and EXISTS )
        • Comparison ( = , != , > , < ).
      • Rules for variable binding
        • Simple variables – pattern parameters, actors or set variables
        • Properties – actor properties or relations
        • Set variables – actors
      • Interpreted by a finite state automaton
    12. Pattern Language for PALADIN : Example Troll
      • Troll Pattern : This pattern tries to discover the cases when a troll exists in a digital social network. A troll in the network is considered a disturbance.
      • Disturbance :
      • (EXISTS [medium | medium.affordance = threadArtefact]) &
      • (EXISTS [troll |(EXISTS [thread | (thread.author = troll) &
      • (COUNT [message | (message.author = troll) &
      • (message.posted = thread)]) > minPosts]) &
      • (~EXISTS[ thread 1 , message 1 | (thread 1 .author 1 != troll) &
      • (message 1 .author = troll & message 1 .posted = thread 1 ]))])])
      • Forces : medium; troll; network; member; thread; message; url
      • Force Relations : neighbour(troll, member); own thread(troll, thread)
      • Solution : No attention must be paid to the discussions started by the troll .
      • Rationale : The troll needs attention to continue its activities. If no attention is paid, he/she will stop participating in the discussions.
      • Pattern Relations : Associates Spammer pattern.
    13. Pattern Discovery Process Digital Social Network 1. Set pattern parameters 2. Instantiate disturbances 3. Evaluate disturbances 4a. Change Pattern Parameters 4b. Apply Pattern Solution Pattern Disturbance Variables Pattern Template Disturbance Variables Pattern Parameters Pattern Template Instance Pattern Instance Disturbance Variables Pattern Parameters Forces Force Relations Rationale Dependencies Description Solution Pattern Relations Disturbance Instances Variables Pattern Parameters
    14. Visualization
    15. Conclusions and Outlook
      • Homogeneous data management
      • Pattern language for disturbance analysis
      • Graph-based visualization
      • Data uncertainty and inconsistent data
      • Goals and intentions of analysts
      • Dynamic Mediabase visualization

    + Ralf KlammaRalf Klamma, 2 years ago

    custom

    670 views, 0 favs, 1 embeds more stats

    Zinayida Petrushyna, Ralf Klamma
    RWTH Aachen Univer more

    More info about this document

    © All Rights Reserved

    Go to text version

    • Total Views 670
      • 668 on SlideShare
      • 2 from embeds
    • Comments 0
    • Favorites 0
    • Downloads 4
    Most viewed embeds
    • 2 views on http://beamtenherrschaft.blogspot.com

    more

    All embeds
    • 2 views on http://beamtenherrschaft.blogspot.com

    less

    Flagged as inappropriate Flag as inappropriate
    Flag as inappropriate

    Select your reason for flagging this presentation as inappropriate. If needed, use the feedback form to let us know more details.

    Cancel
    File a copyright complaint
    Having problems? Go to our helpdesk?

    Categories