Online Behavior Analysis And Modeling Methodology

1,375 views

Published on

Published in: Technology, News & Politics
0 Comments
1 Like
Statistics
Notes
  • Be the first to comment

No Downloads
Views
Total views
1,375
On SlideShare
0
From Embeds
0
Number of Embeds
11
Actions
Shares
0
Downloads
26
Comments
0
Likes
1
Embeds 0
No embeds

No notes for slide
  • Online Behavior Analysis And Modeling Methodology

    1. 1. Maj David Robinson Online Behavior Analysis and Modeling Methodology (OBAMM)
    2. 2. Overview <ul><li>Goals </li></ul><ul><li>Methodology </li></ul><ul><ul><li>Collect </li></ul></ul><ul><ul><li>Cluster/Categorize </li></ul></ul><ul><ul><li>Classify </li></ul></ul><ul><ul><li>Correlate </li></ul></ul><ul><li>Future Work </li></ul>
    3. 3. Goals <ul><li>Creation of online user fingerprints based on usage patterns and interest areas </li></ul><ul><li>Insider threat detection, policy violation, social network analysis, business process analysis </li></ul>
    4. 4. Collect <ul><li>Active or passive </li></ul><ul><li>Need to see “unique” users (in general) </li></ul><ul><li>HTTP/GET Requests </li></ul><ul><ul><li>Minimize chaff </li></ul></ul><ul><ul><li>Interested in src, dest, timestamp </li></ul></ul>08/24/07 17:30:06.203412,192.168.10.10, GET,www.ists.dartmouth.edu Timestamp Source Destination
    5. 5. Cluster/Categorize <ul><li>Initial cluster on categories </li></ul><ul><ul><li>Utilize reverse category lookup </li></ul></ul><ul><li>Then cluster on: </li></ul><ul><ul><li>Content </li></ul></ul><ul><ul><li>Meta-data </li></ul></ul>
    6. 6. Categorization Data <ul><li>ODP – Open Directory Project </li></ul><ul><ul><li>4,830,584 sites categorized </li></ul></ul><ul><ul><li>16 “core” categories </li></ul></ul><ul><ul><li>Multi-language </li></ul></ul><ul><li>Blacklists </li></ul><ul><ul><li>Approx 40 MB of URLs </li></ul></ul><ul><ul><li>Approx 37 categories </li></ul></ul><ul><ul><li>Capture “less common” URLs </li></ul></ul><ul><ul><ul><li>Porn, hacking, drugs, weapons, warez </li></ul></ul></ul><ul><ul><ul><li>Shopping, webmail, finance, dating </li></ul></ul></ul>
    7. 7. ODP Data <ul><ul><li><Topic r:id=&quot;Top/Arts/Movies/Titles/1/10_Rillington_Place&quot;> </li></ul></ul><ul><ul><li><catid>205108</catid> </li></ul></ul><ul><ul><li><link r:resource=&quot;http://www.britishhorrorfilms.co.uk/rillington.shtml&quot;/> </li></ul></ul><ul><ul><li><link r:resource=&quot;http://www.shoestring.org/mmi_revs/10-rillington-place.html&quot;/> </li></ul></ul><ul><ul><li></Topic> </li></ul></ul><ul><ul><li><ExternalPage about=&quot;http://www.britishhorrorfilms.co.uk/rillington.shtml&quot;> </li></ul></ul><ul><ul><li><d:Title>British Horror Films: 10 Rillington Place</d:Title> </li></ul></ul><ul><ul><li> <d:Description>Review which looks at plot especially the shocking features of it.</d:Description> <topic>Top/Arts/Movies/Titles/1/10_Rillington_Place</topic> </li></ul></ul><ul><ul><li></ExternalPage> </li></ul></ul><ul><ul><li><ExternalPage about=&quot;http://www.shoestring.org/mmi_revs/10-rillington-place.html&quot;> </li></ul></ul><ul><ul><li><d:Title>MMI Movie Review: 10 Rillington Place</d:Title> </li></ul></ul><ul><ul><li><d:Description>Review includes plot, real life story behind the film and realism in the film.</d:Description> </li></ul></ul><ul><ul><li><topic>Top/Arts/Movies/Titles/1/10_Rillington_Place</topic> </li></ul></ul><ul><ul><li></ExternalPage> ge> </li></ul></ul>
    8. 8. Blacklist Data <ul><li>Shopping Domains </li></ul><ul><ul><li>bestbudsgarden.com </li></ul></ul><ul><ul><li>bestbudsgarden.pointshop.com </li></ul></ul><ul><ul><li>bestbuket.ru </li></ul></ul><ul><ul><li>bestbung.com </li></ul></ul><ul><ul><li>bestbuy.com </li></ul></ul><ul><li>Shopping URLs </li></ul><ul><ul><li>00sake.com/htdocs </li></ul></ul><ul><ul><li>horseland.com.au/shop </li></ul></ul><ul><ul><li>balance.com/products/shop </li></ul></ul><ul><ul><li>theawningman.com.au/catalog/ </li></ul></ul><ul><ul><li>wigglywigglers.co.uk/shop </li></ul></ul>
    9. 9. Single URL <ul><li>http://www.dartmouth.edu </li></ul>Dartmouth College New Hampshire United States North America Colleges & Universities Education Reference
    10. 10. Multiple URLs <ul><li>news.google.com/?topic=s </li></ul><ul><li>www.kicksology.com </li></ul><ul><li>www.beckett.com/userpages/Meibin.html </li></ul><ul><li>www.razbet.com </li></ul><ul><li>www.fhlsim.com </li></ul>
    11. 11. Multiple URLs Portals News and Media Resources Sports Footwear Apparel Basketball Sports Shopping Basketball Cards Sports Collecting Recreation Basketball Tipping & Handicapping Sports Gambling Games Software Simulation Hockey Fantasy Sports news.google.com/?topic=s www.kicksology.com becket.com/userpages www.razbet.com www.fhlsim.com
    12. 12. Multiple URLs Portals News and Media Resources Footwear Apparel Basketball Shopping Basketball Cards Sports Collecting Recreation Basketball Tipping & Handicapping Gambling Games Software Simulation Hockey Fantasy
    13. 13. Cluster/Categorize <ul><li>Issues </li></ul><ul><ul><li>Not all URLs categorized </li></ul></ul><ul><ul><li>URLs in multiple categories </li></ul></ul><ul><li>Document Clustering Methods </li></ul><ul><ul><li>Page Content </li></ul></ul><ul><ul><li>Page meta data </li></ul></ul>
    14. 14. User Representation
    15. 15. User Details
    16. 16. Correlate <ul><li>Temporal </li></ul><ul><li>Similarity </li></ul><ul><li>Interactions </li></ul>
    17. 17. Temporal Relations Time Frequency Shopping Computer Adult Art
    18. 18. Correlate User 1 User 3 User 4 User 6 User 10 User 8 User 5 User 9 User 7 User 2
    19. 19. Correlate User 1 User 3 User 4 User 6 User 10 User 8 User 5 User 9 User 7 User 2
    20. 20. Correlate User 10 Engineers Med Students Business Students User 1 User 3 User 4 User 6 User 8 User 5 User 9 User 7 User 2
    21. 21. Future Work <ul><li>Search terms </li></ul><ul><li>Categorize functionalities </li></ul><ul><li>Temporal correlation </li></ul><ul><li>Social Network Analysis </li></ul><ul><li>Pruning </li></ul><ul><li>Data </li></ul>
    22. 22. Questions <ul><li>Contacts </li></ul><ul><ul><li>[email_address] </li></ul></ul><ul><ul><li>[email_address] </li></ul></ul><ul><ul><li>[email_address] </li></ul></ul>

    ×