Integrating and publishing public safety data using semantic technologies

840 views

Published on

Published in: Technology, Education
0 Comments
0 Likes
Statistics
Notes
  • Be the first to comment

  • Be the first to like this

No Downloads
Views
Total views
840
On SlideShare
0
From Embeds
0
Number of Embeds
11
Actions
Shares
0
Downloads
11
Comments
0
Likes
0
Embeds 0
No embeds

No notes for slide

Integrating and publishing public safety data using semantic technologies

  1. 1. Integrating and Publishing Public Safety Data Using Semantic Technologies Alvaro Graves [email_address] Tetherless World Constellation Department of Cognitive Science
  2. 2. Outline <ul><li>Motivations </li></ul><ul><li>Implementation </li></ul><ul><li>Challenges and next steps </li></ul><ul><li>Conclusions </li></ul>
  3. 3. <Motivation>
  4. 4. <Research>
  5. 5. Research Motivation <ul><li>Lots of “human computational power” </li></ul><ul><li>An example: </li></ul><ul><ul><li>9 billion human-hours of solitaire were played in 2003 (in red) </li></ul></ul><ul><ul><li>Building Panama canal took 20 million human-hours (in blue) </li></ul></ul>
  6. 6. Interesting, but... <ul><li>… How can we use this cognitive and computational processing power... </li></ul><ul><li>… to solve difficult problems.... </li></ul><ul><li>… (Ideally) without much effort from the users? </li></ul>
  7. 7. Social Machines <ul><li>Social Machines are mechanisms where: </li></ul><ul><ul><li>Humans do the creative work </li></ul></ul><ul><ul><li>Machines do the administrative work </li></ul></ul>
  8. 8. What are we good at?
  9. 9. Example 1: reCAPTCHA
  10. 10. Example 2: GalaxyZoo
  11. 11. Example 3: Threadless
  12. 12. How to study Social Machines? <ul><li>Limitations </li></ul><ul><ul><li>Access to information (logs, database) </li></ul></ul><ul><ul><li>Privacy concerns </li></ul></ul><ul><ul><li>Lack of flexibility </li></ul></ul><ul><li>Solution: </li></ul><ul><ul><li>Create framework for Social Machines </li></ul></ul><ul><ul><li>Incentives are important </li></ul></ul>
  13. 13. </Research>
  14. 14. <Practical>
  15. 15. <ul><li>Segmented city </li></ul><ul><ul><li>Downtown </li></ul></ul><ul><ul><li>RPI </li></ul></ul><ul><ul><li>North/South </li></ul></ul><ul><li>High risk zones </li></ul><ul><li>Unlike big cities </li></ul><ul><ul><li>Hard to have centralized data </li></ul></ul><ul><ul><li>Resources are scarce </li></ul></ul>Troy, NY
  16. 16. How can... <ul><li>Citizens to be aware of their environment? </li></ul><ul><ul><li>How risky is to park in this street between 1AM and 3AM? </li></ul></ul><ul><li>Policy-makers make right decisions? </li></ul><ul><ul><li>Is a specific crime increasing over time? </li></ul></ul><ul><li>Law enforcers be more transparent in their work? </li></ul><ul><ul><li>How to show our work to the community? </li></ul></ul>
  17. 17. What can we do?
  18. 18. Idea: Let's build.... <ul><li>… a platform for integrating public safety information </li></ul><ul><li>… a framework for studying Social Machines </li></ul><ul><ul><li>Understand behavior of users in front of data </li></ul></ul><ul><ul><li>Run experiments (ex: provenance => trust) </li></ul></ul>
  19. 19. <ul><li></Practical> </li></ul><ul><li></Motivation> </li></ul>
  20. 20. <Implementation>
  21. 21. Architecture
  22. 22. <SemanticWeb>
  23. 23. Semantic Technologies <ul><li>Based on the Web </li></ul><ul><ul><li>Don't need to create special platform </li></ul></ul><ul><li>Domain agnostic </li></ul><ul><ul><li>Can express different domains </li></ul></ul><ul><li>Dynamically coupled </li></ul><ul><ul><li>Easy to mix different data sources </li></ul></ul><ul><li>(Also: Interoperable, distributed, extensible, etc.) </li></ul>
  24. 24. Semantic Web: Example <ul><li>Most basic “language”: RDF ( Resource Description Framework ) </li></ul><ul><li>Set of assertions (triples) </li></ul><ul><li>My name is Alvaro Graves </li></ul><ul><li>http://graves.cl#me foaf:name “Alvaro Graves” </li></ul>
  25. 25. Add more triples
  26. 26. </SemanticWeb>
  27. 27. <DataIntegration>
  28. 28. Different sources... <ul><li>RPI Public Safety </li></ul><ul><ul><li>PDF files available on the Web </li></ul></ul><ul><ul><li>Different formats depending on which year </li></ul></ul><ul><ul><li>Not easy to extract data </li></ul></ul><ul><li>Troy Police Department </li></ul><ul><ul><li>Excel files upon request </li></ul></ul><ul><ul><li>Less information (date/time, event type, geographical references) </li></ul></ul><ul><ul><li>Only certain crime types </li></ul></ul>
  29. 29. ...Similar Problems <ul><li>Unclear geographical references (“Dining Hall”, “15 th and Peoples ave.”) </li></ul><ul><li>Not only crimes but other events (fires, medical emergencies, etc.) </li></ul>
  30. 30. Our Taxonomy
  31. 31. </DataIntegration>
  32. 32. <Curation+Persistance>
  33. 33. Existing data is not enough <ul><li>Lack of geolocation </li></ul><ul><ul><li>Use Google Maps as a Web Service for obtaining latitude and longitude </li></ul></ul><ul><li>Typos </li></ul><ul><ul><li>Manual correction </li></ul></ul><ul><li>Ambiguity </li></ul><ul><ul><li>“ Event occurred Off campus” (!) </li></ul></ul><ul><ul><li>Approximate to the best we could </li></ul></ul>
  34. 34. An example
  35. 35. Persistence <ul><li>Use of semantic technologies for persistence </li></ul><ul><li>Pros: </li></ul><ul><ul><li>Easy to setup </li></ul></ul><ul><ul><li>Works with MySQL+PHP </li></ul></ul><ul><li>Cons: </li></ul><ul><ul><li>Not very scalable </li></ul></ul><ul><ul><li>Queries cannot be too complex </li></ul></ul>
  36. 36. </Curation+Persistance>
  37. 37. <Visualization+Publishing>
  38. 38. Data in multiple ways <ul><li>Citizens </li></ul><ul><ul><li>At home/office </li></ul></ul><ul><ul><li>On the street </li></ul></ul><ul><li>Decision makers </li></ul><ul><ul><li>Statistics </li></ul></ul><ul><ul><li>Potential correlations? </li></ul></ul><ul><li>Machines </li></ul><ul><ul><li>Create notifications </li></ul></ul><ul><ul><li>Display in other platform </li></ul></ul><ul><ul><li>Reuse data </li></ul></ul>
  39. 39. PublicSafetyMap.org
  40. 40. PublicSafetyMap.org
  41. 41. m.PublicSafetyMap.org <ul><li>Only data around your location </li></ul><ul><li>Still in development! </li></ul>
  42. 42. PublicSafetyMap.org/feeds.php <ul><li>Latest data </li></ul><ul><li>Current search </li></ul>RDF RSS KML
  43. 43. </Visualization+Publishing> </Implementation>
  44. 44. <Challenges>
  45. 45. More than technical problems <ul><li>Hard to convince decision makers </li></ul><ul><li>Trust issues </li></ul><ul><li>“ Social entry barriers” </li></ul>
  46. 46. Next steps <ul><li>Connect with well-known social networks (Facebook, Twitter, etc.) </li></ul><ul><li>Geolocate anything (Bicycling route, pizza delivery area) </li></ul><ul><ul><li>Geoinferencing </li></ul></ul><ul><li>Add more dataset (Weather, real estate, Fire dept.) </li></ul><ul><li>Annotation on events (Establishing trust measures, explanations, etc.) </li></ul>
  47. 47. Next steps
  48. 48. </Challenges>
  49. 49. <Conclusions>
  50. 50. Research: Studying Social Machines <ul><li>Don't compete with Google </li></ul><ul><li>Opening data so others can use it for their own applications </li></ul><ul><li>But most important: A Framework where we can study Social Machines </li></ul><ul><li>Being able to run experiments </li></ul>
  51. 51. From Practical: Public Safety Information <ul><li>User can visualize their neighborhood. </li></ul><ul><li>Policy-makers can manage data in a useful way. </li></ul><ul><li>Law enforcers can show their efforts. </li></ul>
  52. 52. </Conclusions> <mailto: [email_address] />

×