Successfully reported this slideshow.
We use your LinkedIn profile and activity data to personalize ads and to show you more relevant ads. You can change your ad preferences anytime.

Hans Henseler - Intelligent data analysis for improving public security - Data Quality Summit 2008


Published on

Published in: Technology
  • Be the first to comment

Hans Henseler - Intelligent data analysis for improving public security - Data Quality Summit 2008

  1. 1.  Intelligent data analysis for improving public security November 14, 2008 Data Quality Summit ‘08, Evoluon, Eindhoven Hans Henseler, Forensic Technology Solutions, PwC Advisory
  2. 2. Intelligent data analysis can help improve public security Can you see the pattern ? Data Quality Summit '08
  3. 3. Sources Marechaussee IND KLPd Law enforcement Advise and Research Methods and Technology Statistics Datamining Pattern recognition Social network analysis Data Quality Summit '08 K E C I D A Knowledge and Expertise Centre for Intelligent Data Analysis
  4. 4. Kecida is part of project Pattern Recognition that is financed by the National Coordinator for Counterterrorism (NCTb) Data Quality Summit '08
  5. 5. Example of traditional information analysis Analyst Notebook chart showing all known facts Data Quality Summit '08
  6. 6. Example: text mining and data quality Extraction of names and places Data Quality Summit '08
  7. 7. Source data: Collection of text files Mickey Mouse works for Donald Duck This message is online since 02/10/2007 Mickey Mouse turns out to work for Donald Duck since 2000. Donald was able to take his nephews to Disneyland thanks to Donald. Donald Duck was apprenhended in The Hague. This message is online since 29/09/2007 Yesterday Barak Obama and Madonna have instructed the police to arrest Donald Duck in The Hague just before his performance as a duck. Data Quality Summit '08
  8. 8. Visualising the extracted entities as a network Donald Duck Madonna Barak Obama Mickey Mouse The Hague Disney Land Data Quality Summit '08
  9. 9. Automatic analysis of news flashes on terrorism. <ul><li>New developments can be detected faster by intelligent analysis of open sources. </li></ul>Data Quality Summit '08
  10. 10. Structuring unstructured information Data Quality Summit '08
  11. 11. Text mining: structuring unstructured data and linking data to other data Data Quality Summit '08
  12. 12. Discovering relations between entities (1) Data Quality Summit '08
  13. 13. Discovering relations between entities (2) Data Quality Summit '08
  14. 14. Visualisation and datacleaning Data Quality Summit '08
  15. 15. Example: Investigating money transfers Intelligent search for money laundering activities Every red dot represents a bank account: Data Quality Summit '08
  16. 16. Pairs of bank accounts are normal; Larger groups of linked accounts draw attention. Data Quality Summit '08
  17. 17. A generic aproach: CRISP-DM Cross Industry Standard Process for Data Mining Data Quality Summit '08
  18. 18. Conclusions <ul><li>Intelligent data analysis can help improve public security: </li></ul><ul><ul><li>Discovery of new relations so that risks can be recognized earlier. </li></ul></ul><ul><ul><li>Visual processing increases capacity to analyse information so that </li></ul></ul><ul><ul><li>the same number of people to do more, </li></ul></ul><ul><ul><li>Good data quality is a pre-condition for intelligent analysis : </li></ul></ul><ul><ul><li>Bad data quality make analysis of information more difficult because </li></ul></ul><ul><ul><li>Relations will be lost if names do not match and </li></ul></ul><ul><ul><li>Identities appearing under multiple names lead to more complex networks. </li></ul></ul>Data Quality Summit '08
  19. 19. Thank you for your attention! © 2008 PricewaterhouseCoopers. All rights reserved. “PricewaterhouseCoopers” refers to the network of member firms of PricewaterhouseCoopers International Limited, each of which is a separate and independent legal entity. *connectedthinking is a trademark of PricewaterhouseCoopers LLP (US). 