Successfully reported this slideshow.
Your SlideShare is downloading. ×

Policy Cloud Data Driven Policies against Radicalisation - Technical Overview

Ad
Ad
Ad
Ad
Ad
Ad
Ad
Ad
Ad
Ad
Ad
Upcoming SlideShare
Evolution of Data Spaces
Evolution of Data Spaces
Loading in …3
×

Check these out next

1 of 24 Ad

Policy Cloud Data Driven Policies against Radicalisation - Technical Overview

Download to read offline

The problem of radicalisation is very high on the European agenda as increasing numbers of young European radicals return from Syria and use the internet to disseminate propaganda. To enable policy makers to design policies to address radicalisation effectively, Policy Cloud consortium will collect data from social media and other sources including the open-source Global Terrorism Database (GTD), the Onion City search engine which accesses data over the TOR dark web sites, and Twitter ( through Firehose). The data will be analysed using sentiment analysis and opinion mining software.

The problem of radicalisation is very high on the European agenda as increasing numbers of young European radicals return from Syria and use the internet to disseminate propaganda. To enable policy makers to design policies to address radicalisation effectively, Policy Cloud consortium will collect data from social media and other sources including the open-source Global Terrorism Database (GTD), the Onion City search engine which accesses data over the TOR dark web sites, and Twitter ( through Firehose). The data will be analysed using sentiment analysis and opinion mining software.

Advertisement
Advertisement

More Related Content

Slideshows for you (20)

Similar to Policy Cloud Data Driven Policies against Radicalisation - Technical Overview (20)

Advertisement

More from Big Data Value Association (20)

Recently uploaded (20)

Advertisement

Policy Cloud Data Driven Policies against Radicalisation - Technical Overview

  1. 1. policycloud.eu 1 02/07/2020 Pavlos Kranas (LeanXcale S.L) PolicyCLOUD Technical Overview PolicyCloud has received funding from the European Union’s Horizon 2020 research and innovation programme under grant agreement No 870675.
  2. 2. policycloud.eu 2 Objective PolicyCLOUD: Analytics-as-a -Service facilitating efficient data-driven public policy management 02/07/2020 Policy Cloud Data Driven Policies against Radicalisation
  3. 3. policycloud.eu 3 Background Facts Increasing use of devices and networks leading to the generation of vast quantities of data Data linking is becoming the norm (e.g. linking new data sources with established data sources) Current approaches in policy making are not evidence-based Mature approaches to analyse and understand the “environment” Goal Creation of efficient and effective policies through data-driven policy management Decision support to authorities for policy modelling, implementation and simulation through identified populations, as well as for policy enforcement and adaptation 02/07/2020 Policy Cloud Data Driven Policies against Radicalisation
  4. 4. policycloud.eu 4 Main challenges (1/5) A data-driven approach for effective policies management Across the complete data path, including data modelling, representation and interoperability, cleaning, heterogeneous datasets linking, analytics for knowledge extraction Exploit the collective knowledge out of policy “collections” combined with the data from several sources (e.g. sensor readings, online platforms, etc.) 02/07/2020 Policy Cloud Data Driven Policies against Radicalisation
  5. 5. policycloud.eu 5 Main challenges (2/5) Compilation, assessment and optimization of multi-domain policies Holistic policy modelling, making and implementation in different sectors (e.g. environment, migration, goods and services, etc.), through the analysis and linking of KPIs of different policies that may be interdependent and inter- correlated (e.g. environment) Analysis of (unexpected) patterns and policies relationships Identification of effective KPIs to be re-used and non-effective ones (including the causes for not being effective) towards their improvement 02/07/2020 Kick-off meeting Madrid
  6. 6. policycloud.eu 6 Main challenges (3/5) Data management techniques across the complete data path Meta-interpretation layer for the semantic and syntactic capturing of data properties and their representation Data cleaning to ensure data quality and coherence including the adaptive selection of information sources based on evolving volatility levels (i.e. changing availability or engagement level of information sources) 02/07/2020 Policy Cloud Data Driven Policies against Radicalisation
  7. 7. policycloud.eu 7 Main challenges (4/5) Analytics as a service reusable on top of different datasets Machine and deep learning techniques (e.g. classification, regression, clustering and frequent pattern mining) to infer new data and knowledge Opinion mining, sentiment analysis, social dynamics and behavioral data analytics Technologies that allow analytics tasks to be decoupled from specific datasets and thus be triggered as services and applies to various cases and datasets 02/07/2020 Policy Cloud Data Driven Policies against Radicalisation
  8. 8. policycloud.eu 8 Main challenges (5/5) Unique endpoint to exploit analytics in different cases Execution of different models / analytical tools on data (e.g. to identify trends, to mine opinion artefacts, to explore situational and context awareness information, to identify sensitives, etc.) Modelled policies (through their KPIs) realized / implemented and monitored against these KPIs Adaptive and incremental visualization enabling the policy lifecycle to be visualized in different ways, while the visualization can be modified on the fly and can enable the specification of the assets to be visualized (e.g. data sources or meta-processed information) 02/07/2020 Policy Cloud Data Driven Policies against Radicalisation
  9. 9. policycloud.eu 9 Conceptual architecture 02/07/2020 Policy Cloud Data Driven Policies against Radicalisation
  10. 10. policycloud.eu 10 Seamless Analytical Framework Baseline Technology firstly introduced and implemented as Proof- Of-Concept in H2020 BigDataStack Collaborative work between IBM and LeanXcale First prototype already delivered! Its functionality is planned to be extended in PolicyCLOUD 02/07/2020 Policy Cloud Data Driven Policies against Radicalisation
  11. 11. policycloud.eu 11 Seamless Analytical Framework Modern enterprises use Operational databases for OLTP load Key-Value for IoT data Data warehouses for data analytics Datalakes etc ... Need polyglot capabilities 02/07/2020 Kick-off meeting Madrid
  12. 12. policycloud.eu 12 Seamless Analytical Framework Nowadays: Data Federation using Spark 02/07/2020 Policy Cloud Data Driven Policies against Radicalisation
  13. 13. policycloud.eu 13 Seamless Analytical Framework Nowadays: Data Federation using Spark BUT: Can be very resource consuming Cannot exploit the specific capabilities of each different datastore 02/07/2020 Policy Cloud Data Driven Policies against Radicalisation
  14. 14. policycloud.eu 14 Seamless Analytical Framework – User Story Data ingestion in operational datastore (LeanXcale) Old data becomes historical, with no modifications Data Warehouse to perform analytics on big data volumes Distribution of datasets is problematic Data to be retrieved from both stores To be merged in the application level Data consistency considerations when moving datasets 02/07/2020 Policy Cloud Data Driven Policies against Radicalisation
  15. 15. policycloud.eu 15 Seamless Analytical Framework – Solution Seamless Analytical Framework Federate data coming from two different datastores: HTAP Relational LXS Datastore IBM Object store sharing the SAME dataset Single (black box) component that consists of two datastores exploits unique characteristics of each one transparently from the user does not compromise some requirements for the benefits of others 02/07/2020 Policy Cloud Data Driven Policies against Radicalisation
  16. 16. policycloud.eu 16 Query Federation 02/07/2020 Policy Cloud Data Driven Policies against Radicalisation
  17. 17. policycloud.eu 17 Data Movement High Level 02/07/2020 Policy Cloud Data Driven Policies against Radicalisation
  18. 18. policycloud.eu 18 Supported Operations Currently supports Full Scan Ordered Scan LIMIT Aggregations Group By Aggregations Ordered Group By Aggregations Does not yet support JOIN on fragmented data tables 02/07/2020 Policy Cloud Data Driven Policies against Radicalisation
  19. 19. policycloud.eu 19 `1 with Data Skipping CEP Gatewa y Seamless component LXS DB Remote IBM COS 2/7/2020 Policy Cloud Data Driven Policies against Radicalisation Data Quality Assessment accessingApplication Data Mov er Data Skipping CEP Machine Learning app. Danaos (BigDataStack) Data Center
  20. 20. policycloud.eu 20 16.07.2019 Periodic Review Meeting 20 Relevant for SQL queries Implemented for Apache Spark SQL Up to latest Apache Spark version 3.0 Standalone technology but also nicely complements the seamless component Determine which objects are NOT relevant to a SQL query using a data skipping index Stores and indexes tiny summary metadata for each object. Skipping over irrelevant objects reduces the bytes scanned Index All Objects Data Set Objects Query example: retrieve data of violent storms SELECT vessel_code, datetime, longitude, latitude, wind_speed FROM cos://us-south/…/danaos stored as parquet WHERE wind_speed > 30 Data Skipping WHERE Clause Data Skipping Indexing Candidate Objects SQL Dataset addressed by an SQL query
  21. 21. policycloud.eu 21 16.07.2019 Periodic Review Meeting 21 Relevant for SQL queries Implemented for Apache Spark SQL Standalone technology but also nicely complements the seamless component Determine which objects are NOT relevant to a SQL query using a data skipping index Stores and indexes tiny summary metadata for each object. Skipping over irrelevant objects reduces the bytes scanned Saves time and $ Index All Objects Data Set Objects Example: Look for data in violent storm conditions SELECT vessel_code, datetime, longitude, latitude, wind_speed FROM cos://us-south/…/danaos stored as parquet WHERE wind_speed > 30 Saves Time and $ WHERE Clause Data Skipping Indexing Candidate Objects SQL Data Skipping
  22. 22. policycloud.eu 22 Joint demo IBM/Danaos at the high visibility in IBM THINK ’19 conference When could we try Data Skipping ….
  23. 23. policycloud.eu 23 Joint demo IBM/Danaos at the high visibility in IBM THINK ’19 conference Data Skipping technology integrated as open beta into IBM Cloud SQL Query When could we try Data Skipping …
  24. 24. policycloud.eu 24 GET IN TOUCH @PolicyCloudEU PolicyCloud EU www.policycloud.eu PolicyCloud has received funding from the European Union’s Horizon 2020 research and innovation programme under grant agreement No 870675.

×