Statbel is the national statistical institute of Belgium and a member of the European Statistical System. Big data presents new opportunities and challenges for official statistics. Statbel has established a Big Data Team and conducted projects using mobile phone data, web scraping, and satellite imagery. Key challenges include gaining access to proprietary data sources and developing statistical methodology for large, unstructured datasets. Smart statistics that integrate multiple data sources in real time could provide detailed monitoring systems for issues like air quality and population estimates. Statbel aims to further develop use cases for big data and may need to collaborate with data scientists or hire them internally to fully leverage big data.
SC6 Workshop 1: Big data (phenomenon) challenges and requirements in official...BigData_Europe
Presentation by Fernando Reis, Eurostat, European Commission, at the first workshop of Societal Challlenge 6 in the BigDataEurope project, taking place in Luxembourg on 18 November 2015.
http://www.big-data-europe.eu/social-sciences/
Paulo Canas Rodrigues - The role of Statistics in the Internet of Things - ...Mindtrek
Paulo Canas Rodrigues
Research Director
CAST (Centre for Applied Statistics and Data Analytics) University of Tampere
The role of Statistics in the Internet of Things
Mindtrek 2016
Estermann Panel on Authority Files, 3 June 2020Beat Estermann
Panel on Authority Files and Controlled Vocabularies: Welcome and Introduction; GLAM Inventory; Named Entities in the Context of the LOD Ecosystem for the Performing Arts. Side programme of the Swiss Open Cultural Data Hackathon 2020, Online Session, 3 June 2020.
Presentation done by Marta Sabou, Adrian M.P. Brașoveanu, & Irem Önder, during "Intelligence & analytics" workshop, of the ENTER2015 eTourism conference.
LIBER DH Working Group Workshop: Digital Humanities Activities at Göttingen S...LIBER Europe
This presentation was given as part of the Digital Humanities workshop at LIBER 2017, Patras. For more about LIBER and the Digital Humanities Working Group, please see: www.libereurope.eu
SC6 Workshop 1: Big data (phenomenon) challenges and requirements in official...BigData_Europe
Presentation by Fernando Reis, Eurostat, European Commission, at the first workshop of Societal Challlenge 6 in the BigDataEurope project, taking place in Luxembourg on 18 November 2015.
http://www.big-data-europe.eu/social-sciences/
Paulo Canas Rodrigues - The role of Statistics in the Internet of Things - ...Mindtrek
Paulo Canas Rodrigues
Research Director
CAST (Centre for Applied Statistics and Data Analytics) University of Tampere
The role of Statistics in the Internet of Things
Mindtrek 2016
Estermann Panel on Authority Files, 3 June 2020Beat Estermann
Panel on Authority Files and Controlled Vocabularies: Welcome and Introduction; GLAM Inventory; Named Entities in the Context of the LOD Ecosystem for the Performing Arts. Side programme of the Swiss Open Cultural Data Hackathon 2020, Online Session, 3 June 2020.
Presentation done by Marta Sabou, Adrian M.P. Brașoveanu, & Irem Önder, during "Intelligence & analytics" workshop, of the ENTER2015 eTourism conference.
LIBER DH Working Group Workshop: Digital Humanities Activities at Göttingen S...LIBER Europe
This presentation was given as part of the Digital Humanities workshop at LIBER 2017, Patras. For more about LIBER and the Digital Humanities Working Group, please see: www.libereurope.eu
LandCity Revolution - L'evoluzione del segmento di terra per sostenere l'era ...giovanni biallo
La diffusione di immagini satellitari fruibili gratuitamente e liberamente, come i dati del progetto Copernicus, apre nuovi orizzonti in termini di prodotti e servizi,
Overview of OpenGLAM in Switzerland and the latest activities of the Bern University of Applied Sciences in the area of open cultural data. Presentation held at the Conference on Conference on Open Data and Open Maps for Heritage Protection in Bellinzona, Switzerland, 21 Feb 2020.
Estermann Linked Data Ecosystem for Heritage Data - 29 Feb 2020Beat Estermann
Linked Open Data Ecosystem for Heritage Data. Presentation held at the 5th Anniversary of the Swiss Open Cultural Data Hackathon on 29 February 2020 at the National Library in Bern.
Presentation of the Performing Arts Project on Wikidata, Meeting of the Archival Working Group of the German Society for Theatre Research, Frankfurt a.M., 16 January 2018.
EDF2014: Talk of European Data Innovator Award Winner: Johann Mittheisz, form...European Data Forum
Talk of European Data Innovator Award Winner: Johann Mittheisz, former CIO of the City of Vienna, Austria, at the European Data Forum 2014, 19 March 2014 in Athens, Greece: European Data Innovator (EDI) Award 2014
Presentation given as discussant at the Session on "Migration flows: data and measurement" at the Conference of European Statistics Stakeholders in Budapest on 21 October 2016
LandCity Revolution - L'evoluzione del segmento di terra per sostenere l'era ...giovanni biallo
La diffusione di immagini satellitari fruibili gratuitamente e liberamente, come i dati del progetto Copernicus, apre nuovi orizzonti in termini di prodotti e servizi,
Overview of OpenGLAM in Switzerland and the latest activities of the Bern University of Applied Sciences in the area of open cultural data. Presentation held at the Conference on Conference on Open Data and Open Maps for Heritage Protection in Bellinzona, Switzerland, 21 Feb 2020.
Estermann Linked Data Ecosystem for Heritage Data - 29 Feb 2020Beat Estermann
Linked Open Data Ecosystem for Heritage Data. Presentation held at the 5th Anniversary of the Swiss Open Cultural Data Hackathon on 29 February 2020 at the National Library in Bern.
Presentation of the Performing Arts Project on Wikidata, Meeting of the Archival Working Group of the German Society for Theatre Research, Frankfurt a.M., 16 January 2018.
EDF2014: Talk of European Data Innovator Award Winner: Johann Mittheisz, form...European Data Forum
Talk of European Data Innovator Award Winner: Johann Mittheisz, former CIO of the City of Vienna, Austria, at the European Data Forum 2014, 19 March 2014 in Athens, Greece: European Data Innovator (EDI) Award 2014
Presentation given as discussant at the Session on "Migration flows: data and measurement" at the Conference of European Statistics Stakeholders in Budapest on 21 October 2016
20140902 LinDa Workshop Semantincs2014 - Bringing LOD to SMEsLinDa_FP7
Bringing LOD to SME – The case of the LinDA pilots business intelligence, environmental sector, media industry
Presentation at LinDA Workshop on 2nd September 2014 at Semantics2014 by Salvatore Virtuoso
Carlo Amati - Ex post evaluation of cohesion policyOpenCoesione
La riflessione di Carlo Amati, Evaluation Officer della Commissione europea – DG REGIO, ha guardato all’approccio della valutazione ex post dei fondi UE: l'efficacia, l'efficienza, l'impatto, la loro coerenza con altre politiche, la loro pertinenza e il valore aggiunto dell'Unione Europea in linea con gli orientamenti per legiferare meglio
"Towards Value-Centric Big Data" e-SIDES Workshop - "Responsible Research: An...e-SIDES.eu
The following presentation was given by Prof. Ansar Yasar from the University of Hasselt during the e-SIDES workshop "Towards Value-Centric Big Data" held on April 2, 2019 in Brussels.
Methodologies for Addressing Risks and Opportunities Engendered by Big Data T...BYTE Project
Authors: Rachel L. Finn, Hayley Watson, Kush Wadhwa, Arild Waaler, Ahmet Soylu, Guillermo Vega-Gorgojo, Hans Lammerant, Stephane Grumbach and Scott Cunningham;
Presented at: Critical Explorations of Data and Security Workshop, University of North Carolina, Chapel Hill, USA
15-16 Nov 2013
Using gamification to generate citizen input for public transport planningMarius Rohde Johannessen
Presentation at the 2016 ePart conference in Guimaraes, Portugal. Research in progress presenting a case study of a smart cities app, and discussing how the data can be used for increased citizen participation.
Adjusting primitives for graph : SHORT REPORT / NOTESSubhajit Sahu
Graph algorithms, like PageRank Compressed Sparse Row (CSR) is an adjacency-list based graph representation that is
Multiply with different modes (map)
1. Performance of sequential execution based vs OpenMP based vector multiply.
2. Comparing various launch configs for CUDA based vector multiply.
Sum with different storage types (reduce)
1. Performance of vector element sum using float vs bfloat16 as the storage type.
Sum with different modes (reduce)
1. Performance of sequential execution based vs OpenMP based vector element sum.
2. Performance of memcpy vs in-place based CUDA based vector element sum.
3. Comparing various launch configs for CUDA based vector element sum (memcpy).
4. Comparing various launch configs for CUDA based vector element sum (in-place).
Sum with in-place strategies of CUDA mode (reduce)
1. Comparing various launch configs for CUDA based vector element sum (in-place).
Explore our comprehensive data analysis project presentation on predicting product ad campaign performance. Learn how data-driven insights can optimize your marketing strategies and enhance campaign effectiveness. Perfect for professionals and students looking to understand the power of data analysis in advertising. for more details visit: https://bostoninstituteofanalytics.org/data-science-and-artificial-intelligence/
Chatty Kathy - UNC Bootcamp Final Project Presentation - Final Version - 5.23...John Andrews
SlideShare Description for "Chatty Kathy - UNC Bootcamp Final Project Presentation"
Title: Chatty Kathy: Enhancing Physical Activity Among Older Adults
Description:
Discover how Chatty Kathy, an innovative project developed at the UNC Bootcamp, aims to tackle the challenge of low physical activity among older adults. Our AI-driven solution uses peer interaction to boost and sustain exercise levels, significantly improving health outcomes. This presentation covers our problem statement, the rationale behind Chatty Kathy, synthetic data and persona creation, model performance metrics, a visual demonstration of the project, and potential future developments. Join us for an insightful Q&A session to explore the potential of this groundbreaking project.
Project Team: Jay Requarth, Jana Avery, John Andrews, Dr. Dick Davis II, Nee Buntoum, Nam Yeongjin & Mat Nicholas
StarCompliance is a leading firm specializing in the recovery of stolen cryptocurrency. Our comprehensive services are designed to assist individuals and organizations in navigating the complex process of fraud reporting, investigation, and fund recovery. We combine cutting-edge technology with expert legal support to provide a robust solution for victims of crypto theft.
Our Services Include:
Reporting to Tracking Authorities:
We immediately notify all relevant centralized exchanges (CEX), decentralized exchanges (DEX), and wallet providers about the stolen cryptocurrency. This ensures that the stolen assets are flagged as scam transactions, making it impossible for the thief to use them.
Assistance with Filing Police Reports:
We guide you through the process of filing a valid police report. Our support team provides detailed instructions on which police department to contact and helps you complete the necessary paperwork within the critical 72-hour window.
Launching the Refund Process:
Our team of experienced lawyers can initiate lawsuits on your behalf and represent you in various jurisdictions around the world. They work diligently to recover your stolen funds and ensure that justice is served.
At StarCompliance, we understand the urgency and stress involved in dealing with cryptocurrency theft. Our dedicated team works quickly and efficiently to provide you with the support and expertise needed to recover your assets. Trust us to be your partner in navigating the complexities of the crypto world and safeguarding your investments.
Show drafts
volume_up
Empowering the Data Analytics Ecosystem: A Laser Focus on Value
The data analytics ecosystem thrives when every component functions at its peak, unlocking the true potential of data. Here's a laser focus on key areas for an empowered ecosystem:
1. Democratize Access, Not Data:
Granular Access Controls: Provide users with self-service tools tailored to their specific needs, preventing data overload and misuse.
Data Catalogs: Implement robust data catalogs for easy discovery and understanding of available data sources.
2. Foster Collaboration with Clear Roles:
Data Mesh Architecture: Break down data silos by creating a distributed data ownership model with clear ownership and responsibilities.
Collaborative Workspaces: Utilize interactive platforms where data scientists, analysts, and domain experts can work seamlessly together.
3. Leverage Advanced Analytics Strategically:
AI-powered Automation: Automate repetitive tasks like data cleaning and feature engineering, freeing up data talent for higher-level analysis.
Right-Tool Selection: Strategically choose the most effective advanced analytics techniques (e.g., AI, ML) based on specific business problems.
4. Prioritize Data Quality with Automation:
Automated Data Validation: Implement automated data quality checks to identify and rectify errors at the source, minimizing downstream issues.
Data Lineage Tracking: Track the flow of data throughout the ecosystem, ensuring transparency and facilitating root cause analysis for errors.
5. Cultivate a Data-Driven Mindset:
Metrics-Driven Performance Management: Align KPIs and performance metrics with data-driven insights to ensure actionable decision making.
Data Storytelling Workshops: Equip stakeholders with the skills to translate complex data findings into compelling narratives that drive action.
Benefits of a Precise Ecosystem:
Sharpened Focus: Precise access and clear roles ensure everyone works with the most relevant data, maximizing efficiency.
Actionable Insights: Strategic analytics and automated quality checks lead to more reliable and actionable data insights.
Continuous Improvement: Data-driven performance management fosters a culture of learning and continuous improvement.
Sustainable Growth: Empowered by data, organizations can make informed decisions to drive sustainable growth and innovation.
By focusing on these precise actions, organizations can create an empowered data analytics ecosystem that delivers real value by driving data-driven decisions and maximizing the return on their data investment.
2. http://statbel.fgov.be
Overview
❑ Context
Statbel, big data and the third data revolution in statistics
❑ Big data and Statbel
Projects, accomplishments and problems
❑ European collaboration
❑ Big data and official statistics
Provisional conclusions and way forward
❑ Big data, Statbel and data science
3. http://statbel.fgov.be
Statbel =
❑ Statistics Belgium
❑ The institute formerly known as Nationaal Instituut voor de
Statistiek (NIS) / Institut national de Statistique (INS)
❑ Administratively part of the FPS (‘ministry’) Economy
❑ Member of the European Statistical System (ESS) =
Eurostat + 32 EU & EFTA national statistical institutes + associated statistics producers
4. http://statbel.fgov.be
Big data
❑ = data impossible to process in a ‘normal’ way
‘normal’ is relative …
❑ 3 v’s: volume, velocity, variety
❑ Result of societal and technological changes
Satellites, cameras and sensors, internet and e-mail, social media, mobile
phones and tablets, e-business, e-government, machine-to-machine
(internet of things, IoT)
❑ Result: data explosion, data deluge
5. http://statbel.fgov.be
Big data and statistics
❑ Big data = ‘digital footprint’
❑ Containing valuable information, statistically exploitable
(but also commercially …)
❑ Resulting in the third data revolution in statistics
After surveys (>1846) and administrative data (>2000), now: big data!
❑ Possible data sources – list far from exhaustive!
❖ Scanner data, electronical payments, credit card data
❖ Webscraping for job vacancies, enterprise characteristics
❖ Traffic cameras and detection loops
❖ Smart meters (electricity, gas, water)
❖ Last but not least: mobile phone data!
6. http://statbel.fgov.be
The future of statistics …
… big data!
Instant statistics based on big data, complemented and/or
validated by administrative data and small and specific ad hoc
surveys.
Also known as: smart statistics …
7. http://statbel.fgov.be
Big data and Statbel: Big Data Team
❑ Start at the end of 2015
❑ Restricted group, operating informally and ad hoc
❑ Focus on mobile phone data, webscraping job vacancies
❑ Tasks:
❖Reflection on strategy, priorities
❖External contacts concerning big data, with data owners, potential users,
federal and regional authorities, academia, EU, international
organisations, …
❖Analysing big datasets and connecting them to statistical ones – see
below
8. http://statbel.fgov.be
Big data and Statbel: in production
❑ For consumer price index (CPI)
❖Scanner data supermarkets and retail chains
❖Webscraping prices (e.g. airplane tickets, webshops)
9. http://statbel.fgov.be
Big data and Statbel: not planned (at present)
❑ Social media, internet search results, text analytics, …
=‘high-hanging fruits’, access and interpretation very problematic!
❑ Smart meters
Political decision of regions (2012) not to deploy
=> no data (about to change in Flanders)
❑ Traffic cameras, traffic loops
Regional competence and data
10. http://statbel.fgov.be
Big data and Statbel: projects
❑ Mobile phone data
❖Project Statbel-Proximus-Eurostat
❖Border Region Data Collection (BRDC)
❖City data from LFS and Big Data
❑ Webscraping
❖Job vacancies
❑ Satellite data and aerial photography
❖Deep Solaris
11. http://statbel.fgov.be
Big data in production
❑ Scanner data for CPI
❖ Based on agreement with data owners, facilitated by political pressure
❖ Legal basis (HICP regulation) but cooperative model
❖ Being expanded gradually with new supermarket and retail chains
❖ Extremely smooth and cost-efficient after initial set-up
❑ Webscraping prices for CPI
❖ Collecting prices on webshops (e.g. airplane tickets)
❖ For efficiency but also out of necessity: e-commerce fast expanding!
❖ Legal issues possible
12. http://statbel.fgov.be
Big data almost in production
❑ Webscraping job vacancies: about to go in production …
❖ Methodological and practical issues
❖ Stand-alone results not sufficient, need to combine with existing Job
vacancy survey (JVS) and ‘administrative’ data from regional
employment agencies (VDAB, FOREM, Actiris)
❖ Linked to European project (ESSnet Big Data, see below): https://
webgate.ec.europa.eu/fpfis/mwikis/essnetbigdata/index.php/WP1_Webscraping_job_vacancies
13. http://statbel.fgov.be
Project Statbel, Proximus, Eurostat
❑ Start December 2015, first results April 2016
❑ Step by step approach:
❖ First: actual present population
❖ Basis for: resident population (via place of residence), workplace,
‘usual environment’, tourism, labour migration, migration, time use, …
❑ Innovative:
❖ First collaboration NSI/operator in EU => ‘real’ data
❖ No ‘call detail records’ but network signals: 10 x more frequent!
❖ Combining mobile phone data with statistical datasets
❑ Via geo-coupling of aggregates: no privacy issues!
14. http://statbel.fgov.be
Project Statbel, Proximus, Eurostat
Results and next steps
❑ Results
❖ 10 publications - see De Meersman e.a., Debusschere e.a., Demunter e.a., Reis e.a., Seynaeve e.a.,
Wirthmann e.a., all at https://webgate.ec.europa.eu/fpfis/mwikis/essnetbigdata/index.php/WP5_Documentation
❑ Our objectives:
❖ Further exploration for statistical ánd commercial applications
❖ Concrete use cases with a view to statistical production lines
❑ Unfortunately …
❖ March 2017: data blocked for everyone
❖ In the meantime: new initiatives via MIT/Univ. Newcastle, Eurostat
18. http://statbel.fgov.be
Other projects
❑ Border Region Data Collection (BRDC)
❖Grant EC DG Regio, 1 year, July 2017 - July 2018
❖Cross-border living place-workplace mobility through
combining Labour force survey (LFS), administrative data and
mobile phone data
❖With CBS Netherlands (lead), Destatis Germany, Insee
France, GUS Poland, SURS Slovenia
19. http://statbel.fgov.be
Other projects, continued
❑ Deep Solaris
❖Grant Eurostat, 1 year, kickoff 19 Febr. 2018
❖Detecting solar pannels on the basis of satellite data
and aerial photography
❖Via machine learning
❖With CBS Netherlands (lead), Destatis Germany,
IT.NRW (Düsseldorf, DE) and BISS (Heerlen, NL)
20. http://statbel.fgov.be
Other projects, continued
❑ City data from LFS and Big Data
❖Grant EC DG Regio, 1 year, Jan. – Dec. 2018
❖Mapping metropolitan areas on the basis of Labour force
survey (LFS) and mobile phone data
❖With CBS Netherlands (lead), Destatis Germany, Insee
France, Statistics Austria
21. http://statbel.fgov.be
From exploration to exploitation
Developing use cases
❑ Scanner data and webscraping for CPI
❑ Webscraping job vacancies
❑ Validation living place and workplace Census
❑ Matrix living place-workplace
❑ … (population, migration, tourism, mobility, transport, time use,
environment, agriculture, …)
23. http://statbel.fgov.be
Big data and official statistics
Provisional conclusions
❑ Size and complexity of datasets
❖ Less problematic than anticipated (because of pre-processing, at
least some structure)
❖ Focus consequently less on IT infrastructure and software
24. http://statbel.fgov.be
Big data and official statistics
Provisional conclusions, continued
❑ Biggest obstacle: access!
❖ Data owned by private enterprises: profit-oriented
❖ Fail to see any advantage in collaborating, on the contrary
(mistakenly!)
❖ Imposing legal obligation seems unavoidable …
❑ Link to privacy issues: fear of reputational damage
25. http://statbel.fgov.be
Big data and official statistics
Provisional conclusions, continued
❑ Additional challenge: methodology
❖ All ancient headaches are still there …
… with a lot of new ones added!
26. http://statbel.fgov.be
Big data and official statistics
The next stage: smart statistics
❑ Monitoring systems which are:
❖ integrated
❖ flexible
❖ multi-source
❖ real-time and highly detailed
❑ Some examples:
❖ continuous tracking of air quality
❖ highly granular actual present population (time, location, characteristics)
❖ smart farming statistics
27. http://statbel.fgov.be
For discussion:
big data, Statbel and data science
❑ Statbel owns numerous geocoded datasets (population,
employment, income, lodgings, …)
❑ and might gain access to big data sources …
❑ … but lacks data science, capability to analyse big data
❑ Two possible solutions:
❖ collaboration with academia, researchers
❖ hiring …