Discover the content and approach followed for the development of the European Data Portal. this portal, released in November 2015 aims at referencing open data made available in up to 39 different European countries.
Presentation with some background on Open Data publishing and reuse policies. Some examples to illustrate the benefits of this new paradigm in Europe.
RAW Open Data, Coimbra (16 October 2014)
Presentation describing the purpose of the European Data Portal project. The launching of the European Data Portal is one of the key steps the European Commission is taking in supporting the access to public data.
European Data Portal - ePSI platform webinar 8 February 2016EuropeanDataPortal
All presentations given during the ePSI platform webinar that was held on 8 February 2016.
The agenda of the webinar:
1) Opening by the European Commission.
2) Introduction to the EDP project
3) Demo of the Portal
4) Technical architecture
5) Focus on CKAN extensions developed
6) Focus on maps application
7) Next steps
8) Discussion and tips & tricks for open data implementation
Presentation with some background on Open Data publishing and reuse policies. Some examples to illustrate the benefits of this new paradigm in Europe.
RAW Open Data, Coimbra (16 October 2014)
Presentation describing the purpose of the European Data Portal project. The launching of the European Data Portal is one of the key steps the European Commission is taking in supporting the access to public data.
European Data Portal - ePSI platform webinar 8 February 2016EuropeanDataPortal
All presentations given during the ePSI platform webinar that was held on 8 February 2016.
The agenda of the webinar:
1) Opening by the European Commission.
2) Introduction to the EDP project
3) Demo of the Portal
4) Technical architecture
5) Focus on CKAN extensions developed
6) Focus on maps application
7) Next steps
8) Discussion and tips & tricks for open data implementation
The Presentation of Hans-Jörg Lieder, Staatsbibliothek zu Berlin – Preußischer Kulturbesitz, at the BnF Information Day for Europeana Newspapers (November 2014).
EDF2014: Nicolas Lemcke Horst, Ambassador of the Danish Basic Data Programme,...European Data Forum
Invited Talk of Nicolas Lemcke Horst, Ambassador of the Danish Basic Data Programme, Agency for Digitisation, Ministry of Finance of Denmark: at the European Data Forum 2014, 19 March 2014 in Athens, Greece: Danish Basic Data
Refinement
Europeana Newspapers Workshop: A Gateway to European Newspapers Online. Research Information Infrastructures and the Future Role of Libraries.
LIBER 2013 Annual Conference, Bavarian State Library, 26-29 June 2013, Munich, Germany.
Open Data and Open Government at the local level: an example and thoughts fro...Marco Fioretti
My talk at the first National OGP Forum in Macedonia. More info and link to report at http://mfioretti.com/2014/11/skopje-first-national-open-government-partnership-forum/
The EUnetHTA perspective on the HTA databasePatrice Chalon
Presentation at HTAi Annual meeting 2017, panel session "Rescuing the HTA database – future options and challenges"
A central, international database for HTA reports and other HTA products is considered to be a vital source of information for healthcare researchers and stakeholders. The current HTA database, which contains over 15,000 documents submitted by numerous HTA agencies, was originally established by the International Network of Agencies for Health Technology Assessment (INAHTA) in 2007 and is available on the website of the UK Centre for Reviews and Dissemination (CRD). The database has so far been funded by the UK National Institute for Health Research (NIHR). Its existence is however endangered as future funding is unclear. If no alternative is found, it will no longer be maintained and only an archived version will be available. There would thus no longer be a single access point to HTA reports.
Previous research has indicated that more than 75% of HTA agencies use the HTA database and more than half adapt common HTA products from reports produced by other agencies.1 The lack of an HTA database would have a direct impact on these activities. Smaller HTA agencies would be particularly affected, as they often have insufficient resources to produce their own reports and rely on reports from larger agencies. The wider consequences should also be considered: for instance, the decreasing visibility of HTA reports would diminish their relevance. It may also become more difficult to establish collaborations between HTA agencies. The problem would thus affect the whole HTA community.
Often, however, a crisis also offers opportunities. The establishment of a new HTA database should include a re-evaluation of its structure and technical functions (e.g. inclusion of ongoing projects).
Structure of the session: Short presentations will be held to provide an overview of the different perspectives of the various HTA agencies and networks currently involved in the discussions on the future of the HTA database. The panel will focus on the initiatives to rescue the database and present the options for funding, hosting, structure and technical functions. There will also be a guided discussion on the possible solutions presented and the challenges faced.
Panel/Workshop outcome and objectives: At the end of the session, participants should be aware of the overall importance of the HTA database, the current status quo, and the potential features of a future HTA database.
OECD, 2nd Task Force Meeting on Charting Illicit Trade - Tamara SCHOTTEOECD Governance
This presentation by Tamara SCHOTTE was made at the 2nd Task Force Meeting on Charting Illicit Trade held on 5-7 March 2014. www.oecd.org/gov/risk/charting-illicit-trade-second-task-force-meeting.htm
The European Innovation Partnership on Water Online MarketplaceMartin Kaltenböck
Presentation about the 'The European Innovation Partnership (EIP) on Water Online Marketplace (http://www.eip-water.eu)' taking place on 09.02.2016 in the course of the EIP Water annual conference in Leeuwarden, The Nethetlands.
agINFRA vision after the end of the projectAndreas Drakos
The agINFRA project (http://www.aginfra.eu) lasted from the October 2011 to February 2015. This presentation shows the vision for after the end of the project
Presentation: BigDataEurope, by Martin Kaltenböck, Semantic Web Company (Austria), at the European Data Economy Workshop taking place back to back to SEMANTiCS2015 on 15 September 2015 in Vienna
Introduction: The Big Data Europe Project at the: CMG-AE Event: Big Data: Strategien, Technologien und Nutzen
19th of May 2015, Expat Center der Wirtschaftsagentur, Vienna, Austria
See: http://www.big-data-europe.eu
Ontology Engineering at Scale for Open City Data SharingOscar Corcho
Seminar at the School of Informatics, The University of Edinburgh.
In this talk we will present how we are applying ontology engineering principles and tools for the development of a set of shared vocabularies across municipalities in Spain, so that they can start homogenising the generation and publication of open data that may be useful for their own internal reuse as well as for third parties who want to develop applications reusing open data once and deploy them for all municipalities. We will discuss on the main challenges for ontology engineering that arise in this setting, as well as present the work that we have done to integrate ontology development tools into common software development infrastructure used by those who are not experts in Ontology Engineering.
EDF2014: Marta Nagy-Rothengass, Head of Unit Data Value Chain, Directorate Ge...European Data Forum
PPP on Data & Executive Panel on Big Data, Introduction by Marta Nagy-Rothengass, Head of Unit Data Value Chain, Directorate General for Communications Networks, Content and Technology at the European Data Forum 2014, 20 March 2014 in Athens, Greece: Towards a Data Value Chain Partership in Europe.
Presentation on ICT trends in developments and what this means for the agri-food business, focussing on the FIspace platform. The presentation was part of the mastercourse Hortibusiness in which about 20 entrepreneurs from the horticultural business are participating.
apidays LIVE Paris - APIs for Governments: why, what and how by Monica Posada...apidays
apidays LIVE Paris - Responding to the New Normal with APIs for Business, People and Society
December 8, 9 & 10, 2020
APIs for Governments: why, what and how
Monica Posada, Project Manager of the API Study, Senior Researcher & Lorenzino Vaccari, Senior Researcher, External Consultant at the European Commission - Joint Research Centre
The Presentation of Hans-Jörg Lieder, Staatsbibliothek zu Berlin – Preußischer Kulturbesitz, at the BnF Information Day for Europeana Newspapers (November 2014).
EDF2014: Nicolas Lemcke Horst, Ambassador of the Danish Basic Data Programme,...European Data Forum
Invited Talk of Nicolas Lemcke Horst, Ambassador of the Danish Basic Data Programme, Agency for Digitisation, Ministry of Finance of Denmark: at the European Data Forum 2014, 19 March 2014 in Athens, Greece: Danish Basic Data
Refinement
Europeana Newspapers Workshop: A Gateway to European Newspapers Online. Research Information Infrastructures and the Future Role of Libraries.
LIBER 2013 Annual Conference, Bavarian State Library, 26-29 June 2013, Munich, Germany.
Open Data and Open Government at the local level: an example and thoughts fro...Marco Fioretti
My talk at the first National OGP Forum in Macedonia. More info and link to report at http://mfioretti.com/2014/11/skopje-first-national-open-government-partnership-forum/
The EUnetHTA perspective on the HTA databasePatrice Chalon
Presentation at HTAi Annual meeting 2017, panel session "Rescuing the HTA database – future options and challenges"
A central, international database for HTA reports and other HTA products is considered to be a vital source of information for healthcare researchers and stakeholders. The current HTA database, which contains over 15,000 documents submitted by numerous HTA agencies, was originally established by the International Network of Agencies for Health Technology Assessment (INAHTA) in 2007 and is available on the website of the UK Centre for Reviews and Dissemination (CRD). The database has so far been funded by the UK National Institute for Health Research (NIHR). Its existence is however endangered as future funding is unclear. If no alternative is found, it will no longer be maintained and only an archived version will be available. There would thus no longer be a single access point to HTA reports.
Previous research has indicated that more than 75% of HTA agencies use the HTA database and more than half adapt common HTA products from reports produced by other agencies.1 The lack of an HTA database would have a direct impact on these activities. Smaller HTA agencies would be particularly affected, as they often have insufficient resources to produce their own reports and rely on reports from larger agencies. The wider consequences should also be considered: for instance, the decreasing visibility of HTA reports would diminish their relevance. It may also become more difficult to establish collaborations between HTA agencies. The problem would thus affect the whole HTA community.
Often, however, a crisis also offers opportunities. The establishment of a new HTA database should include a re-evaluation of its structure and technical functions (e.g. inclusion of ongoing projects).
Structure of the session: Short presentations will be held to provide an overview of the different perspectives of the various HTA agencies and networks currently involved in the discussions on the future of the HTA database. The panel will focus on the initiatives to rescue the database and present the options for funding, hosting, structure and technical functions. There will also be a guided discussion on the possible solutions presented and the challenges faced.
Panel/Workshop outcome and objectives: At the end of the session, participants should be aware of the overall importance of the HTA database, the current status quo, and the potential features of a future HTA database.
OECD, 2nd Task Force Meeting on Charting Illicit Trade - Tamara SCHOTTEOECD Governance
This presentation by Tamara SCHOTTE was made at the 2nd Task Force Meeting on Charting Illicit Trade held on 5-7 March 2014. www.oecd.org/gov/risk/charting-illicit-trade-second-task-force-meeting.htm
The European Innovation Partnership on Water Online MarketplaceMartin Kaltenböck
Presentation about the 'The European Innovation Partnership (EIP) on Water Online Marketplace (http://www.eip-water.eu)' taking place on 09.02.2016 in the course of the EIP Water annual conference in Leeuwarden, The Nethetlands.
agINFRA vision after the end of the projectAndreas Drakos
The agINFRA project (http://www.aginfra.eu) lasted from the October 2011 to February 2015. This presentation shows the vision for after the end of the project
Presentation: BigDataEurope, by Martin Kaltenböck, Semantic Web Company (Austria), at the European Data Economy Workshop taking place back to back to SEMANTiCS2015 on 15 September 2015 in Vienna
Introduction: The Big Data Europe Project at the: CMG-AE Event: Big Data: Strategien, Technologien und Nutzen
19th of May 2015, Expat Center der Wirtschaftsagentur, Vienna, Austria
See: http://www.big-data-europe.eu
Ontology Engineering at Scale for Open City Data SharingOscar Corcho
Seminar at the School of Informatics, The University of Edinburgh.
In this talk we will present how we are applying ontology engineering principles and tools for the development of a set of shared vocabularies across municipalities in Spain, so that they can start homogenising the generation and publication of open data that may be useful for their own internal reuse as well as for third parties who want to develop applications reusing open data once and deploy them for all municipalities. We will discuss on the main challenges for ontology engineering that arise in this setting, as well as present the work that we have done to integrate ontology development tools into common software development infrastructure used by those who are not experts in Ontology Engineering.
EDF2014: Marta Nagy-Rothengass, Head of Unit Data Value Chain, Directorate Ge...European Data Forum
PPP on Data & Executive Panel on Big Data, Introduction by Marta Nagy-Rothengass, Head of Unit Data Value Chain, Directorate General for Communications Networks, Content and Technology at the European Data Forum 2014, 20 March 2014 in Athens, Greece: Towards a Data Value Chain Partership in Europe.
Presentation on ICT trends in developments and what this means for the agri-food business, focussing on the FIspace platform. The presentation was part of the mastercourse Hortibusiness in which about 20 entrepreneurs from the horticultural business are participating.
apidays LIVE Paris - APIs for Governments: why, what and how by Monica Posada...apidays
apidays LIVE Paris - Responding to the New Normal with APIs for Business, People and Society
December 8, 9 & 10, 2020
APIs for Governments: why, what and how
Monica Posada, Project Manager of the API Study, Senior Researcher & Lorenzino Vaccari, Senior Researcher, External Consultant at the European Commission - Joint Research Centre
Facilitation of Information Sharing on Agricultural R&D in the SADC Regioniaaldafrika
Presentation on "Facilitation of Information Sharing on Agricultural R&D in
the SADC Region: Experiences and Experiments" made at the 2nd IAALD Africa Chapter Conference, 15 - 17 July 2009, Accra, Ghana
Presentation for a Chinese delegation from the Fujian province that did a study tour in The Netherlands. I presented the work LEI Wageningen UR is doing on Information Management & ICT in Agri-Food by highlighting project work.
Enhanced Enterprise Intelligence with your personal AI Data Copilot.pdfGetInData
Recently we have observed the rise of open-source Large Language Models (LLMs) that are community-driven or developed by the AI market leaders, such as Meta (Llama3), Databricks (DBRX) and Snowflake (Arctic). On the other hand, there is a growth in interest in specialized, carefully fine-tuned yet relatively small models that can efficiently assist programmers in day-to-day tasks. Finally, Retrieval-Augmented Generation (RAG) architectures have gained a lot of traction as the preferred approach for LLMs context and prompt augmentation for building conversational SQL data copilots, code copilots and chatbots.
In this presentation, we will show how we built upon these three concepts a robust Data Copilot that can help to democratize access to company data assets and boost performance of everyone working with data platforms.
Why do we need yet another (open-source ) Copilot?
How can we build one?
Architecture and evaluation
The Building Blocks of QuestDB, a Time Series Databasejavier ramirez
Talk Delivered at Valencia Codes Meetup 2024-06.
Traditionally, databases have treated timestamps just as another data type. However, when performing real-time analytics, timestamps should be first class citizens and we need rich time semantics to get the most out of our data. We also need to deal with ever growing datasets while keeping performant, which is as fun as it sounds.
It is no wonder time-series databases are now more popular than ever before. Join me in this session to learn about the internal architecture and building blocks of QuestDB, an open source time-series database designed for speed. We will also review a history of some of the changes we have gone over the past two years to deal with late and unordered data, non-blocking writes, read-replicas, or faster batch ingestion.
Analysis insight about a Flyball dog competition team's performanceroli9797
Insight of my analysis about a Flyball dog competition team's last year performance. Find more: https://github.com/rolandnagy-ds/flyball_race_analysis/tree/main
Adjusting primitives for graph : SHORT REPORT / NOTESSubhajit Sahu
Graph algorithms, like PageRank Compressed Sparse Row (CSR) is an adjacency-list based graph representation that is
Multiply with different modes (map)
1. Performance of sequential execution based vs OpenMP based vector multiply.
2. Comparing various launch configs for CUDA based vector multiply.
Sum with different storage types (reduce)
1. Performance of vector element sum using float vs bfloat16 as the storage type.
Sum with different modes (reduce)
1. Performance of sequential execution based vs OpenMP based vector element sum.
2. Performance of memcpy vs in-place based CUDA based vector element sum.
3. Comparing various launch configs for CUDA based vector element sum (memcpy).
4. Comparing various launch configs for CUDA based vector element sum (in-place).
Sum with in-place strategies of CUDA mode (reduce)
1. Comparing various launch configs for CUDA based vector element sum (in-place).
Learn SQL from basic queries to Advance queriesmanishkhaire30
Dive into the world of data analysis with our comprehensive guide on mastering SQL! This presentation offers a practical approach to learning SQL, focusing on real-world applications and hands-on practice. Whether you're a beginner or looking to sharpen your skills, this guide provides the tools you need to extract, analyze, and interpret data effectively.
Key Highlights:
Foundations of SQL: Understand the basics of SQL, including data retrieval, filtering, and aggregation.
Advanced Queries: Learn to craft complex queries to uncover deep insights from your data.
Data Trends and Patterns: Discover how to identify and interpret trends and patterns in your datasets.
Practical Examples: Follow step-by-step examples to apply SQL techniques in real-world scenarios.
Actionable Insights: Gain the skills to derive actionable insights that drive informed decision-making.
Join us on this journey to enhance your data analysis capabilities and unlock the full potential of SQL. Perfect for data enthusiasts, analysts, and anyone eager to harness the power of data!
#DataAnalysis #SQL #LearningSQL #DataInsights #DataScience #Analytics
ViewShift: Hassle-free Dynamic Policy Enforcement for Every Data LakeWalaa Eldin Moustafa
Dynamic policy enforcement is becoming an increasingly important topic in today’s world where data privacy and compliance is a top priority for companies, individuals, and regulators alike. In these slides, we discuss how LinkedIn implements a powerful dynamic policy enforcement engine, called ViewShift, and integrates it within its data lake. We show the query engine architecture and how catalog implementations can automatically route table resolutions to compliance-enforcing SQL views. Such views have a set of very interesting properties: (1) They are auto-generated from declarative data annotations. (2) They respect user-level consent and preferences (3) They are context-aware, encoding a different set of transformations for different use cases (4) They are portable; while the SQL logic is only implemented in one SQL dialect, it is accessible in all engines.
#SQL #Views #Privacy #Compliance #DataLake
06-04-2024 - NYC Tech Week - Discussion on Vector Databases, Unstructured Data and AI
Discussion on Vector Databases, Unstructured Data and AI
https://www.meetup.com/unstructured-data-meetup-new-york/
This meetup is for people working in unstructured data. Speakers will come present about related topics such as vector databases, LLMs, and managing data at scale. The intended audience of this group includes roles like machine learning engineers, data scientists, data engineers, software engineers, and PMs.This meetup was formerly Milvus Meetup, and is sponsored by Zilliz maintainers of Milvus.
Techniques to optimize the pagerank algorithm usually fall in two categories. One is to try reducing the work per iteration, and the other is to try reducing the number of iterations. These goals are often at odds with one another. Skipping computation on vertices which have already converged has the potential to save iteration time. Skipping in-identical vertices, with the same in-links, helps reduce duplicate computations and thus could help reduce iteration time. Road networks often have chains which can be short-circuited before pagerank computation to improve performance. Final ranks of chain nodes can be easily calculated. This could reduce both the iteration time, and the number of iterations. If a graph has no dangling nodes, pagerank of each strongly connected component can be computed in topological order. This could help reduce the iteration time, no. of iterations, and also enable multi-iteration concurrency in pagerank computation. The combination of all of the above methods is the STICD algorithm. [sticd] For dynamic graphs, unchanged components whose ranks are unaffected can be skipped altogether.
06-04-2024 - NYC Tech Week - Discussion on Vector Databases, Unstructured Data and AI
Round table discussion of vector databases, unstructured data, ai, big data, real-time, robots and Milvus.
A lively discussion with NJ Gen AI Meetup Lead, Prasad and Procure.FYI's Co-Found
Unleashing the Power of Data_ Choosing a Trusted Analytics Platform.pdfEnterprise Wired
In this guide, we'll explore the key considerations and features to look for when choosing a Trusted analytics platform that meets your organization's needs and delivers actionable intelligence you can trust.