Presentation of HOBBIT Joint Event Post-EDF 2016. Eindhoven, Netherlands
(This project has received funding from the European Union’s Horizon 2020 research and innovation programme under grant agreement No 688227.)
Enabling Edge Analytics of IoT Data: The Case of LoRaWANHong-Linh Truong
The document discusses enabling edge analytics of IoT data using the LoRaWAN network technology. It proposes the IoTRACE framework which augments LoRaWAN's software architecture to support edge computing and analytics of IoT data. This is done by extracting application data at network servers near the edge to enable localized analytics and data services for multiple stakeholders like farmers. A prototype demonstrates initial concepts by emulating LoRaWAN devices and sensors, and performing analytics on the edge using Node-RED flows that access data queues. Future work aims to refine the architectures and data models to support distributed edge-cloud analytics across networks.
OSFair2017 Workshop | Survey results towards a common EOSC catalogueOpen Science Fair
Donatella Castelli presents the survey results: "Towards a common EOSC catalogue" | OSFair2017 Workshop
Workshop title: How FAIR friendly is your data catalogue?
Workshop overview:
This workshop will build upon the work planned by the EOSCpilot data interoperability task and the BlueBridge workshop held on April 3 at the RDA meeting. We will investigate common mechanisms for interoperation of data catalogues that preserve established community standards, norms and resources, while simplifying the process of being/becoming FAIR. Can we have a simple interoperability architecture based on a common set of metadata types? What are the minimum metadata requirements to expose FAIR data to EOSC services and EOSC users?
DAY 3 - PARALLEL SESSION 6 & 7
GI2016 ppt charvat senslog api as tools for collection of big vgi dataIGN Vorstand
SensLog is an integrated solution for collecting and managing sensor data, including volunteered geographic information (VGI). It consists of a data model and server-side application that stores, analyzes, and publishes sensor and VGI data through web services. SensLog's database model is based on standardized models for sensor observations, and it provides APIs for both data producers and consumers to facilitate the collection and use of big VGI data.
ECPPM2016 - SemCat: Publishing and Accessing Building Product Information as ...Pieter Pauwels
Presentation at the 11th European Conference on Product and Process Modelling (2016), in Limassol, Cyprus. Presentation and article are authored by Gudni Gundason and Pieter Pauwels.
1) ProteomeXchange is a global database containing proteomics data from several repositories including PRIDE, MassIVE, and jPOST.
2) A new member, iProX, joined in 2017 and contains over 60 terabytes of data from China.
3) Usage of ProteomeXchange data is increasing, with PRIDE downloads growing from 50 terabytes in 2013 to over 295 terabytes in 2017.
Enabling Edge Analytics of IoT Data: The Case of LoRaWANHong-Linh Truong
The document discusses enabling edge analytics of IoT data using the LoRaWAN network technology. It proposes the IoTRACE framework which augments LoRaWAN's software architecture to support edge computing and analytics of IoT data. This is done by extracting application data at network servers near the edge to enable localized analytics and data services for multiple stakeholders like farmers. A prototype demonstrates initial concepts by emulating LoRaWAN devices and sensors, and performing analytics on the edge using Node-RED flows that access data queues. Future work aims to refine the architectures and data models to support distributed edge-cloud analytics across networks.
OSFair2017 Workshop | Survey results towards a common EOSC catalogueOpen Science Fair
Donatella Castelli presents the survey results: "Towards a common EOSC catalogue" | OSFair2017 Workshop
Workshop title: How FAIR friendly is your data catalogue?
Workshop overview:
This workshop will build upon the work planned by the EOSCpilot data interoperability task and the BlueBridge workshop held on April 3 at the RDA meeting. We will investigate common mechanisms for interoperation of data catalogues that preserve established community standards, norms and resources, while simplifying the process of being/becoming FAIR. Can we have a simple interoperability architecture based on a common set of metadata types? What are the minimum metadata requirements to expose FAIR data to EOSC services and EOSC users?
DAY 3 - PARALLEL SESSION 6 & 7
GI2016 ppt charvat senslog api as tools for collection of big vgi dataIGN Vorstand
SensLog is an integrated solution for collecting and managing sensor data, including volunteered geographic information (VGI). It consists of a data model and server-side application that stores, analyzes, and publishes sensor and VGI data through web services. SensLog's database model is based on standardized models for sensor observations, and it provides APIs for both data producers and consumers to facilitate the collection and use of big VGI data.
ECPPM2016 - SemCat: Publishing and Accessing Building Product Information as ...Pieter Pauwels
Presentation at the 11th European Conference on Product and Process Modelling (2016), in Limassol, Cyprus. Presentation and article are authored by Gudni Gundason and Pieter Pauwels.
1) ProteomeXchange is a global database containing proteomics data from several repositories including PRIDE, MassIVE, and jPOST.
2) A new member, iProX, joined in 2017 and contains over 60 terabytes of data from China.
3) Usage of ProteomeXchange data is increasing, with PRIDE downloads growing from 50 terabytes in 2013 to over 295 terabytes in 2017.
Repository Power: How Repositories can support Open Access Mandates (OR2015 O...OpenAIRE
OpenAIRE presentation at the Open Repositories Conference (OR2015), in Indianapolis, 10/Jun/2015 - Session - P4B: Supporting Open Scholarship and Open Science. Presented by Wolfram Horstmann (Univ. Goettingen) on behalf of the paper authors: Najla Rettberg, Jochen Schirrwagen, Pedro Principe, Eloy Rodrigues, José Carvalho, Paolo Manghi, Natalia Manola.
Barbara Skerritt - IPPOSI Patient Reported Outcome Measures conference Oct 2018ipposi
ICHOM's global outcomes benchmarking program called GLOBE aims to benchmark outcomes data across providers globally for cataract and hip/knee replacement surgeries. The program has collected data from over 50 sites in 8 countries for cataracts and 25 sites in 5 countries for hip/knee replacements, involving over 90,000 cataract patients and 7,000 joint replacement patients since 2016. The objectives of the GLOBE pilots were to demonstrate the feasibility of aggregating and collecting harmonized outcomes data across borders and to determine if meaningful outcome variations could be identified to stimulate quality improvement. The pilots provided insights into building an international benchmarking program and identified areas for future work.
ALLDATA 2015 - RDF Based Linked Data Management as a DaaS PlatformSeonho Kim
suggesting a way to manage linked data platform to be used domain specific applications
Best parper awarded - http://www.iaria.org/conferences2015/AwardsALLDATA15.html
- About the importance of Linked Data technologies to make real/practical of the benefits of Open Data.
- Linked Data, it's all about open data quality and Implementation of the applications/services
The document discusses the Austrian open government data portal data.gv.at. As of May 2014, the portal contained 1,228 datasets from 26 publishers, with 236 applications developed using the data and an average of 350 unique visitors per day. The document raises questions about how to achieve sustainable commitment from participating organizations, what strategic partnerships could help increase awareness of open government data outside of the open government community, and how to gain political leadership support needed for sustainable development of open data initiatives in Austria.
The U.S. Department of Commerce collects, processes and disseminates data on a range of issues that impact our nation. Having a host of data and ensuring that this data is open and accessible to all are two separate issues. This session will cover the Commerce Data Usability Project (CDUP) - a community-driven public-private partnership to help data scientists, programmers and other users to access open knowledge from our open data.
"Dude, where's my graph?" RDF Data Cubes for Clinical Trials DataMarc Andersen
This document discusses using RDF and linked data principles to display clinical trial results as interactive graphs and summary tables. It describes how RDF triples can represent clinical data and be rendered as directed graphs using D3.js. It also presents an interface with actions like "Describe", "Dimensions", and "Data" that build and display SPARQL queries of an RDF data cube, allowing linked exploration and visualization of results. Ongoing work in the PhUSE Semantic Technology Project aims to further specify the RDF data cube model and develop supporting R packages and documentation.
Biased Information Retrieval in Pharmaceutical Drug DevelopmentDr. Haxel Consult
Pharmaceutical companies are highly dependent on access to high quality information retrieval. Insufficient gathering and selection of scientific information could potentially impact corporate decision-making in a wrong direction.
To assess the value of external information retrieval services a number of third party information providers were contacted with two information research requests (within inflammatory diseases). The providers were asked to return with search results and search methodologies used. In the first search the interaction with the providers were kept at a minimal level, whereas in the second search the contact, direction, and interaction were increased.
It is concluded that information research results from different providers are variable. The expected increase in inter-homogeneity of results from the different providers could not be confirmed after the second search. The overall overlap of results was 38% for the first search and 33% for the second search, and surprisingly none of the references were found by all providers.
To fully cover the area of interest and to avoid bias it is recommended to perform exhaustive scientific literature searches. Researchers and decision-makers should accept large amounts of results from literature searches and promote initiatives to analyse these results in detail.
Local government web sites in Finland: A geographic and webometric analysisKim Holmberg
A webometric study about the interlinking between local government web sites in Finland. Paper presented at the 11th conference of International Society of Scientometrics and Informetrics in 2007 Madrid, Spain.
20140902 LinDa Workshop Semantincs2014 - LinDA Project OverviewLinDa_FP7
LinDa Project presentation - Challenges, tools, workplan and objectives
Presentation at LinDA Workshop on 2nd September 2014 at Semantics2014 by Spiros Mouzakitis
Diane Webb, President of BizInt, presented on new features for the BizInt Smart Charts product family at ICIC 2015 in Nice, France. Key updates included support for the new STN platform, moving clinical trial support to a new product, and bundling pipeline and clinical trial support. Enhancements were also made to existing databases for patents, literature, clinical trials, and drug pipelines. A new launch timeline visualization was introduced to plot drugs by estimated launch date.
SC7 Webinar 4 04/05/2017 SatCen Presentation "The Secure Societies Community ...BigData_Europe
The BigDataEurope project aims to integrate big data, software, and communities to address societal challenges in Europe. The EU SatCen is building a Secure Societies community, eliciting big data requirements, and implementing a Space and Security pilot for the project. The Secure Societies pilot uses Sentinel-1 satellite imagery, Twitter data, and Reuters news articles to detect changes and events. The change detection workflow analyzes Sentinel-1 imagery to detect areas with changes, while the event detection workflow monitors news and social media to cluster events and verify locations. The pilot is being optimized to improve scalability, add cybersecurity mechanisms, and enhance visualization tools.
A substantial group of companies use data center proxies to extract data from various hotel websites. In this category, competition is rapidly increasing.
OpenAIRE-Advance: Advancing Open Scholarship (Presentation at RDA 11th Plenary)OpenAIRE
Presentation by Natalia Manola, OpenAIRE Director, at RDA 11th Plenary BoF meeting - EOSC-related European Projects getting Global: Engaging with the RDA
BDE-BDVA Webinar: BigDataEurope Overview & Synergies with BDVABigData_Europe
The document discusses BigDataEurope, a H2020 CSA project that aims to lower barriers for using big data technologies and demonstrate societal value through 7 pilot use cases. It describes the Integrator Platform, which provides a flexible, generic platform for deploying big data value chains using open source solutions. The platform has been instantiated 7 times for the pilot uses cases. It also discusses synergies between BigDataEurope and the Big Data Value Association (BDVA) in advancing big data technical priorities.
Business context of FAIR health data networks - The Hyve - MEDINFO Lyon 2019Kees van Bochove
MedInfo Lyon 2019: FAIR Health Data Sharing Initiatives in Europe: Opportunities and Challenges for international cooperation (Ulli Prokosch, Thomas Ganslandt, Ulrich Sax, Christian Lovis, Carlos Luis Parra Calderón, Peter Rijnbeek, Nigel Hughes, Wiro Niessen, Barend Mons, Kees Van Bochove)
In the data-driven age of medical research many initiatives and projects focus on data linkage and functional integration as well as data reuse. Providing “FAIR data” is a common challenge for all of them. For this workshop leaders of six nation-wide and European initiatives/projects have joined in order to identify the major concepts, challenges and hurdles.
This presentation was used to provide the business context of FAIR data in health data networks.
Creating and Utilizing Linked Open Statistical Data for the Development of Ad...Evangelos Kalampokis
The document discusses the OpenCube approach for working with linked data cubes. OpenCube develops components to support the full lifecycle of linked statistical data, from publishing raw data cubes to consuming them through analytics, visualizations, and other applications. It presents several components that have been implemented, including tools for publishing statistical data in various formats, browsing and visualizing data cubes, and integrating with R for advanced analytics. Initial evaluations of the components have provided insights around publishing and working with large linked statistical datasets.
Introduction to the Orléans/OGC INSPIRE Hackathon 2018plan4all
The document discusses the INSPIRE Hackathon, which is described as an ongoing process rather than a single event. The hackathon aims to merge overlaps and fill gaps in open data integration efforts across Europe and internationally. It provides an environment for experts of all ages and organizations to work on implementing open data policies. The hackathon also focuses on ensuring the continuity of open data integrations from projects and on building capacity around open data policy. Metadata is a focus area for the 2018 hackathon in Orleans, with goals like improving metadata standards and linking geospatial and open data communities.
A presentation of the HOBBIT Survey Results
(This project has received funding from the European Union’s Horizon 2020 research and innovation programme under grant agreement No 688227.)
Repository Power: How Repositories can support Open Access Mandates (OR2015 O...OpenAIRE
OpenAIRE presentation at the Open Repositories Conference (OR2015), in Indianapolis, 10/Jun/2015 - Session - P4B: Supporting Open Scholarship and Open Science. Presented by Wolfram Horstmann (Univ. Goettingen) on behalf of the paper authors: Najla Rettberg, Jochen Schirrwagen, Pedro Principe, Eloy Rodrigues, José Carvalho, Paolo Manghi, Natalia Manola.
Barbara Skerritt - IPPOSI Patient Reported Outcome Measures conference Oct 2018ipposi
ICHOM's global outcomes benchmarking program called GLOBE aims to benchmark outcomes data across providers globally for cataract and hip/knee replacement surgeries. The program has collected data from over 50 sites in 8 countries for cataracts and 25 sites in 5 countries for hip/knee replacements, involving over 90,000 cataract patients and 7,000 joint replacement patients since 2016. The objectives of the GLOBE pilots were to demonstrate the feasibility of aggregating and collecting harmonized outcomes data across borders and to determine if meaningful outcome variations could be identified to stimulate quality improvement. The pilots provided insights into building an international benchmarking program and identified areas for future work.
ALLDATA 2015 - RDF Based Linked Data Management as a DaaS PlatformSeonho Kim
suggesting a way to manage linked data platform to be used domain specific applications
Best parper awarded - http://www.iaria.org/conferences2015/AwardsALLDATA15.html
- About the importance of Linked Data technologies to make real/practical of the benefits of Open Data.
- Linked Data, it's all about open data quality and Implementation of the applications/services
The document discusses the Austrian open government data portal data.gv.at. As of May 2014, the portal contained 1,228 datasets from 26 publishers, with 236 applications developed using the data and an average of 350 unique visitors per day. The document raises questions about how to achieve sustainable commitment from participating organizations, what strategic partnerships could help increase awareness of open government data outside of the open government community, and how to gain political leadership support needed for sustainable development of open data initiatives in Austria.
The U.S. Department of Commerce collects, processes and disseminates data on a range of issues that impact our nation. Having a host of data and ensuring that this data is open and accessible to all are two separate issues. This session will cover the Commerce Data Usability Project (CDUP) - a community-driven public-private partnership to help data scientists, programmers and other users to access open knowledge from our open data.
"Dude, where's my graph?" RDF Data Cubes for Clinical Trials DataMarc Andersen
This document discusses using RDF and linked data principles to display clinical trial results as interactive graphs and summary tables. It describes how RDF triples can represent clinical data and be rendered as directed graphs using D3.js. It also presents an interface with actions like "Describe", "Dimensions", and "Data" that build and display SPARQL queries of an RDF data cube, allowing linked exploration and visualization of results. Ongoing work in the PhUSE Semantic Technology Project aims to further specify the RDF data cube model and develop supporting R packages and documentation.
Biased Information Retrieval in Pharmaceutical Drug DevelopmentDr. Haxel Consult
Pharmaceutical companies are highly dependent on access to high quality information retrieval. Insufficient gathering and selection of scientific information could potentially impact corporate decision-making in a wrong direction.
To assess the value of external information retrieval services a number of third party information providers were contacted with two information research requests (within inflammatory diseases). The providers were asked to return with search results and search methodologies used. In the first search the interaction with the providers were kept at a minimal level, whereas in the second search the contact, direction, and interaction were increased.
It is concluded that information research results from different providers are variable. The expected increase in inter-homogeneity of results from the different providers could not be confirmed after the second search. The overall overlap of results was 38% for the first search and 33% for the second search, and surprisingly none of the references were found by all providers.
To fully cover the area of interest and to avoid bias it is recommended to perform exhaustive scientific literature searches. Researchers and decision-makers should accept large amounts of results from literature searches and promote initiatives to analyse these results in detail.
Local government web sites in Finland: A geographic and webometric analysisKim Holmberg
A webometric study about the interlinking between local government web sites in Finland. Paper presented at the 11th conference of International Society of Scientometrics and Informetrics in 2007 Madrid, Spain.
20140902 LinDa Workshop Semantincs2014 - LinDA Project OverviewLinDa_FP7
LinDa Project presentation - Challenges, tools, workplan and objectives
Presentation at LinDA Workshop on 2nd September 2014 at Semantics2014 by Spiros Mouzakitis
Diane Webb, President of BizInt, presented on new features for the BizInt Smart Charts product family at ICIC 2015 in Nice, France. Key updates included support for the new STN platform, moving clinical trial support to a new product, and bundling pipeline and clinical trial support. Enhancements were also made to existing databases for patents, literature, clinical trials, and drug pipelines. A new launch timeline visualization was introduced to plot drugs by estimated launch date.
SC7 Webinar 4 04/05/2017 SatCen Presentation "The Secure Societies Community ...BigData_Europe
The BigDataEurope project aims to integrate big data, software, and communities to address societal challenges in Europe. The EU SatCen is building a Secure Societies community, eliciting big data requirements, and implementing a Space and Security pilot for the project. The Secure Societies pilot uses Sentinel-1 satellite imagery, Twitter data, and Reuters news articles to detect changes and events. The change detection workflow analyzes Sentinel-1 imagery to detect areas with changes, while the event detection workflow monitors news and social media to cluster events and verify locations. The pilot is being optimized to improve scalability, add cybersecurity mechanisms, and enhance visualization tools.
A substantial group of companies use data center proxies to extract data from various hotel websites. In this category, competition is rapidly increasing.
OpenAIRE-Advance: Advancing Open Scholarship (Presentation at RDA 11th Plenary)OpenAIRE
Presentation by Natalia Manola, OpenAIRE Director, at RDA 11th Plenary BoF meeting - EOSC-related European Projects getting Global: Engaging with the RDA
BDE-BDVA Webinar: BigDataEurope Overview & Synergies with BDVABigData_Europe
The document discusses BigDataEurope, a H2020 CSA project that aims to lower barriers for using big data technologies and demonstrate societal value through 7 pilot use cases. It describes the Integrator Platform, which provides a flexible, generic platform for deploying big data value chains using open source solutions. The platform has been instantiated 7 times for the pilot uses cases. It also discusses synergies between BigDataEurope and the Big Data Value Association (BDVA) in advancing big data technical priorities.
Business context of FAIR health data networks - The Hyve - MEDINFO Lyon 2019Kees van Bochove
MedInfo Lyon 2019: FAIR Health Data Sharing Initiatives in Europe: Opportunities and Challenges for international cooperation (Ulli Prokosch, Thomas Ganslandt, Ulrich Sax, Christian Lovis, Carlos Luis Parra Calderón, Peter Rijnbeek, Nigel Hughes, Wiro Niessen, Barend Mons, Kees Van Bochove)
In the data-driven age of medical research many initiatives and projects focus on data linkage and functional integration as well as data reuse. Providing “FAIR data” is a common challenge for all of them. For this workshop leaders of six nation-wide and European initiatives/projects have joined in order to identify the major concepts, challenges and hurdles.
This presentation was used to provide the business context of FAIR data in health data networks.
Creating and Utilizing Linked Open Statistical Data for the Development of Ad...Evangelos Kalampokis
The document discusses the OpenCube approach for working with linked data cubes. OpenCube develops components to support the full lifecycle of linked statistical data, from publishing raw data cubes to consuming them through analytics, visualizations, and other applications. It presents several components that have been implemented, including tools for publishing statistical data in various formats, browsing and visualizing data cubes, and integrating with R for advanced analytics. Initial evaluations of the components have provided insights around publishing and working with large linked statistical datasets.
Introduction to the Orléans/OGC INSPIRE Hackathon 2018plan4all
The document discusses the INSPIRE Hackathon, which is described as an ongoing process rather than a single event. The hackathon aims to merge overlaps and fill gaps in open data integration efforts across Europe and internationally. It provides an environment for experts of all ages and organizations to work on implementing open data policies. The hackathon also focuses on ensuring the continuity of open data integrations from projects and on building capacity around open data policy. Metadata is a focus area for the 2018 hackathon in Orleans, with goals like improving metadata standards and linking geospatial and open data communities.
A presentation of the HOBBIT Survey Results
(This project has received funding from the European Union’s Horizon 2020 research and innovation programme under grant agreement No 688227.)
The document discusses the Albuquerque Apprenticeship Academy which provides an alternative path to career success. The academy offers apprenticeship programs and works closely with the local community. It aims to help students find fulfilling careers through hands-on learning opportunities.
Čempionu Brokastis #25 / Ulrika Plotniece / "Par ko runā Lauvas?"NORD DDB RIGA
The document discusses key messages for standing for something, being consistent, accepting the unknown, and thinking about the future. It provides numbers for 43,000, 26, 400, 15,000, and 300+ that are not clearly defined but are presented alongside these messages. It repeats the messages multiple times and concludes by stating that creativity does matter.
Presentation given by Dr. Dimitris Gavrilis
Digital Curation Unit - IMIS, Athena Research Center
LoCloud Conference
Sharing local cultural heritage online with LoCloud services Amersfoort, Netherlands
5 February 2016
Metadata Quality Assurance Framework at QQML2016 conference - full versionPéter Király
This document presents a Metadata Quality Assurance Framework to measure and improve metadata quality. It analyzes typical metadata issues like non-informative fields and proposes measuring structural elements like completeness, cardinality, uniqueness, and language specification to predict record quality. Metrics are defined using a problem catalog of known issues mapped to discovery scenarios. Visualizations of early measurement results are shown to identify outliers and inform metadata improvements. The framework is intended to be scalable, transparent, and collaborative.
Indifference as Visual Self Ideology- Danut ZbarceaDanut Zbarcea
Indifference as Visual Self Ideology is a photography project by Danut Zbarcea from 2013 that explores indifference. The project is inspired by a quote from Jidu Krishnamurti that states our love for humanity is fictional if we do not know how to love others. Zbarcea's photo fragments from 2013 visually represent different forms of indifference.
The document describes an efficient approach called Aegle for discovering links between events represented as time intervals using Allen's interval algebra. Aegle expresses the 13 Allen relations using 8 atomic temporal relations between interval start and end points. It then efficiently computes the atomic relations by sorting interval start and end points. Experimental results show Aegle significantly outperforms the state-of-the-art in runtime, scaling to large real-world datasets with hundreds of thousands of events.
The document discusses the internship of Jessica Liang at Upstate Cardiology. It covers several topics:
1) Access to care was highly valued at Upstate Cardiology, which provided care regardless of ability to pay. Disease prevention was also promoted.
2) The referral system was crucial to services, but more convoluted between different health systems due to varying forms and missing information.
3) Overbooking appointments was problematic and reducing it would benefit physicians and patients through better use of time.
Presentation of the HOBBIT Project @ ESWC 2016.
(This project has received funding from the European Union’s Horizon 2020 research and innovation programme under grant agreement No 688227.)
The document discusses the HOBBIT platform for benchmarking big data platforms. It aims to provide a unified benchmarking platform as a community-driven effort. The platform will include reference datasets and implementations of key performance indicators to standardize benchmarking and allow comparison of results. It will focus on benchmarking tasks related to big linked data and the entire data lifecycle.
The document discusses holistic benchmarking of big linked data through the HOBBIT platform. It outlines the benchmark creation process which involves collecting industry data and measures to develop benchmarks. The benchmarks evaluate key performance indicators and tasks across solutions. The platform then allows participants to deploy and benchmark solutions. The document also describes the types of benchmarks offered including streaming, static, and realistic benchmarks. It invites readers to get involved by participating in surveys, joining the community, providing key performance indicators and datasets, and assisting with platform development.
An overview of the workshop as presented at the 1st International Workshop on Benchmarking Linked Data (BLINK).
(HOBBIT project has received funding from the European Union’s Horizon 2020 research and innovation programme under grant agreement No 688227.)
Business and IoT Economic Alchemy or Another Anticlimax - March 2016 - OSGi A...mfrancis
OSGi Alliance presentation at CeBIT IoT Summit from March 2016.
Presented by Dr Richard Nicholson.
From the Environment to Manufacturing, from the Consumer to Education, from Finance to Government, IoT has the potential to transform markets and businesses and almost everything we know today. However, to achieve this change, one must first be cognizant of, and then address, the business and engineering challenges posed by IoT. Business as usual in IT will not deliver the promises of IoT!
This talk will review the fundamental characteristics required for any pervasive IoT solution to achieve this transformation and discuss the central importance of an industry standard for software modularity. These will be compared and contrasted against some of the hot IT trends of the last decade. The presentation will conclude with an overview of the OSGi Alliance and its activities within the IoT, along with industry examples and some opportunities for you to take advantage or and get involved with OSGi.
http://www.cebit.de/event/business-iot-economic-alchemy-or-another-anti-climax/KEY/70812
Use of Open Data in Hong Kong (LegCo 2014)Sammy Fung
Presentation on use of open data in HK given to Legislative Council Secretariat. Content is mixed from my presentations at startmeup 2013 and opendatahk meetup.
Complex Made Simple @ LF Energy Conference in ParisShane Coughlan
According to the documents:
1) Nuclear power supplied around 30% of Japan's national power in 2011 but only 9 reactors were back online as of June 2019 after the Fukushima disaster.
2) The OpenChain Project aims to create a simple, effective industry standard for open source license compliance across organizations through defining common processes and reference materials.
3) OpenChain activities have expanded to various countries and industries in 2019, including additional meetings in Japan, Korea, India, China, and Taiwan as well as an automotive work group and reference tooling work group.
Holistic Benchmarking of Big Linked Data: HOBBITGraph-TA
The document discusses the HOBBIT project, which aims to provide holistic benchmarking of big linked data technologies. It notes the large volumes of linked data and number of available technologies. HOBBIT will identify key performance indicators and benchmarks for critical steps in the linked data lifecycle, including generation, analysis, storage, and visualization. It will gather industry performance needs and provide a platform for standardized benchmarking and challenges to compare technology performance. The project is coordinated by researchers in Europe and runs for three years under the Horizon 2020 program.
Presentation of HOBBIT at Graph-TA
(This project has received funding from the European Union’s Horizon 2020 research and innovation programme under grant agreement No 688227.)
EDINA is a national data center in the UK that delivers geospatial data and services using open standards and open source software. It provides access to collections like Digimap and OpenBoundaries through web mapping applications and data downloads. EDINA uses open standards like OGC and open source software from OSGeo projects to build interoperable and resilient systems while reducing costs. This hybrid approach provides flexible and innovative services to users while meeting the needs of funders.
EDINA is a national data center in the UK that delivers geospatial data and services using open standards and open source software. It provides access to collections like Digimap and OpenBoundaries through web mapping applications and data downloads. EDINA uses open standards like OGC and open source software from projects in OSGeo to build robust and interoperable systems while reducing costs and increasing flexibility.
Local Weather Information and GNOME Shell ExtensionSammy Fung
This document provides information about an upcoming presentation on local weather information and GNOME Shell extensions. The presentation will discuss obtaining weather data from local meteorological observatories and making it available as open data. It will also cover creating weather widgets for the GNOME Shell desktop environment. The presenter has 15+ years experience in open source communities and organizing conferences in Asia and the US.
A summary of the workshop as presented at the 1st International Workshop on Benchmarking Linked Data (BLINK).
(HOBBIT project has received funding from the European Union’s Horizon 2020 research and innovation programme under grant agreement No 688227.)
Interview Co-Founder ReportLinker and Findout, Benjamin CarpanoReportLinker.com
ReportLinker and Findout use data analytics technologies to aggregate and normalize data from a wide range of sources to help researchers and analysts find relevant information more efficiently. The companies' core competency lies in both search and data analytics capabilities. Their natural language processing platform analyzes millions of documents daily to extract and structure concepts, relationships, and sentiments to improve users' productivity. Open data initiatives have helped the businesses by providing more content to incorporate, though most value comes from backend processing rather than raw data access. The companies aim to continue enhancing data discovery and contextualization of results.
The document discusses big data and Hadoop. It provides statistics on the growth of the big data market from IDC and Deloitte. It then discusses Hadoop in more detail, describing it as an open source software platform for distributed storage and processing of large datasets across clusters of commodity servers. The core components of Hadoop including HDFS for storage and MapReduce for processing are explained. Examples of companies using big data technologies like Hadoop are provided.
Direct and Indirect Procurement Transformation at BASF - 56554SAP Ariba Live 2018
Learn how BASF is transforming all procurement, supplier management, and source-to-contract processes for direct, indirect, and packaging spend. Having onboarded more than 20,000 suppliers, BASF is qualifying and segmenting them by region, commodity, and plant. The company is also enforcing that qualification in its sourcing processes.
Emerging Trends in Data Visualization and Dissemination discusses providing statistical data through application programming interfaces (APIs) and as a service rather than goods. It describes how mashups combine data from multiple sources into new applications and services. The document outlines benefits of mashups, how they work by retrieving data through APIs from different websites, and factors to consider when planning a mashup like data sources and programming languages. It provides examples of the United Nations' UNData and Comtrade initiatives that make international statistical databases freely available through APIs and web services.
A web-based plateform for cleaner production and industrial symbiosisGuillaume Massard
- Cost benefit analysis of resource efficiency solutions
- Identification of industrial symbiosis
- Material and energy flows data management for company manager and industrial park, cluster manager
Fraunhofer – SINTEF: towards an initiative on Data Sovereignty in EuropeThorsten Huelsmann
Fraunhofer and SINTEF jointed Industrial Data Space Association in early 2016. Industrial Data Space stands for safer data exchange between companies where the producer of data remains the owner of the data and maintains sovereignty over the use of that data.
IDS Association aims to define the conditions and governance for a reference architecture and interfaces aiming at international standards.
This standard is actively developed and updated on the basis of use cases. It forms the basis for a number of certified software solutions and business models, the development of which is fostered by the association.
Thorsten Huelsmann and Ernst H. Kristiansen talked on this topic during the German-Norwegian Dialogue on Bilateral and
European Cooperation , September 29 2016 at Berlin.
"EARL: Joint Entity and Relation Linking for Question Answering over Knowledge Graphs" as presented in Sthe 17th International Semantic Web Conference ISWC, 9th of October 2018, held in Monterey, California, USA
This work was supported by grants from the EU H2020 Framework Programme provided for the project HOBBIT (GA no. 688227).
"Benchmarking Big Linked Data: The case of the HOBBIT Project" as presented in the First International Workshop on Semantic Web Technologies for Health Data Management (SWH 2018), co-located with ISWC 2018, 9th October, 2018 held in Monterey, California, USA
This work was supported by grants from the EU H2020 Framework Programme provided for the project HOBBIT (GA no. 688227).
"Assessing Linked Data Versioning Systems: The Semantic Publishing Versioning Benchmark" as presented in SSWS 2018 co-located with the 17th International Semantic Web Conference ISWC, 9th of October 2018, held in Monterey, California, USA
This work was supported by grants from the EU H2020 Framework Programme provided for the project HOBBIT (GA no. 688227).
"The DEBS Grand Challenge 2018" as presented in the 12th ACM International Conference on Distributed and Event-Based Systems (DEBS 2017), 25 - 29 June, 2017 held in Hammilton, New Zeland
This work was supported by grants from the EU H2020 Framework Programme provided for the project HOBBIT (GA no. 688227).
"Benchmarking of distributed linked data streaming systems" as presented in the Stream Reasoning Workshop 2018, January 16-17, 2018, held by Department of Informatics DDIS (University of Zurich) in Zurich, Suisse
This work was supported by grants from the EU H2020 Framework Programme provided for the project HOBBIT (GA no. 688227).
"SQCFramework: SPARQL Query Containment Benchmarks Generation Framework" as presented in the 9th International Conference on Knowledge Capture(K-Cap 2017), December 4th-6th, 2017, held in Austin, USA.
This work was supported by grants from the EU H2020 Framework Programme provided for the project HOBBIT (GA no. 688227).
"LargeRDFBench: A billion triples benchmark for SPARQL endpoint federation" as presented in the 17th International Semantic Web Conference ISWC ( ournal track), 8th - 12th of October 2018, held in Monterey, California, USA
This work was supported by grants from the EU H2020 Framework Programme provided for the project HOBBIT (GA no. 688227).
"The DEBS Grand Challenge 2017" as presented in the The 11th ACM International Conference on Distributed and Event-Based Systems, 19 - 23 June, 2017 held in Barcelona, Spain
This work was supported by grants from the EU H2020 Framework Programme provided for the project HOBBIT (GA no. 688227).
"4th Natural Language Interface over the Web of Data (NLIWoD) workshop and QALD-9 Question Answering over Linked Data Challenge" as presented in the 17th International Semantic Web Conference ISWC, 8th - 12th of October 2018, held in Monterey, California, USA
This work was supported by grants from the EU H2020 Framework Programme provided for the project HOBBIT (GA no. 688227).
"Scalable Link Discovery for Modern Data-Driven Applications" poster presented ECAI 2016, September 2016, held in the Hague, Netherlands.
This work was supported by grants from the EU H2020 Framework Programme provided for the project HOBBIT (GA no. 688227).
"An Evaluation of Models for Runtime Approximation in Link Discovery" as presented in the IEEE/WIC/ACM WI, August 25th, 2017, held in Leipzig, Germany.
This work was supported by grants from the EU H2020 Framework Programme provided for the project HOBBIT (GA no. 688227).
"Scalable Link Discovery for Modern Data-Driven Applications" as presented in the 15th International Semantic Web Conference ISWC, Doctoral Consortium, October 18th, 2016, held in Kobe, Japan
This work was supported by grants from the EU H2020 Framework Programme provided for the project HOBBIT (GA no. 688227).
"Extending LargeRDFBench for Multi-Source Data at Scale for SPARQL Endpoint Federation" as presented in SSWS 2018 co-located with the 17th International Semantic Web Conference ISWC, 8th - 12th of October 2018, held in Monterey, California, USA
This work was supported by grants from the EU H2020 Framework Programme provided for the project HOBBIT (GA no. 688227).
"SPgen: A Benchmark Generator for Spatial Link Discovery Tools" as presented in Ontology Matching (OM) hosted by the 17th International Semantic Web Conference ISWC, 8th - 12th of October 2018, held in Monterey, California, USA
This work was supported by grants from the EU H2020 Framework Programme provided for the project HOBBIT (GA no. 688227).
"Introducing the HOBBIT platform into the Ontology Alignment Evaluation Campaign" was presented in Ontology Matching (OM) hosted by the 17th International Semantic Web Conference ISWC, 8th - 12th of October 2018, held in Monterey, California, USA
This work was supported by grants from the EU H2020 Framework Programme provided for the project HOBBIT (GA no. 688227).
OKE Challenge was hosted by European Semantic Web Conference ESWC, 3-7 June 2018, held in Heraklion, Crete, Greece (Aldemar Knossos Royal & Royal Villa).
This work was supported by grants from the EU H2020 Framework Programme provided for the project HOBBIT (GA no. 688227).
MOCHA Challenge was hosted by European Semantic Web Conference ESWC, 3-7 June 2018, held in Heraklion, Crete, Greece (Aldemar Knossos Royal & Royal Villa).
This work was supported by grants from the EU H2020 Framework Programme provided for the project HOBBIT (GA no. 688227).
Paper presented at European Semantic Web Conference ESWC, 3-7 June 2018, held in Heraklion, Crete, Greece (Aldemar Knossos Royal & Royal Villa).
This work was supported by grants from the EU H2020 Framework Programme provided for the project HOBBIT (GA no. 688227).
HOBBIT project overview presented at European Big Data Value Forum, 21-23 Nov 2017, held in Versailles, France (Palais des Congres).
This work was supported by grants from the EU H2020 Framework Programme provided for the project HOBBIT (GA no. 688227).
Leopard ISWC Semantic Web Challenge 2017 at ISWC2017.
This work was supported by grants from the EU H2020 Framework Programme provided for the project HOBBIT (GA no. 688227).
More from Holistic Benchmarking of Big Linked Data (20)
Use PyCharm for remote debugging of WSL on a Windo cf5c162d672e4e58b4dde5d797...shadow0702a
This document serves as a comprehensive step-by-step guide on how to effectively use PyCharm for remote debugging of the Windows Subsystem for Linux (WSL) on a local Windows machine. It meticulously outlines several critical steps in the process, starting with the crucial task of enabling permissions, followed by the installation and configuration of WSL.
The guide then proceeds to explain how to set up the SSH service within the WSL environment, an integral part of the process. Alongside this, it also provides detailed instructions on how to modify the inbound rules of the Windows firewall to facilitate the process, ensuring that there are no connectivity issues that could potentially hinder the debugging process.
The document further emphasizes on the importance of checking the connection between the Windows and WSL environments, providing instructions on how to ensure that the connection is optimal and ready for remote debugging.
It also offers an in-depth guide on how to configure the WSL interpreter and files within the PyCharm environment. This is essential for ensuring that the debugging process is set up correctly and that the program can be run effectively within the WSL terminal.
Additionally, the document provides guidance on how to set up breakpoints for debugging, a fundamental aspect of the debugging process which allows the developer to stop the execution of their code at certain points and inspect their program at those stages.
Finally, the document concludes by providing a link to a reference blog. This blog offers additional information and guidance on configuring the remote Python interpreter in PyCharm, providing the reader with a well-rounded understanding of the process.
Redefining brain tumor segmentation: a cutting-edge convolutional neural netw...IJECEIAES
Medical image analysis has witnessed significant advancements with deep learning techniques. In the domain of brain tumor segmentation, the ability to
precisely delineate tumor boundaries from magnetic resonance imaging (MRI)
scans holds profound implications for diagnosis. This study presents an ensemble convolutional neural network (CNN) with transfer learning, integrating
the state-of-the-art Deeplabv3+ architecture with the ResNet18 backbone. The
model is rigorously trained and evaluated, exhibiting remarkable performance
metrics, including an impressive global accuracy of 99.286%, a high-class accuracy of 82.191%, a mean intersection over union (IoU) of 79.900%, a weighted
IoU of 98.620%, and a Boundary F1 (BF) score of 83.303%. Notably, a detailed comparative analysis with existing methods showcases the superiority of
our proposed model. These findings underscore the model’s competence in precise brain tumor localization, underscoring its potential to revolutionize medical
image analysis and enhance healthcare outcomes. This research paves the way
for future exploration and optimization of advanced CNN models in medical
imaging, emphasizing addressing false positives and resource efficiency.
Rainfall intensity duration frequency curve statistical analysis and modeling...bijceesjournal
Using data from 41 years in Patna’ India’ the study’s goal is to analyze the trends of how often it rains on a weekly, seasonal, and annual basis (1981−2020). First, utilizing the intensity-duration-frequency (IDF) curve and the relationship by statistically analyzing rainfall’ the historical rainfall data set for Patna’ India’ during a 41 year period (1981−2020), was evaluated for its quality. Changes in the hydrologic cycle as a result of increased greenhouse gas emissions are expected to induce variations in the intensity, length, and frequency of precipitation events. One strategy to lessen vulnerability is to quantify probable changes and adapt to them. Techniques such as log-normal, normal, and Gumbel are used (EV-I). Distributions were created with durations of 1, 2, 3, 6, and 24 h and return times of 2, 5, 10, 25, and 100 years. There were also mathematical correlations discovered between rainfall and recurrence interval.
Findings: Based on findings, the Gumbel approach produced the highest intensity values, whereas the other approaches produced values that were close to each other. The data indicates that 461.9 mm of rain fell during the monsoon season’s 301st week. However, it was found that the 29th week had the greatest average rainfall, 92.6 mm. With 952.6 mm on average, the monsoon season saw the highest rainfall. Calculations revealed that the yearly rainfall averaged 1171.1 mm. Using Weibull’s method, the study was subsequently expanded to examine rainfall distribution at different recurrence intervals of 2, 5, 10, and 25 years. Rainfall and recurrence interval mathematical correlations were also developed. Further regression analysis revealed that short wave irrigation, wind direction, wind speed, pressure, relative humidity, and temperature all had a substantial influence on rainfall.
Originality and value: The results of the rainfall IDF curves can provide useful information to policymakers in making appropriate decisions in managing and minimizing floods in the study area.
Discover the latest insights on Data Driven Maintenance with our comprehensive webinar presentation. Learn about traditional maintenance challenges, the right approach to utilizing data, and the benefits of adopting a Data Driven Maintenance strategy. Explore real-world examples, industry best practices, and innovative solutions like FMECA and the D3M model. This presentation, led by expert Jules Oudmans, is essential for asset owners looking to optimize their maintenance processes and leverage digital technologies for improved efficiency and performance. Download now to stay ahead in the evolving maintenance landscape.
Optimizing Gradle Builds - Gradle DPE Tour Berlin 2024Sinan KOZAK
Sinan from the Delivery Hero mobile infrastructure engineering team shares a deep dive into performance acceleration with Gradle build cache optimizations. Sinan shares their journey into solving complex build-cache problems that affect Gradle builds. By understanding the challenges and solutions found in our journey, we aim to demonstrate the possibilities for faster builds. The case study reveals how overlapping outputs and cache misconfigurations led to significant increases in build times, especially as the project scaled up with numerous modules using Paparazzi tests. The journey from diagnosing to defeating cache issues offers invaluable lessons on maintaining cache integrity without sacrificing functionality.
Embedded machine learning-based road conditions and driving behavior monitoringIJECEIAES
Car accident rates have increased in recent years, resulting in losses in human lives, properties, and other financial costs. An embedded machine learning-based system is developed to address this critical issue. The system can monitor road conditions, detect driving patterns, and identify aggressive driving behaviors. The system is based on neural networks trained on a comprehensive dataset of driving events, driving styles, and road conditions. The system effectively detects potential risks and helps mitigate the frequency and impact of accidents. The primary goal is to ensure the safety of drivers and vehicles. Collecting data involved gathering information on three key road events: normal street and normal drive, speed bumps, circular yellow speed bumps, and three aggressive driving actions: sudden start, sudden stop, and sudden entry. The gathered data is processed and analyzed using a machine learning system designed for limited power and memory devices. The developed system resulted in 91.9% accuracy, 93.6% precision, and 92% recall. The achieved inference time on an Arduino Nano 33 BLE Sense with a 32-bit CPU running at 64 MHz is 34 ms and requires 2.6 kB peak RAM and 139.9 kB program flash memory, making it suitable for resource-constrained embedded systems.
An improved modulation technique suitable for a three level flying capacitor ...IJECEIAES
This research paper introduces an innovative modulation technique for controlling a 3-level flying capacitor multilevel inverter (FCMLI), aiming to streamline the modulation process in contrast to conventional methods. The proposed
simplified modulation technique paves the way for more straightforward and
efficient control of multilevel inverters, enabling their widespread adoption and
integration into modern power electronic systems. Through the amalgamation of
sinusoidal pulse width modulation (SPWM) with a high-frequency square wave
pulse, this controlling technique attains energy equilibrium across the coupling
capacitor. The modulation scheme incorporates a simplified switching pattern
and a decreased count of voltage references, thereby simplifying the control
algorithm.
Electric vehicle and photovoltaic advanced roles in enhancing the financial p...IJECEIAES
Climate change's impact on the planet forced the United Nations and governments to promote green energies and electric transportation. The deployments of photovoltaic (PV) and electric vehicle (EV) systems gained stronger momentum due to their numerous advantages over fossil fuel types. The advantages go beyond sustainability to reach financial support and stability. The work in this paper introduces the hybrid system between PV and EV to support industrial and commercial plants. This paper covers the theoretical framework of the proposed hybrid system including the required equation to complete the cost analysis when PV and EV are present. In addition, the proposed design diagram which sets the priorities and requirements of the system is presented. The proposed approach allows setup to advance their power stability, especially during power outages. The presented information supports researchers and plant owners to complete the necessary analysis while promoting the deployment of clean energy. The result of a case study that represents a dairy milk farmer supports the theoretical works and highlights its advanced benefits to existing plants. The short return on investment of the proposed approach supports the paper's novelty approach for the sustainable electrical system. In addition, the proposed system allows for an isolated power setup without the need for a transmission line which enhances the safety of the electrical network
Comparative analysis between traditional aquaponics and reconstructed aquapon...bijceesjournal
The aquaponic system of planting is a method that does not require soil usage. It is a method that only needs water, fish, lava rocks (a substitute for soil), and plants. Aquaponic systems are sustainable and environmentally friendly. Its use not only helps to plant in small spaces but also helps reduce artificial chemical use and minimizes excess water use, as aquaponics consumes 90% less water than soil-based gardening. The study applied a descriptive and experimental design to assess and compare conventional and reconstructed aquaponic methods for reproducing tomatoes. The researchers created an observation checklist to determine the significant factors of the study. The study aims to determine the significant difference between traditional aquaponics and reconstructed aquaponics systems propagating tomatoes in terms of height, weight, girth, and number of fruits. The reconstructed aquaponics system’s higher growth yield results in a much more nourished crop than the traditional aquaponics system. It is superior in its number of fruits, height, weight, and girth measurement. Moreover, the reconstructed aquaponics system is proven to eliminate all the hindrances present in the traditional aquaponics system, which are overcrowding of fish, algae growth, pest problems, contaminated water, and dead fish.
1. HOBBIT
in a Nutshell
Axel Ngonga
Horizon 2020
GA No 688227
01/12/2016–30/11/2018
Joint Event Post-EDF 2016
Eindhoven, Netherlands
July 1st, 2016
Ngonga (InfAI) HOBBIT in a nutshell July 1st, 2016 1 / 17
2. A Lot of Data
1
1http://www.ibmbigdatahub.com/infographic/four-vs-big-data
Ngonga (InfAI) HOBBIT in a nutshell July 1st, 2016 2 / 17
3. A Lot of Tools
2
2https://cdn.datafloq.com/cms/os_big_data_open_source_tools-v2.png
Ngonga (InfAI) HOBBIT in a nutshell July 1st, 2016 3 / 17
4. Core Questions
Developers: How good is my tool?
Vendors: Who is my tool good for?
Users: Which tool(s) should I use for
my application?
Ngonga (InfAI) HOBBIT in a nutshell July 1st, 2016 4 / 17
5. Many Questions
Where are the current bottlenecks?
Which steps of the data lifecycle are
critical?
Which solutions are available?
Which key performance indicators
are relevant?
How well do or should tools
perform?
How do existing solutions perform
w.r.t. relevant indicators?
Ngonga (InfAI) HOBBIT in a nutshell July 1st, 2016 5 / 17
10. HOBBIT
Rationale
A community-driven benchmarking framework for the community
Focus on Big Linked Data
Cover all steps of the Linked Data lifecycle
Used by a growing number of companies
Mature and maturing technologies
Open benchmarks based on industrial data
and use cases
Ngonga (InfAI) HOBBIT in a nutshell July 1st, 2016 8 / 17
11. HOBBIT
36-month project
Project begin: Dec. 1st, 2015
Project volume: ca. 4 million Euros
10 partners
Ngonga (InfAI) HOBBIT in a nutshell July 1st, 2016 9 / 17
12. Aims
1 Gather real requirements
Performance indicators
Performance thresholds
2 Provide universal benchmarking platform
Standardized hardware
Comparable results
3 Develop benchmarks based on real data
4 Periodic benchmarking challenges
5 Periodic reporting
6 Found independent Hobbit association
Ngonga (InfAI) HOBBIT in a nutshell July 1st, 2016 10 / 17
13. Overview
Data Collection
Industry
data
Measure Collection
Benchmark Creation
Benchmark 1
KPIs
Tasks
KPIs
Tasks
KPIs
Tasks
KPIs
Tasks
KPIs
Tasks
KPIs
Tasks
Benchmark 2
Benchmark n
HOBBIT
Platform
Solution 1
Solution k
Solution 2
Challenges
Reports
Participants/Community
Ngonga (InfAI) HOBBIT in a nutshell July 1st, 2016 11 / 17
14. We offer a benchmarking platform
Controller
Data
Generator
Task
Generator
Data
Generator
Data
Generator
Task
Generator
Task
Generator
FrontendSystem Adapter
System
data flow
creates component
Store
SPARQL
Endpoint
Analysis
Benchmark
Evaluator
Module
Eval. Store
Message Bus
Node
Observer
Logging
Ngonga (InfAI) HOBBIT in a nutshell July 1st, 2016 12 / 17
15. We offer a benchmarking platform
Addresses all steps of the Linked
Data Lifecycle
Benchmarks derived from industry
use cases
Real data under the benchmarks
Scalable size of benchmarks
Open-source implementation
Local instance on server cluster
Uses established deployment
technologies
Ngonga (InfAI) HOBBIT in a nutshell July 1st, 2016 13 / 17
16. We offer benchmarks
Streaming and static deterministic benchmarks
Realistic benchmarks
Controlled volume and velocity
Generation and Acquisition
Conversion of XML into RDF
Entity recognition and linking
Relation extraction
Analysis and Processing
Link Discovery
Machine Learning
Supervised and unsupervised
Storage and Curation
Triple stores
Versioning
Incl. updates
Visualization and Services
Question Answering
Faceted Browsing
Usage-based benchmarks
Ngonga (InfAI) HOBBIT in a nutshell July 1st, 2016 14 / 17
17. We offer datasets
Twitter7 dataset
ca. 476 million tweets
ca. 17 million users
ClueWeb12
ca. 733 million websites
1+ billion annotations
Printing Machinery
ca. 6.5 trillion events
1500 printing machines
LIVED
ca. 2.5 billion measurements
6 households, two years
Injection molding industry
ca. 120 million measurements
Traffic data archive
ca. 15 trillion speed measurements
100+ million road segments
Ngonga (InfAI) HOBBIT in a nutshell July 1st, 2016 15 / 17
18. We need ...
Your use cases
Participate in the survey
Join the HOBBIT community
Provide KPIs
Provide datasets
Join the platform development
Ngonga (InfAI) HOBBIT in a nutshell July 1st, 2016 16 / 17