SlideShare a Scribd company logo
THE DEBS 2018 Grand
Challenge
Vincenzo Gulisano, Zbigniew Jerzak, Pavel Smirnov,
Martin Strohbach, Holger Ziekow, Dimitris Zissis
Agenda
•This challenge problem 2018
•Evaluation process
•How this finale works
The DEBS Grand Challenge
 Started in 2011 (8th Grand Challenge in 2018)
 Provides a common ground and uniform evaluation
criteria for a competition aimed at both research
and industrial event-based systems.
New Endeavors in 2018
• Predictions about ship
routes based on AIS data
(movement etc.)
• Open solution: no
predefined algorithm
• Machine learning
combined with stream
processing
• Performance Award not
only about speed but also
prediction quality
The Problem
The Problem:
Predicting Arrival of Ships
• It is not always evident where a ship is going and
when it will be arriving
• AIS data include information about destination and
arrival time
• However
• It is entered by the crew so prone to human error and
inaccuracies (e.g. naming conventions)
• Often does not include intermediate stops on the way to
the destination
The Data
• Data provided by
MarineTraffic hosted by
the BIG Data Ocean, EU
Horizon 2020 project
• Data contains AIS
records (location,
movement etc.) over
approx. 3 month
The Data Details
• SHIP_ID is the anonymized id of the ship
• SHIPTYPE is defined according to the reference
• SPEED is measured in knots (divide value by 10)
• LON is the longitude of the current ship position
• LAT is the latitude of the current ship position
• COURSE is the direction in which ship moves
(see: https://en.wikipedia.org/wiki/Course_(navigation))
• HEADING (see: https://en.wikipedia.org/wiki/Course_(navigation))
• TIMESTAMP is the time at which the message was sent (UTC)
• DEPARTURE_PORT_NAME is the name of the last port visited by the vessel
• REPORTED_DRAUGHT of a ship's hull is the vertical distance between the
waterline and the bottom of the hull
(keel) https://en.wikipedia.org/wiki/Draft_(hull)
The AIS is a collaborative, self-reporting system
for marine vessels to broadcast their
Problem Details
• We consider a list of ports
that ships leave and enter
• Leaving and entering is
defined trough a zone
around the port center
• Predictions are to be made
while ships move between
ports
• Where is the ship going?
• When will the ship arrive?
• Training data is provided to
build the solution
Query 1: Predicting Destination of
Vessels
• For each incoming tuple you must provide a
prediction for the destination
• A good solution sticks to the correct prediction as
early as possible
consecutive correct predictions
prior to arrival
𝐴 =
𝑘=0
𝑁𝑡𝑟𝑖𝑝𝑠 𝑙𝑒𝑛(𝑙𝑎𝑠𝑡 𝑐𝑜𝑟𝑟𝑒𝑐𝑡 𝑠𝑒𝑞𝑢𝑒𝑛𝑐𝑒)
𝑙𝑒𝑛(𝑎𝑙𝑙 𝑡𝑢𝑝𝑙𝑒𝑠 𝑜𝑓 𝑎 𝑡𝑟𝑖𝑝)
𝑁𝑡𝑟𝑖𝑝𝑠
Query 2: Predicting Arrival Times
of Vessels
• For each incoming tuple you must predict when
this ship will arrive at a listed destination
• A good solution has a small average error
𝐴 = 𝑖=0
𝑁𝑡𝑢𝑝𝑙𝑒𝑠
𝑎𝑏𝑠(𝑒𝑟𝑟𝑜𝑟 𝑝𝑒𝑟 𝑡𝑢𝑝𝑙𝑒)
𝑁𝑢𝑝𝑙𝑒𝑠
Sample Tracks
The overall Score
• For each query solutions are ranked by
• (A) prediction quality
• (B) by speed (total runtime)
• Prediction quality is weighted 75% and speed 25%
 avoiding fast meaningless solutions
• Final rank is the average of Query 1 and 2
25.10.2018 14
The HOBBIT platform
23
.
1
5
6Customer
Requires ranking of alternative
solutions by some KPI
Solution provider (vendor)
(e.g. DB, Streaming Platforms, ML
frameworks, etc…)
Customer
Requires ranking of alternative
solutions by some KPI
Benchmark customer
Requires ranking of alternative
solutions by some KPI
Provides:
1. Manual benchmark executions
2. Bulk executions (challenge)
3. GUI, Leaderboards
Advantages:
1. Batch/streaming benchmarks
2. Docker virtualization
3. RDF-enabled
Submit
benchmarks
Submit
systems
The HOBBIT platform
(online or local instance)
25.10.2018 15
The HOBBIT platform. Architecture
(Simplified for Grand Challenge)
Benchmark components Platform components
1
2
3.1
3.2
4
5
6
The online platform:
http://master.project-hobbit.eu/
Cluster: 6 nodes, each is
2×64 bit Intel Xeon E5-2630v3
(8-Cores, 2.4 GHz, HT, 20MB
Cache, each proc.), 256 GB RAM,
1Gb Ethernet
Nodes (benchmark/system): 3/3
https://github.com/hobbit-project/platform/wiki/Overview
7
http://github.com/hobbit-project/platform
System
components
25.10.2018 16
The DEBS GC 2018 Benchmark
Sequential loop for each ship,
but parallel loops for ships
(> 500 ships)
Benchmark
Controller Task Generator
Evaluation storage /
Evaluation module
Benchmarked
System
Acknowledgement (encrypted task ID)
Deploys
Data tuple with task ID
Expected responses
Result with task ID
Result model (KPIs)
Deploys
Pairs evaluation
Start signal
Termination signal
System finished signal
Generation finished signal
http://github.com/hobbit-project/sml-benchmark-v2
Dataset Statistics
• 2 Input sets (a and b) available for GC:
○ about 500.000 tuples each
○ timespan: approx. 3 month
○ about 500 ships in each set (100 overlapping) about 900 in total
• Initial released data set
○ First release: data set a: 80% released, 5% online training with
HOBBIT, kept 15% for final testing
• Second release (after you have running system):
○ data set b: 70% released, 30% kept for final testing
• Final Benchmark
○ 15% from set a + 30% from set be for final benchmark
25.10.2018 18
Challenges
25.10.2018 19
Training phase
2018 Statistics
17 Teams
registered in
EasyChair
9 Teams passed the
checks for inclusion
in DEBS
4 finalists
in this session
(5 others as posters)
(paper review)
Top 2 determined
by Bechmark
Top 2 determined
by reviews
Scores
Winner
2 of 8 working systems did not finish processing within
time limit
Awards
• $1000 “Grand Challenge Award”
• for the best performing submission
Make your
vote count!
Audience Awards (Voted for by you!)
• Most interesting/appealing solution
• Voting boxes and valleys at the exits
Awards will be announced during the banquett
ACKNOWLEDGMENTS
This year’s Grand Challenge is co-organized by the HOBBIT project
(https://project-hobbit.eu/) represented by AGT International (http:
//www.agtinternational.com/). We would like to thank MarineTraffc
(https://marinetrafc.com) and the BigDataOcean project (EU
Horizon 2020, grant agreement number 732310) for providing the
dataset for this challenge. The automated evaluation platform was
provided by the HOBBIT project, while the data will be hosted
on the BigDataOcean project. The HOBBIT project has received
funding from the European Union’s H2020 research and innovation action
program under grant agreement number 688227, while
the BigDataOcean project has received funding from the European
Union’s H2020 research and innovation action program under grant
agreement number 732310.

More Related Content

Similar to The DEBS Grand Challenge 2018

Introduction to Civil Infrastructure Platform
Introduction to Civil Infrastructure PlatformIntroduction to Civil Infrastructure Platform
Introduction to Civil Infrastructure Platform
SZ Lin
 
The DEBS Grand Challenge 2017
The DEBS Grand Challenge 2017The DEBS Grand Challenge 2017
The DEBS Grand Challenge 2017
Holistic Benchmarking of Big Linked Data
 
Harpster, J. - Open data on buildings with satellite imagery processing
Harpster, J. - Open data on buildings with satellite imagery processingHarpster, J. - Open data on buildings with satellite imagery processing
Harpster, J. - Open data on buildings with satellite imagery processing
OECDregions
 
Value Stream Mapping – Stories From the Trenches
Value Stream Mapping – Stories From the TrenchesValue Stream Mapping – Stories From the Trenches
Value Stream Mapping – Stories From the Trenches
DevOps.com
 
Marine Air Ground Task Force Command & Control Systems Software Deployment an...
Marine Air Ground Task Force Command & Control Systems Software Deployment an...Marine Air Ground Task Force Command & Control Systems Software Deployment an...
Marine Air Ground Task Force Command & Control Systems Software Deployment an...
LaurenWendler
 
Seven Degrees Presentation for 2015 ICEAA
Seven Degrees Presentation for 2015 ICEAASeven Degrees Presentation for 2015 ICEAA
Seven Degrees Presentation for 2015 ICEAA
James Lawlor
 
Automated traffic control system
Automated traffic control systemAutomated traffic control system
Automated traffic control system
Likithe
 
OpenCms Days 2015 Arkema, a leading chemicals company
OpenCms Days 2015 Arkema, a leading chemicals companyOpenCms Days 2015 Arkema, a leading chemicals company
OpenCms Days 2015 Arkema, a leading chemicals company
Alkacon Software GmbH & Co. KG
 
Scottish Urban Air Qualtiy Steering Group - Modelling & Monitoring Workshop -...
Scottish Urban Air Qualtiy Steering Group - Modelling & Monitoring Workshop -...Scottish Urban Air Qualtiy Steering Group - Modelling & Monitoring Workshop -...
Scottish Urban Air Qualtiy Steering Group - Modelling & Monitoring Workshop -...
STEP_scotland
 
Advanced Analytics in Banking, CITI
Advanced Analytics in Banking, CITIAdvanced Analytics in Banking, CITI
Advanced Analytics in Banking, CITI
Innovation Enterprise
 
Kickoff Presentation
Kickoff PresentationKickoff Presentation
Kickoff Presentation
CodeTN
 
"Click here" to build your UAV
"Click here" to build your UAV"Click here" to build your UAV
"Click here" to build your UAV
Dirk Gorissen
 
Sta by usha_mehta
Sta by usha_mehtaSta by usha_mehta
Sta by usha_mehta
Usha Mehta
 
Truck planning: how to certify the right route
Truck planning: how to certify the right routeTruck planning: how to certify the right route
Truck planning: how to certify the right route
Speck&Tech
 
Sbst2018 contest2018
Sbst2018 contest2018Sbst2018 contest2018
Sbst2018 contest2018
Annibale Panichella
 
Short.course.introduction.to.vhdl for beginners
Short.course.introduction.to.vhdl for beginners Short.course.introduction.to.vhdl for beginners
Short.course.introduction.to.vhdl for beginners
Ravi Sony
 
Benchmarking of distributed linked data streaming systems
Benchmarking of distributed linked data streaming systemsBenchmarking of distributed linked data streaming systems
Benchmarking of distributed linked data streaming systems
Holistic Benchmarking of Big Linked Data
 
#FiaComit - Digital Processing at the Workface
#FiaComit - Digital Processing at the Workface#FiaComit - Digital Processing at the Workface
#FiaComit - Digital Processing at the Workface
Comit Projects Ltd
 
The data streaming processing paradigm and its use in modern fog architectures
The data streaming processing paradigm and its use in modern fog architecturesThe data streaming processing paradigm and its use in modern fog architectures
The data streaming processing paradigm and its use in modern fog architectures
Vincenzo Gulisano
 
SIMUL8 User Group - Visual8 Case Study - Plywood Manufacturing.
SIMUL8 User Group - Visual8 Case Study - Plywood Manufacturing. SIMUL8 User Group - Visual8 Case Study - Plywood Manufacturing.
SIMUL8 User Group - Visual8 Case Study - Plywood Manufacturing.
SIMUL8 Corporation
 

Similar to The DEBS Grand Challenge 2018 (20)

Introduction to Civil Infrastructure Platform
Introduction to Civil Infrastructure PlatformIntroduction to Civil Infrastructure Platform
Introduction to Civil Infrastructure Platform
 
The DEBS Grand Challenge 2017
The DEBS Grand Challenge 2017The DEBS Grand Challenge 2017
The DEBS Grand Challenge 2017
 
Harpster, J. - Open data on buildings with satellite imagery processing
Harpster, J. - Open data on buildings with satellite imagery processingHarpster, J. - Open data on buildings with satellite imagery processing
Harpster, J. - Open data on buildings with satellite imagery processing
 
Value Stream Mapping – Stories From the Trenches
Value Stream Mapping – Stories From the TrenchesValue Stream Mapping – Stories From the Trenches
Value Stream Mapping – Stories From the Trenches
 
Marine Air Ground Task Force Command & Control Systems Software Deployment an...
Marine Air Ground Task Force Command & Control Systems Software Deployment an...Marine Air Ground Task Force Command & Control Systems Software Deployment an...
Marine Air Ground Task Force Command & Control Systems Software Deployment an...
 
Seven Degrees Presentation for 2015 ICEAA
Seven Degrees Presentation for 2015 ICEAASeven Degrees Presentation for 2015 ICEAA
Seven Degrees Presentation for 2015 ICEAA
 
Automated traffic control system
Automated traffic control systemAutomated traffic control system
Automated traffic control system
 
OpenCms Days 2015 Arkema, a leading chemicals company
OpenCms Days 2015 Arkema, a leading chemicals companyOpenCms Days 2015 Arkema, a leading chemicals company
OpenCms Days 2015 Arkema, a leading chemicals company
 
Scottish Urban Air Qualtiy Steering Group - Modelling & Monitoring Workshop -...
Scottish Urban Air Qualtiy Steering Group - Modelling & Monitoring Workshop -...Scottish Urban Air Qualtiy Steering Group - Modelling & Monitoring Workshop -...
Scottish Urban Air Qualtiy Steering Group - Modelling & Monitoring Workshop -...
 
Advanced Analytics in Banking, CITI
Advanced Analytics in Banking, CITIAdvanced Analytics in Banking, CITI
Advanced Analytics in Banking, CITI
 
Kickoff Presentation
Kickoff PresentationKickoff Presentation
Kickoff Presentation
 
"Click here" to build your UAV
"Click here" to build your UAV"Click here" to build your UAV
"Click here" to build your UAV
 
Sta by usha_mehta
Sta by usha_mehtaSta by usha_mehta
Sta by usha_mehta
 
Truck planning: how to certify the right route
Truck planning: how to certify the right routeTruck planning: how to certify the right route
Truck planning: how to certify the right route
 
Sbst2018 contest2018
Sbst2018 contest2018Sbst2018 contest2018
Sbst2018 contest2018
 
Short.course.introduction.to.vhdl for beginners
Short.course.introduction.to.vhdl for beginners Short.course.introduction.to.vhdl for beginners
Short.course.introduction.to.vhdl for beginners
 
Benchmarking of distributed linked data streaming systems
Benchmarking of distributed linked data streaming systemsBenchmarking of distributed linked data streaming systems
Benchmarking of distributed linked data streaming systems
 
#FiaComit - Digital Processing at the Workface
#FiaComit - Digital Processing at the Workface#FiaComit - Digital Processing at the Workface
#FiaComit - Digital Processing at the Workface
 
The data streaming processing paradigm and its use in modern fog architectures
The data streaming processing paradigm and its use in modern fog architecturesThe data streaming processing paradigm and its use in modern fog architectures
The data streaming processing paradigm and its use in modern fog architectures
 
SIMUL8 User Group - Visual8 Case Study - Plywood Manufacturing.
SIMUL8 User Group - Visual8 Case Study - Plywood Manufacturing. SIMUL8 User Group - Visual8 Case Study - Plywood Manufacturing.
SIMUL8 User Group - Visual8 Case Study - Plywood Manufacturing.
 

More from Holistic Benchmarking of Big Linked Data

EARL: Joint Entity and Relation Linking for Question Answering over Knowledge...
EARL: Joint Entity and Relation Linking for Question Answering over Knowledge...EARL: Joint Entity and Relation Linking for Question Answering over Knowledge...
EARL: Joint Entity and Relation Linking for Question Answering over Knowledge...
Holistic Benchmarking of Big Linked Data
 
Benchmarking Big Linked Data: The case of the HOBBIT Project
Benchmarking Big Linked Data: The case of the HOBBIT ProjectBenchmarking Big Linked Data: The case of the HOBBIT Project
Benchmarking Big Linked Data: The case of the HOBBIT Project
Holistic Benchmarking of Big Linked Data
 
Assessing Linked Data Versioning Systems: The Semantic Publishing Versioning ...
Assessing Linked Data Versioning Systems: The Semantic Publishing Versioning ...Assessing Linked Data Versioning Systems: The Semantic Publishing Versioning ...
Assessing Linked Data Versioning Systems: The Semantic Publishing Versioning ...
Holistic Benchmarking of Big Linked Data
 
SQCFramework: SPARQL Query Containment Benchmarks Generation Framework
SQCFramework: SPARQL Query Containment Benchmarks Generation FrameworkSQCFramework: SPARQL Query Containment Benchmarks Generation Framework
SQCFramework: SPARQL Query Containment Benchmarks Generation Framework
Holistic Benchmarking of Big Linked Data
 
LargeRDFBench: A billion triples benchmark for SPARQL endpoint federation
LargeRDFBench: A billion triples benchmark for SPARQL endpoint federationLargeRDFBench: A billion triples benchmark for SPARQL endpoint federation
LargeRDFBench: A billion triples benchmark for SPARQL endpoint federation
Holistic Benchmarking of Big Linked Data
 
4th Natural Language Interface over the Web of Data (NLIWoD) workshop and QAL...
4th Natural Language Interface over the Web of Data (NLIWoD) workshop and QAL...4th Natural Language Interface over the Web of Data (NLIWoD) workshop and QAL...
4th Natural Language Interface over the Web of Data (NLIWoD) workshop and QAL...
Holistic Benchmarking of Big Linked Data
 
Scalable Link Discovery for Modern Data-Driven Applications (poster)
Scalable Link Discovery for Modern Data-Driven Applications (poster)Scalable Link Discovery for Modern Data-Driven Applications (poster)
Scalable Link Discovery for Modern Data-Driven Applications (poster)
Holistic Benchmarking of Big Linked Data
 
An Evaluation of Models for Runtime Approximation in Link Discovery
An Evaluation of Models for Runtime Approximation in Link DiscoveryAn Evaluation of Models for Runtime Approximation in Link Discovery
An Evaluation of Models for Runtime Approximation in Link Discovery
Holistic Benchmarking of Big Linked Data
 
Scalable Link Discovery for Modern Data-Driven Applications
Scalable Link Discovery for Modern Data-Driven ApplicationsScalable Link Discovery for Modern Data-Driven Applications
Scalable Link Discovery for Modern Data-Driven Applications
Holistic Benchmarking of Big Linked Data
 
Extending LargeRDFBench for Multi-Source Data at Scale for SPARQL Endpoint F...
 Extending LargeRDFBench for Multi-Source Data at Scale for SPARQL Endpoint F... Extending LargeRDFBench for Multi-Source Data at Scale for SPARQL Endpoint F...
Extending LargeRDFBench for Multi-Source Data at Scale for SPARQL Endpoint F...
Holistic Benchmarking of Big Linked Data
 
SPgen: A Benchmark Generator for Spatial Link Discovery Tools
SPgen: A Benchmark Generator for Spatial Link Discovery ToolsSPgen: A Benchmark Generator for Spatial Link Discovery Tools
SPgen: A Benchmark Generator for Spatial Link Discovery Tools
Holistic Benchmarking of Big Linked Data
 
Introducing the HOBBIT platform into the Ontology Alignment Evaluation Campaign
Introducing the HOBBIT platform into the Ontology Alignment Evaluation CampaignIntroducing the HOBBIT platform into the Ontology Alignment Evaluation Campaign
Introducing the HOBBIT platform into the Ontology Alignment Evaluation Campaign
Holistic Benchmarking of Big Linked Data
 
OKE2018 Challenge @ ESWC2018
OKE2018 Challenge @ ESWC2018OKE2018 Challenge @ ESWC2018
OKE2018 Challenge @ ESWC2018
Holistic Benchmarking of Big Linked Data
 
MOCHA 2018 Challenge @ ESWC2018
MOCHA 2018 Challenge @ ESWC2018MOCHA 2018 Challenge @ ESWC2018
MOCHA 2018 Challenge @ ESWC2018
Holistic Benchmarking of Big Linked Data
 
Dynamic planning for link discovery - ESWC 2018
Dynamic planning for link discovery - ESWC 2018Dynamic planning for link discovery - ESWC 2018
Dynamic planning for link discovery - ESWC 2018
Holistic Benchmarking of Big Linked Data
 
Hobbit project overview presented at EBDVF 2017
Hobbit project overview presented at EBDVF 2017Hobbit project overview presented at EBDVF 2017
Hobbit project overview presented at EBDVF 2017
Holistic Benchmarking of Big Linked Data
 
Leopard ISWC Semantic Web Challenge 2017 (poster)
Leopard ISWC Semantic Web Challenge 2017 (poster)Leopard ISWC Semantic Web Challenge 2017 (poster)
Leopard ISWC Semantic Web Challenge 2017 (poster)
Holistic Benchmarking of Big Linked Data
 
Leopard ISWC Semantic Web Challenge 2017
Leopard ISWC Semantic Web Challenge 2017Leopard ISWC Semantic Web Challenge 2017
Leopard ISWC Semantic Web Challenge 2017
Holistic Benchmarking of Big Linked Data
 
Benchmarking Link Discovery Systems for Geo-Spatial Data - BLINK ISWC2017.
Benchmarking Link Discovery Systems for Geo-Spatial Data - BLINK  ISWC2017. Benchmarking Link Discovery Systems for Geo-Spatial Data - BLINK  ISWC2017.
Benchmarking Link Discovery Systems for Geo-Spatial Data - BLINK ISWC2017.
Holistic Benchmarking of Big Linked Data
 
Instance Matching Benchmarks in the ERA of Linked Data - ISWC2017
Instance Matching Benchmarks in the ERA of Linked Data - ISWC2017Instance Matching Benchmarks in the ERA of Linked Data - ISWC2017
Instance Matching Benchmarks in the ERA of Linked Data - ISWC2017
Holistic Benchmarking of Big Linked Data
 

More from Holistic Benchmarking of Big Linked Data (20)

EARL: Joint Entity and Relation Linking for Question Answering over Knowledge...
EARL: Joint Entity and Relation Linking for Question Answering over Knowledge...EARL: Joint Entity and Relation Linking for Question Answering over Knowledge...
EARL: Joint Entity and Relation Linking for Question Answering over Knowledge...
 
Benchmarking Big Linked Data: The case of the HOBBIT Project
Benchmarking Big Linked Data: The case of the HOBBIT ProjectBenchmarking Big Linked Data: The case of the HOBBIT Project
Benchmarking Big Linked Data: The case of the HOBBIT Project
 
Assessing Linked Data Versioning Systems: The Semantic Publishing Versioning ...
Assessing Linked Data Versioning Systems: The Semantic Publishing Versioning ...Assessing Linked Data Versioning Systems: The Semantic Publishing Versioning ...
Assessing Linked Data Versioning Systems: The Semantic Publishing Versioning ...
 
SQCFramework: SPARQL Query Containment Benchmarks Generation Framework
SQCFramework: SPARQL Query Containment Benchmarks Generation FrameworkSQCFramework: SPARQL Query Containment Benchmarks Generation Framework
SQCFramework: SPARQL Query Containment Benchmarks Generation Framework
 
LargeRDFBench: A billion triples benchmark for SPARQL endpoint federation
LargeRDFBench: A billion triples benchmark for SPARQL endpoint federationLargeRDFBench: A billion triples benchmark for SPARQL endpoint federation
LargeRDFBench: A billion triples benchmark for SPARQL endpoint federation
 
4th Natural Language Interface over the Web of Data (NLIWoD) workshop and QAL...
4th Natural Language Interface over the Web of Data (NLIWoD) workshop and QAL...4th Natural Language Interface over the Web of Data (NLIWoD) workshop and QAL...
4th Natural Language Interface over the Web of Data (NLIWoD) workshop and QAL...
 
Scalable Link Discovery for Modern Data-Driven Applications (poster)
Scalable Link Discovery for Modern Data-Driven Applications (poster)Scalable Link Discovery for Modern Data-Driven Applications (poster)
Scalable Link Discovery for Modern Data-Driven Applications (poster)
 
An Evaluation of Models for Runtime Approximation in Link Discovery
An Evaluation of Models for Runtime Approximation in Link DiscoveryAn Evaluation of Models for Runtime Approximation in Link Discovery
An Evaluation of Models for Runtime Approximation in Link Discovery
 
Scalable Link Discovery for Modern Data-Driven Applications
Scalable Link Discovery for Modern Data-Driven ApplicationsScalable Link Discovery for Modern Data-Driven Applications
Scalable Link Discovery for Modern Data-Driven Applications
 
Extending LargeRDFBench for Multi-Source Data at Scale for SPARQL Endpoint F...
 Extending LargeRDFBench for Multi-Source Data at Scale for SPARQL Endpoint F... Extending LargeRDFBench for Multi-Source Data at Scale for SPARQL Endpoint F...
Extending LargeRDFBench for Multi-Source Data at Scale for SPARQL Endpoint F...
 
SPgen: A Benchmark Generator for Spatial Link Discovery Tools
SPgen: A Benchmark Generator for Spatial Link Discovery ToolsSPgen: A Benchmark Generator for Spatial Link Discovery Tools
SPgen: A Benchmark Generator for Spatial Link Discovery Tools
 
Introducing the HOBBIT platform into the Ontology Alignment Evaluation Campaign
Introducing the HOBBIT platform into the Ontology Alignment Evaluation CampaignIntroducing the HOBBIT platform into the Ontology Alignment Evaluation Campaign
Introducing the HOBBIT platform into the Ontology Alignment Evaluation Campaign
 
OKE2018 Challenge @ ESWC2018
OKE2018 Challenge @ ESWC2018OKE2018 Challenge @ ESWC2018
OKE2018 Challenge @ ESWC2018
 
MOCHA 2018 Challenge @ ESWC2018
MOCHA 2018 Challenge @ ESWC2018MOCHA 2018 Challenge @ ESWC2018
MOCHA 2018 Challenge @ ESWC2018
 
Dynamic planning for link discovery - ESWC 2018
Dynamic planning for link discovery - ESWC 2018Dynamic planning for link discovery - ESWC 2018
Dynamic planning for link discovery - ESWC 2018
 
Hobbit project overview presented at EBDVF 2017
Hobbit project overview presented at EBDVF 2017Hobbit project overview presented at EBDVF 2017
Hobbit project overview presented at EBDVF 2017
 
Leopard ISWC Semantic Web Challenge 2017 (poster)
Leopard ISWC Semantic Web Challenge 2017 (poster)Leopard ISWC Semantic Web Challenge 2017 (poster)
Leopard ISWC Semantic Web Challenge 2017 (poster)
 
Leopard ISWC Semantic Web Challenge 2017
Leopard ISWC Semantic Web Challenge 2017Leopard ISWC Semantic Web Challenge 2017
Leopard ISWC Semantic Web Challenge 2017
 
Benchmarking Link Discovery Systems for Geo-Spatial Data - BLINK ISWC2017.
Benchmarking Link Discovery Systems for Geo-Spatial Data - BLINK  ISWC2017. Benchmarking Link Discovery Systems for Geo-Spatial Data - BLINK  ISWC2017.
Benchmarking Link Discovery Systems for Geo-Spatial Data - BLINK ISWC2017.
 
Instance Matching Benchmarks in the ERA of Linked Data - ISWC2017
Instance Matching Benchmarks in the ERA of Linked Data - ISWC2017Instance Matching Benchmarks in the ERA of Linked Data - ISWC2017
Instance Matching Benchmarks in the ERA of Linked Data - ISWC2017
 

Recently uploaded

Deep Dive: Getting Funded with Jason Jason Lemkin Founder & CEO @ SaaStr
Deep Dive: Getting Funded with Jason Jason Lemkin Founder & CEO @ SaaStrDeep Dive: Getting Funded with Jason Jason Lemkin Founder & CEO @ SaaStr
Deep Dive: Getting Funded with Jason Jason Lemkin Founder & CEO @ SaaStr
saastr
 
Skybuffer SAM4U tool for SAP license adoption
Skybuffer SAM4U tool for SAP license adoptionSkybuffer SAM4U tool for SAP license adoption
Skybuffer SAM4U tool for SAP license adoption
Tatiana Kojar
 
HCL Notes and Domino License Cost Reduction in the World of DLAU
HCL Notes and Domino License Cost Reduction in the World of DLAUHCL Notes and Domino License Cost Reduction in the World of DLAU
HCL Notes and Domino License Cost Reduction in the World of DLAU
panagenda
 
Azure API Management to expose backend services securely
Azure API Management to expose backend services securelyAzure API Management to expose backend services securely
Azure API Management to expose backend services securely
Dinusha Kumarasiri
 
Driving Business Innovation: Latest Generative AI Advancements & Success Story
Driving Business Innovation: Latest Generative AI Advancements & Success StoryDriving Business Innovation: Latest Generative AI Advancements & Success Story
Driving Business Innovation: Latest Generative AI Advancements & Success Story
Safe Software
 
Operating System Used by Users in day-to-day life.pptx
Operating System Used by Users in day-to-day life.pptxOperating System Used by Users in day-to-day life.pptx
Operating System Used by Users in day-to-day life.pptx
Pravash Chandra Das
 
HCL Notes und Domino Lizenzkostenreduzierung in der Welt von DLAU
HCL Notes und Domino Lizenzkostenreduzierung in der Welt von DLAUHCL Notes und Domino Lizenzkostenreduzierung in der Welt von DLAU
HCL Notes und Domino Lizenzkostenreduzierung in der Welt von DLAU
panagenda
 
Taking AI to the Next Level in Manufacturing.pdf
Taking AI to the Next Level in Manufacturing.pdfTaking AI to the Next Level in Manufacturing.pdf
Taking AI to the Next Level in Manufacturing.pdf
ssuserfac0301
 
Trusted Execution Environment for Decentralized Process Mining
Trusted Execution Environment for Decentralized Process MiningTrusted Execution Environment for Decentralized Process Mining
Trusted Execution Environment for Decentralized Process Mining
LucaBarbaro3
 
Salesforce Integration for Bonterra Impact Management (fka Social Solutions A...
Salesforce Integration for Bonterra Impact Management (fka Social Solutions A...Salesforce Integration for Bonterra Impact Management (fka Social Solutions A...
Salesforce Integration for Bonterra Impact Management (fka Social Solutions A...
Jeffrey Haguewood
 
AWS Cloud Cost Optimization Presentation.pptx
AWS Cloud Cost Optimization Presentation.pptxAWS Cloud Cost Optimization Presentation.pptx
AWS Cloud Cost Optimization Presentation.pptx
HarisZaheer8
 
Energy Efficient Video Encoding for Cloud and Edge Computing Instances
Energy Efficient Video Encoding for Cloud and Edge Computing InstancesEnergy Efficient Video Encoding for Cloud and Edge Computing Instances
Energy Efficient Video Encoding for Cloud and Edge Computing Instances
Alpen-Adria-Universität
 
Building Production Ready Search Pipelines with Spark and Milvus
Building Production Ready Search Pipelines with Spark and MilvusBuilding Production Ready Search Pipelines with Spark and Milvus
Building Production Ready Search Pipelines with Spark and Milvus
Zilliz
 
Serial Arm Control in Real Time Presentation
Serial Arm Control in Real Time PresentationSerial Arm Control in Real Time Presentation
Serial Arm Control in Real Time Presentation
tolgahangng
 
Ocean lotus Threat actors project by John Sitima 2024 (1).pptx
Ocean lotus Threat actors project by John Sitima 2024 (1).pptxOcean lotus Threat actors project by John Sitima 2024 (1).pptx
Ocean lotus Threat actors project by John Sitima 2024 (1).pptx
SitimaJohn
 
Introduction of Cybersecurity with OSS at Code Europe 2024
Introduction of Cybersecurity with OSS  at Code Europe 2024Introduction of Cybersecurity with OSS  at Code Europe 2024
Introduction of Cybersecurity with OSS at Code Europe 2024
Hiroshi SHIBATA
 
Your One-Stop Shop for Python Success: Top 10 US Python Development Providers
Your One-Stop Shop for Python Success: Top 10 US Python Development ProvidersYour One-Stop Shop for Python Success: Top 10 US Python Development Providers
Your One-Stop Shop for Python Success: Top 10 US Python Development Providers
akankshawande
 
Programming Foundation Models with DSPy - Meetup Slides
Programming Foundation Models with DSPy - Meetup SlidesProgramming Foundation Models with DSPy - Meetup Slides
Programming Foundation Models with DSPy - Meetup Slides
Zilliz
 
leewayhertz.com-AI in predictive maintenance Use cases technologies benefits ...
leewayhertz.com-AI in predictive maintenance Use cases technologies benefits ...leewayhertz.com-AI in predictive maintenance Use cases technologies benefits ...
leewayhertz.com-AI in predictive maintenance Use cases technologies benefits ...
alexjohnson7307
 
5th LF Energy Power Grid Model Meet-up Slides
5th LF Energy Power Grid Model Meet-up Slides5th LF Energy Power Grid Model Meet-up Slides
5th LF Energy Power Grid Model Meet-up Slides
DanBrown980551
 

Recently uploaded (20)

Deep Dive: Getting Funded with Jason Jason Lemkin Founder & CEO @ SaaStr
Deep Dive: Getting Funded with Jason Jason Lemkin Founder & CEO @ SaaStrDeep Dive: Getting Funded with Jason Jason Lemkin Founder & CEO @ SaaStr
Deep Dive: Getting Funded with Jason Jason Lemkin Founder & CEO @ SaaStr
 
Skybuffer SAM4U tool for SAP license adoption
Skybuffer SAM4U tool for SAP license adoptionSkybuffer SAM4U tool for SAP license adoption
Skybuffer SAM4U tool for SAP license adoption
 
HCL Notes and Domino License Cost Reduction in the World of DLAU
HCL Notes and Domino License Cost Reduction in the World of DLAUHCL Notes and Domino License Cost Reduction in the World of DLAU
HCL Notes and Domino License Cost Reduction in the World of DLAU
 
Azure API Management to expose backend services securely
Azure API Management to expose backend services securelyAzure API Management to expose backend services securely
Azure API Management to expose backend services securely
 
Driving Business Innovation: Latest Generative AI Advancements & Success Story
Driving Business Innovation: Latest Generative AI Advancements & Success StoryDriving Business Innovation: Latest Generative AI Advancements & Success Story
Driving Business Innovation: Latest Generative AI Advancements & Success Story
 
Operating System Used by Users in day-to-day life.pptx
Operating System Used by Users in day-to-day life.pptxOperating System Used by Users in day-to-day life.pptx
Operating System Used by Users in day-to-day life.pptx
 
HCL Notes und Domino Lizenzkostenreduzierung in der Welt von DLAU
HCL Notes und Domino Lizenzkostenreduzierung in der Welt von DLAUHCL Notes und Domino Lizenzkostenreduzierung in der Welt von DLAU
HCL Notes und Domino Lizenzkostenreduzierung in der Welt von DLAU
 
Taking AI to the Next Level in Manufacturing.pdf
Taking AI to the Next Level in Manufacturing.pdfTaking AI to the Next Level in Manufacturing.pdf
Taking AI to the Next Level in Manufacturing.pdf
 
Trusted Execution Environment for Decentralized Process Mining
Trusted Execution Environment for Decentralized Process MiningTrusted Execution Environment for Decentralized Process Mining
Trusted Execution Environment for Decentralized Process Mining
 
Salesforce Integration for Bonterra Impact Management (fka Social Solutions A...
Salesforce Integration for Bonterra Impact Management (fka Social Solutions A...Salesforce Integration for Bonterra Impact Management (fka Social Solutions A...
Salesforce Integration for Bonterra Impact Management (fka Social Solutions A...
 
AWS Cloud Cost Optimization Presentation.pptx
AWS Cloud Cost Optimization Presentation.pptxAWS Cloud Cost Optimization Presentation.pptx
AWS Cloud Cost Optimization Presentation.pptx
 
Energy Efficient Video Encoding for Cloud and Edge Computing Instances
Energy Efficient Video Encoding for Cloud and Edge Computing InstancesEnergy Efficient Video Encoding for Cloud and Edge Computing Instances
Energy Efficient Video Encoding for Cloud and Edge Computing Instances
 
Building Production Ready Search Pipelines with Spark and Milvus
Building Production Ready Search Pipelines with Spark and MilvusBuilding Production Ready Search Pipelines with Spark and Milvus
Building Production Ready Search Pipelines with Spark and Milvus
 
Serial Arm Control in Real Time Presentation
Serial Arm Control in Real Time PresentationSerial Arm Control in Real Time Presentation
Serial Arm Control in Real Time Presentation
 
Ocean lotus Threat actors project by John Sitima 2024 (1).pptx
Ocean lotus Threat actors project by John Sitima 2024 (1).pptxOcean lotus Threat actors project by John Sitima 2024 (1).pptx
Ocean lotus Threat actors project by John Sitima 2024 (1).pptx
 
Introduction of Cybersecurity with OSS at Code Europe 2024
Introduction of Cybersecurity with OSS  at Code Europe 2024Introduction of Cybersecurity with OSS  at Code Europe 2024
Introduction of Cybersecurity with OSS at Code Europe 2024
 
Your One-Stop Shop for Python Success: Top 10 US Python Development Providers
Your One-Stop Shop for Python Success: Top 10 US Python Development ProvidersYour One-Stop Shop for Python Success: Top 10 US Python Development Providers
Your One-Stop Shop for Python Success: Top 10 US Python Development Providers
 
Programming Foundation Models with DSPy - Meetup Slides
Programming Foundation Models with DSPy - Meetup SlidesProgramming Foundation Models with DSPy - Meetup Slides
Programming Foundation Models with DSPy - Meetup Slides
 
leewayhertz.com-AI in predictive maintenance Use cases technologies benefits ...
leewayhertz.com-AI in predictive maintenance Use cases technologies benefits ...leewayhertz.com-AI in predictive maintenance Use cases technologies benefits ...
leewayhertz.com-AI in predictive maintenance Use cases technologies benefits ...
 
5th LF Energy Power Grid Model Meet-up Slides
5th LF Energy Power Grid Model Meet-up Slides5th LF Energy Power Grid Model Meet-up Slides
5th LF Energy Power Grid Model Meet-up Slides
 

The DEBS Grand Challenge 2018

  • 1. THE DEBS 2018 Grand Challenge Vincenzo Gulisano, Zbigniew Jerzak, Pavel Smirnov, Martin Strohbach, Holger Ziekow, Dimitris Zissis
  • 2. Agenda •This challenge problem 2018 •Evaluation process •How this finale works
  • 3. The DEBS Grand Challenge  Started in 2011 (8th Grand Challenge in 2018)  Provides a common ground and uniform evaluation criteria for a competition aimed at both research and industrial event-based systems.
  • 4. New Endeavors in 2018 • Predictions about ship routes based on AIS data (movement etc.) • Open solution: no predefined algorithm • Machine learning combined with stream processing • Performance Award not only about speed but also prediction quality
  • 6. The Problem: Predicting Arrival of Ships • It is not always evident where a ship is going and when it will be arriving • AIS data include information about destination and arrival time • However • It is entered by the crew so prone to human error and inaccuracies (e.g. naming conventions) • Often does not include intermediate stops on the way to the destination
  • 7. The Data • Data provided by MarineTraffic hosted by the BIG Data Ocean, EU Horizon 2020 project • Data contains AIS records (location, movement etc.) over approx. 3 month
  • 8. The Data Details • SHIP_ID is the anonymized id of the ship • SHIPTYPE is defined according to the reference • SPEED is measured in knots (divide value by 10) • LON is the longitude of the current ship position • LAT is the latitude of the current ship position • COURSE is the direction in which ship moves (see: https://en.wikipedia.org/wiki/Course_(navigation)) • HEADING (see: https://en.wikipedia.org/wiki/Course_(navigation)) • TIMESTAMP is the time at which the message was sent (UTC) • DEPARTURE_PORT_NAME is the name of the last port visited by the vessel • REPORTED_DRAUGHT of a ship's hull is the vertical distance between the waterline and the bottom of the hull (keel) https://en.wikipedia.org/wiki/Draft_(hull) The AIS is a collaborative, self-reporting system for marine vessels to broadcast their
  • 9. Problem Details • We consider a list of ports that ships leave and enter • Leaving and entering is defined trough a zone around the port center • Predictions are to be made while ships move between ports • Where is the ship going? • When will the ship arrive? • Training data is provided to build the solution
  • 10. Query 1: Predicting Destination of Vessels • For each incoming tuple you must provide a prediction for the destination • A good solution sticks to the correct prediction as early as possible consecutive correct predictions prior to arrival 𝐴 = 𝑘=0 𝑁𝑡𝑟𝑖𝑝𝑠 𝑙𝑒𝑛(𝑙𝑎𝑠𝑡 𝑐𝑜𝑟𝑟𝑒𝑐𝑡 𝑠𝑒𝑞𝑢𝑒𝑛𝑐𝑒) 𝑙𝑒𝑛(𝑎𝑙𝑙 𝑡𝑢𝑝𝑙𝑒𝑠 𝑜𝑓 𝑎 𝑡𝑟𝑖𝑝) 𝑁𝑡𝑟𝑖𝑝𝑠
  • 11. Query 2: Predicting Arrival Times of Vessels • For each incoming tuple you must predict when this ship will arrive at a listed destination • A good solution has a small average error 𝐴 = 𝑖=0 𝑁𝑡𝑢𝑝𝑙𝑒𝑠 𝑎𝑏𝑠(𝑒𝑟𝑟𝑜𝑟 𝑝𝑒𝑟 𝑡𝑢𝑝𝑙𝑒) 𝑁𝑢𝑝𝑙𝑒𝑠
  • 13. The overall Score • For each query solutions are ranked by • (A) prediction quality • (B) by speed (total runtime) • Prediction quality is weighted 75% and speed 25%  avoiding fast meaningless solutions • Final rank is the average of Query 1 and 2
  • 14. 25.10.2018 14 The HOBBIT platform 23 . 1 5 6Customer Requires ranking of alternative solutions by some KPI Solution provider (vendor) (e.g. DB, Streaming Platforms, ML frameworks, etc…) Customer Requires ranking of alternative solutions by some KPI Benchmark customer Requires ranking of alternative solutions by some KPI Provides: 1. Manual benchmark executions 2. Bulk executions (challenge) 3. GUI, Leaderboards Advantages: 1. Batch/streaming benchmarks 2. Docker virtualization 3. RDF-enabled Submit benchmarks Submit systems The HOBBIT platform (online or local instance)
  • 15. 25.10.2018 15 The HOBBIT platform. Architecture (Simplified for Grand Challenge) Benchmark components Platform components 1 2 3.1 3.2 4 5 6 The online platform: http://master.project-hobbit.eu/ Cluster: 6 nodes, each is 2×64 bit Intel Xeon E5-2630v3 (8-Cores, 2.4 GHz, HT, 20MB Cache, each proc.), 256 GB RAM, 1Gb Ethernet Nodes (benchmark/system): 3/3 https://github.com/hobbit-project/platform/wiki/Overview 7 http://github.com/hobbit-project/platform System components
  • 16. 25.10.2018 16 The DEBS GC 2018 Benchmark Sequential loop for each ship, but parallel loops for ships (> 500 ships) Benchmark Controller Task Generator Evaluation storage / Evaluation module Benchmarked System Acknowledgement (encrypted task ID) Deploys Data tuple with task ID Expected responses Result with task ID Result model (KPIs) Deploys Pairs evaluation Start signal Termination signal System finished signal Generation finished signal http://github.com/hobbit-project/sml-benchmark-v2
  • 17. Dataset Statistics • 2 Input sets (a and b) available for GC: ○ about 500.000 tuples each ○ timespan: approx. 3 month ○ about 500 ships in each set (100 overlapping) about 900 in total • Initial released data set ○ First release: data set a: 80% released, 5% online training with HOBBIT, kept 15% for final testing • Second release (after you have running system): ○ data set b: 70% released, 30% kept for final testing • Final Benchmark ○ 15% from set a + 30% from set be for final benchmark
  • 20. 2018 Statistics 17 Teams registered in EasyChair 9 Teams passed the checks for inclusion in DEBS 4 finalists in this session (5 others as posters) (paper review) Top 2 determined by Bechmark Top 2 determined by reviews
  • 21. Scores Winner 2 of 8 working systems did not finish processing within time limit
  • 22. Awards • $1000 “Grand Challenge Award” • for the best performing submission Make your vote count! Audience Awards (Voted for by you!) • Most interesting/appealing solution • Voting boxes and valleys at the exits Awards will be announced during the banquett
  • 23. ACKNOWLEDGMENTS This year’s Grand Challenge is co-organized by the HOBBIT project (https://project-hobbit.eu/) represented by AGT International (http: //www.agtinternational.com/). We would like to thank MarineTraffc (https://marinetrafc.com) and the BigDataOcean project (EU Horizon 2020, grant agreement number 732310) for providing the dataset for this challenge. The automated evaluation platform was provided by the HOBBIT project, while the data will be hosted on the BigDataOcean project. The HOBBIT project has received funding from the European Union’s H2020 research and innovation action program under grant agreement number 688227, while the BigDataOcean project has received funding from the European Union’s H2020 research and innovation action program under grant agreement number 732310.