SlideShare a Scribd company logo
1 of 51
Download to read offline
Unified Benchmarking of Big Data Platforms
The HOBBIT Platform
Axel-Cyrille Ngonga Ngomo
Horizon 2020
GA No 688227
01/12/2016–30/11/2018
Apache Big Data
Sevilla, Spain
November 15, 2016
Ngonga Ngomo (InfAI) Benchmarking Big Data November 15th, 2016 1 / 44
Summary
Rationale
A community-driven unified benchmarking platform for the community
Ngonga Ngomo (InfAI) Benchmarking Big Data November 15th, 2016 2 / 44
Summary
Rationale
A community-driven unified benchmarking platform for the community
Focus on Big (Linked) Data
Provide benchmarks and baselines
Provide reference implementation of KPIs
Extensible and referenceable
Result analysis
Open-Source
http://project-hobbit.eu
@hobbit_project
Ngonga Ngomo (InfAI) Benchmarking Big Data November 15th, 2016 2 / 44
A Lot of Data
1
1http://www.ibmbigdatahub.com/infographic/four-vs-big-data
Ngonga Ngomo (InfAI) Benchmarking Big Data November 15th, 2016 3 / 44
A Lot of Tools
2
2https://cloudramblings.me/
Ngonga Ngomo (InfAI) Benchmarking Big Data November 15th, 2016 4 / 44
A Lot ... of Tools
Ngonga Ngomo (InfAI) Benchmarking Big Data November 15th, 2016 5 / 44
Questions
Developers: How good is my tool?
Vendors: Who is my tool good for?
Users: Which tool(s) should I use for
my application?
Ngonga Ngomo (InfAI) Benchmarking Big Data November 15th, 2016 6 / 44
Many Questions
Where are the current bottlenecks?
Which steps of the data lifecycle are
critical?
Which solutions are available?
Which key performance indicators
are relevant?
How well do or should tools
perform?
How do existing solutions perform
w.r.t. relevant indicators?
Ngonga Ngomo (InfAI) Benchmarking Big Data November 15th, 2016 7 / 44
A Lot of Views
4
4https://steemit.com/philosophy/@l0k1/
subjectivity-and-truth-how-blockchains-model-consensus-buildingNgonga Ngomo (InfAI) Benchmarking Big Data November 15th, 2016 8 / 44
Solution
Benchmark
Ngonga Ngomo (InfAI) Benchmarking Big Data November 15th, 2016 9 / 44
Solution
Benchmark
Components
Dataset(s), e.g., Twitter stream, sensor data
Task(s), e.g., entity recognition, storage
Performance indicators, e.g., precision, recall, queries per second
Ngonga Ngomo (InfAI) Benchmarking Big Data November 15th, 2016 9 / 44
Solution
Benchmark
TPCH-H (3,000 GB Results): −5.6 × 106
QphH between 2014 and 20165
QALD: ≈ 5% increase in Micro F-Measure
ACE2004: ≈ 6% increase in Micro F-measure
5http://www.tpc.org/tpch/results/tpch_perf_results.asp
Ngonga Ngomo (InfAI) Benchmarking Big Data November 15th, 2016 10 / 44
Challenges
Dataset and KPI Mismatch
Year
ACE
Wiki
AQUAINT
MSNBC
IITB
Meij
AIDA/CoNLL
N3
collection
KORE50
Wiki-Disamb30
Wiki-Annot30
SpotlightCorpus
SemEval-2013task12
SemEval-2007task7
SemEval-2007task17
Senseval-3
NIF-basedcorpus
Microposts2014
Softwareavailable?
Webserviceavailable?
Cucerzan 2007 
Wikipedia
2008 * 
Miner
Illinois Wikifier 2011   *  
Spotlight 2011   
AIDA 2011   **
TagMe 2 2012    
Dexter 2013  
KEA 2013 
WAT 2013  
AGDISTIS 2014          
Babelfy 2014       
NERD-ML 2014    
BAT-
2013       * 
Framework
NERD
2014     
Framework
GERBIL 2014       *       
Ngonga Ngomo (InfAI) Benchmarking Big Data November 15th, 2016 11 / 44
Challenges
Unclear KPI Semantics
Example
Federated queries in distributed storage solutions
Which time do we measure?
First or last result?
With or without network delay?
Ngonga Ngomo (InfAI) Benchmarking Big Data November 15th, 2016 12 / 44
Challenges
Unclear KPI Semantics
Example
Entity recognition and linking
When is an annotation correct?
Weak or strong annotation?
Semantically equivalent or exact URI?
Ngonga Ngomo (InfAI) Benchmarking Big Data November 15th, 2016 13 / 44
Solution!
Unified Benchmarking Framework
Rationale
Provide all benchmark components in one package
Include reference datasets and baselines
Devise standardized tasks and reference KPI implementations
Ngonga Ngomo (InfAI) Benchmarking Big Data November 15th, 2016 14 / 44
Solution!
Unified Benchmarking Framework
Rationale
Provide all benchmark components in one package
Include reference datasets and baselines
Devise standardized tasks and reference KPI implementations
Benchmark Core
Web service calls
Dataset
Wrapper
Web service calls
Interface View
Annotator
Wrapper
Interface View
Open Datasets
Configuration
(Model)
...
Benchmark Core
Your
Annotator
Your Dataset
DataHub.io
GERBIL
Core
Controller
Persistent
Experiment Database
(Model)
Ngonga Ngomo (InfAI) Benchmarking Big Data November 15th, 2016 14 / 44
GERBIL
HOBBIT v0.1
Features
Unified benchmarking platform
for NER/NEL
18 reference annotation systems
32 reference datasets
Reference implementations of
KPIs
Ngonga Ngomo (InfAI) Benchmarking Big Data November 15th, 2016 15 / 44
GERBIL
HOBBIT v0.1
Features
Unified benchmarking platform
for NER/NEL
18 reference annotation systems
32 reference datasets
Reference implementations of
KPIs
Advantages
Benchmarking ≈ 30× faster
Archiving of results
Citeable URIs
Additional analysis
http://gerbil.aksw.org
http://github.org/aksw/gerbil
Ngonga Ngomo (InfAI) Benchmarking Big Data November 15th, 2016 15 / 44
GERBIL
HOBBIT v0.1
Features
Unified benchmarking platform
for NER/NEL
18 reference annotation systems
32 reference datasets
Reference implementations of
KPIs
Advantages
Benchmarking ≈ 30× faster
Archiving of results
Citeable URIs
Additional analysis
Availability
Open-source project
Local deployment
Online instance
Feedback for developers and users
http://gerbil.aksw.org
http://github.org/aksw/gerbil
Ngonga Ngomo (InfAI) Benchmarking Big Data November 15th, 2016 15 / 44
GERBIL
HOBBIT v0.1
Annotator Tasks
NIF-based Annotators 2519
Babelfy 958
DBpedia Spotlight 922
TagMe 2 811
WAT 787
Kea 763
Wikipedia Miner 714
NERD-ML 639
Dexter 587
AGDISTIS 443
Entityclassifier.eu NER 410
FOX 352
Cetus 1
Overall 24.3K exps
50+ papers
http://gerbil.aksw.org
http://github.org/aksw/gerbil
Ngonga Ngomo (InfAI) Benchmarking Big Data November 15th, 2016 16 / 44
HOBBIT
Rationale
Rationale
A community-driven unified benchmarking platform for the community
Ngonga Ngomo (InfAI) Benchmarking Big Data November 15th, 2016 17 / 44
HOBBIT
Rationale
Rationale
A community-driven unified benchmarking platform for the community
Build upon 24.3K GERBIL experiments
Experiments focus on Big Linked Data
Designed to accomodate all Big Data
Cover all steps of the Big (Linked) Data
lifecycle
Open benchmarks based on industrial data
and use cases
Ngonga Ngomo (InfAI) Benchmarking Big Data November 15th, 2016 17 / 44
HOBBIT
Aims
1 Gather real requirements
Performance indicators
Performance thresholds
2 Develop benchmarks based on real data
3 Provide universal benchmarking platform
Standardized hardware
Comparable results
4 Periodic benchmarking challenges
5 Periodic reporting
6 Found independent Hobbit association
Ngonga Ngomo (InfAI) Benchmarking Big Data November 15th, 2016 18 / 44
HOBBIT
Overview
Data Collection
Industry
data
Measure Collection
Benchmark Creation
Benchmark 1
KPIs
Tasks
KPIs
Tasks
KPIs
Tasks
KPIs
Tasks
KPIs
Tasks
KPIs
Tasks
Benchmark 2
Benchmark n
HOBBIT
Platform
Solution 1
Solution k
Solution 2
Challenges
Reports
Participants/Community
Ngonga Ngomo (InfAI) Benchmarking Big Data November 15th, 2016 19 / 44
Survey
Questions
Questions
In what areas are organizations active?
What do people expect from benchmarks?
How are benchmarks being used?
Profile Count
Solution providers 56
Technology users 67
Scientific community 65
Ngonga Ngomo (InfAI) Benchmarking Big Data November 15th, 2016 20 / 44
Survey
Can your solution be benchmarked?
Ngonga Ngomo (InfAI) Benchmarking Big Data November 15th, 2016 21 / 44
Survey
Do you benchmark your solution?
Own datasets and settings in many cases
Own implementations of measures
Results not comparable
Ngonga Ngomo (InfAI) Benchmarking Big Data November 15th, 2016 22 / 44
Survey
Application Areas
http://big-data-europe.eu
Ngonga Ngomo (InfAI) Benchmarking Big Data November 15th, 2016 23 / 44
HOBBIT Platform
Features
Uses established deployment
technologies (Docker)
Decoupled components
Benchmark and systems can be
written in different languages
Uses scalable message queues for
communication
Open-source implementation
Supports distributed benchmarks
and systems
Online instance on server cluster
Ngonga Ngomo (InfAI) Benchmarking Big Data November 15th, 2016 24 / 44
HOBBIT Platform
Features
Features
Unified benchmarking platform for Big
Data
20+ reference annotation systems
40+ reference datasets
Reference implementations of KPIs
Advantages
Benchmarks derived from real industrial
data and use cases
Scalable size of benchmarks
Archiving of results
Citeable URIs
Result analysis
Availability
Open-source project
Local deployment
Ngonga Ngomo (InfAI) Benchmarking Big Data November 15th, 2016 25 / 44
HOBBIT Platform
Architecture
Platform
Controller
Data
Generator
Task
Generator
Data
Generator
Data
Generator
Task
Generator
Task
Generator
Front End
Benchmarked System
data flow
creates component
Storage
Analysis
Benchmark
Controller
Evaluation
Module
Eval. Storage
Logging
Ngonga Ngomo (InfAI) Benchmarking Big Data November 15th, 2016 26 / 44
HOBBIT Platform
Benchmark Initialization
Platform
Controller
Data
Generator
Task
Generator
Data
Generator
Data
Generator
Task
Generator
Task
Generator
Benchmarked System
data flow
creates component
Storage
Benchmark
Controller
Eval. Storage
Ngonga Ngomo (InfAI) Benchmarking Big Data November 15th, 2016 27 / 44
HOBBIT Platform
Benchmark Execution
Platform
Controller
data flow
creates component
Storage
Data
Generator
Task
Generator
Data
Generator
Data
Generator
Task
Generator
Task
Generator
Benchmarked System
Benchmark
Controller
Eval. Storage
ex:Entity rdf:type ex:Class
...
Ngonga Ngomo (InfAI) Benchmarking Big Data November 15th, 2016 28 / 44
HOBBIT Platform
Benchmark Execution
Platform
Controller
data flow
creates component
Storage
Data
Generator
Task
Generator
Data
Generator
Data
Generator
Task
Generator
Task
Generator
Benchmarked System
Benchmark
Controller
Eval. Storage
v
ex:Entity
...
SELECT ?v
WHERE { ?v a ex:Class }
Ngonga Ngomo (InfAI) Benchmarking Big Data November 15th, 2016 29 / 44
HOBBIT Platform
Benchmark Execution
Platform
Controller
data flow
creates component
Storage
Data
Generator
Task
Generator
Data
Generator
Data
Generator
Task
Generator
Task
Generator
Benchmarked System
Benchmark
Controller
Eval. Storage
X
v
ex:Entity
...
Ngonga Ngomo (InfAI) Benchmarking Big Data November 15th, 2016 30 / 44
HOBBIT Platform
Benchmark Evaluation
data flow
creates component
Platform
Controller
Storage
Benchmark
Controller
Evaluation
Module
Eval. Storage
precision=...
recall=...
F1-score=... precision=...
recall=...
F1-score=...
benchmark
parameters:
...
v
ex:Entity
...
v
ex:Entity
...
Ngonga Ngomo (InfAI) Benchmarking Big Data November 15th, 2016 31 / 44
HOBBIT Platform
Benchmarks
Streaming and static deterministic benchmarks
Realistic benchmarks
Controlled volume and velocity
Generation and Acquisition
Conversion of XML into RDF
Entity recognition and linking
Relation extraction
Analysis and Processing
Link Discovery
Machine Learning
Supervised and unsupervised
Storage and Curation
Triple stores
Versioning
Incl. updates
Visualization and Services
Question Answering
Faceted Browsing
Usage-based benchmarks
Ngonga Ngomo (InfAI) Benchmarking Big Data November 15th, 2016 32 / 44
Datasets
TWIG
Goal: Simulate real Twitter Firehose
Relies on 476 million tweets as training data
Mimicking algorithm based on
Distribution of character frequencies
Distribution of transportation frequency
Network topology
Ngonga Ngomo (InfAI) Benchmarking Big Data November 15th, 2016 33 / 44
Datasets
LinkedConnections
Goal: Simulate real transport network
Real transportation data from Belgium for training
Mimicking algorithm based on
Observed correlation between population density and transportation
Distribution of transportation frequency
Network topology
Ngonga Ngomo (InfAI) Benchmarking Big Data November 15th, 2016 34 / 44
Datasets
Printing Machinery
Goal: Simulate events from printing machinery
Mimicking algorithm using event correlations and distributions
Changing plate
Double sheet
Early sheet
Finish job
Misaligned sheet
Missing sheet
Operation partially completed
Performance
Printing interval
Produktion Good Sheet
Side guide warning
Start job
Washing blanket
Washing impression cylinder
Washing ink rollers
with washing ink fountain roller
with washing plates
Mai 01 00:00 Mai 01 06:00 Mai 01 12:00 Mai 01 18:00 Mai 02 00:00
Time
Events
Ngonga Ngomo (InfAI) Benchmarking Big Data November 15th, 2016 35 / 44
Datasets
Weidmüller
Goal: Simulate events from injection molding machinery
Mimicking algorithm using event correlations and distributions
Ngonga Ngomo (InfAI) Benchmarking Big Data November 15th, 2016 36 / 44
Datasets
Semantic Publishing
Goal: Simulate data from the BBC
Generator based on manually configurable set of correlations
Ngonga Ngomo (InfAI) Benchmarking Big Data November 15th, 2016 37 / 44
HOBBIT Runs
Triple Stores
1 4 16
1
10
100
1000
QmpH Updates
virtuoso
blazegraph
fuseki
SPARQL worker
updates
Ngonga Ngomo (InfAI) Benchmarking Big Data November 15th, 2016 38 / 44
HOBBIT Runs
Runtimes
10× more effort for reduction of error rate by 30%
Ngonga Ngomo (InfAI) Benchmarking Big Data November 15th, 2016 39 / 44
HOBBIT Runs
A2KB
System
AIDA/CoNLL-Comp.
IITB
KORE50
MSNBC
Microp.2014-Train
N3-Reuters-128
AIDA 0.668 0.141 0.625 0.622 0.363 0.391
Babelfy 0.448 0.129 0.564 0.423 0.311 0.289
DBpedia Spotlight 0.545 0.262 0.341 0.457 0.448 0.320
FOX 0.512 0.100 0.268 0.127 0.309 0.518
FREME NER 0.358 0.074 0.160 0.208 0.254 0.263
WAT 0.673 0.137 0.543 0.631 0.403 0.480
xLisa 0.363 0.233 0.352 0.365 0.322 0.274
Ngonga Ngomo (InfAI) Benchmarking Big Data November 15th, 2016 40 / 44
Summary
Rationale
A community-driven unified benchmarking platform for the community
Ngonga Ngomo (InfAI) Benchmarking Big Data November 15th, 2016 41 / 44
Summary
Rationale
A community-driven unified benchmarking platform for the community
Provide benchmarks and baselines
Provide reference implementation of KPIs
Extensible and referenceable
Open-Source
@hobbit_project
Ngonga Ngomo (InfAI) Benchmarking Big Data November 15th, 2016 41 / 44
Join HOBBIT
α completed
Join the HOBBIT community
Provide KPIs
Provide datasets
Join the platform development
Follow us on Twitter
https://project-hobbit.eu
Ngonga Ngomo (InfAI) Benchmarking Big Data November 15th, 2016 42 / 44
Thank You
Axel Ngonga
AKSW Research Group
Institute for Applied Informatics
ngonga@informatik.uni-leipzig.de
Michael Röder
AKSW Research Group
Institute for Applied Informatics
roeder@informatik.uni-leipzig.de
Ngonga Ngomo (InfAI) Benchmarking Big Data November 15th, 2016 43 / 44
Acknowledgment
This work was supported by grants from the EU H2020 Framework Programme
provided for the project HOBBIT (GA no. 688227).
Ngonga Ngomo (InfAI) Benchmarking Big Data November 15th, 2016 44 / 44

More Related Content

Viewers also liked

An RDF Dataset Generator for the Social Network Benchmark with Real-World Coh...
An RDF Dataset Generator for the Social Network Benchmark with Real-World Coh...An RDF Dataset Generator for the Social Network Benchmark with Real-World Coh...
An RDF Dataset Generator for the Social Network Benchmark with Real-World Coh...Holistic Benchmarking of Big Linked Data
 
How well does your Instance Matching system perform? Experimental evaluation ...
How well does your Instance Matching system perform? Experimental evaluation ...How well does your Instance Matching system perform? Experimental evaluation ...
How well does your Instance Matching system perform? Experimental evaluation ...Holistic Benchmarking of Big Linked Data
 
SPARQL Querying Benchmarks ISWC2016
SPARQL Querying Benchmarks ISWC2016SPARQL Querying Benchmarks ISWC2016
SPARQL Querying Benchmarks ISWC2016Muhammad Saleem
 
Link Discovery Tutorial Part III: Benchmarking for Instance Matching Systems
Link Discovery Tutorial Part III: Benchmarking for Instance Matching SystemsLink Discovery Tutorial Part III: Benchmarking for Instance Matching Systems
Link Discovery Tutorial Part III: Benchmarking for Instance Matching SystemsHolistic Benchmarking of Big Linked Data
 
Towards digitizing scholarly communication
Towards digitizing scholarly communicationTowards digitizing scholarly communication
Towards digitizing scholarly communicationSören Auer
 
DBtrends Semantics 2016
DBtrends Semantics 2016DBtrends Semantics 2016
DBtrends Semantics 2016Edgard Marx
 
openQA Hoverboard - Open-source Question Answering Framework
openQA Hoverboard - Open-source Question Answering FrameworkopenQA Hoverboard - Open-source Question Answering Framework
openQA Hoverboard - Open-source Question Answering FrameworkEdgard Marx
 
Linked Data Tutorial
Linked Data TutorialLinked Data Tutorial
Linked Data TutorialSören Auer
 
Do MORe with your data
Do MORe with your dataDo MORe with your data
Do MORe with your datalocloud
 
SPARQL and RDF query optimization
SPARQL and RDF query optimizationSPARQL and RDF query optimization
SPARQL and RDF query optimizationKisung Kim
 
Enterprise knowledge graphs
Enterprise knowledge graphsEnterprise knowledge graphs
Enterprise knowledge graphsSören Auer
 
Suporte técnico em redes sociais
Suporte técnico em redes sociaisSuporte técnico em redes sociais
Suporte técnico em redes sociaisRoberto Cohen
 

Viewers also liked (20)

Link Discovery Tutorial Part I: Efficiency
Link Discovery Tutorial Part I: EfficiencyLink Discovery Tutorial Part I: Efficiency
Link Discovery Tutorial Part I: Efficiency
 
An RDF Dataset Generator for the Social Network Benchmark with Real-World Coh...
An RDF Dataset Generator for the Social Network Benchmark with Real-World Coh...An RDF Dataset Generator for the Social Network Benchmark with Real-World Coh...
An RDF Dataset Generator for the Social Network Benchmark with Real-World Coh...
 
Link Discovery Tutorial Introduction
Link Discovery Tutorial IntroductionLink Discovery Tutorial Introduction
Link Discovery Tutorial Introduction
 
How well does your Instance Matching system perform? Experimental evaluation ...
How well does your Instance Matching system perform? Experimental evaluation ...How well does your Instance Matching system perform? Experimental evaluation ...
How well does your Instance Matching system perform? Experimental evaluation ...
 
Link Discovery Tutorial Part II: Accuracy
Link Discovery Tutorial Part II: AccuracyLink Discovery Tutorial Part II: Accuracy
Link Discovery Tutorial Part II: Accuracy
 
SPARQL Querying Benchmarks ISWC2016
SPARQL Querying Benchmarks ISWC2016SPARQL Querying Benchmarks ISWC2016
SPARQL Querying Benchmarks ISWC2016
 
Versioning for Linked Data: Archiving Systems and Benchmarks
Versioning for Linked Data: Archiving Systems and BenchmarksVersioning for Linked Data: Archiving Systems and Benchmarks
Versioning for Linked Data: Archiving Systems and Benchmarks
 
Link Discovery Tutorial Part III: Benchmarking for Instance Matching Systems
Link Discovery Tutorial Part III: Benchmarking for Instance Matching SystemsLink Discovery Tutorial Part III: Benchmarking for Instance Matching Systems
Link Discovery Tutorial Part III: Benchmarking for Instance Matching Systems
 
Link Discovery Tutorial Part V: Hands-On
Link Discovery Tutorial Part V: Hands-OnLink Discovery Tutorial Part V: Hands-On
Link Discovery Tutorial Part V: Hands-On
 
Towards digitizing scholarly communication
Towards digitizing scholarly communicationTowards digitizing scholarly communication
Towards digitizing scholarly communication
 
DBtrends Semantics 2016
DBtrends Semantics 2016DBtrends Semantics 2016
DBtrends Semantics 2016
 
openQA Hoverboard - Open-source Question Answering Framework
openQA Hoverboard - Open-source Question Answering FrameworkopenQA Hoverboard - Open-source Question Answering Framework
openQA Hoverboard - Open-source Question Answering Framework
 
Linked Data Tutorial
Linked Data TutorialLinked Data Tutorial
Linked Data Tutorial
 
Do MORe with your data
Do MORe with your dataDo MORe with your data
Do MORe with your data
 
Lord of the Rings- Review
Lord of the Rings- ReviewLord of the Rings- Review
Lord of the Rings- Review
 
SPARQL and RDF query optimization
SPARQL and RDF query optimizationSPARQL and RDF query optimization
SPARQL and RDF query optimization
 
Enterprise knowledge graphs
Enterprise knowledge graphsEnterprise knowledge graphs
Enterprise knowledge graphs
 
Suporte técnico em redes sociais
Suporte técnico em redes sociaisSuporte técnico em redes sociais
Suporte técnico em redes sociais
 
№ 4 (36), май 2013
№ 4 (36), май 2013№ 4 (36), май 2013
№ 4 (36), май 2013
 
Nationalization
NationalizationNationalization
Nationalization
 

Similar to Benchmarking Big Data Platforms for Optimal Performance

Benchmarking for Big Data Applications with the DataBench Framework, Arne Ber...
Benchmarking for Big Data Applications with the DataBench Framework, Arne Ber...Benchmarking for Big Data Applications with the DataBench Framework, Arne Ber...
Benchmarking for Big Data Applications with the DataBench Framework, Arne Ber...DataBench
 
IoT Developer Survey 2016
IoT Developer Survey 2016IoT Developer Survey 2016
IoT Developer Survey 2016Ian Skerrett
 
BDVe Webinar Series: DataBench – Benchmarking Big Data. Arne Berre. Tue, Oct ...
BDVe Webinar Series: DataBench – Benchmarking Big Data. Arne Berre. Tue, Oct ...BDVe Webinar Series: DataBench – Benchmarking Big Data. Arne Berre. Tue, Oct ...
BDVe Webinar Series: DataBench – Benchmarking Big Data. Arne Berre. Tue, Oct ...Big Data Value Association
 
Big Data Technical Benchmarking, Arne Berre, BDVe Webinar series, 09/10/2018
Big Data Technical Benchmarking, Arne Berre, BDVe Webinar series, 09/10/2018 Big Data Technical Benchmarking, Arne Berre, BDVe Webinar series, 09/10/2018
Big Data Technical Benchmarking, Arne Berre, BDVe Webinar series, 09/10/2018 DataBench
 
Analysing search engine data on socially relevant topics
Analysing search engine data on socially relevant topicsAnalysing search engine data on socially relevant topics
Analysing search engine data on socially relevant topicsDirk Lewandowski
 
Relating Big Data Business and Technical Performance Indicators, Barbara Pern...
Relating Big Data Business and Technical Performance Indicators, Barbara Pern...Relating Big Data Business and Technical Performance Indicators, Barbara Pern...
Relating Big Data Business and Technical Performance Indicators, Barbara Pern...DataBench
 
Interview Co-Founder ReportLinker and Findout, Benjamin Carpano
Interview Co-Founder ReportLinker and Findout, Benjamin CarpanoInterview Co-Founder ReportLinker and Findout, Benjamin Carpano
Interview Co-Founder ReportLinker and Findout, Benjamin CarpanoReportLinker.com
 
10th OpenAIRE Content Providers Community Call
10th OpenAIRE Content Providers Community Call10th OpenAIRE Content Providers Community Call
10th OpenAIRE Content Providers Community CallOpenAIRE
 
DA 592 - Term Project Report - Berker Kozan Can Koklu
DA 592 - Term Project Report - Berker Kozan Can KokluDA 592 - Term Project Report - Berker Kozan Can Koklu
DA 592 - Term Project Report - Berker Kozan Can KokluCan Köklü
 
SAP Systems Integration by SAP PI (XI)
SAP Systems Integration by SAP PI (XI)SAP Systems Integration by SAP PI (XI)
SAP Systems Integration by SAP PI (XI)alpercelk
 
Building the DataBench Workflow and Architecture
Building the DataBench Workflow and ArchitectureBuilding the DataBench Workflow and Architecture
Building the DataBench Workflow and Architecturet_ivanov
 
Building the DataBench Workflow and Architecture, Todor Ivanov, Bench 2019 - ...
Building the DataBench Workflow and Architecture, Todor Ivanov, Bench 2019 - ...Building the DataBench Workflow and Architecture, Todor Ivanov, Bench 2019 - ...
Building the DataBench Workflow and Architecture, Todor Ivanov, Bench 2019 - ...DataBench
 
ODPi (Open Data Platform Initiative) - Standardizing Hadoop Ecosystem: Linaro...
ODPi (Open Data Platform Initiative) - Standardizing Hadoop Ecosystem: Linaro...ODPi (Open Data Platform Initiative) - Standardizing Hadoop Ecosystem: Linaro...
ODPi (Open Data Platform Initiative) - Standardizing Hadoop Ecosystem: Linaro...Ganesh Raju
 
Local Weather Information and GNOME Shell Extension
Local Weather Information and GNOME Shell ExtensionLocal Weather Information and GNOME Shell Extension
Local Weather Information and GNOME Shell ExtensionSammy Fung
 
Estimation and measuring of software size within the atos gobal delivery plat...
Estimation and measuring of software size within the atos gobal delivery plat...Estimation and measuring of software size within the atos gobal delivery plat...
Estimation and measuring of software size within the atos gobal delivery plat...IWSM Mensura
 
8795: Big Data Infrastructure
8795: Big Data Infrastructure8795: Big Data Infrastructure
8795: Big Data InfrastructureAbhinav Bisht
 
IoT Developer Survey 2016
IoT Developer Survey 2016IoT Developer Survey 2016
IoT Developer Survey 2016Eclipse IoT
 
Complex Made Simple @ LF Energy Conference in Paris
Complex Made Simple @ LF Energy Conference in ParisComplex Made Simple @ LF Energy Conference in Paris
Complex Made Simple @ LF Energy Conference in ParisShane Coughlan
 

Similar to Benchmarking Big Data Platforms for Optimal Performance (20)

Hobbit in a Nutshell - EDF2016
Hobbit in a Nutshell - EDF2016Hobbit in a Nutshell - EDF2016
Hobbit in a Nutshell - EDF2016
 
Benchmarking for Big Data Applications with the DataBench Framework, Arne Ber...
Benchmarking for Big Data Applications with the DataBench Framework, Arne Ber...Benchmarking for Big Data Applications with the DataBench Framework, Arne Ber...
Benchmarking for Big Data Applications with the DataBench Framework, Arne Ber...
 
HOBBIT at ESWC EU Networking Session
HOBBIT at ESWC EU Networking SessionHOBBIT at ESWC EU Networking Session
HOBBIT at ESWC EU Networking Session
 
IoT Developer Survey 2016
IoT Developer Survey 2016IoT Developer Survey 2016
IoT Developer Survey 2016
 
BDVe Webinar Series: DataBench – Benchmarking Big Data. Arne Berre. Tue, Oct ...
BDVe Webinar Series: DataBench – Benchmarking Big Data. Arne Berre. Tue, Oct ...BDVe Webinar Series: DataBench – Benchmarking Big Data. Arne Berre. Tue, Oct ...
BDVe Webinar Series: DataBench – Benchmarking Big Data. Arne Berre. Tue, Oct ...
 
Big Data Technical Benchmarking, Arne Berre, BDVe Webinar series, 09/10/2018
Big Data Technical Benchmarking, Arne Berre, BDVe Webinar series, 09/10/2018 Big Data Technical Benchmarking, Arne Berre, BDVe Webinar series, 09/10/2018
Big Data Technical Benchmarking, Arne Berre, BDVe Webinar series, 09/10/2018
 
Analysing search engine data on socially relevant topics
Analysing search engine data on socially relevant topicsAnalysing search engine data on socially relevant topics
Analysing search engine data on socially relevant topics
 
Relating Big Data Business and Technical Performance Indicators, Barbara Pern...
Relating Big Data Business and Technical Performance Indicators, Barbara Pern...Relating Big Data Business and Technical Performance Indicators, Barbara Pern...
Relating Big Data Business and Technical Performance Indicators, Barbara Pern...
 
Interview Co-Founder ReportLinker and Findout, Benjamin Carpano
Interview Co-Founder ReportLinker and Findout, Benjamin CarpanoInterview Co-Founder ReportLinker and Findout, Benjamin Carpano
Interview Co-Founder ReportLinker and Findout, Benjamin Carpano
 
10th OpenAIRE Content Providers Community Call
10th OpenAIRE Content Providers Community Call10th OpenAIRE Content Providers Community Call
10th OpenAIRE Content Providers Community Call
 
DA 592 - Term Project Report - Berker Kozan Can Koklu
DA 592 - Term Project Report - Berker Kozan Can KokluDA 592 - Term Project Report - Berker Kozan Can Koklu
DA 592 - Term Project Report - Berker Kozan Can Koklu
 
SAP Systems Integration by SAP PI (XI)
SAP Systems Integration by SAP PI (XI)SAP Systems Integration by SAP PI (XI)
SAP Systems Integration by SAP PI (XI)
 
Building the DataBench Workflow and Architecture
Building the DataBench Workflow and ArchitectureBuilding the DataBench Workflow and Architecture
Building the DataBench Workflow and Architecture
 
Building the DataBench Workflow and Architecture, Todor Ivanov, Bench 2019 - ...
Building the DataBench Workflow and Architecture, Todor Ivanov, Bench 2019 - ...Building the DataBench Workflow and Architecture, Todor Ivanov, Bench 2019 - ...
Building the DataBench Workflow and Architecture, Todor Ivanov, Bench 2019 - ...
 
ODPi (Open Data Platform Initiative) - Standardizing Hadoop Ecosystem: Linaro...
ODPi (Open Data Platform Initiative) - Standardizing Hadoop Ecosystem: Linaro...ODPi (Open Data Platform Initiative) - Standardizing Hadoop Ecosystem: Linaro...
ODPi (Open Data Platform Initiative) - Standardizing Hadoop Ecosystem: Linaro...
 
Local Weather Information and GNOME Shell Extension
Local Weather Information and GNOME Shell ExtensionLocal Weather Information and GNOME Shell Extension
Local Weather Information and GNOME Shell Extension
 
Estimation and measuring of software size within the atos gobal delivery plat...
Estimation and measuring of software size within the atos gobal delivery plat...Estimation and measuring of software size within the atos gobal delivery plat...
Estimation and measuring of software size within the atos gobal delivery plat...
 
8795: Big Data Infrastructure
8795: Big Data Infrastructure8795: Big Data Infrastructure
8795: Big Data Infrastructure
 
IoT Developer Survey 2016
IoT Developer Survey 2016IoT Developer Survey 2016
IoT Developer Survey 2016
 
Complex Made Simple @ LF Energy Conference in Paris
Complex Made Simple @ LF Energy Conference in ParisComplex Made Simple @ LF Energy Conference in Paris
Complex Made Simple @ LF Energy Conference in Paris
 

More from Holistic Benchmarking of Big Linked Data

EARL: Joint Entity and Relation Linking for Question Answering over Knowledge...
EARL: Joint Entity and Relation Linking for Question Answering over Knowledge...EARL: Joint Entity and Relation Linking for Question Answering over Knowledge...
EARL: Joint Entity and Relation Linking for Question Answering over Knowledge...Holistic Benchmarking of Big Linked Data
 
Assessing Linked Data Versioning Systems: The Semantic Publishing Versioning ...
Assessing Linked Data Versioning Systems: The Semantic Publishing Versioning ...Assessing Linked Data Versioning Systems: The Semantic Publishing Versioning ...
Assessing Linked Data Versioning Systems: The Semantic Publishing Versioning ...Holistic Benchmarking of Big Linked Data
 
4th Natural Language Interface over the Web of Data (NLIWoD) workshop and QAL...
4th Natural Language Interface over the Web of Data (NLIWoD) workshop and QAL...4th Natural Language Interface over the Web of Data (NLIWoD) workshop and QAL...
4th Natural Language Interface over the Web of Data (NLIWoD) workshop and QAL...Holistic Benchmarking of Big Linked Data
 
Extending LargeRDFBench for Multi-Source Data at Scale for SPARQL Endpoint F...
 Extending LargeRDFBench for Multi-Source Data at Scale for SPARQL Endpoint F... Extending LargeRDFBench for Multi-Source Data at Scale for SPARQL Endpoint F...
Extending LargeRDFBench for Multi-Source Data at Scale for SPARQL Endpoint F...Holistic Benchmarking of Big Linked Data
 
Introducing the HOBBIT platform into the Ontology Alignment Evaluation Campaign
Introducing the HOBBIT platform into the Ontology Alignment Evaluation CampaignIntroducing the HOBBIT platform into the Ontology Alignment Evaluation Campaign
Introducing the HOBBIT platform into the Ontology Alignment Evaluation CampaignHolistic Benchmarking of Big Linked Data
 

More from Holistic Benchmarking of Big Linked Data (20)

EARL: Joint Entity and Relation Linking for Question Answering over Knowledge...
EARL: Joint Entity and Relation Linking for Question Answering over Knowledge...EARL: Joint Entity and Relation Linking for Question Answering over Knowledge...
EARL: Joint Entity and Relation Linking for Question Answering over Knowledge...
 
Benchmarking Big Linked Data: The case of the HOBBIT Project
Benchmarking Big Linked Data: The case of the HOBBIT ProjectBenchmarking Big Linked Data: The case of the HOBBIT Project
Benchmarking Big Linked Data: The case of the HOBBIT Project
 
Assessing Linked Data Versioning Systems: The Semantic Publishing Versioning ...
Assessing Linked Data Versioning Systems: The Semantic Publishing Versioning ...Assessing Linked Data Versioning Systems: The Semantic Publishing Versioning ...
Assessing Linked Data Versioning Systems: The Semantic Publishing Versioning ...
 
The DEBS Grand Challenge 2018
The DEBS Grand Challenge 2018The DEBS Grand Challenge 2018
The DEBS Grand Challenge 2018
 
Benchmarking of distributed linked data streaming systems
Benchmarking of distributed linked data streaming systemsBenchmarking of distributed linked data streaming systems
Benchmarking of distributed linked data streaming systems
 
SQCFramework: SPARQL Query Containment Benchmarks Generation Framework
SQCFramework: SPARQL Query Containment Benchmarks Generation FrameworkSQCFramework: SPARQL Query Containment Benchmarks Generation Framework
SQCFramework: SPARQL Query Containment Benchmarks Generation Framework
 
LargeRDFBench: A billion triples benchmark for SPARQL endpoint federation
LargeRDFBench: A billion triples benchmark for SPARQL endpoint federationLargeRDFBench: A billion triples benchmark for SPARQL endpoint federation
LargeRDFBench: A billion triples benchmark for SPARQL endpoint federation
 
The DEBS Grand Challenge 2017
The DEBS Grand Challenge 2017The DEBS Grand Challenge 2017
The DEBS Grand Challenge 2017
 
4th Natural Language Interface over the Web of Data (NLIWoD) workshop and QAL...
4th Natural Language Interface over the Web of Data (NLIWoD) workshop and QAL...4th Natural Language Interface over the Web of Data (NLIWoD) workshop and QAL...
4th Natural Language Interface over the Web of Data (NLIWoD) workshop and QAL...
 
Scalable Link Discovery for Modern Data-Driven Applications (poster)
Scalable Link Discovery for Modern Data-Driven Applications (poster)Scalable Link Discovery for Modern Data-Driven Applications (poster)
Scalable Link Discovery for Modern Data-Driven Applications (poster)
 
An Evaluation of Models for Runtime Approximation in Link Discovery
An Evaluation of Models for Runtime Approximation in Link DiscoveryAn Evaluation of Models for Runtime Approximation in Link Discovery
An Evaluation of Models for Runtime Approximation in Link Discovery
 
Scalable Link Discovery for Modern Data-Driven Applications
Scalable Link Discovery for Modern Data-Driven ApplicationsScalable Link Discovery for Modern Data-Driven Applications
Scalable Link Discovery for Modern Data-Driven Applications
 
Extending LargeRDFBench for Multi-Source Data at Scale for SPARQL Endpoint F...
 Extending LargeRDFBench for Multi-Source Data at Scale for SPARQL Endpoint F... Extending LargeRDFBench for Multi-Source Data at Scale for SPARQL Endpoint F...
Extending LargeRDFBench for Multi-Source Data at Scale for SPARQL Endpoint F...
 
SPgen: A Benchmark Generator for Spatial Link Discovery Tools
SPgen: A Benchmark Generator for Spatial Link Discovery ToolsSPgen: A Benchmark Generator for Spatial Link Discovery Tools
SPgen: A Benchmark Generator for Spatial Link Discovery Tools
 
Introducing the HOBBIT platform into the Ontology Alignment Evaluation Campaign
Introducing the HOBBIT platform into the Ontology Alignment Evaluation CampaignIntroducing the HOBBIT platform into the Ontology Alignment Evaluation Campaign
Introducing the HOBBIT platform into the Ontology Alignment Evaluation Campaign
 
OKE2018 Challenge @ ESWC2018
OKE2018 Challenge @ ESWC2018OKE2018 Challenge @ ESWC2018
OKE2018 Challenge @ ESWC2018
 
MOCHA 2018 Challenge @ ESWC2018
MOCHA 2018 Challenge @ ESWC2018MOCHA 2018 Challenge @ ESWC2018
MOCHA 2018 Challenge @ ESWC2018
 
Dynamic planning for link discovery - ESWC 2018
Dynamic planning for link discovery - ESWC 2018Dynamic planning for link discovery - ESWC 2018
Dynamic planning for link discovery - ESWC 2018
 
Hobbit project overview presented at EBDVF 2017
Hobbit project overview presented at EBDVF 2017Hobbit project overview presented at EBDVF 2017
Hobbit project overview presented at EBDVF 2017
 
Leopard ISWC Semantic Web Challenge 2017 (poster)
Leopard ISWC Semantic Web Challenge 2017 (poster)Leopard ISWC Semantic Web Challenge 2017 (poster)
Leopard ISWC Semantic Web Challenge 2017 (poster)
 

Recently uploaded

Nanoparticles synthesis and characterization​ ​
Nanoparticles synthesis and characterization​  ​Nanoparticles synthesis and characterization​  ​
Nanoparticles synthesis and characterization​ ​kaibalyasahoo82800
 
SOLUBLE PATTERN RECOGNITION RECEPTORS.pptx
SOLUBLE PATTERN RECOGNITION RECEPTORS.pptxSOLUBLE PATTERN RECOGNITION RECEPTORS.pptx
SOLUBLE PATTERN RECOGNITION RECEPTORS.pptxkessiyaTpeter
 
Hubble Asteroid Hunter III. Physical properties of newly found asteroids
Hubble Asteroid Hunter III. Physical properties of newly found asteroidsHubble Asteroid Hunter III. Physical properties of newly found asteroids
Hubble Asteroid Hunter III. Physical properties of newly found asteroidsSérgio Sacani
 
Luciferase in rDNA technology (biotechnology).pptx
Luciferase in rDNA technology (biotechnology).pptxLuciferase in rDNA technology (biotechnology).pptx
Luciferase in rDNA technology (biotechnology).pptxAleenaTreesaSaji
 
Animal Communication- Auditory and Visual.pptx
Animal Communication- Auditory and Visual.pptxAnimal Communication- Auditory and Visual.pptx
Animal Communication- Auditory and Visual.pptxUmerFayaz5
 
Analytical Profile of Coleus Forskohlii | Forskolin .pptx
Analytical Profile of Coleus Forskohlii | Forskolin .pptxAnalytical Profile of Coleus Forskohlii | Forskolin .pptx
Analytical Profile of Coleus Forskohlii | Forskolin .pptxSwapnil Therkar
 
Call Girls in Munirka Delhi 💯Call Us 🔝9953322196🔝 💯Escort.
Call Girls in Munirka Delhi 💯Call Us 🔝9953322196🔝 💯Escort.Call Girls in Munirka Delhi 💯Call Us 🔝9953322196🔝 💯Escort.
Call Girls in Munirka Delhi 💯Call Us 🔝9953322196🔝 💯Escort.aasikanpl
 
STERILITY TESTING OF PHARMACEUTICALS ppt by DR.C.P.PRINCE
STERILITY TESTING OF PHARMACEUTICALS ppt by DR.C.P.PRINCESTERILITY TESTING OF PHARMACEUTICALS ppt by DR.C.P.PRINCE
STERILITY TESTING OF PHARMACEUTICALS ppt by DR.C.P.PRINCEPRINCE C P
 
G9 Science Q4- Week 1-2 Projectile Motion.ppt
G9 Science Q4- Week 1-2 Projectile Motion.pptG9 Science Q4- Week 1-2 Projectile Motion.ppt
G9 Science Q4- Week 1-2 Projectile Motion.pptMAESTRELLAMesa2
 
Orientation, design and principles of polyhouse
Orientation, design and principles of polyhouseOrientation, design and principles of polyhouse
Orientation, design and principles of polyhousejana861314
 
Artificial Intelligence In Microbiology by Dr. Prince C P
Artificial Intelligence In Microbiology by Dr. Prince C PArtificial Intelligence In Microbiology by Dr. Prince C P
Artificial Intelligence In Microbiology by Dr. Prince C PPRINCE C P
 
Analytical Profile of Coleus Forskohlii | Forskolin .pdf
Analytical Profile of Coleus Forskohlii | Forskolin .pdfAnalytical Profile of Coleus Forskohlii | Forskolin .pdf
Analytical Profile of Coleus Forskohlii | Forskolin .pdfSwapnil Therkar
 
Scheme-of-Work-Science-Stage-4 cambridge science.docx
Scheme-of-Work-Science-Stage-4 cambridge science.docxScheme-of-Work-Science-Stage-4 cambridge science.docx
Scheme-of-Work-Science-Stage-4 cambridge science.docxyaramohamed343013
 
Biopesticide (2).pptx .This slides helps to know the different types of biop...
Biopesticide (2).pptx  .This slides helps to know the different types of biop...Biopesticide (2).pptx  .This slides helps to know the different types of biop...
Biopesticide (2).pptx .This slides helps to know the different types of biop...RohitNehra6
 
Recombination DNA Technology (Nucleic Acid Hybridization )
Recombination DNA Technology (Nucleic Acid Hybridization )Recombination DNA Technology (Nucleic Acid Hybridization )
Recombination DNA Technology (Nucleic Acid Hybridization )aarthirajkumar25
 
Natural Polymer Based Nanomaterials
Natural Polymer Based NanomaterialsNatural Polymer Based Nanomaterials
Natural Polymer Based NanomaterialsAArockiyaNisha
 
Lucknow 💋 Russian Call Girls Lucknow Finest Escorts Service 8923113531 Availa...
Lucknow 💋 Russian Call Girls Lucknow Finest Escorts Service 8923113531 Availa...Lucknow 💋 Russian Call Girls Lucknow Finest Escorts Service 8923113531 Availa...
Lucknow 💋 Russian Call Girls Lucknow Finest Escorts Service 8923113531 Availa...anilsa9823
 

Recently uploaded (20)

Nanoparticles synthesis and characterization​ ​
Nanoparticles synthesis and characterization​  ​Nanoparticles synthesis and characterization​  ​
Nanoparticles synthesis and characterization​ ​
 
SOLUBLE PATTERN RECOGNITION RECEPTORS.pptx
SOLUBLE PATTERN RECOGNITION RECEPTORS.pptxSOLUBLE PATTERN RECOGNITION RECEPTORS.pptx
SOLUBLE PATTERN RECOGNITION RECEPTORS.pptx
 
Hubble Asteroid Hunter III. Physical properties of newly found asteroids
Hubble Asteroid Hunter III. Physical properties of newly found asteroidsHubble Asteroid Hunter III. Physical properties of newly found asteroids
Hubble Asteroid Hunter III. Physical properties of newly found asteroids
 
Luciferase in rDNA technology (biotechnology).pptx
Luciferase in rDNA technology (biotechnology).pptxLuciferase in rDNA technology (biotechnology).pptx
Luciferase in rDNA technology (biotechnology).pptx
 
Animal Communication- Auditory and Visual.pptx
Animal Communication- Auditory and Visual.pptxAnimal Communication- Auditory and Visual.pptx
Animal Communication- Auditory and Visual.pptx
 
Analytical Profile of Coleus Forskohlii | Forskolin .pptx
Analytical Profile of Coleus Forskohlii | Forskolin .pptxAnalytical Profile of Coleus Forskohlii | Forskolin .pptx
Analytical Profile of Coleus Forskohlii | Forskolin .pptx
 
Call Girls in Munirka Delhi 💯Call Us 🔝9953322196🔝 💯Escort.
Call Girls in Munirka Delhi 💯Call Us 🔝9953322196🔝 💯Escort.Call Girls in Munirka Delhi 💯Call Us 🔝9953322196🔝 💯Escort.
Call Girls in Munirka Delhi 💯Call Us 🔝9953322196🔝 💯Escort.
 
STERILITY TESTING OF PHARMACEUTICALS ppt by DR.C.P.PRINCE
STERILITY TESTING OF PHARMACEUTICALS ppt by DR.C.P.PRINCESTERILITY TESTING OF PHARMACEUTICALS ppt by DR.C.P.PRINCE
STERILITY TESTING OF PHARMACEUTICALS ppt by DR.C.P.PRINCE
 
G9 Science Q4- Week 1-2 Projectile Motion.ppt
G9 Science Q4- Week 1-2 Projectile Motion.pptG9 Science Q4- Week 1-2 Projectile Motion.ppt
G9 Science Q4- Week 1-2 Projectile Motion.ppt
 
The Philosophy of Science
The Philosophy of ScienceThe Philosophy of Science
The Philosophy of Science
 
Orientation, design and principles of polyhouse
Orientation, design and principles of polyhouseOrientation, design and principles of polyhouse
Orientation, design and principles of polyhouse
 
Artificial Intelligence In Microbiology by Dr. Prince C P
Artificial Intelligence In Microbiology by Dr. Prince C PArtificial Intelligence In Microbiology by Dr. Prince C P
Artificial Intelligence In Microbiology by Dr. Prince C P
 
9953056974 Young Call Girls In Mahavir enclave Indian Quality Escort service
9953056974 Young Call Girls In Mahavir enclave Indian Quality Escort service9953056974 Young Call Girls In Mahavir enclave Indian Quality Escort service
9953056974 Young Call Girls In Mahavir enclave Indian Quality Escort service
 
Analytical Profile of Coleus Forskohlii | Forskolin .pdf
Analytical Profile of Coleus Forskohlii | Forskolin .pdfAnalytical Profile of Coleus Forskohlii | Forskolin .pdf
Analytical Profile of Coleus Forskohlii | Forskolin .pdf
 
Scheme-of-Work-Science-Stage-4 cambridge science.docx
Scheme-of-Work-Science-Stage-4 cambridge science.docxScheme-of-Work-Science-Stage-4 cambridge science.docx
Scheme-of-Work-Science-Stage-4 cambridge science.docx
 
Biopesticide (2).pptx .This slides helps to know the different types of biop...
Biopesticide (2).pptx  .This slides helps to know the different types of biop...Biopesticide (2).pptx  .This slides helps to know the different types of biop...
Biopesticide (2).pptx .This slides helps to know the different types of biop...
 
Recombination DNA Technology (Nucleic Acid Hybridization )
Recombination DNA Technology (Nucleic Acid Hybridization )Recombination DNA Technology (Nucleic Acid Hybridization )
Recombination DNA Technology (Nucleic Acid Hybridization )
 
Natural Polymer Based Nanomaterials
Natural Polymer Based NanomaterialsNatural Polymer Based Nanomaterials
Natural Polymer Based Nanomaterials
 
Lucknow 💋 Russian Call Girls Lucknow Finest Escorts Service 8923113531 Availa...
Lucknow 💋 Russian Call Girls Lucknow Finest Escorts Service 8923113531 Availa...Lucknow 💋 Russian Call Girls Lucknow Finest Escorts Service 8923113531 Availa...
Lucknow 💋 Russian Call Girls Lucknow Finest Escorts Service 8923113531 Availa...
 
Engler and Prantl system of classification in plant taxonomy
Engler and Prantl system of classification in plant taxonomyEngler and Prantl system of classification in plant taxonomy
Engler and Prantl system of classification in plant taxonomy
 

Benchmarking Big Data Platforms for Optimal Performance

  • 1. Unified Benchmarking of Big Data Platforms The HOBBIT Platform Axel-Cyrille Ngonga Ngomo Horizon 2020 GA No 688227 01/12/2016–30/11/2018 Apache Big Data Sevilla, Spain November 15, 2016 Ngonga Ngomo (InfAI) Benchmarking Big Data November 15th, 2016 1 / 44
  • 2. Summary Rationale A community-driven unified benchmarking platform for the community Ngonga Ngomo (InfAI) Benchmarking Big Data November 15th, 2016 2 / 44
  • 3. Summary Rationale A community-driven unified benchmarking platform for the community Focus on Big (Linked) Data Provide benchmarks and baselines Provide reference implementation of KPIs Extensible and referenceable Result analysis Open-Source http://project-hobbit.eu @hobbit_project Ngonga Ngomo (InfAI) Benchmarking Big Data November 15th, 2016 2 / 44
  • 4. A Lot of Data 1 1http://www.ibmbigdatahub.com/infographic/four-vs-big-data Ngonga Ngomo (InfAI) Benchmarking Big Data November 15th, 2016 3 / 44
  • 5. A Lot of Tools 2 2https://cloudramblings.me/ Ngonga Ngomo (InfAI) Benchmarking Big Data November 15th, 2016 4 / 44
  • 6. A Lot ... of Tools Ngonga Ngomo (InfAI) Benchmarking Big Data November 15th, 2016 5 / 44
  • 7. Questions Developers: How good is my tool? Vendors: Who is my tool good for? Users: Which tool(s) should I use for my application? Ngonga Ngomo (InfAI) Benchmarking Big Data November 15th, 2016 6 / 44
  • 8. Many Questions Where are the current bottlenecks? Which steps of the data lifecycle are critical? Which solutions are available? Which key performance indicators are relevant? How well do or should tools perform? How do existing solutions perform w.r.t. relevant indicators? Ngonga Ngomo (InfAI) Benchmarking Big Data November 15th, 2016 7 / 44
  • 9. A Lot of Views 4 4https://steemit.com/philosophy/@l0k1/ subjectivity-and-truth-how-blockchains-model-consensus-buildingNgonga Ngomo (InfAI) Benchmarking Big Data November 15th, 2016 8 / 44
  • 10. Solution Benchmark Ngonga Ngomo (InfAI) Benchmarking Big Data November 15th, 2016 9 / 44
  • 11. Solution Benchmark Components Dataset(s), e.g., Twitter stream, sensor data Task(s), e.g., entity recognition, storage Performance indicators, e.g., precision, recall, queries per second Ngonga Ngomo (InfAI) Benchmarking Big Data November 15th, 2016 9 / 44
  • 12. Solution Benchmark TPCH-H (3,000 GB Results): −5.6 × 106 QphH between 2014 and 20165 QALD: ≈ 5% increase in Micro F-Measure ACE2004: ≈ 6% increase in Micro F-measure 5http://www.tpc.org/tpch/results/tpch_perf_results.asp Ngonga Ngomo (InfAI) Benchmarking Big Data November 15th, 2016 10 / 44
  • 13. Challenges Dataset and KPI Mismatch Year ACE Wiki AQUAINT MSNBC IITB Meij AIDA/CoNLL N3 collection KORE50 Wiki-Disamb30 Wiki-Annot30 SpotlightCorpus SemEval-2013task12 SemEval-2007task7 SemEval-2007task17 Senseval-3 NIF-basedcorpus Microposts2014 Softwareavailable? Webserviceavailable? Cucerzan 2007 Wikipedia 2008 * Miner Illinois Wikifier 2011 * Spotlight 2011 AIDA 2011 ** TagMe 2 2012 Dexter 2013 KEA 2013 WAT 2013 AGDISTIS 2014 Babelfy 2014 NERD-ML 2014 BAT- 2013 * Framework NERD 2014 Framework GERBIL 2014 * Ngonga Ngomo (InfAI) Benchmarking Big Data November 15th, 2016 11 / 44
  • 14. Challenges Unclear KPI Semantics Example Federated queries in distributed storage solutions Which time do we measure? First or last result? With or without network delay? Ngonga Ngomo (InfAI) Benchmarking Big Data November 15th, 2016 12 / 44
  • 15. Challenges Unclear KPI Semantics Example Entity recognition and linking When is an annotation correct? Weak or strong annotation? Semantically equivalent or exact URI? Ngonga Ngomo (InfAI) Benchmarking Big Data November 15th, 2016 13 / 44
  • 16. Solution! Unified Benchmarking Framework Rationale Provide all benchmark components in one package Include reference datasets and baselines Devise standardized tasks and reference KPI implementations Ngonga Ngomo (InfAI) Benchmarking Big Data November 15th, 2016 14 / 44
  • 17. Solution! Unified Benchmarking Framework Rationale Provide all benchmark components in one package Include reference datasets and baselines Devise standardized tasks and reference KPI implementations Benchmark Core Web service calls Dataset Wrapper Web service calls Interface View Annotator Wrapper Interface View Open Datasets Configuration (Model) ... Benchmark Core Your Annotator Your Dataset DataHub.io GERBIL Core Controller Persistent Experiment Database (Model) Ngonga Ngomo (InfAI) Benchmarking Big Data November 15th, 2016 14 / 44
  • 18. GERBIL HOBBIT v0.1 Features Unified benchmarking platform for NER/NEL 18 reference annotation systems 32 reference datasets Reference implementations of KPIs Ngonga Ngomo (InfAI) Benchmarking Big Data November 15th, 2016 15 / 44
  • 19. GERBIL HOBBIT v0.1 Features Unified benchmarking platform for NER/NEL 18 reference annotation systems 32 reference datasets Reference implementations of KPIs Advantages Benchmarking ≈ 30× faster Archiving of results Citeable URIs Additional analysis http://gerbil.aksw.org http://github.org/aksw/gerbil Ngonga Ngomo (InfAI) Benchmarking Big Data November 15th, 2016 15 / 44
  • 20. GERBIL HOBBIT v0.1 Features Unified benchmarking platform for NER/NEL 18 reference annotation systems 32 reference datasets Reference implementations of KPIs Advantages Benchmarking ≈ 30× faster Archiving of results Citeable URIs Additional analysis Availability Open-source project Local deployment Online instance Feedback for developers and users http://gerbil.aksw.org http://github.org/aksw/gerbil Ngonga Ngomo (InfAI) Benchmarking Big Data November 15th, 2016 15 / 44
  • 21. GERBIL HOBBIT v0.1 Annotator Tasks NIF-based Annotators 2519 Babelfy 958 DBpedia Spotlight 922 TagMe 2 811 WAT 787 Kea 763 Wikipedia Miner 714 NERD-ML 639 Dexter 587 AGDISTIS 443 Entityclassifier.eu NER 410 FOX 352 Cetus 1 Overall 24.3K exps 50+ papers http://gerbil.aksw.org http://github.org/aksw/gerbil Ngonga Ngomo (InfAI) Benchmarking Big Data November 15th, 2016 16 / 44
  • 22. HOBBIT Rationale Rationale A community-driven unified benchmarking platform for the community Ngonga Ngomo (InfAI) Benchmarking Big Data November 15th, 2016 17 / 44
  • 23. HOBBIT Rationale Rationale A community-driven unified benchmarking platform for the community Build upon 24.3K GERBIL experiments Experiments focus on Big Linked Data Designed to accomodate all Big Data Cover all steps of the Big (Linked) Data lifecycle Open benchmarks based on industrial data and use cases Ngonga Ngomo (InfAI) Benchmarking Big Data November 15th, 2016 17 / 44
  • 24. HOBBIT Aims 1 Gather real requirements Performance indicators Performance thresholds 2 Develop benchmarks based on real data 3 Provide universal benchmarking platform Standardized hardware Comparable results 4 Periodic benchmarking challenges 5 Periodic reporting 6 Found independent Hobbit association Ngonga Ngomo (InfAI) Benchmarking Big Data November 15th, 2016 18 / 44
  • 25. HOBBIT Overview Data Collection Industry data Measure Collection Benchmark Creation Benchmark 1 KPIs Tasks KPIs Tasks KPIs Tasks KPIs Tasks KPIs Tasks KPIs Tasks Benchmark 2 Benchmark n HOBBIT Platform Solution 1 Solution k Solution 2 Challenges Reports Participants/Community Ngonga Ngomo (InfAI) Benchmarking Big Data November 15th, 2016 19 / 44
  • 26. Survey Questions Questions In what areas are organizations active? What do people expect from benchmarks? How are benchmarks being used? Profile Count Solution providers 56 Technology users 67 Scientific community 65 Ngonga Ngomo (InfAI) Benchmarking Big Data November 15th, 2016 20 / 44
  • 27. Survey Can your solution be benchmarked? Ngonga Ngomo (InfAI) Benchmarking Big Data November 15th, 2016 21 / 44
  • 28. Survey Do you benchmark your solution? Own datasets and settings in many cases Own implementations of measures Results not comparable Ngonga Ngomo (InfAI) Benchmarking Big Data November 15th, 2016 22 / 44
  • 29. Survey Application Areas http://big-data-europe.eu Ngonga Ngomo (InfAI) Benchmarking Big Data November 15th, 2016 23 / 44
  • 30. HOBBIT Platform Features Uses established deployment technologies (Docker) Decoupled components Benchmark and systems can be written in different languages Uses scalable message queues for communication Open-source implementation Supports distributed benchmarks and systems Online instance on server cluster Ngonga Ngomo (InfAI) Benchmarking Big Data November 15th, 2016 24 / 44
  • 31. HOBBIT Platform Features Features Unified benchmarking platform for Big Data 20+ reference annotation systems 40+ reference datasets Reference implementations of KPIs Advantages Benchmarks derived from real industrial data and use cases Scalable size of benchmarks Archiving of results Citeable URIs Result analysis Availability Open-source project Local deployment Ngonga Ngomo (InfAI) Benchmarking Big Data November 15th, 2016 25 / 44
  • 32. HOBBIT Platform Architecture Platform Controller Data Generator Task Generator Data Generator Data Generator Task Generator Task Generator Front End Benchmarked System data flow creates component Storage Analysis Benchmark Controller Evaluation Module Eval. Storage Logging Ngonga Ngomo (InfAI) Benchmarking Big Data November 15th, 2016 26 / 44
  • 33. HOBBIT Platform Benchmark Initialization Platform Controller Data Generator Task Generator Data Generator Data Generator Task Generator Task Generator Benchmarked System data flow creates component Storage Benchmark Controller Eval. Storage Ngonga Ngomo (InfAI) Benchmarking Big Data November 15th, 2016 27 / 44
  • 34. HOBBIT Platform Benchmark Execution Platform Controller data flow creates component Storage Data Generator Task Generator Data Generator Data Generator Task Generator Task Generator Benchmarked System Benchmark Controller Eval. Storage ex:Entity rdf:type ex:Class ... Ngonga Ngomo (InfAI) Benchmarking Big Data November 15th, 2016 28 / 44
  • 35. HOBBIT Platform Benchmark Execution Platform Controller data flow creates component Storage Data Generator Task Generator Data Generator Data Generator Task Generator Task Generator Benchmarked System Benchmark Controller Eval. Storage v ex:Entity ... SELECT ?v WHERE { ?v a ex:Class } Ngonga Ngomo (InfAI) Benchmarking Big Data November 15th, 2016 29 / 44
  • 36. HOBBIT Platform Benchmark Execution Platform Controller data flow creates component Storage Data Generator Task Generator Data Generator Data Generator Task Generator Task Generator Benchmarked System Benchmark Controller Eval. Storage X v ex:Entity ... Ngonga Ngomo (InfAI) Benchmarking Big Data November 15th, 2016 30 / 44
  • 37. HOBBIT Platform Benchmark Evaluation data flow creates component Platform Controller Storage Benchmark Controller Evaluation Module Eval. Storage precision=... recall=... F1-score=... precision=... recall=... F1-score=... benchmark parameters: ... v ex:Entity ... v ex:Entity ... Ngonga Ngomo (InfAI) Benchmarking Big Data November 15th, 2016 31 / 44
  • 38. HOBBIT Platform Benchmarks Streaming and static deterministic benchmarks Realistic benchmarks Controlled volume and velocity Generation and Acquisition Conversion of XML into RDF Entity recognition and linking Relation extraction Analysis and Processing Link Discovery Machine Learning Supervised and unsupervised Storage and Curation Triple stores Versioning Incl. updates Visualization and Services Question Answering Faceted Browsing Usage-based benchmarks Ngonga Ngomo (InfAI) Benchmarking Big Data November 15th, 2016 32 / 44
  • 39. Datasets TWIG Goal: Simulate real Twitter Firehose Relies on 476 million tweets as training data Mimicking algorithm based on Distribution of character frequencies Distribution of transportation frequency Network topology Ngonga Ngomo (InfAI) Benchmarking Big Data November 15th, 2016 33 / 44
  • 40. Datasets LinkedConnections Goal: Simulate real transport network Real transportation data from Belgium for training Mimicking algorithm based on Observed correlation between population density and transportation Distribution of transportation frequency Network topology Ngonga Ngomo (InfAI) Benchmarking Big Data November 15th, 2016 34 / 44
  • 41. Datasets Printing Machinery Goal: Simulate events from printing machinery Mimicking algorithm using event correlations and distributions Changing plate Double sheet Early sheet Finish job Misaligned sheet Missing sheet Operation partially completed Performance Printing interval Produktion Good Sheet Side guide warning Start job Washing blanket Washing impression cylinder Washing ink rollers with washing ink fountain roller with washing plates Mai 01 00:00 Mai 01 06:00 Mai 01 12:00 Mai 01 18:00 Mai 02 00:00 Time Events Ngonga Ngomo (InfAI) Benchmarking Big Data November 15th, 2016 35 / 44
  • 42. Datasets Weidmüller Goal: Simulate events from injection molding machinery Mimicking algorithm using event correlations and distributions Ngonga Ngomo (InfAI) Benchmarking Big Data November 15th, 2016 36 / 44
  • 43. Datasets Semantic Publishing Goal: Simulate data from the BBC Generator based on manually configurable set of correlations Ngonga Ngomo (InfAI) Benchmarking Big Data November 15th, 2016 37 / 44
  • 44. HOBBIT Runs Triple Stores 1 4 16 1 10 100 1000 QmpH Updates virtuoso blazegraph fuseki SPARQL worker updates Ngonga Ngomo (InfAI) Benchmarking Big Data November 15th, 2016 38 / 44
  • 45. HOBBIT Runs Runtimes 10× more effort for reduction of error rate by 30% Ngonga Ngomo (InfAI) Benchmarking Big Data November 15th, 2016 39 / 44
  • 46. HOBBIT Runs A2KB System AIDA/CoNLL-Comp. IITB KORE50 MSNBC Microp.2014-Train N3-Reuters-128 AIDA 0.668 0.141 0.625 0.622 0.363 0.391 Babelfy 0.448 0.129 0.564 0.423 0.311 0.289 DBpedia Spotlight 0.545 0.262 0.341 0.457 0.448 0.320 FOX 0.512 0.100 0.268 0.127 0.309 0.518 FREME NER 0.358 0.074 0.160 0.208 0.254 0.263 WAT 0.673 0.137 0.543 0.631 0.403 0.480 xLisa 0.363 0.233 0.352 0.365 0.322 0.274 Ngonga Ngomo (InfAI) Benchmarking Big Data November 15th, 2016 40 / 44
  • 47. Summary Rationale A community-driven unified benchmarking platform for the community Ngonga Ngomo (InfAI) Benchmarking Big Data November 15th, 2016 41 / 44
  • 48. Summary Rationale A community-driven unified benchmarking platform for the community Provide benchmarks and baselines Provide reference implementation of KPIs Extensible and referenceable Open-Source @hobbit_project Ngonga Ngomo (InfAI) Benchmarking Big Data November 15th, 2016 41 / 44
  • 49. Join HOBBIT α completed Join the HOBBIT community Provide KPIs Provide datasets Join the platform development Follow us on Twitter https://project-hobbit.eu Ngonga Ngomo (InfAI) Benchmarking Big Data November 15th, 2016 42 / 44
  • 50. Thank You Axel Ngonga AKSW Research Group Institute for Applied Informatics ngonga@informatik.uni-leipzig.de Michael Röder AKSW Research Group Institute for Applied Informatics roeder@informatik.uni-leipzig.de Ngonga Ngomo (InfAI) Benchmarking Big Data November 15th, 2016 43 / 44
  • 51. Acknowledgment This work was supported by grants from the EU H2020 Framework Programme provided for the project HOBBIT (GA no. 688227). Ngonga Ngomo (InfAI) Benchmarking Big Data November 15th, 2016 44 / 44