SlideShare a Scribd company logo
1 of 18
http://www.bigdata.uni-frankfurt.de/
Welcome to the Frankfurt Big
Data Lab!
http://www.bigdata.uni-frankfurt.de/
http://www.bigdata.uni-frankfurt.de/
Mission
The objective of the Big Data Laboratory is to carry out research in the
domains of big data and data analytics from the perspective of information
systems and computer science.
Our approach is based on the interdisciplinary binding between data management technologies
and analytics.
2
The lab is located in Frankfurt, the financial
metropolis of central Europe and targets to be a source of
knowledge and expertise both for research and industry
applications.
Frankfurt Big Data Lab
The DATA REFUGEES Project
http://www.bigdata.uni-frankfurt.de/
Prof. Dott. Ing.
Roberto V. Zicari
Dr. Karsten Tolle
Lab Director
Hee Eun Kim
PhD Student
Todor Ivanov
PhD Student
Marten Rosselli
PhD Student
Affiliations: Goethe
University / Accenture
Sven Rill
PhD Student
Affiliation: Hof
University of Applied
Sciences
Rahul Soni
PhD Student
Affiliations: Goethe
University / Accenture
Concha Sanchez-Ocaña
Project Manager
DBIS, Goethe University
Frankfurt
Raik Niemann
PhD Student
Affiliation: Hof
University of Applied
Sciences
3
Team
http://www.bigdata.uni-frankfurt.de/
4
Teaching
http://www.bigdata.uni-frankfurt.de/
Our lab is currently active in the following research areas:
1. Big Data Management Technologies
2. Data Analytics / Data Science
3. Graph Databases / Linked Open Data (LOD)
4. Big Data for Common Good
5
Research Areas
http://www.bigdata.uni-frankfurt.de/
Our work is concentrated on the evaluation and
optimization of
Operational data stores that allow
flexible schemas
Big Data management and analytical
platforms (Hadoop, Spark, etc …)
Complex distributed storage and
processing architectures
Big Data Benchmarks
6
1. Big Data Management Technologies
http://www.bigdata.uni-frankfurt.de/
7
1. Big Data Management Technologies (cnt)
Benchmarking Big Data platforms for performance, scalability, elasticity, fault-tolerance …
Yahoo Cloud Service
Benchmark (YCSB)
Evaluating the performance (read/write
workloads) of NoSQL stores like Cassandra.
HiBench
10 workloads for evaluating the Hadoop platform in terms of
speed, throughput, HDFS bandwidth, system resource
utilization and machine learning algorithms.
BigBench
Application level benchmark consisting of 30 queries
implemented in Hive based on the TPC-DS
benchmark.
TPCx-HS The first standard Big Data Benchmark for Hadoop,
based on the TeraSort workload.
Benchmarks used
http://www.bigdata.uni-frankfurt.de/
8
1. Big Data Management Technologies (cnt)
NoSQL
Evaluated the Cassandra / DataStax Enterprise (9 Nodes
Cassandra Cluster) with HiBench and Yahoo Cloud Service
Benchmarks.
Hadoop Ecosystem
• Evaluated the performance of different virtualized Hadoop cluster
configurations on top of VMware vSphere using the Big Data
Extension (Project Serengeti).
• Benchmarking the Cloudera Hadoop Distribution 5.2 (4 Nodes
Hadoop Cluster) with the TPCx-HS benchmarks.
• Experimenting with the BigBench benchmark using Hive and Spark
SQL.
In-Memory Databases
Evaluation of a Big Data Architecture based on SAP HANA and
Cloudera Hadoop for different use cases and analytical
workloads
Platforms used
http://www.bigdata.uni-frankfurt.de/
9
1. Big Data Management Technologies (cnt)
Relevant Publications
• Performance Evaluation of Enterprise Big Data Platforms with HiBench (In 9th IEEE International
Conference on Big Data Science and Engineering (IEEE BigDataSE 2015), August 20-22, Helsinki, Finland)
• Benchmarking the Availability and Fault Tolerance of Cassandra (In 6th Workshop on Big Data
Benchmarking (6th WBDB), June 16-17, 2015, Toronto, Canada)
• Performance Evaluation of Spark SQL using BigBench (In 6th Workshop on Big Data Benchmarking (6th
WBDB), June 16-17, 2015, Toronto, Canada)
• Benchmarking DataStax Enterprise/Cassandra with HiBench (Technical Report No. 2014-2 )
• Performance Evaluation of Virtualized Hadoop Clusters (Technical Report No. 2014-1)
• Benchmarking Virtualized Hadoop Clusters (In proceedings of the Big Data Benchmarking - 5th International
Workshop, WBDB 2014, Potsdam,Germany, August 5-6, 2014, Revised Selected Papers)
Full list of publications is available online: http://www.bigdata.uni-frankfurt.de/publications/
http://www.bigdata.uni-frankfurt.de/
1. Big Data Management Technologies (cnt)
Member of the Standard
Performance
Evaluation Corporation
(SPEC)
SPEC is a non-profit corporation formed to
establish, maintain and endorse a
standardized set of relevant benchmarks that
can be applied to the newest generation of
high-performance computers.
The RG Big Data Working Group is a
forum for individuals and organizations
interested in the big data benchmarking
topic.
List of all 52 Member Organizations
Advanced Strategic Technology LLC
ARM
bankmark UG
Barcelona Supercomputing Center
Charles University
Cisco Systems
Cloudera, Inc
Compilaflows
Delft University of Technology
Dell
fortiss GmbH
Friedrich-Alexander-University Erlangen-Nuremberg
Goethe University Frankfurt
Hewlett-Packard
Huawei
IBM
Imperial College London
Indian Institute of Technology, Bombay
Institute for Information Industry, Taiwan
Institute of Communication and Computer Systems/NTUA
Intel
Karlsruhe Institute of Technology
Kiel University
Microsoft
MIOsoft Corporation
NICTA
NovaTec GmbH
Oracle
Purdue University
Red Hat
RWTH Aachen University
Salesforce.com
San Diego Supercomputing Center
San Francisco State University
SAP AG
Siemens Corporation
Technische Universität Darmstadt
The MITRE Corporation
Umea University
University of Alberta
University of Coimbra
University of Florence
University of Lugano
University of Minnesota
University of North Florida
University of Paderborn
VMware
University of Wuerzburg
University of Texas at Austin
University of Stuttgart *
University of Pavia
http://www.bigdata.uni-frankfurt.de/
Benchmarking (Berlin SPARQL Benchmark - BSBM):
Linked Open Data are structured data that are published online in order to be accessed automatically by computers. By
combining different sources huge amount of similar or related structured data are brought together in order to be queried and
analyzed.
This research area is closely related to the Semantic Web and its standards stack like RDF* and OWL. We are interested in analyzing
and benchmarking existing storage solutions and to apply the idea of LOD to selected applications.
Our current activities are:
• AFE-Web – Cooperation project with Römisch Germanischen Kommission (RGK) Antike Fundmünzen in Europa: database (AFE-
WEB) is a web-based database for recording and publishing coin finds
• European Coin Find Network
• Nomisma.org (Karsten Tolle being member of the steering committee)
11
2. Graph Databases/Linked Open Data(LOD)
http://www.bigdata.uni-frankfurt.de/
• Twenty students from UC Berkeley, Stanford University and Goether University Frankfurt
committed participation to the challenges of obesity, heart/lung failure and mood disorder.
• Frankfurt Big Data Lab uses of data acquisition and data blending to improve the quality of analytics.
12
3. Data analytics/Data science
# of patients
in a postal area
<visualization of the given patients’ data>
<retrieved Twitter data by
a keyword of obesity>
<retrieved Twitter data from
a state of Pennsylvania>
http://www.bigdata.uni-frankfurt.de/
13
4. Big Data for Social Good
What can be done in the international research community to make sure that some
of the most brilliant big data use cases do have an impact also for social issues ?
Our motivation is to encourage the
international research community to
work on Big Data problems that have
a potential positive social impact for
mankind
World map of
scientific
collaborations,
2014
http://www.bigdata.uni-frankfurt.de/
14
4. Big Data for Social Good Projects
The DATA REFUGEES PROJECT
1.1 million refugees and migrants registered in Germany in 2015
Number of refugees to arrive in Frankfurt increases from 170 to 250 per week
We will explore the question if and how data can be used to create:
— A Data Products to help the inclusion in the city of Frankfurt
— Insights that can be escalated to the decision makers in the city of Frankfurt.
http://www.bigdata.uni-frankfurt.de/
15
4. Big Data for Social Good Projects
The DATA REFUGEES PROJECT - METODOLOGY
We will gather data from various sources available in Frankfurt. The challenge is that the flow of
information is not, by nature, well organized.
Data integration
Data fusion
Data blending
Design Thinking
Techniques
Collect data from multiple sources, including changes of format and cleanup of redundant or useless
entries. The outcome is a standardized, unified table.
Integrate imperfect data sources overlapping over a small group of objects.
Allow sources to be imperfect, incomplete, and overlapping over a few objects or none at all, requiring
inspired guesses and generalizations
Create and evaluate new ideas through a human centric approach for problem solving.
http://www.bigdata.uni-frankfurt.de/
16
4. Big Data for Social Good Projects
DATA
UNDERSTANDING
INCLUSION
PROCESS
UNDERSTANDING
DATA
PREPARATION
DATA
INTEGRATION
MODELLINGEVALUATIONDEPLOYMENT
DATA
PRODUCT
OR
KNOWLEDGE
DELIVERY
We are here!
No too much data
available!
We aim to demonstrate that is feasible by using
available data to help, support and possibly guide the
process of inclusion for refugees in the city of Frankfurt.
http://www.bigdata.uni-frankfurt.de/
17
4. Big Data for Social Good Projects
• LOOKING FOR DATA!
• Define a open source tool and methodology for managing the volunteers and activities
and propose it to the AWO organization. THIS DATA IS NOT COLLECTED
http://lale.help https://volunteer-planner.org
• Retrieve the twitter data about refugees in Frankfurt.
• Coaching the two refugees to develop a mobile app.
• We aim in particular to help refugee children. We hope to be able to help them in the
inclusion process in our society.
http://www.bigdata.uni-frankfurt.de/
18
4. Big Data for Social Good Projects
The DATA REFUGEES PROJECT NEEDS
We encourage developers to contact us if you wish to contribute to
this project!
Contact Person: Concha Sanchez-Ocaña, Project Manager. concha@dbis.cs.uni-frankfurt.de
Organizations involved
Frankfurt Big Data Lab, Goethe University Frankfurt.
School of Business, University of Applied Sciences Mainz
Research Center SAFE (funded by the State of Hessen initiative for research, LOEWE)
Betriebliche Kommunikationssysteme und IT-Security, University of Applied Sciences Offenburg
THANK YOU!!

More Related Content

What's hot

Building the FAIR Research Commons: A Data Driven Society of Scientists
Building the FAIR Research Commons: A Data Driven Society of ScientistsBuilding the FAIR Research Commons: A Data Driven Society of Scientists
Building the FAIR Research Commons: A Data Driven Society of ScientistsCarole Goble
 
Make our Scientific Datasets Accessible and Interoperable on the Web
Make our Scientific Datasets Accessible and Interoperable on the WebMake our Scientific Datasets Accessible and Interoperable on the Web
Make our Scientific Datasets Accessible and Interoperable on the WebFranck Michel
 
Semantics and linked data at astra zeneca
Semantics and linked data at astra zenecaSemantics and linked data at astra zeneca
Semantics and linked data at astra zenecaKerstin Forsberg
 
Towards an Open Research Knowledge Graph
Towards an Open Research Knowledge GraphTowards an Open Research Knowledge Graph
Towards an Open Research Knowledge GraphSören Auer
 
RO-Crate: A framework for packaging research products into FAIR Research Objects
RO-Crate: A framework for packaging research products into FAIR Research ObjectsRO-Crate: A framework for packaging research products into FAIR Research Objects
RO-Crate: A framework for packaging research products into FAIR Research ObjectsCarole Goble
 
Converged IT and Data Commons
Converged IT and Data CommonsConverged IT and Data Commons
Converged IT and Data CommonsSimon Twigger
 
Fairification experience clarifying the semantics of data matrices
Fairification experience clarifying the semantics of data matricesFairification experience clarifying the semantics of data matrices
Fairification experience clarifying the semantics of data matricesPistoia Alliance
 
TranSMART: How open source software revolutionizes drug discovery through cro...
TranSMART: How open source software revolutionizes drug discovery through cro...TranSMART: How open source software revolutionizes drug discovery through cro...
TranSMART: How open source software revolutionizes drug discovery through cro...keesvb
 
What is Reproducibility? The R* brouhaha (and how Research Objects can help)
What is Reproducibility? The R* brouhaha (and how Research Objects can help)What is Reproducibility? The R* brouhaha (and how Research Objects can help)
What is Reproducibility? The R* brouhaha (and how Research Objects can help)Carole Goble
 
Content + Signals: The value of the entire data estate for machine learning
Content + Signals: The value of the entire data estate for machine learningContent + Signals: The value of the entire data estate for machine learning
Content + Signals: The value of the entire data estate for machine learningPaul Groth
 
The State of Linked Government Data
The State of Linked Government DataThe State of Linked Government Data
The State of Linked Government DataRichard Cyganiak
 
FAIRy stories: the FAIR Data principles in theory and in practice
FAIRy stories: the FAIR Data principles in theory and in practiceFAIRy stories: the FAIR Data principles in theory and in practice
FAIRy stories: the FAIR Data principles in theory and in practiceCarole Goble
 
Minimal viable-datareuse-czi
Minimal viable-datareuse-cziMinimal viable-datareuse-czi
Minimal viable-datareuse-cziPaul Groth
 
Knowledge Graph Maintenance
Knowledge Graph MaintenanceKnowledge Graph Maintenance
Knowledge Graph MaintenancePaul Groth
 
Open interoperability standards, tools and services at EMBL-EBI
Open interoperability standards, tools and services at EMBL-EBIOpen interoperability standards, tools and services at EMBL-EBI
Open interoperability standards, tools and services at EMBL-EBIPistoia Alliance
 
An Ecosystem for Linked Humanities Data
An Ecosystem for Linked Humanities DataAn Ecosystem for Linked Humanities Data
An Ecosystem for Linked Humanities DataRinke Hoekstra
 
Research Knowledge Graphs at GESIS & NFDI4DataScience
Research Knowledge Graphs at GESIS & NFDI4DataScienceResearch Knowledge Graphs at GESIS & NFDI4DataScience
Research Knowledge Graphs at GESIS & NFDI4DataScienceStefan Dietze
 
From Data Search to Data Showcasing
From Data Search to Data ShowcasingFrom Data Search to Data Showcasing
From Data Search to Data ShowcasingPaul Groth
 
Better software, better service, better research: The Software Sustainabilit...
Better software, better service, better research: The Software Sustainabilit...Better software, better service, better research: The Software Sustainabilit...
Better software, better service, better research: The Software Sustainabilit...Carole Goble
 
Managing Metadata for Science and Technology Studies: the RISIS case
Managing Metadata for Science and Technology Studies: the RISIS caseManaging Metadata for Science and Technology Studies: the RISIS case
Managing Metadata for Science and Technology Studies: the RISIS caseRinke Hoekstra
 

What's hot (20)

Building the FAIR Research Commons: A Data Driven Society of Scientists
Building the FAIR Research Commons: A Data Driven Society of ScientistsBuilding the FAIR Research Commons: A Data Driven Society of Scientists
Building the FAIR Research Commons: A Data Driven Society of Scientists
 
Make our Scientific Datasets Accessible and Interoperable on the Web
Make our Scientific Datasets Accessible and Interoperable on the WebMake our Scientific Datasets Accessible and Interoperable on the Web
Make our Scientific Datasets Accessible and Interoperable on the Web
 
Semantics and linked data at astra zeneca
Semantics and linked data at astra zenecaSemantics and linked data at astra zeneca
Semantics and linked data at astra zeneca
 
Towards an Open Research Knowledge Graph
Towards an Open Research Knowledge GraphTowards an Open Research Knowledge Graph
Towards an Open Research Knowledge Graph
 
RO-Crate: A framework for packaging research products into FAIR Research Objects
RO-Crate: A framework for packaging research products into FAIR Research ObjectsRO-Crate: A framework for packaging research products into FAIR Research Objects
RO-Crate: A framework for packaging research products into FAIR Research Objects
 
Converged IT and Data Commons
Converged IT and Data CommonsConverged IT and Data Commons
Converged IT and Data Commons
 
Fairification experience clarifying the semantics of data matrices
Fairification experience clarifying the semantics of data matricesFairification experience clarifying the semantics of data matrices
Fairification experience clarifying the semantics of data matrices
 
TranSMART: How open source software revolutionizes drug discovery through cro...
TranSMART: How open source software revolutionizes drug discovery through cro...TranSMART: How open source software revolutionizes drug discovery through cro...
TranSMART: How open source software revolutionizes drug discovery through cro...
 
What is Reproducibility? The R* brouhaha (and how Research Objects can help)
What is Reproducibility? The R* brouhaha (and how Research Objects can help)What is Reproducibility? The R* brouhaha (and how Research Objects can help)
What is Reproducibility? The R* brouhaha (and how Research Objects can help)
 
Content + Signals: The value of the entire data estate for machine learning
Content + Signals: The value of the entire data estate for machine learningContent + Signals: The value of the entire data estate for machine learning
Content + Signals: The value of the entire data estate for machine learning
 
The State of Linked Government Data
The State of Linked Government DataThe State of Linked Government Data
The State of Linked Government Data
 
FAIRy stories: the FAIR Data principles in theory and in practice
FAIRy stories: the FAIR Data principles in theory and in practiceFAIRy stories: the FAIR Data principles in theory and in practice
FAIRy stories: the FAIR Data principles in theory and in practice
 
Minimal viable-datareuse-czi
Minimal viable-datareuse-cziMinimal viable-datareuse-czi
Minimal viable-datareuse-czi
 
Knowledge Graph Maintenance
Knowledge Graph MaintenanceKnowledge Graph Maintenance
Knowledge Graph Maintenance
 
Open interoperability standards, tools and services at EMBL-EBI
Open interoperability standards, tools and services at EMBL-EBIOpen interoperability standards, tools and services at EMBL-EBI
Open interoperability standards, tools and services at EMBL-EBI
 
An Ecosystem for Linked Humanities Data
An Ecosystem for Linked Humanities DataAn Ecosystem for Linked Humanities Data
An Ecosystem for Linked Humanities Data
 
Research Knowledge Graphs at GESIS & NFDI4DataScience
Research Knowledge Graphs at GESIS & NFDI4DataScienceResearch Knowledge Graphs at GESIS & NFDI4DataScience
Research Knowledge Graphs at GESIS & NFDI4DataScience
 
From Data Search to Data Showcasing
From Data Search to Data ShowcasingFrom Data Search to Data Showcasing
From Data Search to Data Showcasing
 
Better software, better service, better research: The Software Sustainabilit...
Better software, better service, better research: The Software Sustainabilit...Better software, better service, better research: The Software Sustainabilit...
Better software, better service, better research: The Software Sustainabilit...
 
Managing Metadata for Science and Technology Studies: the RISIS case
Managing Metadata for Science and Technology Studies: the RISIS caseManaging Metadata for Science and Technology Studies: the RISIS case
Managing Metadata for Science and Technology Studies: the RISIS case
 

Similar to Frankfurt Big Data Lab & Refugee Projeect

Data management plans – EUDAT Best practices and case study | www.eudat.eu
Data management plans – EUDAT Best practices and case study | www.eudat.euData management plans – EUDAT Best practices and case study | www.eudat.eu
Data management plans – EUDAT Best practices and case study | www.eudat.euEUDAT
 
cBioPortal Webinar Slides (3/3)
cBioPortal Webinar Slides (3/3)cBioPortal Webinar Slides (3/3)
cBioPortal Webinar Slides (3/3)Pistoia Alliance
 
EUDAT & OpenAIRE Webinar: How to write a Data Management Plan - July 14, 2016...
EUDAT & OpenAIRE Webinar: How to write a Data Management Plan - July 14, 2016...EUDAT & OpenAIRE Webinar: How to write a Data Management Plan - July 14, 2016...
EUDAT & OpenAIRE Webinar: How to write a Data Management Plan - July 14, 2016...EUDAT
 
Putting the L in front: from Open Data to Linked Open Data
Putting the L in front: from Open Data to Linked Open DataPutting the L in front: from Open Data to Linked Open Data
Putting the L in front: from Open Data to Linked Open DataMartin Kaltenböck
 
Linked Open Data_mlanet13
Linked Open Data_mlanet13Linked Open Data_mlanet13
Linked Open Data_mlanet13Kristi Holmes
 
Global Research Data Initiatives
Global Research Data InitiativesGlobal Research Data Initiatives
Global Research Data InitiativesSarah Jones
 
H2020 Open Data Pilot
H2020 Open Data PilotH2020 Open Data Pilot
H2020 Open Data PilotSarah Jones
 
Memory Management in BigData: A Perpective View
Memory Management in BigData: A Perpective ViewMemory Management in BigData: A Perpective View
Memory Management in BigData: A Perpective Viewijtsrd
 
Big Data Europe SC6 WS 3: Ron Dekker, Director CESSDA European Open Science A...
Big Data Europe SC6 WS 3: Ron Dekker, Director CESSDA European Open Science A...Big Data Europe SC6 WS 3: Ron Dekker, Director CESSDA European Open Science A...
Big Data Europe SC6 WS 3: Ron Dekker, Director CESSDA European Open Science A...BigData_Europe
 
The state of global research data initiatives: observations from a life on th...
The state of global research data initiatives: observations from a life on th...The state of global research data initiatives: observations from a life on th...
The state of global research data initiatives: observations from a life on th...Projeto RCAAP
 
Turning FAIR data into reality
Turning FAIR data into realityTurning FAIR data into reality
Turning FAIR data into realitySarah Jones
 
Standards and tools for model management in biomedical research
Standards and tools for model management in biomedical researchStandards and tools for model management in biomedical research
Standards and tools for model management in biomedical researchUniversity Medicine Greifswald
 
RDMkit, a Research Data Management Toolkit. Built by the Community for the ...
RDMkit, a Research Data Management Toolkit.  Built by the Community for the ...RDMkit, a Research Data Management Toolkit.  Built by the Community for the ...
RDMkit, a Research Data Management Toolkit. Built by the Community for the ...Carole Goble
 
H2020 data pilot openaire
H2020 data pilot openaireH2020 data pilot openaire
H2020 data pilot openaireSarah Jones
 
The Horizon 2020 Open Data Pilot - OpenAIRE webinar (Oct. 21 2014) by Sarah J...
The Horizon 2020 Open Data Pilot - OpenAIRE webinar (Oct. 21 2014) by Sarah J...The Horizon 2020 Open Data Pilot - OpenAIRE webinar (Oct. 21 2014) by Sarah J...
The Horizon 2020 Open Data Pilot - OpenAIRE webinar (Oct. 21 2014) by Sarah J...OpenAIRE
 
Tds — big science dec 2021
Tds — big science dec 2021Tds — big science dec 2021
Tds — big science dec 2021Gérard Dupont
 

Similar to Frankfurt Big Data Lab & Refugee Projeect (20)

Data management plans – EUDAT Best practices and case study | www.eudat.eu
Data management plans – EUDAT Best practices and case study | www.eudat.euData management plans – EUDAT Best practices and case study | www.eudat.eu
Data management plans – EUDAT Best practices and case study | www.eudat.eu
 
cBioPortal Webinar Slides (3/3)
cBioPortal Webinar Slides (3/3)cBioPortal Webinar Slides (3/3)
cBioPortal Webinar Slides (3/3)
 
EUDAT & OpenAIRE Webinar: How to write a Data Management Plan - July 14, 2016...
EUDAT & OpenAIRE Webinar: How to write a Data Management Plan - July 14, 2016...EUDAT & OpenAIRE Webinar: How to write a Data Management Plan - July 14, 2016...
EUDAT & OpenAIRE Webinar: How to write a Data Management Plan - July 14, 2016...
 
Putting the L in front: from Open Data to Linked Open Data
Putting the L in front: from Open Data to Linked Open DataPutting the L in front: from Open Data to Linked Open Data
Putting the L in front: from Open Data to Linked Open Data
 
Linked Open Data_mlanet13
Linked Open Data_mlanet13Linked Open Data_mlanet13
Linked Open Data_mlanet13
 
Global Research Data Initiatives
Global Research Data InitiativesGlobal Research Data Initiatives
Global Research Data Initiatives
 
H2020 Open Data Pilot
H2020 Open Data PilotH2020 Open Data Pilot
H2020 Open Data Pilot
 
2016 nov-ieee-sdn-wiki
2016 nov-ieee-sdn-wiki2016 nov-ieee-sdn-wiki
2016 nov-ieee-sdn-wiki
 
Seminario Sobre Datasets Consorcio Madrono
Seminario Sobre Datasets Consorcio Madrono Seminario Sobre Datasets Consorcio Madrono
Seminario Sobre Datasets Consorcio Madrono
 
Memory Management in BigData: A Perpective View
Memory Management in BigData: A Perpective ViewMemory Management in BigData: A Perpective View
Memory Management in BigData: A Perpective View
 
Open data pilot
Open data pilotOpen data pilot
Open data pilot
 
Big Data Europe SC6 WS 3: Ron Dekker, Director CESSDA European Open Science A...
Big Data Europe SC6 WS 3: Ron Dekker, Director CESSDA European Open Science A...Big Data Europe SC6 WS 3: Ron Dekker, Director CESSDA European Open Science A...
Big Data Europe SC6 WS 3: Ron Dekker, Director CESSDA European Open Science A...
 
The state of global research data initiatives: observations from a life on th...
The state of global research data initiatives: observations from a life on th...The state of global research data initiatives: observations from a life on th...
The state of global research data initiatives: observations from a life on th...
 
Turning FAIR data into reality
Turning FAIR data into realityTurning FAIR data into reality
Turning FAIR data into reality
 
Standards and tools for model management in biomedical research
Standards and tools for model management in biomedical researchStandards and tools for model management in biomedical research
Standards and tools for model management in biomedical research
 
RDMkit, a Research Data Management Toolkit. Built by the Community for the ...
RDMkit, a Research Data Management Toolkit.  Built by the Community for the ...RDMkit, a Research Data Management Toolkit.  Built by the Community for the ...
RDMkit, a Research Data Management Toolkit. Built by the Community for the ...
 
H2020 data pilot openaire
H2020 data pilot openaireH2020 data pilot openaire
H2020 data pilot openaire
 
The Horizon 2020 Open Data Pilot - OpenAIRE webinar (Oct. 21 2014) by Sarah J...
The Horizon 2020 Open Data Pilot - OpenAIRE webinar (Oct. 21 2014) by Sarah J...The Horizon 2020 Open Data Pilot - OpenAIRE webinar (Oct. 21 2014) by Sarah J...
The Horizon 2020 Open Data Pilot - OpenAIRE webinar (Oct. 21 2014) by Sarah J...
 
Introduction to FAIR Data and Research Objects
Introduction to FAIR Data and Research ObjectsIntroduction to FAIR Data and Research Objects
Introduction to FAIR Data and Research Objects
 
Tds — big science dec 2021
Tds — big science dec 2021Tds — big science dec 2021
Tds — big science dec 2021
 

Recently uploaded

Discover Why Less is More in B2B Research
Discover Why Less is More in B2B ResearchDiscover Why Less is More in B2B Research
Discover Why Less is More in B2B Researchmichael115558
 
Digital Transformation Playbook by Graham Ware
Digital Transformation Playbook by Graham WareDigital Transformation Playbook by Graham Ware
Digital Transformation Playbook by Graham WareGraham Ware
 
Top profile Call Girls In Vadodara [ 7014168258 ] Call Me For Genuine Models ...
Top profile Call Girls In Vadodara [ 7014168258 ] Call Me For Genuine Models ...Top profile Call Girls In Vadodara [ 7014168258 ] Call Me For Genuine Models ...
Top profile Call Girls In Vadodara [ 7014168258 ] Call Me For Genuine Models ...gajnagarg
 
怎样办理纽约州立大学宾汉姆顿分校毕业证(SUNY-Bin毕业证书)成绩单学校原版复制
怎样办理纽约州立大学宾汉姆顿分校毕业证(SUNY-Bin毕业证书)成绩单学校原版复制怎样办理纽约州立大学宾汉姆顿分校毕业证(SUNY-Bin毕业证书)成绩单学校原版复制
怎样办理纽约州立大学宾汉姆顿分校毕业证(SUNY-Bin毕业证书)成绩单学校原版复制vexqp
 
Top profile Call Girls In dimapur [ 7014168258 ] Call Me For Genuine Models W...
Top profile Call Girls In dimapur [ 7014168258 ] Call Me For Genuine Models W...Top profile Call Girls In dimapur [ 7014168258 ] Call Me For Genuine Models W...
Top profile Call Girls In dimapur [ 7014168258 ] Call Me For Genuine Models W...gajnagarg
 
Predicting HDB Resale Prices - Conducting Linear Regression Analysis With Orange
Predicting HDB Resale Prices - Conducting Linear Regression Analysis With OrangePredicting HDB Resale Prices - Conducting Linear Regression Analysis With Orange
Predicting HDB Resale Prices - Conducting Linear Regression Analysis With OrangeThinkInnovation
 
怎样办理圣地亚哥州立大学毕业证(SDSU毕业证书)成绩单学校原版复制
怎样办理圣地亚哥州立大学毕业证(SDSU毕业证书)成绩单学校原版复制怎样办理圣地亚哥州立大学毕业证(SDSU毕业证书)成绩单学校原版复制
怎样办理圣地亚哥州立大学毕业证(SDSU毕业证书)成绩单学校原版复制vexqp
 
Top profile Call Girls In Purnia [ 7014168258 ] Call Me For Genuine Models We...
Top profile Call Girls In Purnia [ 7014168258 ] Call Me For Genuine Models We...Top profile Call Girls In Purnia [ 7014168258 ] Call Me For Genuine Models We...
Top profile Call Girls In Purnia [ 7014168258 ] Call Me For Genuine Models We...nirzagarg
 
Top profile Call Girls In Hapur [ 7014168258 ] Call Me For Genuine Models We ...
Top profile Call Girls In Hapur [ 7014168258 ] Call Me For Genuine Models We ...Top profile Call Girls In Hapur [ 7014168258 ] Call Me For Genuine Models We ...
Top profile Call Girls In Hapur [ 7014168258 ] Call Me For Genuine Models We ...nirzagarg
 
Digital Advertising Lecture for Advanced Digital & Social Media Strategy at U...
Digital Advertising Lecture for Advanced Digital & Social Media Strategy at U...Digital Advertising Lecture for Advanced Digital & Social Media Strategy at U...
Digital Advertising Lecture for Advanced Digital & Social Media Strategy at U...Valters Lauzums
 
Dubai Call Girls Peeing O525547819 Call Girls Dubai
Dubai Call Girls Peeing O525547819 Call Girls DubaiDubai Call Girls Peeing O525547819 Call Girls Dubai
Dubai Call Girls Peeing O525547819 Call Girls Dubaikojalkojal131
 
Top profile Call Girls In Chandrapur [ 7014168258 ] Call Me For Genuine Model...
Top profile Call Girls In Chandrapur [ 7014168258 ] Call Me For Genuine Model...Top profile Call Girls In Chandrapur [ 7014168258 ] Call Me For Genuine Model...
Top profile Call Girls In Chandrapur [ 7014168258 ] Call Me For Genuine Model...gajnagarg
 
Jual Obat Aborsi Surabaya ( Asli No.1 ) 085657271886 Obat Penggugur Kandungan...
Jual Obat Aborsi Surabaya ( Asli No.1 ) 085657271886 Obat Penggugur Kandungan...Jual Obat Aborsi Surabaya ( Asli No.1 ) 085657271886 Obat Penggugur Kandungan...
Jual Obat Aborsi Surabaya ( Asli No.1 ) 085657271886 Obat Penggugur Kandungan...ZurliaSoop
 
Reconciling Conflicting Data Curation Actions: Transparency Through Argument...
Reconciling Conflicting Data Curation Actions:  Transparency Through Argument...Reconciling Conflicting Data Curation Actions:  Transparency Through Argument...
Reconciling Conflicting Data Curation Actions: Transparency Through Argument...Bertram Ludäscher
 
In Riyadh ((+919101817206)) Cytotec kit @ Abortion Pills Saudi Arabia
In Riyadh ((+919101817206)) Cytotec kit @ Abortion Pills Saudi ArabiaIn Riyadh ((+919101817206)) Cytotec kit @ Abortion Pills Saudi Arabia
In Riyadh ((+919101817206)) Cytotec kit @ Abortion Pills Saudi Arabiaahmedjiabur940
 
Top profile Call Girls In bhavnagar [ 7014168258 ] Call Me For Genuine Models...
Top profile Call Girls In bhavnagar [ 7014168258 ] Call Me For Genuine Models...Top profile Call Girls In bhavnagar [ 7014168258 ] Call Me For Genuine Models...
Top profile Call Girls In bhavnagar [ 7014168258 ] Call Me For Genuine Models...gajnagarg
 
The-boAt-Story-Navigating-the-Waves-of-Innovation.pptx
The-boAt-Story-Navigating-the-Waves-of-Innovation.pptxThe-boAt-Story-Navigating-the-Waves-of-Innovation.pptx
The-boAt-Story-Navigating-the-Waves-of-Innovation.pptxVivek487417
 
一比一原版(UCD毕业证书)加州大学戴维斯分校毕业证成绩单原件一模一样
一比一原版(UCD毕业证书)加州大学戴维斯分校毕业证成绩单原件一模一样一比一原版(UCD毕业证书)加州大学戴维斯分校毕业证成绩单原件一模一样
一比一原版(UCD毕业证书)加州大学戴维斯分校毕业证成绩单原件一模一样wsppdmt
 
Jual Cytotec Asli Obat Aborsi No. 1 Paling Manjur
Jual Cytotec Asli Obat Aborsi No. 1 Paling ManjurJual Cytotec Asli Obat Aborsi No. 1 Paling Manjur
Jual Cytotec Asli Obat Aborsi No. 1 Paling Manjurptikerjasaptiker
 

Recently uploaded (20)

Discover Why Less is More in B2B Research
Discover Why Less is More in B2B ResearchDiscover Why Less is More in B2B Research
Discover Why Less is More in B2B Research
 
Digital Transformation Playbook by Graham Ware
Digital Transformation Playbook by Graham WareDigital Transformation Playbook by Graham Ware
Digital Transformation Playbook by Graham Ware
 
Top profile Call Girls In Vadodara [ 7014168258 ] Call Me For Genuine Models ...
Top profile Call Girls In Vadodara [ 7014168258 ] Call Me For Genuine Models ...Top profile Call Girls In Vadodara [ 7014168258 ] Call Me For Genuine Models ...
Top profile Call Girls In Vadodara [ 7014168258 ] Call Me For Genuine Models ...
 
怎样办理纽约州立大学宾汉姆顿分校毕业证(SUNY-Bin毕业证书)成绩单学校原版复制
怎样办理纽约州立大学宾汉姆顿分校毕业证(SUNY-Bin毕业证书)成绩单学校原版复制怎样办理纽约州立大学宾汉姆顿分校毕业证(SUNY-Bin毕业证书)成绩单学校原版复制
怎样办理纽约州立大学宾汉姆顿分校毕业证(SUNY-Bin毕业证书)成绩单学校原版复制
 
Top profile Call Girls In dimapur [ 7014168258 ] Call Me For Genuine Models W...
Top profile Call Girls In dimapur [ 7014168258 ] Call Me For Genuine Models W...Top profile Call Girls In dimapur [ 7014168258 ] Call Me For Genuine Models W...
Top profile Call Girls In dimapur [ 7014168258 ] Call Me For Genuine Models W...
 
Predicting HDB Resale Prices - Conducting Linear Regression Analysis With Orange
Predicting HDB Resale Prices - Conducting Linear Regression Analysis With OrangePredicting HDB Resale Prices - Conducting Linear Regression Analysis With Orange
Predicting HDB Resale Prices - Conducting Linear Regression Analysis With Orange
 
Sequential and reinforcement learning for demand side management by Margaux B...
Sequential and reinforcement learning for demand side management by Margaux B...Sequential and reinforcement learning for demand side management by Margaux B...
Sequential and reinforcement learning for demand side management by Margaux B...
 
怎样办理圣地亚哥州立大学毕业证(SDSU毕业证书)成绩单学校原版复制
怎样办理圣地亚哥州立大学毕业证(SDSU毕业证书)成绩单学校原版复制怎样办理圣地亚哥州立大学毕业证(SDSU毕业证书)成绩单学校原版复制
怎样办理圣地亚哥州立大学毕业证(SDSU毕业证书)成绩单学校原版复制
 
Top profile Call Girls In Purnia [ 7014168258 ] Call Me For Genuine Models We...
Top profile Call Girls In Purnia [ 7014168258 ] Call Me For Genuine Models We...Top profile Call Girls In Purnia [ 7014168258 ] Call Me For Genuine Models We...
Top profile Call Girls In Purnia [ 7014168258 ] Call Me For Genuine Models We...
 
Top profile Call Girls In Hapur [ 7014168258 ] Call Me For Genuine Models We ...
Top profile Call Girls In Hapur [ 7014168258 ] Call Me For Genuine Models We ...Top profile Call Girls In Hapur [ 7014168258 ] Call Me For Genuine Models We ...
Top profile Call Girls In Hapur [ 7014168258 ] Call Me For Genuine Models We ...
 
Digital Advertising Lecture for Advanced Digital & Social Media Strategy at U...
Digital Advertising Lecture for Advanced Digital & Social Media Strategy at U...Digital Advertising Lecture for Advanced Digital & Social Media Strategy at U...
Digital Advertising Lecture for Advanced Digital & Social Media Strategy at U...
 
Dubai Call Girls Peeing O525547819 Call Girls Dubai
Dubai Call Girls Peeing O525547819 Call Girls DubaiDubai Call Girls Peeing O525547819 Call Girls Dubai
Dubai Call Girls Peeing O525547819 Call Girls Dubai
 
Top profile Call Girls In Chandrapur [ 7014168258 ] Call Me For Genuine Model...
Top profile Call Girls In Chandrapur [ 7014168258 ] Call Me For Genuine Model...Top profile Call Girls In Chandrapur [ 7014168258 ] Call Me For Genuine Model...
Top profile Call Girls In Chandrapur [ 7014168258 ] Call Me For Genuine Model...
 
Jual Obat Aborsi Surabaya ( Asli No.1 ) 085657271886 Obat Penggugur Kandungan...
Jual Obat Aborsi Surabaya ( Asli No.1 ) 085657271886 Obat Penggugur Kandungan...Jual Obat Aborsi Surabaya ( Asli No.1 ) 085657271886 Obat Penggugur Kandungan...
Jual Obat Aborsi Surabaya ( Asli No.1 ) 085657271886 Obat Penggugur Kandungan...
 
Reconciling Conflicting Data Curation Actions: Transparency Through Argument...
Reconciling Conflicting Data Curation Actions:  Transparency Through Argument...Reconciling Conflicting Data Curation Actions:  Transparency Through Argument...
Reconciling Conflicting Data Curation Actions: Transparency Through Argument...
 
In Riyadh ((+919101817206)) Cytotec kit @ Abortion Pills Saudi Arabia
In Riyadh ((+919101817206)) Cytotec kit @ Abortion Pills Saudi ArabiaIn Riyadh ((+919101817206)) Cytotec kit @ Abortion Pills Saudi Arabia
In Riyadh ((+919101817206)) Cytotec kit @ Abortion Pills Saudi Arabia
 
Top profile Call Girls In bhavnagar [ 7014168258 ] Call Me For Genuine Models...
Top profile Call Girls In bhavnagar [ 7014168258 ] Call Me For Genuine Models...Top profile Call Girls In bhavnagar [ 7014168258 ] Call Me For Genuine Models...
Top profile Call Girls In bhavnagar [ 7014168258 ] Call Me For Genuine Models...
 
The-boAt-Story-Navigating-the-Waves-of-Innovation.pptx
The-boAt-Story-Navigating-the-Waves-of-Innovation.pptxThe-boAt-Story-Navigating-the-Waves-of-Innovation.pptx
The-boAt-Story-Navigating-the-Waves-of-Innovation.pptx
 
一比一原版(UCD毕业证书)加州大学戴维斯分校毕业证成绩单原件一模一样
一比一原版(UCD毕业证书)加州大学戴维斯分校毕业证成绩单原件一模一样一比一原版(UCD毕业证书)加州大学戴维斯分校毕业证成绩单原件一模一样
一比一原版(UCD毕业证书)加州大学戴维斯分校毕业证成绩单原件一模一样
 
Jual Cytotec Asli Obat Aborsi No. 1 Paling Manjur
Jual Cytotec Asli Obat Aborsi No. 1 Paling ManjurJual Cytotec Asli Obat Aborsi No. 1 Paling Manjur
Jual Cytotec Asli Obat Aborsi No. 1 Paling Manjur
 

Frankfurt Big Data Lab & Refugee Projeect

  • 1. http://www.bigdata.uni-frankfurt.de/ Welcome to the Frankfurt Big Data Lab! http://www.bigdata.uni-frankfurt.de/
  • 2. http://www.bigdata.uni-frankfurt.de/ Mission The objective of the Big Data Laboratory is to carry out research in the domains of big data and data analytics from the perspective of information systems and computer science. Our approach is based on the interdisciplinary binding between data management technologies and analytics. 2 The lab is located in Frankfurt, the financial metropolis of central Europe and targets to be a source of knowledge and expertise both for research and industry applications. Frankfurt Big Data Lab The DATA REFUGEES Project
  • 3. http://www.bigdata.uni-frankfurt.de/ Prof. Dott. Ing. Roberto V. Zicari Dr. Karsten Tolle Lab Director Hee Eun Kim PhD Student Todor Ivanov PhD Student Marten Rosselli PhD Student Affiliations: Goethe University / Accenture Sven Rill PhD Student Affiliation: Hof University of Applied Sciences Rahul Soni PhD Student Affiliations: Goethe University / Accenture Concha Sanchez-Ocaña Project Manager DBIS, Goethe University Frankfurt Raik Niemann PhD Student Affiliation: Hof University of Applied Sciences 3 Team
  • 5. http://www.bigdata.uni-frankfurt.de/ Our lab is currently active in the following research areas: 1. Big Data Management Technologies 2. Data Analytics / Data Science 3. Graph Databases / Linked Open Data (LOD) 4. Big Data for Common Good 5 Research Areas
  • 6. http://www.bigdata.uni-frankfurt.de/ Our work is concentrated on the evaluation and optimization of Operational data stores that allow flexible schemas Big Data management and analytical platforms (Hadoop, Spark, etc …) Complex distributed storage and processing architectures Big Data Benchmarks 6 1. Big Data Management Technologies
  • 7. http://www.bigdata.uni-frankfurt.de/ 7 1. Big Data Management Technologies (cnt) Benchmarking Big Data platforms for performance, scalability, elasticity, fault-tolerance … Yahoo Cloud Service Benchmark (YCSB) Evaluating the performance (read/write workloads) of NoSQL stores like Cassandra. HiBench 10 workloads for evaluating the Hadoop platform in terms of speed, throughput, HDFS bandwidth, system resource utilization and machine learning algorithms. BigBench Application level benchmark consisting of 30 queries implemented in Hive based on the TPC-DS benchmark. TPCx-HS The first standard Big Data Benchmark for Hadoop, based on the TeraSort workload. Benchmarks used
  • 8. http://www.bigdata.uni-frankfurt.de/ 8 1. Big Data Management Technologies (cnt) NoSQL Evaluated the Cassandra / DataStax Enterprise (9 Nodes Cassandra Cluster) with HiBench and Yahoo Cloud Service Benchmarks. Hadoop Ecosystem • Evaluated the performance of different virtualized Hadoop cluster configurations on top of VMware vSphere using the Big Data Extension (Project Serengeti). • Benchmarking the Cloudera Hadoop Distribution 5.2 (4 Nodes Hadoop Cluster) with the TPCx-HS benchmarks. • Experimenting with the BigBench benchmark using Hive and Spark SQL. In-Memory Databases Evaluation of a Big Data Architecture based on SAP HANA and Cloudera Hadoop for different use cases and analytical workloads Platforms used
  • 9. http://www.bigdata.uni-frankfurt.de/ 9 1. Big Data Management Technologies (cnt) Relevant Publications • Performance Evaluation of Enterprise Big Data Platforms with HiBench (In 9th IEEE International Conference on Big Data Science and Engineering (IEEE BigDataSE 2015), August 20-22, Helsinki, Finland) • Benchmarking the Availability and Fault Tolerance of Cassandra (In 6th Workshop on Big Data Benchmarking (6th WBDB), June 16-17, 2015, Toronto, Canada) • Performance Evaluation of Spark SQL using BigBench (In 6th Workshop on Big Data Benchmarking (6th WBDB), June 16-17, 2015, Toronto, Canada) • Benchmarking DataStax Enterprise/Cassandra with HiBench (Technical Report No. 2014-2 ) • Performance Evaluation of Virtualized Hadoop Clusters (Technical Report No. 2014-1) • Benchmarking Virtualized Hadoop Clusters (In proceedings of the Big Data Benchmarking - 5th International Workshop, WBDB 2014, Potsdam,Germany, August 5-6, 2014, Revised Selected Papers) Full list of publications is available online: http://www.bigdata.uni-frankfurt.de/publications/
  • 10. http://www.bigdata.uni-frankfurt.de/ 1. Big Data Management Technologies (cnt) Member of the Standard Performance Evaluation Corporation (SPEC) SPEC is a non-profit corporation formed to establish, maintain and endorse a standardized set of relevant benchmarks that can be applied to the newest generation of high-performance computers. The RG Big Data Working Group is a forum for individuals and organizations interested in the big data benchmarking topic. List of all 52 Member Organizations Advanced Strategic Technology LLC ARM bankmark UG Barcelona Supercomputing Center Charles University Cisco Systems Cloudera, Inc Compilaflows Delft University of Technology Dell fortiss GmbH Friedrich-Alexander-University Erlangen-Nuremberg Goethe University Frankfurt Hewlett-Packard Huawei IBM Imperial College London Indian Institute of Technology, Bombay Institute for Information Industry, Taiwan Institute of Communication and Computer Systems/NTUA Intel Karlsruhe Institute of Technology Kiel University Microsoft MIOsoft Corporation NICTA NovaTec GmbH Oracle Purdue University Red Hat RWTH Aachen University Salesforce.com San Diego Supercomputing Center San Francisco State University SAP AG Siemens Corporation Technische Universität Darmstadt The MITRE Corporation Umea University University of Alberta University of Coimbra University of Florence University of Lugano University of Minnesota University of North Florida University of Paderborn VMware University of Wuerzburg University of Texas at Austin University of Stuttgart * University of Pavia
  • 11. http://www.bigdata.uni-frankfurt.de/ Benchmarking (Berlin SPARQL Benchmark - BSBM): Linked Open Data are structured data that are published online in order to be accessed automatically by computers. By combining different sources huge amount of similar or related structured data are brought together in order to be queried and analyzed. This research area is closely related to the Semantic Web and its standards stack like RDF* and OWL. We are interested in analyzing and benchmarking existing storage solutions and to apply the idea of LOD to selected applications. Our current activities are: • AFE-Web – Cooperation project with Römisch Germanischen Kommission (RGK) Antike Fundmünzen in Europa: database (AFE- WEB) is a web-based database for recording and publishing coin finds • European Coin Find Network • Nomisma.org (Karsten Tolle being member of the steering committee) 11 2. Graph Databases/Linked Open Data(LOD)
  • 12. http://www.bigdata.uni-frankfurt.de/ • Twenty students from UC Berkeley, Stanford University and Goether University Frankfurt committed participation to the challenges of obesity, heart/lung failure and mood disorder. • Frankfurt Big Data Lab uses of data acquisition and data blending to improve the quality of analytics. 12 3. Data analytics/Data science # of patients in a postal area <visualization of the given patients’ data> <retrieved Twitter data by a keyword of obesity> <retrieved Twitter data from a state of Pennsylvania>
  • 13. http://www.bigdata.uni-frankfurt.de/ 13 4. Big Data for Social Good What can be done in the international research community to make sure that some of the most brilliant big data use cases do have an impact also for social issues ? Our motivation is to encourage the international research community to work on Big Data problems that have a potential positive social impact for mankind World map of scientific collaborations, 2014
  • 14. http://www.bigdata.uni-frankfurt.de/ 14 4. Big Data for Social Good Projects The DATA REFUGEES PROJECT 1.1 million refugees and migrants registered in Germany in 2015 Number of refugees to arrive in Frankfurt increases from 170 to 250 per week We will explore the question if and how data can be used to create: — A Data Products to help the inclusion in the city of Frankfurt — Insights that can be escalated to the decision makers in the city of Frankfurt.
  • 15. http://www.bigdata.uni-frankfurt.de/ 15 4. Big Data for Social Good Projects The DATA REFUGEES PROJECT - METODOLOGY We will gather data from various sources available in Frankfurt. The challenge is that the flow of information is not, by nature, well organized. Data integration Data fusion Data blending Design Thinking Techniques Collect data from multiple sources, including changes of format and cleanup of redundant or useless entries. The outcome is a standardized, unified table. Integrate imperfect data sources overlapping over a small group of objects. Allow sources to be imperfect, incomplete, and overlapping over a few objects or none at all, requiring inspired guesses and generalizations Create and evaluate new ideas through a human centric approach for problem solving.
  • 16. http://www.bigdata.uni-frankfurt.de/ 16 4. Big Data for Social Good Projects DATA UNDERSTANDING INCLUSION PROCESS UNDERSTANDING DATA PREPARATION DATA INTEGRATION MODELLINGEVALUATIONDEPLOYMENT DATA PRODUCT OR KNOWLEDGE DELIVERY We are here! No too much data available! We aim to demonstrate that is feasible by using available data to help, support and possibly guide the process of inclusion for refugees in the city of Frankfurt.
  • 17. http://www.bigdata.uni-frankfurt.de/ 17 4. Big Data for Social Good Projects • LOOKING FOR DATA! • Define a open source tool and methodology for managing the volunteers and activities and propose it to the AWO organization. THIS DATA IS NOT COLLECTED http://lale.help https://volunteer-planner.org • Retrieve the twitter data about refugees in Frankfurt. • Coaching the two refugees to develop a mobile app. • We aim in particular to help refugee children. We hope to be able to help them in the inclusion process in our society.
  • 18. http://www.bigdata.uni-frankfurt.de/ 18 4. Big Data for Social Good Projects The DATA REFUGEES PROJECT NEEDS We encourage developers to contact us if you wish to contribute to this project! Contact Person: Concha Sanchez-Ocaña, Project Manager. concha@dbis.cs.uni-frankfurt.de Organizations involved Frankfurt Big Data Lab, Goethe University Frankfurt. School of Business, University of Applied Sciences Mainz Research Center SAFE (funded by the State of Hessen initiative for research, LOEWE) Betriebliche Kommunikationssysteme und IT-Security, University of Applied Sciences Offenburg THANK YOU!!

Editor's Notes

  1. We are interested to benchmark new software platforms for storing and processing massive amounts of data and for analytics beyond what conventional relational systems can do.  We are interested to test such systems against domain specific workloads to perform data clustering, predictive modeling, and complex statistics. In addition, we are investigating graph-based DBMSes for social-network-style analysis.