SlideShare a Scribd company logo
INTRO INTO
DATA SCIENCE CLUB
16th OCTOBER
2017
TABLE OF CONTENT
INTRO
WHY DATA SCIENCE CLUB AND WHAT ARE THE GOALS?
INTRO INTO DATA SCIENCE
AI research isn't just in Silicon Valley
- Alan Turing, born in London
- DeepMind, founded in London
- Amazon AI labs, Berlin
- Yandex, Russia
- Baido, China
- ...
Why Data Science Club?
- Network business and academia for mutual good
- Business
- Dedicated to adding a value
- Needs an access to research, talent and innovation
- Academia
- Dedicated to research and teaching
- Can't do all the research and teaching alone
Exponea is a company with ...
- Team of Software Engineers that work on applied AI
- 80% from FIIT and Matfyz
- Data and interesting clients from all around the world
- Lots of interesting problems to solve
- Lots of relevant knowledge to share
STU FIIT is an education institution with ...
- A study programme Intelligent Software Systems
- Software Engineering
- Artificial Intelligence
- A relevant research and team of experienced researchers
- Lots of graduates with careers related to data science
- Great facilities where all of us can meet
What is Data Science Club?
- Community of like minded people with interest in
data, analytics, software engineering and AI
- Regular meetups where we share knowledge and
work on our challenges together
When?
- 6 times per semester in both summer and winter
- Monday starting at 16:00 with agenda for 3 hours:
- Expert talk(s)
- Practical workshop
- Networking, consulting, discussions
Draft of Agenda
Winter
1. Intro into Data Science
2. Data storage
3. SQL for data analytics
4. Map-Reduce
5. Stream data processing
6. Data visualisation
Summer
1. Applications of machine learning
2. Process of machine learning
3. Classification
4. Recommendation engines
5. Bandit algorithms
6. Reinforcement learning
Final agenda depends on your interest and availability of expert speakers
Goals
- Build an active community
- 3+ universities, 10+ companies, 100+ members
- Create and share content
- Reach 2000+ viewers
- Contribute to research and applications
- 5+ publications by B&A authors in 2017/18
- 5+ new features in software products
People
- Jozo Kovac, Cofounder, CTO
- Matus Cimerman, Head of AI
- Peter Kovacs, AISW Engineer
- Ondrej Brichta, AISW Engineer
- Jakub Macina, AISW Engineer
- Robert Lacok, AISW Engineer
- Dalibor Meszaros, AISW Engineer
- Lucia Siebestichova, Event Manager
- Martina Kolibasova, QA Engineer
Contacts
- FB Page: https://www.facebook.com/ExponeaSocietyBratislava/
- Slack: https://community.exponea.com/
- Wiki: https://github.com/exponea/data-science-club/wiki
- Git: https://github.com/exponea/data-science-club
- Email: matus.cimerman@exponea.com
MEET
OUR
TEAM
WRITE HERE SOMETHING
MY SEARCH FOR
A FAMILY VACATION
LETS START WITH A REAL WORLD PROBLEM
Challenges
- We pay for traffic, how to increase customer value?
- Can customer history improve the quality of search?
- Search must be fast, render everything under 100ms
A quick analysis
- A learn to rank problem
https://www.slideshare.net/MrChrisJohnson/interactive-
recommender-systems-with-netflix-and-spotify
Designing Data-Intensive Applications
WORKSHOP
16th OCTOBER
2017
Challenges - step by step
• Collect data
• Preprocess
• Train a model
• Evaluate
• Deploy
• Improve the model
Data Model
Data* InsightModel
Collect data
• If you have it, you’re done
• If not, find a way to collect
Preprocess
• What:
• Select features
• Normalize
• Fill
• Aggregate...
• How:
• Store
• Scale out processing
Create a model
• Learn about your domain
• Go really simple first
• Don’t reinvent the wheel
Evaluate
• What metrics make sense?
• Test sets vs live data
Deploy
• Online API vs batch
• Latency matters
• Reliable
• Monitor & evaluate
• Retrain on new data
Improve the model
• Read more papers and blogs. Is anyone doing it better?
• Lead, innovate and be the best
Workshop
1. Set up your environment: dependencies, template & data
2. Preprocess: Clean, normalize, feature select
(pandas/scikit-learn)
3. Train: Choose a model of your liking (scikit-learn)
4. Deploy: Create a REST API (flask)

More Related Content

What's hot

Introduction To Data Science
Introduction To Data ScienceIntroduction To Data Science
Introduction To Data Science
Spotle.ai
 
Introduction to Data Science
Introduction to Data ScienceIntroduction to Data Science
Introduction to Data Science
Srishti44
 
Data science & data scientist
Data science & data scientistData science & data scientist
Data science & data scientist
VijayMohan Vasu
 
Introduction of Data Science
Introduction of Data ScienceIntroduction of Data Science
Introduction of Data Science
Jason Geng
 
What Is Data Science? Data Science Course - Data Science Tutorial For Beginne...
What Is Data Science? Data Science Course - Data Science Tutorial For Beginne...What Is Data Science? Data Science Course - Data Science Tutorial For Beginne...
What Is Data Science? Data Science Course - Data Science Tutorial For Beginne...
Edureka!
 
Data Science Introduction
Data Science IntroductionData Science Introduction
Data Science Introduction
Gang Tao
 
Data Science For Beginners | Who Is A Data Scientist? | Data Science Tutorial...
Data Science For Beginners | Who Is A Data Scientist? | Data Science Tutorial...Data Science For Beginners | Who Is A Data Scientist? | Data Science Tutorial...
Data Science For Beginners | Who Is A Data Scientist? | Data Science Tutorial...
Edureka!
 
What Is Data Science? | Introduction to Data Science | Data Science For Begin...
What Is Data Science? | Introduction to Data Science | Data Science For Begin...What Is Data Science? | Introduction to Data Science | Data Science For Begin...
What Is Data Science? | Introduction to Data Science | Data Science For Begin...
Simplilearn
 
Data Science Training | Data Science Tutorial | Data Science Certification | ...
Data Science Training | Data Science Tutorial | Data Science Certification | ...Data Science Training | Data Science Tutorial | Data Science Certification | ...
Data Science Training | Data Science Tutorial | Data Science Certification | ...
Edureka!
 
Data Science
Data ScienceData Science
Data Science
Rabin BK
 
Data Science Tutorial | What is Data Science? | Data Science For Beginners | ...
Data Science Tutorial | What is Data Science? | Data Science For Beginners | ...Data Science Tutorial | What is Data Science? | Data Science For Beginners | ...
Data Science Tutorial | What is Data Science? | Data Science For Beginners | ...
Edureka!
 
Data Science Tutorial | Introduction To Data Science | Data Science Training ...
Data Science Tutorial | Introduction To Data Science | Data Science Training ...Data Science Tutorial | Introduction To Data Science | Data Science Training ...
Data Science Tutorial | Introduction To Data Science | Data Science Training ...
Edureka!
 
Data science
Data scienceData science
Data science
Purna Chander
 
Introduction to Data Science
Introduction to Data ScienceIntroduction to Data Science
Introduction to Data Science
Edureka!
 
Data Science Training | Data Science For Beginners | Data Science With Python...
Data Science Training | Data Science For Beginners | Data Science With Python...Data Science Training | Data Science For Beginners | Data Science With Python...
Data Science Training | Data Science For Beginners | Data Science With Python...
Simplilearn
 
Data science presentation
Data science presentationData science presentation
Data science presentation
MSDEVMTL
 
How to Become a Data Scientist | Data Scientist Skills | Data Science Trainin...
How to Become a Data Scientist | Data Scientist Skills | Data Science Trainin...How to Become a Data Scientist | Data Scientist Skills | Data Science Trainin...
How to Become a Data Scientist | Data Scientist Skills | Data Science Trainin...
Edureka!
 
What is Data Science
What is Data ScienceWhat is Data Science
What is Data Science
Ioannis Kourouklides
 
Data science
Data scienceData science
Data science
Sreejith c
 
Introduction to Data Analytics
Introduction to Data AnalyticsIntroduction to Data Analytics
Introduction to Data Analytics
Utkarsh Sharma
 

What's hot (20)

Introduction To Data Science
Introduction To Data ScienceIntroduction To Data Science
Introduction To Data Science
 
Introduction to Data Science
Introduction to Data ScienceIntroduction to Data Science
Introduction to Data Science
 
Data science & data scientist
Data science & data scientistData science & data scientist
Data science & data scientist
 
Introduction of Data Science
Introduction of Data ScienceIntroduction of Data Science
Introduction of Data Science
 
What Is Data Science? Data Science Course - Data Science Tutorial For Beginne...
What Is Data Science? Data Science Course - Data Science Tutorial For Beginne...What Is Data Science? Data Science Course - Data Science Tutorial For Beginne...
What Is Data Science? Data Science Course - Data Science Tutorial For Beginne...
 
Data Science Introduction
Data Science IntroductionData Science Introduction
Data Science Introduction
 
Data Science For Beginners | Who Is A Data Scientist? | Data Science Tutorial...
Data Science For Beginners | Who Is A Data Scientist? | Data Science Tutorial...Data Science For Beginners | Who Is A Data Scientist? | Data Science Tutorial...
Data Science For Beginners | Who Is A Data Scientist? | Data Science Tutorial...
 
What Is Data Science? | Introduction to Data Science | Data Science For Begin...
What Is Data Science? | Introduction to Data Science | Data Science For Begin...What Is Data Science? | Introduction to Data Science | Data Science For Begin...
What Is Data Science? | Introduction to Data Science | Data Science For Begin...
 
Data Science Training | Data Science Tutorial | Data Science Certification | ...
Data Science Training | Data Science Tutorial | Data Science Certification | ...Data Science Training | Data Science Tutorial | Data Science Certification | ...
Data Science Training | Data Science Tutorial | Data Science Certification | ...
 
Data Science
Data ScienceData Science
Data Science
 
Data Science Tutorial | What is Data Science? | Data Science For Beginners | ...
Data Science Tutorial | What is Data Science? | Data Science For Beginners | ...Data Science Tutorial | What is Data Science? | Data Science For Beginners | ...
Data Science Tutorial | What is Data Science? | Data Science For Beginners | ...
 
Data Science Tutorial | Introduction To Data Science | Data Science Training ...
Data Science Tutorial | Introduction To Data Science | Data Science Training ...Data Science Tutorial | Introduction To Data Science | Data Science Training ...
Data Science Tutorial | Introduction To Data Science | Data Science Training ...
 
Data science
Data scienceData science
Data science
 
Introduction to Data Science
Introduction to Data ScienceIntroduction to Data Science
Introduction to Data Science
 
Data Science Training | Data Science For Beginners | Data Science With Python...
Data Science Training | Data Science For Beginners | Data Science With Python...Data Science Training | Data Science For Beginners | Data Science With Python...
Data Science Training | Data Science For Beginners | Data Science With Python...
 
Data science presentation
Data science presentationData science presentation
Data science presentation
 
How to Become a Data Scientist | Data Scientist Skills | Data Science Trainin...
How to Become a Data Scientist | Data Scientist Skills | Data Science Trainin...How to Become a Data Scientist | Data Scientist Skills | Data Science Trainin...
How to Become a Data Scientist | Data Scientist Skills | Data Science Trainin...
 
What is Data Science
What is Data ScienceWhat is Data Science
What is Data Science
 
Data science
Data scienceData science
Data science
 
Introduction to Data Analytics
Introduction to Data AnalyticsIntroduction to Data Analytics
Introduction to Data Analytics
 

Similar to Introduction to data science club

How to become the best datascientist in Europe
How to become the best datascientist in EuropeHow to become the best datascientist in Europe
How to become the best datascientist in Europe
DigitYser
 
OCS_Andrija_Bednarik
OCS_Andrija_BednarikOCS_Andrija_Bednarik
OCS_Andrija_Bednarik
Vojvodina ICT Cluster
 
Outsourcing Center Serbia
Outsourcing Center SerbiaOutsourcing Center Serbia
Outsourcing Center Serbia
ITDogadjaji.com
 
Brussels Capital of Data Science
Brussels Capital of Data ScienceBrussels Capital of Data Science
Brussels Capital of Data Science
DigitYser
 
AI-SDV Meeting in Nice
AI-SDV Meeting in NiceAI-SDV Meeting in Nice
AI-SDV Meeting in Nice
Dr. Haxel Consult
 
Guide for being a Culture Hack Scotland data partner v1
Guide for being a Culture Hack Scotland data partner v1Guide for being a Culture Hack Scotland data partner v1
Guide for being a Culture Hack Scotland data partner v1
Sync
 
Personal Knowledge Mastery (PKM) and the Social Intranet
Personal Knowledge Mastery (PKM) and the Social IntranetPersonal Knowledge Mastery (PKM) and the Social Intranet
Personal Knowledge Mastery (PKM) and the Social Intranet
Intranätverk
 
2015 06 18 datascienc meetup privacy - update - philippe van impe
2015 06 18   datascienc meetup privacy - update - philippe van impe2015 06 18   datascienc meetup privacy - update - philippe van impe
2015 06 18 datascienc meetup privacy - update - philippe van impe
DigitYser
 
Data Activities in Austria
Data Activities in AustriaData Activities in Austria
Data Activities in Austria
Semantic Web Company
 
Data Culture Keynote and Exec Track Birm Dec 8th
Data Culture Keynote and Exec Track Birm Dec 8thData Culture Keynote and Exec Track Birm Dec 8th
Data Culture Keynote and Exec Track Birm Dec 8th
Jonathan Woodward
 
Sironta
SirontaSironta
Sironta
Miguel Vidal
 
Who We Are
Who We AreWho We Are
Who We Are
kurilo
 
ION Hangzhou - Opening Remarks
ION Hangzhou - Opening RemarksION Hangzhou - Opening Remarks
ION Hangzhou - Opening Remarks
Deploy360 Programme (Internet Society)
 
Kick-off meeting Linkflows project
Kick-off meeting Linkflows projectKick-off meeting Linkflows project
Tutorial helsinki 20180313 v1
Tutorial helsinki 20180313 v1Tutorial helsinki 20180313 v1
Tutorial helsinki 20180313 v1
ISSIP
 
Allcu
AllcuAllcu
Gdsc kick off
Gdsc kick offGdsc kick off
Gdsc kick off
GDSCAISSMSIOIT
 
OxSDE.pptx
OxSDE.pptxOxSDE.pptx
OxSDE.pptx
sbonfa
 
Building valuable (online and offline) Data Science communities - Experience ...
Building valuable (online and offline) Data Science communities - Experience ...Building valuable (online and offline) Data Science communities - Experience ...
Building valuable (online and offline) Data Science communities - Experience ...
Institute of Contemporary Sciences
 
ION Belgrade - Opening Slides
ION Belgrade - Opening SlidesION Belgrade - Opening Slides
ION Belgrade - Opening Slides
Deploy360 Programme (Internet Society)
 

Similar to Introduction to data science club (20)

How to become the best datascientist in Europe
How to become the best datascientist in EuropeHow to become the best datascientist in Europe
How to become the best datascientist in Europe
 
OCS_Andrija_Bednarik
OCS_Andrija_BednarikOCS_Andrija_Bednarik
OCS_Andrija_Bednarik
 
Outsourcing Center Serbia
Outsourcing Center SerbiaOutsourcing Center Serbia
Outsourcing Center Serbia
 
Brussels Capital of Data Science
Brussels Capital of Data ScienceBrussels Capital of Data Science
Brussels Capital of Data Science
 
AI-SDV Meeting in Nice
AI-SDV Meeting in NiceAI-SDV Meeting in Nice
AI-SDV Meeting in Nice
 
Guide for being a Culture Hack Scotland data partner v1
Guide for being a Culture Hack Scotland data partner v1Guide for being a Culture Hack Scotland data partner v1
Guide for being a Culture Hack Scotland data partner v1
 
Personal Knowledge Mastery (PKM) and the Social Intranet
Personal Knowledge Mastery (PKM) and the Social IntranetPersonal Knowledge Mastery (PKM) and the Social Intranet
Personal Knowledge Mastery (PKM) and the Social Intranet
 
2015 06 18 datascienc meetup privacy - update - philippe van impe
2015 06 18   datascienc meetup privacy - update - philippe van impe2015 06 18   datascienc meetup privacy - update - philippe van impe
2015 06 18 datascienc meetup privacy - update - philippe van impe
 
Data Activities in Austria
Data Activities in AustriaData Activities in Austria
Data Activities in Austria
 
Data Culture Keynote and Exec Track Birm Dec 8th
Data Culture Keynote and Exec Track Birm Dec 8thData Culture Keynote and Exec Track Birm Dec 8th
Data Culture Keynote and Exec Track Birm Dec 8th
 
Sironta
SirontaSironta
Sironta
 
Who We Are
Who We AreWho We Are
Who We Are
 
ION Hangzhou - Opening Remarks
ION Hangzhou - Opening RemarksION Hangzhou - Opening Remarks
ION Hangzhou - Opening Remarks
 
Kick-off meeting Linkflows project
Kick-off meeting Linkflows projectKick-off meeting Linkflows project
Kick-off meeting Linkflows project
 
Tutorial helsinki 20180313 v1
Tutorial helsinki 20180313 v1Tutorial helsinki 20180313 v1
Tutorial helsinki 20180313 v1
 
Allcu
AllcuAllcu
Allcu
 
Gdsc kick off
Gdsc kick offGdsc kick off
Gdsc kick off
 
OxSDE.pptx
OxSDE.pptxOxSDE.pptx
OxSDE.pptx
 
Building valuable (online and offline) Data Science communities - Experience ...
Building valuable (online and offline) Data Science communities - Experience ...Building valuable (online and offline) Data Science communities - Experience ...
Building valuable (online and offline) Data Science communities - Experience ...
 
ION Belgrade - Opening Slides
ION Belgrade - Opening SlidesION Belgrade - Opening Slides
ION Belgrade - Opening Slides
 

More from Data Science Club

How to present campaign results to your boss
How to present campaign results to your bossHow to present campaign results to your boss
How to present campaign results to your boss
Data Science Club
 
Principles of Big Data Analytics Visualization
Principles of Big Data Analytics VisualizationPrinciples of Big Data Analytics Visualization
Principles of Big Data Analytics Visualization
Data Science Club
 
Batch (Spark) and Streaming (Kafka) Data-Preprocessing
Batch (Spark) and Streaming (Kafka) Data-PreprocessingBatch (Spark) and Streaming (Kafka) Data-Preprocessing
Batch (Spark) and Streaming (Kafka) Data-Preprocessing
Data Science Club
 
A Big (Query) Frog in a Small Pond, Jakub Motyl, BuffPanel
A Big (Query) Frog in a Small Pond, Jakub Motyl, BuffPanelA Big (Query) Frog in a Small Pond, Jakub Motyl, BuffPanel
A Big (Query) Frog in a Small Pond, Jakub Motyl, BuffPanel
Data Science Club
 
Why Successful Games Need Analytics
Why Successful Games Need AnalyticsWhy Successful Games Need Analytics
Why Successful Games Need Analytics
Data Science Club
 
Live predictions with schemaless data at scale. MLMU Kosice, Exponea
Live predictions with schemaless data at scale. MLMU Kosice, ExponeaLive predictions with schemaless data at scale. MLMU Kosice, Exponea
Live predictions with schemaless data at scale. MLMU Kosice, Exponea
Data Science Club
 

More from Data Science Club (6)

How to present campaign results to your boss
How to present campaign results to your bossHow to present campaign results to your boss
How to present campaign results to your boss
 
Principles of Big Data Analytics Visualization
Principles of Big Data Analytics VisualizationPrinciples of Big Data Analytics Visualization
Principles of Big Data Analytics Visualization
 
Batch (Spark) and Streaming (Kafka) Data-Preprocessing
Batch (Spark) and Streaming (Kafka) Data-PreprocessingBatch (Spark) and Streaming (Kafka) Data-Preprocessing
Batch (Spark) and Streaming (Kafka) Data-Preprocessing
 
A Big (Query) Frog in a Small Pond, Jakub Motyl, BuffPanel
A Big (Query) Frog in a Small Pond, Jakub Motyl, BuffPanelA Big (Query) Frog in a Small Pond, Jakub Motyl, BuffPanel
A Big (Query) Frog in a Small Pond, Jakub Motyl, BuffPanel
 
Why Successful Games Need Analytics
Why Successful Games Need AnalyticsWhy Successful Games Need Analytics
Why Successful Games Need Analytics
 
Live predictions with schemaless data at scale. MLMU Kosice, Exponea
Live predictions with schemaless data at scale. MLMU Kosice, ExponeaLive predictions with schemaless data at scale. MLMU Kosice, Exponea
Live predictions with schemaless data at scale. MLMU Kosice, Exponea
 

Recently uploaded

一比一原版(UCSF文凭证书)旧金山分校毕业证如何办理
一比一原版(UCSF文凭证书)旧金山分校毕业证如何办理一比一原版(UCSF文凭证书)旧金山分校毕业证如何办理
一比一原版(UCSF文凭证书)旧金山分校毕业证如何办理
nuttdpt
 
Everything you wanted to know about LIHTC
Everything you wanted to know about LIHTCEverything you wanted to know about LIHTC
Everything you wanted to know about LIHTC
Roger Valdez
 
End-to-end pipeline agility - Berlin Buzzwords 2024
End-to-end pipeline agility - Berlin Buzzwords 2024End-to-end pipeline agility - Berlin Buzzwords 2024
End-to-end pipeline agility - Berlin Buzzwords 2024
Lars Albertsson
 
STATATHON: Unleashing the Power of Statistics in a 48-Hour Knowledge Extravag...
STATATHON: Unleashing the Power of Statistics in a 48-Hour Knowledge Extravag...STATATHON: Unleashing the Power of Statistics in a 48-Hour Knowledge Extravag...
STATATHON: Unleashing the Power of Statistics in a 48-Hour Knowledge Extravag...
sameer shah
 
一比一原版(Coventry毕业证书)考文垂大学毕业证如何办理
一比一原版(Coventry毕业证书)考文垂大学毕业证如何办理一比一原版(Coventry毕业证书)考文垂大学毕业证如何办理
一比一原版(Coventry毕业证书)考文垂大学毕业证如何办理
74nqk8xf
 
在线办理(英国UCA毕业证书)创意艺术大学毕业证在读证明一模一样
在线办理(英国UCA毕业证书)创意艺术大学毕业证在读证明一模一样在线办理(英国UCA毕业证书)创意艺术大学毕业证在读证明一模一样
在线办理(英国UCA毕业证书)创意艺术大学毕业证在读证明一模一样
v7oacc3l
 
ViewShift: Hassle-free Dynamic Policy Enforcement for Every Data Lake
ViewShift: Hassle-free Dynamic Policy Enforcement for Every Data LakeViewShift: Hassle-free Dynamic Policy Enforcement for Every Data Lake
ViewShift: Hassle-free Dynamic Policy Enforcement for Every Data Lake
Walaa Eldin Moustafa
 
Enhanced Enterprise Intelligence with your personal AI Data Copilot.pdf
Enhanced Enterprise Intelligence with your personal AI Data Copilot.pdfEnhanced Enterprise Intelligence with your personal AI Data Copilot.pdf
Enhanced Enterprise Intelligence with your personal AI Data Copilot.pdf
GetInData
 
一比一原版(CBU毕业证)卡普顿大学毕业证如何办理
一比一原版(CBU毕业证)卡普顿大学毕业证如何办理一比一原版(CBU毕业证)卡普顿大学毕业证如何办理
一比一原版(CBU毕业证)卡普顿大学毕业证如何办理
ahzuo
 
State of Artificial intelligence Report 2023
State of Artificial intelligence Report 2023State of Artificial intelligence Report 2023
State of Artificial intelligence Report 2023
kuntobimo2016
 
The Building Blocks of QuestDB, a Time Series Database
The Building Blocks of QuestDB, a Time Series DatabaseThe Building Blocks of QuestDB, a Time Series Database
The Building Blocks of QuestDB, a Time Series Database
javier ramirez
 
University of New South Wales degree offer diploma Transcript
University of New South Wales degree offer diploma TranscriptUniversity of New South Wales degree offer diploma Transcript
University of New South Wales degree offer diploma Transcript
soxrziqu
 
Natural Language Processing (NLP), RAG and its applications .pptx
Natural Language Processing (NLP), RAG and its applications .pptxNatural Language Processing (NLP), RAG and its applications .pptx
Natural Language Processing (NLP), RAG and its applications .pptx
fkyes25
 
一比一原版(爱大毕业证书)爱丁堡大学毕业证如何办理
一比一原版(爱大毕业证书)爱丁堡大学毕业证如何办理一比一原版(爱大毕业证书)爱丁堡大学毕业证如何办理
一比一原版(爱大毕业证书)爱丁堡大学毕业证如何办理
g4dpvqap0
 
Population Growth in Bataan: The effects of population growth around rural pl...
Population Growth in Bataan: The effects of population growth around rural pl...Population Growth in Bataan: The effects of population growth around rural pl...
Population Growth in Bataan: The effects of population growth around rural pl...
Bill641377
 
一比一原版(BCU毕业证书)伯明翰城市大学毕业证如何办理
一比一原版(BCU毕业证书)伯明翰城市大学毕业证如何办理一比一原版(BCU毕业证书)伯明翰城市大学毕业证如何办理
一比一原版(BCU毕业证书)伯明翰城市大学毕业证如何办理
dwreak4tg
 
一比一原版(Chester毕业证书)切斯特大学毕业证如何办理
一比一原版(Chester毕业证书)切斯特大学毕业证如何办理一比一原版(Chester毕业证书)切斯特大学毕业证如何办理
一比一原版(Chester毕业证书)切斯特大学毕业证如何办理
74nqk8xf
 
一比一原版(UO毕业证)渥太华大学毕业证如何办理
一比一原版(UO毕业证)渥太华大学毕业证如何办理一比一原版(UO毕业证)渥太华大学毕业证如何办理
一比一原版(UO毕业证)渥太华大学毕业证如何办理
aqzctr7x
 
一比一原版(UCSB文凭证书)圣芭芭拉分校毕业证如何办理
一比一原版(UCSB文凭证书)圣芭芭拉分校毕业证如何办理一比一原版(UCSB文凭证书)圣芭芭拉分校毕业证如何办理
一比一原版(UCSB文凭证书)圣芭芭拉分校毕业证如何办理
nuttdpt
 
一比一原版(UMN文凭证书)明尼苏达大学毕业证如何办理
一比一原版(UMN文凭证书)明尼苏达大学毕业证如何办理一比一原版(UMN文凭证书)明尼苏达大学毕业证如何办理
一比一原版(UMN文凭证书)明尼苏达大学毕业证如何办理
nyfuhyz
 

Recently uploaded (20)

一比一原版(UCSF文凭证书)旧金山分校毕业证如何办理
一比一原版(UCSF文凭证书)旧金山分校毕业证如何办理一比一原版(UCSF文凭证书)旧金山分校毕业证如何办理
一比一原版(UCSF文凭证书)旧金山分校毕业证如何办理
 
Everything you wanted to know about LIHTC
Everything you wanted to know about LIHTCEverything you wanted to know about LIHTC
Everything you wanted to know about LIHTC
 
End-to-end pipeline agility - Berlin Buzzwords 2024
End-to-end pipeline agility - Berlin Buzzwords 2024End-to-end pipeline agility - Berlin Buzzwords 2024
End-to-end pipeline agility - Berlin Buzzwords 2024
 
STATATHON: Unleashing the Power of Statistics in a 48-Hour Knowledge Extravag...
STATATHON: Unleashing the Power of Statistics in a 48-Hour Knowledge Extravag...STATATHON: Unleashing the Power of Statistics in a 48-Hour Knowledge Extravag...
STATATHON: Unleashing the Power of Statistics in a 48-Hour Knowledge Extravag...
 
一比一原版(Coventry毕业证书)考文垂大学毕业证如何办理
一比一原版(Coventry毕业证书)考文垂大学毕业证如何办理一比一原版(Coventry毕业证书)考文垂大学毕业证如何办理
一比一原版(Coventry毕业证书)考文垂大学毕业证如何办理
 
在线办理(英国UCA毕业证书)创意艺术大学毕业证在读证明一模一样
在线办理(英国UCA毕业证书)创意艺术大学毕业证在读证明一模一样在线办理(英国UCA毕业证书)创意艺术大学毕业证在读证明一模一样
在线办理(英国UCA毕业证书)创意艺术大学毕业证在读证明一模一样
 
ViewShift: Hassle-free Dynamic Policy Enforcement for Every Data Lake
ViewShift: Hassle-free Dynamic Policy Enforcement for Every Data LakeViewShift: Hassle-free Dynamic Policy Enforcement for Every Data Lake
ViewShift: Hassle-free Dynamic Policy Enforcement for Every Data Lake
 
Enhanced Enterprise Intelligence with your personal AI Data Copilot.pdf
Enhanced Enterprise Intelligence with your personal AI Data Copilot.pdfEnhanced Enterprise Intelligence with your personal AI Data Copilot.pdf
Enhanced Enterprise Intelligence with your personal AI Data Copilot.pdf
 
一比一原版(CBU毕业证)卡普顿大学毕业证如何办理
一比一原版(CBU毕业证)卡普顿大学毕业证如何办理一比一原版(CBU毕业证)卡普顿大学毕业证如何办理
一比一原版(CBU毕业证)卡普顿大学毕业证如何办理
 
State of Artificial intelligence Report 2023
State of Artificial intelligence Report 2023State of Artificial intelligence Report 2023
State of Artificial intelligence Report 2023
 
The Building Blocks of QuestDB, a Time Series Database
The Building Blocks of QuestDB, a Time Series DatabaseThe Building Blocks of QuestDB, a Time Series Database
The Building Blocks of QuestDB, a Time Series Database
 
University of New South Wales degree offer diploma Transcript
University of New South Wales degree offer diploma TranscriptUniversity of New South Wales degree offer diploma Transcript
University of New South Wales degree offer diploma Transcript
 
Natural Language Processing (NLP), RAG and its applications .pptx
Natural Language Processing (NLP), RAG and its applications .pptxNatural Language Processing (NLP), RAG and its applications .pptx
Natural Language Processing (NLP), RAG and its applications .pptx
 
一比一原版(爱大毕业证书)爱丁堡大学毕业证如何办理
一比一原版(爱大毕业证书)爱丁堡大学毕业证如何办理一比一原版(爱大毕业证书)爱丁堡大学毕业证如何办理
一比一原版(爱大毕业证书)爱丁堡大学毕业证如何办理
 
Population Growth in Bataan: The effects of population growth around rural pl...
Population Growth in Bataan: The effects of population growth around rural pl...Population Growth in Bataan: The effects of population growth around rural pl...
Population Growth in Bataan: The effects of population growth around rural pl...
 
一比一原版(BCU毕业证书)伯明翰城市大学毕业证如何办理
一比一原版(BCU毕业证书)伯明翰城市大学毕业证如何办理一比一原版(BCU毕业证书)伯明翰城市大学毕业证如何办理
一比一原版(BCU毕业证书)伯明翰城市大学毕业证如何办理
 
一比一原版(Chester毕业证书)切斯特大学毕业证如何办理
一比一原版(Chester毕业证书)切斯特大学毕业证如何办理一比一原版(Chester毕业证书)切斯特大学毕业证如何办理
一比一原版(Chester毕业证书)切斯特大学毕业证如何办理
 
一比一原版(UO毕业证)渥太华大学毕业证如何办理
一比一原版(UO毕业证)渥太华大学毕业证如何办理一比一原版(UO毕业证)渥太华大学毕业证如何办理
一比一原版(UO毕业证)渥太华大学毕业证如何办理
 
一比一原版(UCSB文凭证书)圣芭芭拉分校毕业证如何办理
一比一原版(UCSB文凭证书)圣芭芭拉分校毕业证如何办理一比一原版(UCSB文凭证书)圣芭芭拉分校毕业证如何办理
一比一原版(UCSB文凭证书)圣芭芭拉分校毕业证如何办理
 
一比一原版(UMN文凭证书)明尼苏达大学毕业证如何办理
一比一原版(UMN文凭证书)明尼苏达大学毕业证如何办理一比一原版(UMN文凭证书)明尼苏达大学毕业证如何办理
一比一原版(UMN文凭证书)明尼苏达大学毕业证如何办理
 

Introduction to data science club

  • 1. INTRO INTO DATA SCIENCE CLUB 16th OCTOBER 2017
  • 2. TABLE OF CONTENT INTRO WHY DATA SCIENCE CLUB AND WHAT ARE THE GOALS? INTRO INTO DATA SCIENCE
  • 3.
  • 4.
  • 5.
  • 6.
  • 7. AI research isn't just in Silicon Valley - Alan Turing, born in London - DeepMind, founded in London - Amazon AI labs, Berlin - Yandex, Russia - Baido, China - ...
  • 8. Why Data Science Club? - Network business and academia for mutual good - Business - Dedicated to adding a value - Needs an access to research, talent and innovation - Academia - Dedicated to research and teaching - Can't do all the research and teaching alone
  • 9. Exponea is a company with ... - Team of Software Engineers that work on applied AI - 80% from FIIT and Matfyz - Data and interesting clients from all around the world - Lots of interesting problems to solve - Lots of relevant knowledge to share
  • 10.
  • 11.
  • 12. STU FIIT is an education institution with ... - A study programme Intelligent Software Systems - Software Engineering - Artificial Intelligence - A relevant research and team of experienced researchers - Lots of graduates with careers related to data science - Great facilities where all of us can meet
  • 13. What is Data Science Club? - Community of like minded people with interest in data, analytics, software engineering and AI - Regular meetups where we share knowledge and work on our challenges together
  • 14. When? - 6 times per semester in both summer and winter - Monday starting at 16:00 with agenda for 3 hours: - Expert talk(s) - Practical workshop - Networking, consulting, discussions
  • 15. Draft of Agenda Winter 1. Intro into Data Science 2. Data storage 3. SQL for data analytics 4. Map-Reduce 5. Stream data processing 6. Data visualisation Summer 1. Applications of machine learning 2. Process of machine learning 3. Classification 4. Recommendation engines 5. Bandit algorithms 6. Reinforcement learning Final agenda depends on your interest and availability of expert speakers
  • 16. Goals - Build an active community - 3+ universities, 10+ companies, 100+ members - Create and share content - Reach 2000+ viewers - Contribute to research and applications - 5+ publications by B&A authors in 2017/18 - 5+ new features in software products
  • 17. People - Jozo Kovac, Cofounder, CTO - Matus Cimerman, Head of AI - Peter Kovacs, AISW Engineer - Ondrej Brichta, AISW Engineer - Jakub Macina, AISW Engineer - Robert Lacok, AISW Engineer - Dalibor Meszaros, AISW Engineer - Lucia Siebestichova, Event Manager - Martina Kolibasova, QA Engineer
  • 18. Contacts - FB Page: https://www.facebook.com/ExponeaSocietyBratislava/ - Slack: https://community.exponea.com/ - Wiki: https://github.com/exponea/data-science-club/wiki - Git: https://github.com/exponea/data-science-club - Email: matus.cimerman@exponea.com
  • 19. MEET OUR TEAM WRITE HERE SOMETHING MY SEARCH FOR A FAMILY VACATION LETS START WITH A REAL WORLD PROBLEM
  • 20.
  • 21.
  • 22. Challenges - We pay for traffic, how to increase customer value? - Can customer history improve the quality of search? - Search must be fast, render everything under 100ms
  • 23. A quick analysis - A learn to rank problem
  • 24.
  • 26.
  • 27.
  • 29.
  • 30.
  • 31.
  • 33. Challenges - step by step • Collect data • Preprocess • Train a model • Evaluate • Deploy • Improve the model Data Model Data* InsightModel
  • 34. Collect data • If you have it, you’re done • If not, find a way to collect
  • 35. Preprocess • What: • Select features • Normalize • Fill • Aggregate... • How: • Store • Scale out processing
  • 36. Create a model • Learn about your domain • Go really simple first • Don’t reinvent the wheel
  • 37. Evaluate • What metrics make sense? • Test sets vs live data
  • 38. Deploy • Online API vs batch • Latency matters • Reliable • Monitor & evaluate • Retrain on new data
  • 39. Improve the model • Read more papers and blogs. Is anyone doing it better? • Lead, innovate and be the best
  • 40. Workshop 1. Set up your environment: dependencies, template & data 2. Preprocess: Clean, normalize, feature select (pandas/scikit-learn) 3. Train: Choose a model of your liking (scikit-learn) 4. Deploy: Create a REST API (flask)