SlideShare a Scribd company logo
Statoil/C-CORE Iceberg Classifier Challenge
Ship or iceberg, can you decide from space?
Who we are?
Kirill Zhdanovich
Inovia, Spotify, Maps.me, Wargaming
Competitive machine learning
Competition mechanics
● Data
● Model
● Submission
● Evaluation
● Leaderboard
Real world application vs Competitions
● Business problem
● Problem formalization
● Data collecting
● Data processing
● Modelling
● Evaluate model in production
● Deploy
Real world application vs Competitions
Aspect Real life Competition
Problem formalization Y N
Choice of target metric Y N
Inference Speed Y N
Data collecting Y Y/N
Model complexity Y Y/N
Target metric value Y Y
Kaggle
● Biggest prize - 1.500.000$
● Biggest amount of participants - 5.169
● Community(May 2016):
○ 536.000 kagglers
○ 194 country
Famous kagglers
Fchollet (François Chollet) - creator of Keras
Goodfellow (Ian Goodfellow) - author of Deep Learning book
H2o.ai - industry leader in ml platforms, company created from kagglers
DeepMind - some team member are famous kagglers(1, 2)
Why to participate
● Learning - getting job in ML after winning Kaggle competition
● Interesting challenges
● Meeting new people in ML/DS community
● Winning money
Where to start
● Kaggle
● Coursera: How to Win a Data Science Competition
● Deep learning with python
Who we are?
Andrii Sydorchuk
Sana Labs, Spotify, Google
Statoil/C-CORE Iceberg Classifier Challenge
Problem description
● Drifting icebergs present threats to navigation
● Companies use aerial reconnaissance and shore-based support to monitor
environmental conditions and assess risks from icebergs
● In remote areas with particularly harsh weather, these methods are not
feasible, and the only viable monitoring option is via satellite
● Build an algorithm that automatically identifies if a remotely sensed target is a
ship or iceberg
Background
● The Sentinel-1 satellite constellation is used to monitor land and ocean
● The C-Band radar can "see" through darkness, rain, cloud and even fog
● An object will appear as a bright spot because it reflects more radar energy
than its surroundings
Background
Data & evaluation
● band1, band2 - 75x75 sensor data, radar backscatter produced from
different polarizations
● incidence angle - the incidence angle of which the image was taken
● 1604 images in the train set (753 icebergs, 851 ships)
● 8424 images in the test set
● test set contains synthetic data
● evaluation - log loss between the predicted values and the ground truth
Data samples
Data samples
Data samples
Data samples
High level analysis
Working with images
● Convolutional neural networks
● Pretrained models (Xception, ResNet, VGG16)
● Data augmentation
Working with incidence angle
● Additional input to neural network
● Feature engineering
Initial approach
● Train custom CNN for the image + angle data
● Tune hyperparameters
● Tune data augmentation parameters
Visualize your data
Final approach
Train image data Test image data
CNN ensemble
CNN predictions Incidence angle data
features per inc. angle key KNN predictions
LightGBM classifier
Final predictions
CNN predictions
Final approach
1. Ensemble of CNNs with similar architecture and data augmentation
2. Take subset of CNNs that gives best score on cross validation
3. Group predictions from the previous step by incidence angle
a. For each group calculate mean, median, number of samples
4. Run KNN regressor on predictions from step 2) by incidence angle
5. Train LightGBM on the features from steps 2), 3), 4)
Questions?
Andrii Sydorchuk - sydorchuk.andriy@gmail.com
Kirill Zhdanovich - kzhdanovich@gmail.com
Competition link - https://www.kaggle.com/c/statoil-iceberg-classifier-challenge
Source code - https://github.com/asydorchuk/kaggle/tree/master/statoil

More Related Content

What's hot

OpenStreetMap in 3D - current developments
OpenStreetMap in 3D - current developmentsOpenStreetMap in 3D - current developments
OpenStreetMap in 3D - current developments
virtualcitySYSTEMS GmbH
 
Risk studies using OpenQuake-engine
Risk studies using OpenQuake-engineRisk studies using OpenQuake-engine
Risk studies using OpenQuake-engine
Global Earthquake Model Foundation
 
Nuclear Power Plant in Japan- The Evolution
Nuclear Power Plant in Japan- The EvolutionNuclear Power Plant in Japan- The Evolution
Nuclear Power Plant in Japan- The Evolution
Catherine Chen
 
Creating A Multi-wavelength Galactic Plane Atlas With Amazon Web Services
Creating A Multi-wavelength Galactic Plane Atlas With Amazon Web ServicesCreating A Multi-wavelength Galactic Plane Atlas With Amazon Web Services
Creating A Multi-wavelength Galactic Plane Atlas With Amazon Web Services
G. Bruce Berriman
 
Project 2019 05
Project 2019 05Project 2019 05
EMME Earthquake Model of Middle East
EMME Earthquake Model of Middle EastEMME Earthquake Model of Middle East
EMME Earthquake Model of Middle East
Global Earthquake Model Foundation
 
INSPIREd computing for EO Based Services
INSPIREd computing for EO Based Services INSPIREd computing for EO Based Services
INSPIREd computing for EO Based Services
Redazione Planetek Italia
 
Convolutional Patch Representations for Image Retrieval An unsupervised approach
Convolutional Patch Representations for Image Retrieval An unsupervised approachConvolutional Patch Representations for Image Retrieval An unsupervised approach
Convolutional Patch Representations for Image Retrieval An unsupervised approach
Universitat de Barcelona
 
LinkedIn Slide presentation_DSTL_179
LinkedIn Slide presentation_DSTL_179LinkedIn Slide presentation_DSTL_179
LinkedIn Slide presentation_DSTL_179Winson Lam, Ph.D.
 
Rasdaman use case
Rasdaman use case Rasdaman use case
Rasdaman use case
EOSC-hub project
 
2018 GIS in the Rockies Vendor Showcase (Th): ERDAS Imagine What's New and Ti...
2018 GIS in the Rockies Vendor Showcase (Th): ERDAS Imagine What's New and Ti...2018 GIS in the Rockies Vendor Showcase (Th): ERDAS Imagine What's New and Ti...
2018 GIS in the Rockies Vendor Showcase (Th): ERDAS Imagine What's New and Ti...
GIS in the Rockies
 
Data Usability Assessment for Remote Sensing Data: Accuracy of Interactive Da...
Data Usability Assessment for Remote Sensing Data: Accuracy of Interactive Da...Data Usability Assessment for Remote Sensing Data: Accuracy of Interactive Da...
Data Usability Assessment for Remote Sensing Data: Accuracy of Interactive Da...
Beniamino Murgante
 
Using Deep Learning to Derive 3D Cities from Satellite Imagery
Using Deep Learning to Derive 3D Cities from Satellite ImageryUsing Deep Learning to Derive 3D Cities from Satellite Imagery
Using Deep Learning to Derive 3D Cities from Satellite Imagery
Astraea, Inc.
 
Looking from Above: Object Detection and Other Computer Vision Tasks on Satel...
Looking from Above: Object Detection and Other Computer Vision Tasks on Satel...Looking from Above: Object Detection and Other Computer Vision Tasks on Satel...
Looking from Above: Object Detection and Other Computer Vision Tasks on Satel...
Xiaoyong Zhu
 
RasterFrames: Enabling Global-Scale Geospatial Machine Learning
RasterFrames: Enabling Global-Scale Geospatial Machine LearningRasterFrames: Enabling Global-Scale Geospatial Machine Learning
RasterFrames: Enabling Global-Scale Geospatial Machine Learning
Astraea, Inc.
 
GIS fundamentals - raster
GIS fundamentals - rasterGIS fundamentals - raster
GIS fundamentals - raster
Hans van der Kwast
 
Gal project - ASI Workshop for FP7 Galileo Projects
Gal project - ASI Workshop for FP7 Galileo ProjectsGal project - ASI Workshop for FP7 Galileo Projects
Gal project - ASI Workshop for FP7 Galileo ProjectsAngelo Amodio
 
Rendering Aurora
Rendering AuroraRendering Aurora
Rendering Aurora
rusted_coil
 
24 Konsep Terbaik Desain Gedung Pencakar Langit Dunia
24 Konsep Terbaik Desain Gedung Pencakar Langit Dunia24 Konsep Terbaik Desain Gedung Pencakar Langit Dunia
24 Konsep Terbaik Desain Gedung Pencakar Langit Dunia
badar masbadar
 
Ict 2019 v2
Ict 2019 v2Ict 2019 v2
Ict 2019 v2
FranoisCURNIER
 

What's hot (20)

OpenStreetMap in 3D - current developments
OpenStreetMap in 3D - current developmentsOpenStreetMap in 3D - current developments
OpenStreetMap in 3D - current developments
 
Risk studies using OpenQuake-engine
Risk studies using OpenQuake-engineRisk studies using OpenQuake-engine
Risk studies using OpenQuake-engine
 
Nuclear Power Plant in Japan- The Evolution
Nuclear Power Plant in Japan- The EvolutionNuclear Power Plant in Japan- The Evolution
Nuclear Power Plant in Japan- The Evolution
 
Creating A Multi-wavelength Galactic Plane Atlas With Amazon Web Services
Creating A Multi-wavelength Galactic Plane Atlas With Amazon Web ServicesCreating A Multi-wavelength Galactic Plane Atlas With Amazon Web Services
Creating A Multi-wavelength Galactic Plane Atlas With Amazon Web Services
 
Project 2019 05
Project 2019 05Project 2019 05
Project 2019 05
 
EMME Earthquake Model of Middle East
EMME Earthquake Model of Middle EastEMME Earthquake Model of Middle East
EMME Earthquake Model of Middle East
 
INSPIREd computing for EO Based Services
INSPIREd computing for EO Based Services INSPIREd computing for EO Based Services
INSPIREd computing for EO Based Services
 
Convolutional Patch Representations for Image Retrieval An unsupervised approach
Convolutional Patch Representations for Image Retrieval An unsupervised approachConvolutional Patch Representations for Image Retrieval An unsupervised approach
Convolutional Patch Representations for Image Retrieval An unsupervised approach
 
LinkedIn Slide presentation_DSTL_179
LinkedIn Slide presentation_DSTL_179LinkedIn Slide presentation_DSTL_179
LinkedIn Slide presentation_DSTL_179
 
Rasdaman use case
Rasdaman use case Rasdaman use case
Rasdaman use case
 
2018 GIS in the Rockies Vendor Showcase (Th): ERDAS Imagine What's New and Ti...
2018 GIS in the Rockies Vendor Showcase (Th): ERDAS Imagine What's New and Ti...2018 GIS in the Rockies Vendor Showcase (Th): ERDAS Imagine What's New and Ti...
2018 GIS in the Rockies Vendor Showcase (Th): ERDAS Imagine What's New and Ti...
 
Data Usability Assessment for Remote Sensing Data: Accuracy of Interactive Da...
Data Usability Assessment for Remote Sensing Data: Accuracy of Interactive Da...Data Usability Assessment for Remote Sensing Data: Accuracy of Interactive Da...
Data Usability Assessment for Remote Sensing Data: Accuracy of Interactive Da...
 
Using Deep Learning to Derive 3D Cities from Satellite Imagery
Using Deep Learning to Derive 3D Cities from Satellite ImageryUsing Deep Learning to Derive 3D Cities from Satellite Imagery
Using Deep Learning to Derive 3D Cities from Satellite Imagery
 
Looking from Above: Object Detection and Other Computer Vision Tasks on Satel...
Looking from Above: Object Detection and Other Computer Vision Tasks on Satel...Looking from Above: Object Detection and Other Computer Vision Tasks on Satel...
Looking from Above: Object Detection and Other Computer Vision Tasks on Satel...
 
RasterFrames: Enabling Global-Scale Geospatial Machine Learning
RasterFrames: Enabling Global-Scale Geospatial Machine LearningRasterFrames: Enabling Global-Scale Geospatial Machine Learning
RasterFrames: Enabling Global-Scale Geospatial Machine Learning
 
GIS fundamentals - raster
GIS fundamentals - rasterGIS fundamentals - raster
GIS fundamentals - raster
 
Gal project - ASI Workshop for FP7 Galileo Projects
Gal project - ASI Workshop for FP7 Galileo ProjectsGal project - ASI Workshop for FP7 Galileo Projects
Gal project - ASI Workshop for FP7 Galileo Projects
 
Rendering Aurora
Rendering AuroraRendering Aurora
Rendering Aurora
 
24 Konsep Terbaik Desain Gedung Pencakar Langit Dunia
24 Konsep Terbaik Desain Gedung Pencakar Langit Dunia24 Konsep Terbaik Desain Gedung Pencakar Langit Dunia
24 Konsep Terbaik Desain Gedung Pencakar Langit Dunia
 
Ict 2019 v2
Ict 2019 v2Ict 2019 v2
Ict 2019 v2
 

Similar to Electrolux meetup

Semantic Segmentation on Satellite Imagery
Semantic Segmentation on Satellite ImagerySemantic Segmentation on Satellite Imagery
Semantic Segmentation on Satellite Imagery
RAHUL BHOJWANI
 
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers:  A Deep Dive into Serverless Spatial Data and FMECloud Frontiers:  A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
Safe Software
 
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers:  A Deep Dive into Serverless Spatial Data and FMECloud Frontiers:  A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
Safe Software
 
Get Your Head in the Cloud - Lessons in GPU Computing with Schlumberger
Get Your Head in the Cloud - Lessons in GPU Computing with SchlumbergerGet Your Head in the Cloud - Lessons in GPU Computing with Schlumberger
Get Your Head in the Cloud - Lessons in GPU Computing with Schlumberger
inside-BigData.com
 
AI models for Ice Classification - ExtremeEarth Open Workshop
AI models for Ice Classification - ExtremeEarth Open WorkshopAI models for Ice Classification - ExtremeEarth Open Workshop
AI models for Ice Classification - ExtremeEarth Open Workshop
ExtremeEarth
 
Deep image retrieval learning global representations for image search
Deep image retrieval  learning global representations for image searchDeep image retrieval  learning global representations for image search
Deep image retrieval learning global representations for image search
Universitat Politècnica de Catalunya
 
Physics-ML のためのフレームワーク NVIDIA Modulus 最新事情
Physics-ML のためのフレームワーク NVIDIA Modulus 最新事情Physics-ML のためのフレームワーク NVIDIA Modulus 最新事情
Physics-ML のためのフレームワーク NVIDIA Modulus 最新事情
NVIDIA Japan
 
Centernet
CenternetCenternet
Centernet
Arithmer Inc.
 
Desktop Softwares for Unmanned Aerial Systems(UAS))
Desktop Softwares for Unmanned Aerial Systems(UAS))Desktop Softwares for Unmanned Aerial Systems(UAS))
Desktop Softwares for Unmanned Aerial Systems(UAS))
Kamal Shahi
 
Deep image retrieval - learning global representations for image search - ub ...
Deep image retrieval - learning global representations for image search - ub ...Deep image retrieval - learning global representations for image search - ub ...
Deep image retrieval - learning global representations for image search - ub ...
Universitat de Barcelona
 
GDG Cloud Southlake #11 Steve McGhee Reliability Theory and Practice
GDG Cloud Southlake #11 Steve McGhee Reliability Theory and PracticeGDG Cloud Southlake #11 Steve McGhee Reliability Theory and Practice
GDG Cloud Southlake #11 Steve McGhee Reliability Theory and Practice
JamesAnderson599331
 
Holistic data application quality
Holistic data application qualityHolistic data application quality
Holistic data application quality
Lars Albertsson
 
impervious cover
impervious coverimpervious cover
impervious coverJames Yang
 
High Resolution Energy Modeling that Scales with Apache Spark 2.0 Spark Summi...
High Resolution Energy Modeling that Scales with Apache Spark 2.0 Spark Summi...High Resolution Energy Modeling that Scales with Apache Spark 2.0 Spark Summi...
High Resolution Energy Modeling that Scales with Apache Spark 2.0 Spark Summi...
Spark Summit
 
understanding the planet using satellites and deep learning
understanding the planet using satellites and deep learningunderstanding the planet using satellites and deep learning
understanding the planet using satellites and deep learning
Albert Pujol Torras
 
Data engineering in 10 years.pdf
Data engineering in 10 years.pdfData engineering in 10 years.pdf
Data engineering in 10 years.pdf
Lars Albertsson
 
DDI Dense Point Cloud Processing Presentation Oct2009
DDI Dense Point Cloud Processing Presentation Oct2009DDI Dense Point Cloud Processing Presentation Oct2009
DDI Dense Point Cloud Processing Presentation Oct2009
Direct Dimensions, Inc.
 
DDI Dense Point Cloud Processing Presentation Oct 2009
DDI Dense Point Cloud Processing Presentation Oct 2009DDI Dense Point Cloud Processing Presentation Oct 2009
DDI Dense Point Cloud Processing Presentation Oct 2009
Direct Dimensions, Inc.
 
byteLAKE's expertise across NVIDIA architectures and configurations
byteLAKE's expertise across NVIDIA architectures and configurationsbyteLAKE's expertise across NVIDIA architectures and configurations
byteLAKE's expertise across NVIDIA architectures and configurations
byteLAKE
 
A Three-Dimensional Representation method for Noisy Point Clouds based on Gro...
A Three-Dimensional Representation method for Noisy Point Clouds based on Gro...A Three-Dimensional Representation method for Noisy Point Clouds based on Gro...
A Three-Dimensional Representation method for Noisy Point Clouds based on Gro...
Sergio Orts-Escolano
 

Similar to Electrolux meetup (20)

Semantic Segmentation on Satellite Imagery
Semantic Segmentation on Satellite ImagerySemantic Segmentation on Satellite Imagery
Semantic Segmentation on Satellite Imagery
 
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers:  A Deep Dive into Serverless Spatial Data and FMECloud Frontiers:  A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
 
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers:  A Deep Dive into Serverless Spatial Data and FMECloud Frontiers:  A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
 
Get Your Head in the Cloud - Lessons in GPU Computing with Schlumberger
Get Your Head in the Cloud - Lessons in GPU Computing with SchlumbergerGet Your Head in the Cloud - Lessons in GPU Computing with Schlumberger
Get Your Head in the Cloud - Lessons in GPU Computing with Schlumberger
 
AI models for Ice Classification - ExtremeEarth Open Workshop
AI models for Ice Classification - ExtremeEarth Open WorkshopAI models for Ice Classification - ExtremeEarth Open Workshop
AI models for Ice Classification - ExtremeEarth Open Workshop
 
Deep image retrieval learning global representations for image search
Deep image retrieval  learning global representations for image searchDeep image retrieval  learning global representations for image search
Deep image retrieval learning global representations for image search
 
Physics-ML のためのフレームワーク NVIDIA Modulus 最新事情
Physics-ML のためのフレームワーク NVIDIA Modulus 最新事情Physics-ML のためのフレームワーク NVIDIA Modulus 最新事情
Physics-ML のためのフレームワーク NVIDIA Modulus 最新事情
 
Centernet
CenternetCenternet
Centernet
 
Desktop Softwares for Unmanned Aerial Systems(UAS))
Desktop Softwares for Unmanned Aerial Systems(UAS))Desktop Softwares for Unmanned Aerial Systems(UAS))
Desktop Softwares for Unmanned Aerial Systems(UAS))
 
Deep image retrieval - learning global representations for image search - ub ...
Deep image retrieval - learning global representations for image search - ub ...Deep image retrieval - learning global representations for image search - ub ...
Deep image retrieval - learning global representations for image search - ub ...
 
GDG Cloud Southlake #11 Steve McGhee Reliability Theory and Practice
GDG Cloud Southlake #11 Steve McGhee Reliability Theory and PracticeGDG Cloud Southlake #11 Steve McGhee Reliability Theory and Practice
GDG Cloud Southlake #11 Steve McGhee Reliability Theory and Practice
 
Holistic data application quality
Holistic data application qualityHolistic data application quality
Holistic data application quality
 
impervious cover
impervious coverimpervious cover
impervious cover
 
High Resolution Energy Modeling that Scales with Apache Spark 2.0 Spark Summi...
High Resolution Energy Modeling that Scales with Apache Spark 2.0 Spark Summi...High Resolution Energy Modeling that Scales with Apache Spark 2.0 Spark Summi...
High Resolution Energy Modeling that Scales with Apache Spark 2.0 Spark Summi...
 
understanding the planet using satellites and deep learning
understanding the planet using satellites and deep learningunderstanding the planet using satellites and deep learning
understanding the planet using satellites and deep learning
 
Data engineering in 10 years.pdf
Data engineering in 10 years.pdfData engineering in 10 years.pdf
Data engineering in 10 years.pdf
 
DDI Dense Point Cloud Processing Presentation Oct2009
DDI Dense Point Cloud Processing Presentation Oct2009DDI Dense Point Cloud Processing Presentation Oct2009
DDI Dense Point Cloud Processing Presentation Oct2009
 
DDI Dense Point Cloud Processing Presentation Oct 2009
DDI Dense Point Cloud Processing Presentation Oct 2009DDI Dense Point Cloud Processing Presentation Oct 2009
DDI Dense Point Cloud Processing Presentation Oct 2009
 
byteLAKE's expertise across NVIDIA architectures and configurations
byteLAKE's expertise across NVIDIA architectures and configurationsbyteLAKE's expertise across NVIDIA architectures and configurations
byteLAKE's expertise across NVIDIA architectures and configurations
 
A Three-Dimensional Representation method for Noisy Point Clouds based on Gro...
A Three-Dimensional Representation method for Noisy Point Clouds based on Gro...A Three-Dimensional Representation method for Noisy Point Clouds based on Gro...
A Three-Dimensional Representation method for Noisy Point Clouds based on Gro...
 

Recently uploaded

Predicting Product Ad Campaign Performance: A Data Analysis Project Presentation
Predicting Product Ad Campaign Performance: A Data Analysis Project PresentationPredicting Product Ad Campaign Performance: A Data Analysis Project Presentation
Predicting Product Ad Campaign Performance: A Data Analysis Project Presentation
Boston Institute of Analytics
 
一比一原版(BCU毕业证书)伯明翰城市大学毕业证如何办理
一比一原版(BCU毕业证书)伯明翰城市大学毕业证如何办理一比一原版(BCU毕业证书)伯明翰城市大学毕业证如何办理
一比一原版(BCU毕业证书)伯明翰城市大学毕业证如何办理
dwreak4tg
 
一比一原版(Deakin毕业证书)迪肯大学毕业证如何办理
一比一原版(Deakin毕业证书)迪肯大学毕业证如何办理一比一原版(Deakin毕业证书)迪肯大学毕业证如何办理
一比一原版(Deakin毕业证书)迪肯大学毕业证如何办理
oz8q3jxlp
 
一比一原版(QU毕业证)皇后大学毕业证成绩单
一比一原版(QU毕业证)皇后大学毕业证成绩单一比一原版(QU毕业证)皇后大学毕业证成绩单
一比一原版(QU毕业证)皇后大学毕业证成绩单
enxupq
 
一比一原版(NYU毕业证)纽约大学毕业证成绩单
一比一原版(NYU毕业证)纽约大学毕业证成绩单一比一原版(NYU毕业证)纽约大学毕业证成绩单
一比一原版(NYU毕业证)纽约大学毕业证成绩单
ewymefz
 
一比一原版(IIT毕业证)伊利诺伊理工大学毕业证成绩单
一比一原版(IIT毕业证)伊利诺伊理工大学毕业证成绩单一比一原版(IIT毕业证)伊利诺伊理工大学毕业证成绩单
一比一原版(IIT毕业证)伊利诺伊理工大学毕业证成绩单
ewymefz
 
Criminal IP - Threat Hunting Webinar.pdf
Criminal IP - Threat Hunting Webinar.pdfCriminal IP - Threat Hunting Webinar.pdf
Criminal IP - Threat Hunting Webinar.pdf
Criminal IP
 
一比一原版(UofM毕业证)明尼苏达大学毕业证成绩单
一比一原版(UofM毕业证)明尼苏达大学毕业证成绩单一比一原版(UofM毕业证)明尼苏达大学毕业证成绩单
一比一原版(UofM毕业证)明尼苏达大学毕业证成绩单
ewymefz
 
06-04-2024 - NYC Tech Week - Discussion on Vector Databases, Unstructured Dat...
06-04-2024 - NYC Tech Week - Discussion on Vector Databases, Unstructured Dat...06-04-2024 - NYC Tech Week - Discussion on Vector Databases, Unstructured Dat...
06-04-2024 - NYC Tech Week - Discussion on Vector Databases, Unstructured Dat...
Timothy Spann
 
Sample_Global Non-invasive Prenatal Testing (NIPT) Market, 2019-2030.pdf
Sample_Global Non-invasive Prenatal Testing (NIPT) Market, 2019-2030.pdfSample_Global Non-invasive Prenatal Testing (NIPT) Market, 2019-2030.pdf
Sample_Global Non-invasive Prenatal Testing (NIPT) Market, 2019-2030.pdf
Linda486226
 
一比一原版(CBU毕业证)卡普顿大学毕业证成绩单
一比一原版(CBU毕业证)卡普顿大学毕业证成绩单一比一原版(CBU毕业证)卡普顿大学毕业证成绩单
一比一原版(CBU毕业证)卡普顿大学毕业证成绩单
nscud
 
【社内勉強会資料_Octo: An Open-Source Generalist Robot Policy】
【社内勉強会資料_Octo: An Open-Source Generalist Robot Policy】【社内勉強会資料_Octo: An Open-Source Generalist Robot Policy】
【社内勉強会資料_Octo: An Open-Source Generalist Robot Policy】
NABLAS株式会社
 
一比一原版(UIUC毕业证)伊利诺伊大学|厄巴纳-香槟分校毕业证如何办理
一比一原版(UIUC毕业证)伊利诺伊大学|厄巴纳-香槟分校毕业证如何办理一比一原版(UIUC毕业证)伊利诺伊大学|厄巴纳-香槟分校毕业证如何办理
一比一原版(UIUC毕业证)伊利诺伊大学|厄巴纳-香槟分校毕业证如何办理
ahzuo
 
一比一原版(UPenn毕业证)宾夕法尼亚大学毕业证成绩单
一比一原版(UPenn毕业证)宾夕法尼亚大学毕业证成绩单一比一原版(UPenn毕业证)宾夕法尼亚大学毕业证成绩单
一比一原版(UPenn毕业证)宾夕法尼亚大学毕业证成绩单
ewymefz
 
Criminal IP - Threat Hunting Webinar.pdf
Criminal IP - Threat Hunting Webinar.pdfCriminal IP - Threat Hunting Webinar.pdf
Criminal IP - Threat Hunting Webinar.pdf
Criminal IP
 
一比一原版(BU毕业证)波士顿大学毕业证成绩单
一比一原版(BU毕业证)波士顿大学毕业证成绩单一比一原版(BU毕业证)波士顿大学毕业证成绩单
一比一原版(BU毕业证)波士顿大学毕业证成绩单
ewymefz
 
FP Growth Algorithm and its Applications
FP Growth Algorithm and its ApplicationsFP Growth Algorithm and its Applications
FP Growth Algorithm and its Applications
MaleehaSheikh2
 
Data_and_Analytics_Essentials_Architect_an_Analytics_Platform.pptx
Data_and_Analytics_Essentials_Architect_an_Analytics_Platform.pptxData_and_Analytics_Essentials_Architect_an_Analytics_Platform.pptx
Data_and_Analytics_Essentials_Architect_an_Analytics_Platform.pptx
AnirbanRoy608946
 
Malana- Gimlet Market Analysis (Portfolio 2)
Malana- Gimlet Market Analysis (Portfolio 2)Malana- Gimlet Market Analysis (Portfolio 2)
Malana- Gimlet Market Analysis (Portfolio 2)
TravisMalana
 
一比一原版(Coventry毕业证书)考文垂大学毕业证如何办理
一比一原版(Coventry毕业证书)考文垂大学毕业证如何办理一比一原版(Coventry毕业证书)考文垂大学毕业证如何办理
一比一原版(Coventry毕业证书)考文垂大学毕业证如何办理
74nqk8xf
 

Recently uploaded (20)

Predicting Product Ad Campaign Performance: A Data Analysis Project Presentation
Predicting Product Ad Campaign Performance: A Data Analysis Project PresentationPredicting Product Ad Campaign Performance: A Data Analysis Project Presentation
Predicting Product Ad Campaign Performance: A Data Analysis Project Presentation
 
一比一原版(BCU毕业证书)伯明翰城市大学毕业证如何办理
一比一原版(BCU毕业证书)伯明翰城市大学毕业证如何办理一比一原版(BCU毕业证书)伯明翰城市大学毕业证如何办理
一比一原版(BCU毕业证书)伯明翰城市大学毕业证如何办理
 
一比一原版(Deakin毕业证书)迪肯大学毕业证如何办理
一比一原版(Deakin毕业证书)迪肯大学毕业证如何办理一比一原版(Deakin毕业证书)迪肯大学毕业证如何办理
一比一原版(Deakin毕业证书)迪肯大学毕业证如何办理
 
一比一原版(QU毕业证)皇后大学毕业证成绩单
一比一原版(QU毕业证)皇后大学毕业证成绩单一比一原版(QU毕业证)皇后大学毕业证成绩单
一比一原版(QU毕业证)皇后大学毕业证成绩单
 
一比一原版(NYU毕业证)纽约大学毕业证成绩单
一比一原版(NYU毕业证)纽约大学毕业证成绩单一比一原版(NYU毕业证)纽约大学毕业证成绩单
一比一原版(NYU毕业证)纽约大学毕业证成绩单
 
一比一原版(IIT毕业证)伊利诺伊理工大学毕业证成绩单
一比一原版(IIT毕业证)伊利诺伊理工大学毕业证成绩单一比一原版(IIT毕业证)伊利诺伊理工大学毕业证成绩单
一比一原版(IIT毕业证)伊利诺伊理工大学毕业证成绩单
 
Criminal IP - Threat Hunting Webinar.pdf
Criminal IP - Threat Hunting Webinar.pdfCriminal IP - Threat Hunting Webinar.pdf
Criminal IP - Threat Hunting Webinar.pdf
 
一比一原版(UofM毕业证)明尼苏达大学毕业证成绩单
一比一原版(UofM毕业证)明尼苏达大学毕业证成绩单一比一原版(UofM毕业证)明尼苏达大学毕业证成绩单
一比一原版(UofM毕业证)明尼苏达大学毕业证成绩单
 
06-04-2024 - NYC Tech Week - Discussion on Vector Databases, Unstructured Dat...
06-04-2024 - NYC Tech Week - Discussion on Vector Databases, Unstructured Dat...06-04-2024 - NYC Tech Week - Discussion on Vector Databases, Unstructured Dat...
06-04-2024 - NYC Tech Week - Discussion on Vector Databases, Unstructured Dat...
 
Sample_Global Non-invasive Prenatal Testing (NIPT) Market, 2019-2030.pdf
Sample_Global Non-invasive Prenatal Testing (NIPT) Market, 2019-2030.pdfSample_Global Non-invasive Prenatal Testing (NIPT) Market, 2019-2030.pdf
Sample_Global Non-invasive Prenatal Testing (NIPT) Market, 2019-2030.pdf
 
一比一原版(CBU毕业证)卡普顿大学毕业证成绩单
一比一原版(CBU毕业证)卡普顿大学毕业证成绩单一比一原版(CBU毕业证)卡普顿大学毕业证成绩单
一比一原版(CBU毕业证)卡普顿大学毕业证成绩单
 
【社内勉強会資料_Octo: An Open-Source Generalist Robot Policy】
【社内勉強会資料_Octo: An Open-Source Generalist Robot Policy】【社内勉強会資料_Octo: An Open-Source Generalist Robot Policy】
【社内勉強会資料_Octo: An Open-Source Generalist Robot Policy】
 
一比一原版(UIUC毕业证)伊利诺伊大学|厄巴纳-香槟分校毕业证如何办理
一比一原版(UIUC毕业证)伊利诺伊大学|厄巴纳-香槟分校毕业证如何办理一比一原版(UIUC毕业证)伊利诺伊大学|厄巴纳-香槟分校毕业证如何办理
一比一原版(UIUC毕业证)伊利诺伊大学|厄巴纳-香槟分校毕业证如何办理
 
一比一原版(UPenn毕业证)宾夕法尼亚大学毕业证成绩单
一比一原版(UPenn毕业证)宾夕法尼亚大学毕业证成绩单一比一原版(UPenn毕业证)宾夕法尼亚大学毕业证成绩单
一比一原版(UPenn毕业证)宾夕法尼亚大学毕业证成绩单
 
Criminal IP - Threat Hunting Webinar.pdf
Criminal IP - Threat Hunting Webinar.pdfCriminal IP - Threat Hunting Webinar.pdf
Criminal IP - Threat Hunting Webinar.pdf
 
一比一原版(BU毕业证)波士顿大学毕业证成绩单
一比一原版(BU毕业证)波士顿大学毕业证成绩单一比一原版(BU毕业证)波士顿大学毕业证成绩单
一比一原版(BU毕业证)波士顿大学毕业证成绩单
 
FP Growth Algorithm and its Applications
FP Growth Algorithm and its ApplicationsFP Growth Algorithm and its Applications
FP Growth Algorithm and its Applications
 
Data_and_Analytics_Essentials_Architect_an_Analytics_Platform.pptx
Data_and_Analytics_Essentials_Architect_an_Analytics_Platform.pptxData_and_Analytics_Essentials_Architect_an_Analytics_Platform.pptx
Data_and_Analytics_Essentials_Architect_an_Analytics_Platform.pptx
 
Malana- Gimlet Market Analysis (Portfolio 2)
Malana- Gimlet Market Analysis (Portfolio 2)Malana- Gimlet Market Analysis (Portfolio 2)
Malana- Gimlet Market Analysis (Portfolio 2)
 
一比一原版(Coventry毕业证书)考文垂大学毕业证如何办理
一比一原版(Coventry毕业证书)考文垂大学毕业证如何办理一比一原版(Coventry毕业证书)考文垂大学毕业证如何办理
一比一原版(Coventry毕业证书)考文垂大学毕业证如何办理
 

Electrolux meetup

  • 1. Statoil/C-CORE Iceberg Classifier Challenge Ship or iceberg, can you decide from space?
  • 2. Who we are? Kirill Zhdanovich Inovia, Spotify, Maps.me, Wargaming
  • 4. Competition mechanics ● Data ● Model ● Submission ● Evaluation ● Leaderboard
  • 5. Real world application vs Competitions ● Business problem ● Problem formalization ● Data collecting ● Data processing ● Modelling ● Evaluate model in production ● Deploy
  • 6. Real world application vs Competitions Aspect Real life Competition Problem formalization Y N Choice of target metric Y N Inference Speed Y N Data collecting Y Y/N Model complexity Y Y/N Target metric value Y Y
  • 7.
  • 8. Kaggle ● Biggest prize - 1.500.000$ ● Biggest amount of participants - 5.169 ● Community(May 2016): ○ 536.000 kagglers ○ 194 country
  • 9. Famous kagglers Fchollet (François Chollet) - creator of Keras Goodfellow (Ian Goodfellow) - author of Deep Learning book H2o.ai - industry leader in ml platforms, company created from kagglers DeepMind - some team member are famous kagglers(1, 2)
  • 10. Why to participate ● Learning - getting job in ML after winning Kaggle competition ● Interesting challenges ● Meeting new people in ML/DS community ● Winning money
  • 11. Where to start ● Kaggle ● Coursera: How to Win a Data Science Competition ● Deep learning with python
  • 12. Who we are? Andrii Sydorchuk Sana Labs, Spotify, Google
  • 14. Problem description ● Drifting icebergs present threats to navigation ● Companies use aerial reconnaissance and shore-based support to monitor environmental conditions and assess risks from icebergs ● In remote areas with particularly harsh weather, these methods are not feasible, and the only viable monitoring option is via satellite ● Build an algorithm that automatically identifies if a remotely sensed target is a ship or iceberg
  • 15. Background ● The Sentinel-1 satellite constellation is used to monitor land and ocean ● The C-Band radar can "see" through darkness, rain, cloud and even fog ● An object will appear as a bright spot because it reflects more radar energy than its surroundings
  • 17. Data & evaluation ● band1, band2 - 75x75 sensor data, radar backscatter produced from different polarizations ● incidence angle - the incidence angle of which the image was taken ● 1604 images in the train set (753 icebergs, 851 ships) ● 8424 images in the test set ● test set contains synthetic data ● evaluation - log loss between the predicted values and the ground truth
  • 22. High level analysis Working with images ● Convolutional neural networks ● Pretrained models (Xception, ResNet, VGG16) ● Data augmentation Working with incidence angle ● Additional input to neural network ● Feature engineering
  • 23. Initial approach ● Train custom CNN for the image + angle data ● Tune hyperparameters ● Tune data augmentation parameters
  • 25. Final approach Train image data Test image data CNN ensemble CNN predictions Incidence angle data features per inc. angle key KNN predictions LightGBM classifier Final predictions CNN predictions
  • 26. Final approach 1. Ensemble of CNNs with similar architecture and data augmentation 2. Take subset of CNNs that gives best score on cross validation 3. Group predictions from the previous step by incidence angle a. For each group calculate mean, median, number of samples 4. Run KNN regressor on predictions from step 2) by incidence angle 5. Train LightGBM on the features from steps 2), 3), 4)
  • 27. Questions? Andrii Sydorchuk - sydorchuk.andriy@gmail.com Kirill Zhdanovich - kzhdanovich@gmail.com Competition link - https://www.kaggle.com/c/statoil-iceberg-classifier-challenge Source code - https://github.com/asydorchuk/kaggle/tree/master/statoil