SlideShare a Scribd company logo
1 of 11
Download to read offline
2018.05.12 @osciiart Stage 1: 9th, Stage 2: 71st /3634
kaggle Tokyo Meetup #4
Lightning Talk
2018 Data Science Bowl
Who am I?
秋山理 Osamu Akiyama @osciiart
Biography
• 京都大学生命科学修士号
• 大阪大学医学部医学科5回 (31歳)
• 研究: 脳科学, BMI
• AIメディカル研究会 (AIMS)
paper
• Akiyama O. ASCII Art Synthesis with Convolutional Networks. NIPS 2017 Workshop,
Machine Learning for Creativity and Design. 2017.
• ASCII.jp: アスキーアートの精度はディープラーニングでどこまで上がるのか?
• VICE MOTHERBOARD: This Machine Learning Algorithm Can Turn Any Line Drawing Into ASCII Art
Kaggle status (@osciiart)
• 3 Silver, 1 Bronze
Other competition result
• DeepAnalytics バイエル薬品 医薬情報テキストマイニング 2nd / 127
• Bioinformatics Contest 2018 20th
2018 Data Science Bowl
Instance Segmentation
Evaluation
Pred Label
IoU > threshold -> True Positive
Average Precision (AP) =
mean AP (mAP) =
1.00
0.00
0.50 0.55 0.60 0.65 0.70 0.75 0.80 0.85 0.90 0.95
mAP
threshold
AP
2 Stage Competition
Strong generalization required
Train data: 665 Stage 1 Test data: 65
Stage 2 Test Data: 3019 (most of all is fake)
Mask R-CNN vs U-Net
2-stage detector
• Detection とSegmentationのprocessを分離
• 精度が高い (State-of-the-Art)
• Occlusion, Class imbalanceに対応できる
• 学習が難しい
1-stage detector
• そのままではInstanceを分離できない
• Simple and Fast
• Occlusion, Class imbalance に弱い
• Ensembleが適用しやすい
Ronneberger, O., Fischer, P., Brox, T. U-Net: Convolutional
Networks for Biomedical Image Segmentation. arXiv. 2015.
He, K., Gkioxari, G., Dollár, P., Girshick, R. Mask R-CNN. arXiv.
2017.
The Organizer Stands Like God
主催者の一人 Allen がコンペ開始からぶっちぎりの1位に君臨
Stage 1 で結局誰もAllenを追い抜けなかった
Allen が積極的に手法を公開したため、彼の手法をいかに再現するかの勝負の様相
Allenの手法がMask R-CNNのため多くの人がMask R-CNNに注目した
1st Stage LB
My Solution: (based on) Deep Watershed Transform
• 3 net in serial -> in parallel (for simplification)
• Binned depth classification -> normalized depth regression (for size augmentation)
Bai M, Urtasun R. Deep Watershed Transform for Instance Segmentation. arXiv. 2016.
SegNet
Direction
Net
Depth
Net
DeepLab
V3+’
• Augmentation
• Random cropping
• Resize (0.5 – 2.0)
• Rotation (-180° - 180°)
• Flip
• Hue, Saturation, Lightness
• TTA
• Mean diameter (25, 30, 35, 40, 45 pixel)
• Flip
• Rotation (0°, 90°,…, 270°)
Marvelous Article: Applying Deep Watershed Transform to Kaggle Data Science Bowl 2018
My Solution: Semi-supervised by GAN
(doesn’t work)
• Generator (labeled)
G
True Label
D
PredictionInput
Real PairAdv Loss
MSE Loss
D
Real Pair
or
Fake Pair
Adv Loss
Prediction
Input
True Label
Input
or
G
D
PredictionInput
Real PairAdv Loss
• Discriminator
• Generator (unlabeled)
1st place solution: U-Net on Steroids
• targets - we predict touching borders along with the masks to solve the problem as
instance segmentation
• loss function - that combines crossentropy and soft dice loss in such a way that pixel
imbalance doesn't affect the results
• very deep encoder-decoder architectures that also achieve state-of-the-art results in other
binary segmentation problems (SpaceNet, Inria and others)
• tricky postprocessing that combines watershed, morphological features and second-level
model with Gradient Boosted Trees (increased 0.015)
• task specific data augmentations
Result: Mask R-CNN vs U-Net
0.582
-
2nd Stage LB U-Net (touching border)
-
Mask-RCNN
U-Net (watershed, 2step)
Mask-RCNN
-
Mask-RCNN
Mask-RCNN
-
Mask-RCNN
Mask-RCNN
-
-
Mask-RCNN
-
-
U-Net (watershed, 1step)
6チームがAllenを超えることができた

More Related Content

Similar to kaggle Tokyo Meetup #4 Lightning Talk 2018 Data Science Bowl

2013추계학술대회 인쇄용
2013추계학술대회 인쇄용2013추계학술대회 인쇄용
2013추계학술대회 인쇄용
Byung Kook Ha
 
High-Performance Advanced Analytics with Spark-Alchemy
High-Performance Advanced Analytics with Spark-AlchemyHigh-Performance Advanced Analytics with Spark-Alchemy
High-Performance Advanced Analytics with Spark-Alchemy
Databricks
 
Lecture 7: Data-Intensive Computing for Text Analysis (Fall 2011)
Lecture 7: Data-Intensive Computing for Text Analysis (Fall 2011)Lecture 7: Data-Intensive Computing for Text Analysis (Fall 2011)
Lecture 7: Data-Intensive Computing for Text Analysis (Fall 2011)
Matthew Lease
 

Similar to kaggle Tokyo Meetup #4 Lightning Talk 2018 Data Science Bowl (20)

Jan2016 nabsys giab
Jan2016 nabsys giabJan2016 nabsys giab
Jan2016 nabsys giab
 
Feature Engineering Hands-On by Dmitry Larko
Feature Engineering Hands-On by Dmitry LarkoFeature Engineering Hands-On by Dmitry Larko
Feature Engineering Hands-On by Dmitry Larko
 
BSSML16 L3. Clusters and Anomaly Detection
BSSML16 L3. Clusters and Anomaly DetectionBSSML16 L3. Clusters and Anomaly Detection
BSSML16 L3. Clusters and Anomaly Detection
 
Feature Engineering for ML - Dmitry Larko, H2O.ai
Feature Engineering for ML - Dmitry Larko, H2O.aiFeature Engineering for ML - Dmitry Larko, H2O.ai
Feature Engineering for ML - Dmitry Larko, H2O.ai
 
2013추계학술대회 인쇄용
2013추계학술대회 인쇄용2013추계학술대회 인쇄용
2013추계학술대회 인쇄용
 
Artem Baklanov - Votes Aggregation Techniques in Geo-Wiki Crowdsourcing Game:...
Artem Baklanov - Votes Aggregation Techniques in Geo-Wiki Crowdsourcing Game:...Artem Baklanov - Votes Aggregation Techniques in Geo-Wiki Crowdsourcing Game:...
Artem Baklanov - Votes Aggregation Techniques in Geo-Wiki Crowdsourcing Game:...
 
BSSML17 - Clusters
BSSML17 - ClustersBSSML17 - Clusters
BSSML17 - Clusters
 
CipherKey Algorithm
CipherKey AlgorithmCipherKey Algorithm
CipherKey Algorithm
 
ADAPTIVE FILTER FOR DENOISING 3D DATA CAPTURED BY DEPTH SENSORS
ADAPTIVE FILTER FOR DENOISING 3D DATA CAPTURED BY DEPTH SENSORSADAPTIVE FILTER FOR DENOISING 3D DATA CAPTURED BY DEPTH SENSORS
ADAPTIVE FILTER FOR DENOISING 3D DATA CAPTURED BY DEPTH SENSORS
 
High-Performance Advanced Analytics with Spark-Alchemy
High-Performance Advanced Analytics with Spark-AlchemyHigh-Performance Advanced Analytics with Spark-Alchemy
High-Performance Advanced Analytics with Spark-Alchemy
 
Structured Forests for Fast Edge Detection [Paper Presentation]
Structured Forests for Fast Edge Detection [Paper Presentation]Structured Forests for Fast Edge Detection [Paper Presentation]
Structured Forests for Fast Edge Detection [Paper Presentation]
 
Defect detection in circlips using image processing in ni lab view
Defect detection in circlips using image processing in ni lab viewDefect detection in circlips using image processing in ni lab view
Defect detection in circlips using image processing in ni lab view
 
How to program software and objects
How to program software and objectsHow to program software and objects
How to program software and objects
 
Deep LearningフレームワークChainerと最近の技術動向
Deep LearningフレームワークChainerと最近の技術動向Deep LearningフレームワークChainerと最近の技術動向
Deep LearningフレームワークChainerと最近の技術動向
 
Neural Art (English Version)
Neural Art (English Version)Neural Art (English Version)
Neural Art (English Version)
 
Gradient Boosted Regression Trees in Scikit Learn by Gilles Louppe & Peter Pr...
Gradient Boosted Regression Trees in Scikit Learn by Gilles Louppe & Peter Pr...Gradient Boosted Regression Trees in Scikit Learn by Gilles Louppe & Peter Pr...
Gradient Boosted Regression Trees in Scikit Learn by Gilles Louppe & Peter Pr...
 
Utah Big Mountain Conference: AncestryDNA, HBase, Hadoop (9-7-2013)
Utah Big Mountain Conference:  AncestryDNA, HBase, Hadoop (9-7-2013)Utah Big Mountain Conference:  AncestryDNA, HBase, Hadoop (9-7-2013)
Utah Big Mountain Conference: AncestryDNA, HBase, Hadoop (9-7-2013)
 
Lecture 7: Data-Intensive Computing for Text Analysis (Fall 2011)
Lecture 7: Data-Intensive Computing for Text Analysis (Fall 2011)Lecture 7: Data-Intensive Computing for Text Analysis (Fall 2011)
Lecture 7: Data-Intensive Computing for Text Analysis (Fall 2011)
 
Knowledge Graphs and Graph Data Science: More Context, Better Predictions (Ne...
Knowledge Graphs and Graph Data Science: More Context, Better Predictions (Ne...Knowledge Graphs and Graph Data Science: More Context, Better Predictions (Ne...
Knowledge Graphs and Graph Data Science: More Context, Better Predictions (Ne...
 
Follow the money with graphs
Follow the money with graphsFollow the money with graphs
Follow the money with graphs
 

Recently uploaded

Tembisa Central Terminating Pills +27838792658 PHOMOLONG Top Abortion Pills F...
Tembisa Central Terminating Pills +27838792658 PHOMOLONG Top Abortion Pills F...Tembisa Central Terminating Pills +27838792658 PHOMOLONG Top Abortion Pills F...
Tembisa Central Terminating Pills +27838792658 PHOMOLONG Top Abortion Pills F...
drjose256
 
21P35A0312 Internship eccccccReport.docx
21P35A0312 Internship eccccccReport.docx21P35A0312 Internship eccccccReport.docx
21P35A0312 Internship eccccccReport.docx
rahulmanepalli02
 

Recently uploaded (20)

Raashid final report on Embedded Systems
Raashid final report on Embedded SystemsRaashid final report on Embedded Systems
Raashid final report on Embedded Systems
 
Software Engineering Practical File Front Pages.pdf
Software Engineering Practical File Front Pages.pdfSoftware Engineering Practical File Front Pages.pdf
Software Engineering Practical File Front Pages.pdf
 
Diploma Engineering Drawing Qp-2024 Ece .pdf
Diploma Engineering Drawing Qp-2024 Ece .pdfDiploma Engineering Drawing Qp-2024 Ece .pdf
Diploma Engineering Drawing Qp-2024 Ece .pdf
 
History of Indian Railways - the story of Growth & Modernization
History of Indian Railways - the story of Growth & ModernizationHistory of Indian Railways - the story of Growth & Modernization
History of Indian Railways - the story of Growth & Modernization
 
Tembisa Central Terminating Pills +27838792658 PHOMOLONG Top Abortion Pills F...
Tembisa Central Terminating Pills +27838792658 PHOMOLONG Top Abortion Pills F...Tembisa Central Terminating Pills +27838792658 PHOMOLONG Top Abortion Pills F...
Tembisa Central Terminating Pills +27838792658 PHOMOLONG Top Abortion Pills F...
 
Basics of Relay for Engineering Students
Basics of Relay for Engineering StudentsBasics of Relay for Engineering Students
Basics of Relay for Engineering Students
 
UNIT-2 image enhancement.pdf Image Processing Unit 2 AKTU
UNIT-2 image enhancement.pdf Image Processing Unit 2 AKTUUNIT-2 image enhancement.pdf Image Processing Unit 2 AKTU
UNIT-2 image enhancement.pdf Image Processing Unit 2 AKTU
 
Maximizing Incident Investigation Efficacy in Oil & Gas: Techniques and Tools
Maximizing Incident Investigation Efficacy in Oil & Gas: Techniques and ToolsMaximizing Incident Investigation Efficacy in Oil & Gas: Techniques and Tools
Maximizing Incident Investigation Efficacy in Oil & Gas: Techniques and Tools
 
Adsorption (mass transfer operations 2) ppt
Adsorption (mass transfer operations 2) pptAdsorption (mass transfer operations 2) ppt
Adsorption (mass transfer operations 2) ppt
 
21P35A0312 Internship eccccccReport.docx
21P35A0312 Internship eccccccReport.docx21P35A0312 Internship eccccccReport.docx
21P35A0312 Internship eccccccReport.docx
 
Worksharing and 3D Modeling with Revit.pptx
Worksharing and 3D Modeling with Revit.pptxWorksharing and 3D Modeling with Revit.pptx
Worksharing and 3D Modeling with Revit.pptx
 
Autodesk Construction Cloud (Autodesk Build).pptx
Autodesk Construction Cloud (Autodesk Build).pptxAutodesk Construction Cloud (Autodesk Build).pptx
Autodesk Construction Cloud (Autodesk Build).pptx
 
NO1 Best Powerful Vashikaran Specialist Baba Vashikaran Specialist For Love V...
NO1 Best Powerful Vashikaran Specialist Baba Vashikaran Specialist For Love V...NO1 Best Powerful Vashikaran Specialist Baba Vashikaran Specialist For Love V...
NO1 Best Powerful Vashikaran Specialist Baba Vashikaran Specialist For Love V...
 
What is Coordinate Measuring Machine? CMM Types, Features, Functions
What is Coordinate Measuring Machine? CMM Types, Features, FunctionsWhat is Coordinate Measuring Machine? CMM Types, Features, Functions
What is Coordinate Measuring Machine? CMM Types, Features, Functions
 
litvinenko_Henry_Intrusion_Hong-Kong_2024.pdf
litvinenko_Henry_Intrusion_Hong-Kong_2024.pdflitvinenko_Henry_Intrusion_Hong-Kong_2024.pdf
litvinenko_Henry_Intrusion_Hong-Kong_2024.pdf
 
Artificial Intelligence in due diligence
Artificial Intelligence in due diligenceArtificial Intelligence in due diligence
Artificial Intelligence in due diligence
 
Research Methodolgy & Intellectual Property Rights Series 1
Research Methodolgy & Intellectual Property Rights Series 1Research Methodolgy & Intellectual Property Rights Series 1
Research Methodolgy & Intellectual Property Rights Series 1
 
Dynamo Scripts for Task IDs and Space Naming.pptx
Dynamo Scripts for Task IDs and Space Naming.pptxDynamo Scripts for Task IDs and Space Naming.pptx
Dynamo Scripts for Task IDs and Space Naming.pptx
 
Filters for Electromagnetic Compatibility Applications
Filters for Electromagnetic Compatibility ApplicationsFilters for Electromagnetic Compatibility Applications
Filters for Electromagnetic Compatibility Applications
 
handbook on reinforce concrete and detailing
handbook on reinforce concrete and detailinghandbook on reinforce concrete and detailing
handbook on reinforce concrete and detailing
 

kaggle Tokyo Meetup #4 Lightning Talk 2018 Data Science Bowl

  • 1. 2018.05.12 @osciiart Stage 1: 9th, Stage 2: 71st /3634 kaggle Tokyo Meetup #4 Lightning Talk 2018 Data Science Bowl
  • 2. Who am I? 秋山理 Osamu Akiyama @osciiart Biography • 京都大学生命科学修士号 • 大阪大学医学部医学科5回 (31歳) • 研究: 脳科学, BMI • AIメディカル研究会 (AIMS) paper • Akiyama O. ASCII Art Synthesis with Convolutional Networks. NIPS 2017 Workshop, Machine Learning for Creativity and Design. 2017. • ASCII.jp: アスキーアートの精度はディープラーニングでどこまで上がるのか? • VICE MOTHERBOARD: This Machine Learning Algorithm Can Turn Any Line Drawing Into ASCII Art Kaggle status (@osciiart) • 3 Silver, 1 Bronze Other competition result • DeepAnalytics バイエル薬品 医薬情報テキストマイニング 2nd / 127 • Bioinformatics Contest 2018 20th
  • 3. 2018 Data Science Bowl Instance Segmentation
  • 4. Evaluation Pred Label IoU > threshold -> True Positive Average Precision (AP) = mean AP (mAP) = 1.00 0.00 0.50 0.55 0.60 0.65 0.70 0.75 0.80 0.85 0.90 0.95 mAP threshold AP
  • 5. 2 Stage Competition Strong generalization required Train data: 665 Stage 1 Test data: 65 Stage 2 Test Data: 3019 (most of all is fake)
  • 6. Mask R-CNN vs U-Net 2-stage detector • Detection とSegmentationのprocessを分離 • 精度が高い (State-of-the-Art) • Occlusion, Class imbalanceに対応できる • 学習が難しい 1-stage detector • そのままではInstanceを分離できない • Simple and Fast • Occlusion, Class imbalance に弱い • Ensembleが適用しやすい Ronneberger, O., Fischer, P., Brox, T. U-Net: Convolutional Networks for Biomedical Image Segmentation. arXiv. 2015. He, K., Gkioxari, G., Dollár, P., Girshick, R. Mask R-CNN. arXiv. 2017.
  • 7. The Organizer Stands Like God 主催者の一人 Allen がコンペ開始からぶっちぎりの1位に君臨 Stage 1 で結局誰もAllenを追い抜けなかった Allen が積極的に手法を公開したため、彼の手法をいかに再現するかの勝負の様相 Allenの手法がMask R-CNNのため多くの人がMask R-CNNに注目した 1st Stage LB
  • 8. My Solution: (based on) Deep Watershed Transform • 3 net in serial -> in parallel (for simplification) • Binned depth classification -> normalized depth regression (for size augmentation) Bai M, Urtasun R. Deep Watershed Transform for Instance Segmentation. arXiv. 2016. SegNet Direction Net Depth Net DeepLab V3+’ • Augmentation • Random cropping • Resize (0.5 – 2.0) • Rotation (-180° - 180°) • Flip • Hue, Saturation, Lightness • TTA • Mean diameter (25, 30, 35, 40, 45 pixel) • Flip • Rotation (0°, 90°,…, 270°) Marvelous Article: Applying Deep Watershed Transform to Kaggle Data Science Bowl 2018
  • 9. My Solution: Semi-supervised by GAN (doesn’t work) • Generator (labeled) G True Label D PredictionInput Real PairAdv Loss MSE Loss D Real Pair or Fake Pair Adv Loss Prediction Input True Label Input or G D PredictionInput Real PairAdv Loss • Discriminator • Generator (unlabeled)
  • 10. 1st place solution: U-Net on Steroids • targets - we predict touching borders along with the masks to solve the problem as instance segmentation • loss function - that combines crossentropy and soft dice loss in such a way that pixel imbalance doesn't affect the results • very deep encoder-decoder architectures that also achieve state-of-the-art results in other binary segmentation problems (SpaceNet, Inria and others) • tricky postprocessing that combines watershed, morphological features and second-level model with Gradient Boosted Trees (increased 0.015) • task specific data augmentations
  • 11. Result: Mask R-CNN vs U-Net 0.582 - 2nd Stage LB U-Net (touching border) - Mask-RCNN U-Net (watershed, 2step) Mask-RCNN - Mask-RCNN Mask-RCNN - Mask-RCNN Mask-RCNN - - Mask-RCNN - - U-Net (watershed, 1step) 6チームがAllenを超えることができた