kaggle Tokyo Meetup #4 Lightning Talk 2018 Data Science Bowl

•

2 likes•2,558 views

理

1. The document describes Osamu Akiyama's participation in the 2018 Data Science Bowl competition where he ranked 9th in Stage 1 and 71st overall. 2. It then discusses the competition format which involved a two-stage structure with limited training data, requiring strong generalization. Two popular approaches for the instance segmentation task were Mask R-CNN and U-Net. 3. Akiyama's solution was based on Deep Watershed Transform, using an ensemble of segmentation networks. However, his attempt to use GANs for semi-supervised learning was unsuccessful. The top solution used an enhanced U-Net model with techniques like touching border prediction and combined loss functions.

Engineering

2018.05.12 @osciiart Stage 1: 9th, Stage 2: 71st /3634
kaggle Tokyo Meetup #4
Lightning Talk
2018 Data Science Bowl

Who am I?
秋山理 Osamu Akiyama @osciiart
Biography
• 京都大学生命科学修士号
• 大阪大学医学部医学科5回 (31歳)
• 研究: 脳科学, BMI
• AIメディカル研究会 (AIMS)
paper
• Akiyama O. ASCII Art Synthesis with Convolutional Networks. NIPS 2017 Workshop,
Machine Learning for Creativity and Design. 2017.
• ASCII.jp: アスキーアートの精度はディープラーニングでどこまで上がるのか？
• VICE MOTHERBOARD: This Machine Learning Algorithm Can Turn Any Line Drawing Into ASCII Art
Kaggle status (@osciiart)
• 3 Silver, 1 Bronze
Other competition result
• DeepAnalytics バイエル薬品医薬情報テキストマイニング 2nd / 127
• Bioinformatics Contest 2018 20th

2018 Data Science Bowl
Instance Segmentation

Evaluation
Pred Label
IoU > threshold -> True Positive
Average Precision (AP) =
mean AP (mAP) =
1.00
0.00
0.50 0.55 0.60 0.65 0.70 0.75 0.80 0.85 0.90 0.95
mAP
threshold
AP

2 Stage Competition
Strong generalization required
Train data: 665 Stage 1 Test data: 65
Stage 2 Test Data: 3019 (most of all is fake)

Mask R-CNN vs U-Net
2-stage detector
• Detection とSegmentationのprocessを分離
• 精度が高い (State-of-the-Art)
• Occlusion, Class imbalanceに対応できる
• 学習が難しい
1-stage detector
• そのままではInstanceを分離できない
• Simple and Fast
• Occlusion, Class imbalance に弱い
• Ensembleが適用しやすい
Ronneberger, O., Fischer, P., Brox, T. U-Net: Convolutional
Networks for Biomedical Image Segmentation. arXiv. 2015.
He, K., Gkioxari, G., Dollár, P., Girshick, R. Mask R-CNN. arXiv.
2017.

The Organizer Stands Like God
主催者の一人 Allen がコンペ開始からぶっちぎりの1位に君臨
Stage 1 で結局誰もAllenを追い抜けなかった
Allen が積極的に手法を公開したため、彼の手法をいかに再現するかの勝負の様相
Allenの手法がMask R-CNNのため多くの人がMask R-CNNに注目した
1st Stage LB

My Solution: (based on) Deep Watershed Transform
• 3 net in serial -> in parallel (for simplification)
• Binned depth classification -> normalized depth regression (for size augmentation)
Bai M, Urtasun R. Deep Watershed Transform for Instance Segmentation. arXiv. 2016.
SegNet
Direction
Net
Depth
Net
DeepLab
V3+’
• Augmentation
• Random cropping
• Resize (0.5 – 2.0)
• Rotation (-180° - 180°)
• Flip
• Hue, Saturation, Lightness
• TTA
• Mean diameter (25, 30, 35, 40, 45 pixel)
• Flip
• Rotation (0°, 90°,…, 270°)
Marvelous Article: Applying Deep Watershed Transform to Kaggle Data Science Bowl 2018

My Solution: Semi-supervised by GAN
(doesn’t work)
• Generator (labeled)
G
True Label
D
PredictionInput
Real PairAdv Loss
MSE Loss
D
Real Pair
or
Fake Pair
Adv Loss
Prediction
Input
True Label
Input
or
G
D
PredictionInput
Real PairAdv Loss
• Discriminator
• Generator (unlabeled)

1st place solution: U-Net on Steroids
• targets - we predict touching borders along with the masks to solve the problem as
instance segmentation
• loss function - that combines crossentropy and soft dice loss in such a way that pixel
imbalance doesn't affect the results
• very deep encoder-decoder architectures that also achieve state-of-the-art results in other
binary segmentation problems (SpaceNet, Inria and others)
• tricky postprocessing that combines watershed, morphological features and second-level
model with Gradient Boosted Trees (increased 0.015)
• task specific data augmentations

Result: Mask R-CNN vs U-Net
0.582
-
2nd Stage LB U-Net (touching border)
-
Mask-RCNN
U-Net (watershed, 2step)
Mask-RCNN
-
Mask-RCNN
Mask-RCNN
-
Mask-RCNN
Mask-RCNN
-
-
Mask-RCNN
-
-
U-Net (watershed, 1step)
6チームがAllenを超えることができた

Similar to kaggle Tokyo Meetup #4 Lightning Talk 2018 Data Science Bowl

Jan2016 nabsys giab

GenomeInABottle

Feature Engineering Hands-On by Dmitry Larko

Sri Ambati

BSSML16 L3. Clusters and Anomaly Detection

BigML, Inc

This talk was given at H2O World 2018 NYC and can be viewed here: https://youtu.be/wcFdmQSX6hM Description: In this talk, Dmitry shares his approach to feature engineering which he used successfully in various Kaggle competitions. He covers common techniques used to convert your features into numeric representation used by ML algorithms. Speaker's Bio: Dmitry has more than 10 years of experience in IT. Starting with data warehousing and BI, now in big data and data science. He has a lot of experience in predictive analytics software development for different domains and tasks. He is also a Kaggle Grandmaster who loves to use his machine learning and data science skills on Kaggle competitions.

Feature Engineering for ML - Dmitry Larko, H2O.ai

Sri Ambati

2013추계학술대회 인쇄용

Byung Kook Ha

Artem Baklanov - Votes Aggregation Techniques in Geo-Wiki Crowdsourcing Game:...

AIST

BSSML17 - Clusters

BigML, Inc

Converting a plain text into non readable format to maintain a confidentiality and integrity of data is called Encoding. And the technique used to decode that into readable format, is called Decryption. To encrypt and decrypt, algorithms we have developed. This entire theory, The whole technology is called Cryptography. Many algorithms were developed, many are Decoded, and many of the algorithms are still running nowadays also. So, here I came up with the new algorithm, with new technique, with new idea in algorithm. Sweety Gone | Kuldeep B. Vayadande "CipherKey Algorithm" Published in International Journal of Trend in Scientific Research and Development (ijtsrd), ISSN: 2456-6470, Volume-5 | Issue-1 , December 2020, URL: https://www.ijtsrd.com/papers/ijtsrd37924.pdf Paper URL : https://www.ijtsrd.com/computer-science/computer-network/37924/cipherkey-algorithm/sweety-gone

CipherKey Algorithm

ijtsrd

Current consumer depth sensors produce depth maps that are often noisy and lack sufficient detail. Enhancing the quality of the 3D depth data obtained from compact depth Kinect-like sensors is an increasingly popular research area. Although depth data is known to carry a signal-dependent noise, the state-of-the-art denoising methods tend to employ denoising techniques which are independent of the depth signal itself. In this paper, we present a novel adaptive denoising filter to enhance object recognition from 3D depth data. We evaluate the performance of our proposed denoising filter against other state-of-the-art filters based on the enhancement of object recognition accuracy achieved after denoising the raw data with each filter. In order to perform object recognition from depth data, we make use of Differential Histogram of Normal Vectors (DHONV) features along with a linear SVM. Experiments show that our proposed filter outperformed the state-of-the-art denoising methods.

ADAPTIVE FILTER FOR DENOISING 3D DATA CAPTURED BY DEPTH SENSORS

Soma Boubou

Pre-aggregation is a powerful analytics technique as long as the measures being computed are reaggregable. Counts reaggregate with SUM, minimums with MIN, maximums with MAX, etc. The odd one out is distinct counts, which are not reaggregable. Traditionally, the non-reaggregability of distinct counts leads to an implicit restriction: whichever system computes distinct counts has to have access to the most granular data and touch every row at query time. Because of this, in typical analytics architectures, where fast query response times are required, raw data has to be duplicated between Spark and another system such as an RDBMS. This talk is for everyone who computes or consumes distinct counts and for everyone who doesn’t understand the magical power of HyperLogLog (HLL) sketches. We will break through the limits of traditional analytics architectures using the advanced HLL functionality and cross-system interoperability of the spark-alchemy open-source library, whose capabilities go beyond what is possible with OSS Spark, Redshift or even BigQuery. We will uncover patterns for 1000x gains in analytic query performance without data duplication and with significantly less capacity. We will explore real-world use cases from Swoop’s petabyte-scale systems, improve data privacy when running analytics over sensitive data, and even see how a real-time analytics frontend running in a browser can be provisioned with data directly from Spark.

High-Performance Advanced Analytics with Spark-Alchemy

Databricks

Structured Forests for Fast Edge Detection [Paper Presentation]

Mohammad Shaker

Defect detection in circlips using image processing in ni lab view

Sayali Bodhankar

How to program software and objects

Francisco Perez

Deep LearningフレームワークChainerと最近の技術動向

Shunta Saito

Neural Art (English Version)

Mark Chang

Gradient Boosted Regression Trees in Scikit Learn by Gilles Louppe & Peter Pr...

PyData

Utah Big Mountain Conference: AncestryDNA, HBase, Hadoop (9-7-2013)

William Yetman

Lecture 7: Data-Intensive Computing for Text Analysis (Fall 2011)

Matthew Lease

Knowledge Graphs and Graph Data Science: More Context, Better Predictions (Ne...

Neo4j

Every year the financial industry loses billions because of fraud while in the meantime fraudsters are coming up with more and more sophisticated patterns. Financial institutions have to find the balance between fraud protection and negative customer experience. Fraudsters bury their patterns in lots of data, but the traditional technologies are not designed to detect fraud in real-time or to see patterns beyond the individual account. Analyzing relations with graph databases helps uncover these larger complex patterns and speeds up suspicious behavior identification. Furthermore, graph databases enable fast and effective real-time link queries and passing context to machine learning models. The earlier fraud pattern or network is identified, the faster the activity is blocked. As a result, losses and fines are minimized.

Follow the money with graphs

Stanka Dalekova

Similar to kaggle Tokyo Meetup #4 Lightning Talk 2018 Data Science Bowl (20)

Jan2016 nabsys giab

Feature Engineering Hands-On by Dmitry Larko

BSSML16 L3. Clusters and Anomaly Detection

Feature Engineering for ML - Dmitry Larko, H2O.ai

2013추계학술대회 인쇄용

Artem Baklanov - Votes Aggregation Techniques in Geo-Wiki Crowdsourcing Game:...

BSSML17 - Clusters

CipherKey Algorithm

ADAPTIVE FILTER FOR DENOISING 3D DATA CAPTURED BY DEPTH SENSORS

High-Performance Advanced Analytics with Spark-Alchemy

Structured Forests for Fast Edge Detection [Paper Presentation]

Defect detection in circlips using image processing in ni lab view

How to program software and objects

Deep LearningフレームワークChainerと最近の技術動向

Neural Art (English Version)

Gradient Boosted Regression Trees in Scikit Learn by Gilles Louppe & Peter Pr...

Utah Big Mountain Conference: AncestryDNA, HBase, Hadoop (9-7-2013)

Lecture 7: Data-Intensive Computing for Text Analysis (Fall 2011)

Knowledge Graphs and Graph Data Science: More Context, Better Predictions (Ne...

Follow the money with graphs

Recently uploaded

Raashid final report on Embedded Systems

RaashidFaiyazSheikh

Software Engineering Practical File Front Pages.pdf

ssuser5c9d4b1

Diploma Engineering Drawing Qp-2024 Ece .pdf

JNTUA

History of Indian Railways - the story of Growth & Modernization

Emaan Sharma

Tembisa Central Terminating Pills +27838792658 PHOMOLONG Top Abortion Pills F...

drjose256

Basics of Relay for Engineering Students

kannan348865

UNIT-2 image enhancement.pdf Image Processing Unit 2 AKTU

ankushspencer015

This presentation, crafted with expertise from the oil and gas industry, details the imperative techniques and tools used in the process of incident investigation. The presentation highlights the structured methods involving interviews with engaged parties, the importance of gathering documentary evidence, employing structured forms and checklists, the value of reviewing surveillance footage, the power of modern technology, and synergizing with subject-matter experts. It is designed for industry professionals keen on enhancing safety practices, regulatory compliance, and preventive measures within their operational environment. Valuable for health & safety officers, compliance managers, and operation leaders, it prioritises continuous learning and adherence to best practices to ensure workplace safety and operational excellence.

Maximizing Incident Investigation Efficacy in Oil & Gas: Techniques and Tools

soginsider

Adsorption (mass transfer operations 2) ppt

jigup7320

21P35A0312 Internship eccccccReport.docx

rahulmanepalli02

Worksharing and 3D Modeling with Revit.pptx

Mustafa Ahmed

Autodesk Construction Cloud (Autodesk Build).pptx

Mustafa Ahmed

NO1 Best Powerful Vashikaran Specialist Baba Vashikaran Specialist For Love V...

Amil baba

A Coordinate Measuring Machine (CMM) is a precision measuring metrology machine used in manufacturing and quality control processes to precisely measure the geometric characteristics of objects. It operates on the principle of coordinate geometry, employing a probe to collect data points from the surface of an object to determine the dimensions, shapes, and positions of features, such as holes, slots, and surfaces. CMMs are essential tools for ensuring that manufactured parts meet design specifications and quality standards.

What is Coordinate Measuring Machine? CMM Types, Features, Functions

VIEW

We investigated the applicability and efficiency of the MLMC approach to the Henry-like problem with uncertain porosity, permeability and recharge. These uncertain parameters were modelled by random fields with three independent random variables. Permeability is a function of porosity. Both functions are time-dependent, have multi-scale behaviour and are defined for two layers. The numerical solution for each random realisation was obtained using the well-known ug4 parallel multigrid solver. The number of random samples required at each level was estimated by calculating the decay of the variances and the computational cost for each level. The MLMC method was used to compute the expected value and variance of several QoIs, such as the solution at a few preselected points $(t,\bx)$, the solution integrated over a small subdomain, and the time evolution of the freshwater integral. We have found that some QoIs require only 2-3 mesh levels and samples from finer meshes would not significantly improve the result. Other QoIs require more grid levels.

litvinenko_Henry_Intrusion_Hong-Kong_2024.pdf

Alexander Litvinenko

Exploring AI's Impact: Key Features in Due Diligence In the realm of due diligence, AI emerges as a game-changer, revolutionizing traditional methods with its advanced features. AI-powered algorithms excel in data analysis, swiftly sifting through vast amounts of information for crucial insights. Automation streamlines document review processes, ensuring accuracy and efficiency. Moreover, AI enables predictive analytics, forecasting potential risks and opportunities with precision. With machine learning capabilities, AI continuously improves its performance, adapting to evolving trends and patterns. By integrating AI into due diligence practices, businesses gain a competitive edge, maximizing efficiency and making informed decisions swiftly. AI in due diligence is not just a tool; it's a transformational force driving businesses into the future. https://www.leewayhertz.com/ai-in-due-diligence/

Artificial Intelligence in due diligence

mahaffeycheryld

Research Methodolgy & Intellectual Property Rights Series 1

T.D. Shashikala

Dynamo Scripts for Task IDs and Space Naming.pptx

Mustafa Ahmed

In this short lecture, I explain the fundamentals of electromagnetic compatibility (EMC), the basic coupling model and coupling paths via cables, electric fields, magnetic fields and wave fields. We also look at electric vehicles as an example of systems with many conducted EMC problems due to power electronic devices such as rectifiers and inverters with non-linear components such as diodes and fast switching components such as MOSFETs or IGBTs. After a brief review of circuit analysis fundamentals and an experimental investigation of the frequency-dependent impedance of resistors, capacitors and inductors, we look at a simple low-pass filter. The transfer function is derived and measured.

Filters for Electromagnetic Compatibility Applications

Mathias Magdowski

handbook on reinforce concrete and detailing

AshishSingh1301

Recently uploaded (20)

Raashid final report on Embedded Systems

Software Engineering Practical File Front Pages.pdf

Diploma Engineering Drawing Qp-2024 Ece .pdf

History of Indian Railways - the story of Growth & Modernization

Tembisa Central Terminating Pills +27838792658 PHOMOLONG Top Abortion Pills F...

Basics of Relay for Engineering Students

UNIT-2 image enhancement.pdf Image Processing Unit 2 AKTU

Maximizing Incident Investigation Efficacy in Oil & Gas: Techniques and Tools

Adsorption (mass transfer operations 2) ppt

21P35A0312 Internship eccccccReport.docx

Worksharing and 3D Modeling with Revit.pptx

Autodesk Construction Cloud (Autodesk Build).pptx

NO1 Best Powerful Vashikaran Specialist Baba Vashikaran Specialist For Love V...

What is Coordinate Measuring Machine? CMM Types, Features, Functions

litvinenko_Henry_Intrusion_Hong-Kong_2024.pdf

Artificial Intelligence in due diligence

Research Methodolgy & Intellectual Property Rights Series 1

Dynamo Scripts for Task IDs and Space Naming.pptx

Filters for Electromagnetic Compatibility Applications

handbook on reinforce concrete and detailing

kaggle Tokyo Meetup #4 Lightning Talk 2018 Data Science Bowl

1. 2018.05.12 @osciiart Stage 1: 9th, Stage 2: 71st /3634 kaggle Tokyo Meetup #4 Lightning Talk 2018 Data Science Bowl

2. Who am I? 秋山理 Osamu Akiyama @osciiart Biography • 京都大学生命科学修士号 • 大阪大学医学部医学科5回 (31歳) • 研究: 脳科学, BMI • AIメディカル研究会 (AIMS) paper • Akiyama O. ASCII Art Synthesis with Convolutional Networks. NIPS 2017 Workshop, Machine Learning for Creativity and Design. 2017. • ASCII.jp: アスキーアートの精度はディープラーニングでどこまで上がるのか？ • VICE MOTHERBOARD: This Machine Learning Algorithm Can Turn Any Line Drawing Into ASCII Art Kaggle status (@osciiart) • 3 Silver, 1 Bronze Other competition result • DeepAnalytics バイエル薬品医薬情報テキストマイニング 2nd / 127 • Bioinformatics Contest 2018 20th

3. 2018 Data Science Bowl Instance Segmentation

4. Evaluation Pred Label IoU > threshold -> True Positive Average Precision (AP) = mean AP (mAP) = 1.00 0.00 0.50 0.55 0.60 0.65 0.70 0.75 0.80 0.85 0.90 0.95 mAP threshold AP

5. 2 Stage Competition Strong generalization required Train data: 665 Stage 1 Test data: 65 Stage 2 Test Data: 3019 (most of all is fake)

6. Mask R-CNN vs U-Net 2-stage detector • Detection とSegmentationのprocessを分離 • 精度が高い (State-of-the-Art) • Occlusion, Class imbalanceに対応できる • 学習が難しい 1-stage detector • そのままではInstanceを分離できない • Simple and Fast • Occlusion, Class imbalance に弱い • Ensembleが適用しやすい Ronneberger, O., Fischer, P., Brox, T. U-Net: Convolutional Networks for Biomedical Image Segmentation. arXiv. 2015. He, K., Gkioxari, G., Dollár, P., Girshick, R. Mask R-CNN. arXiv. 2017.

7. The Organizer Stands Like God 主催者の一人 Allen がコンペ開始からぶっちぎりの1位に君臨 Stage 1 で結局誰もAllenを追い抜けなかった Allen が積極的に手法を公開したため、彼の手法をいかに再現するかの勝負の様相 Allenの手法がMask R-CNNのため多くの人がMask R-CNNに注目した 1st Stage LB

8. My Solution: (based on) Deep Watershed Transform • 3 net in serial -> in parallel (for simplification) • Binned depth classification -> normalized depth regression (for size augmentation) Bai M, Urtasun R. Deep Watershed Transform for Instance Segmentation. arXiv. 2016. SegNet Direction Net Depth Net DeepLab V3+’ • Augmentation • Random cropping • Resize (0.5 – 2.0) • Rotation (-180° - 180°) • Flip • Hue, Saturation, Lightness • TTA • Mean diameter (25, 30, 35, 40, 45 pixel) • Flip • Rotation (0°, 90°,…, 270°) Marvelous Article: Applying Deep Watershed Transform to Kaggle Data Science Bowl 2018

9. My Solution: Semi-supervised by GAN (doesn’t work) • Generator (labeled) G True Label D PredictionInput Real PairAdv Loss MSE Loss D Real Pair or Fake Pair Adv Loss Prediction Input True Label Input or G D PredictionInput Real PairAdv Loss • Discriminator • Generator (unlabeled)

10. 1st place solution: U-Net on Steroids • targets - we predict touching borders along with the masks to solve the problem as instance segmentation • loss function - that combines crossentropy and soft dice loss in such a way that pixel imbalance doesn't affect the results • very deep encoder-decoder architectures that also achieve state-of-the-art results in other binary segmentation problems (SpaceNet, Inria and others) • tricky postprocessing that combines watershed, morphological features and second-level model with Gradient Boosted Trees (increased 0.015) • task specific data augmentations

11. Result: Mask R-CNN vs U-Net 0.582 - 2nd Stage LB U-Net (touching border) - Mask-RCNN U-Net (watershed, 2step) Mask-RCNN - Mask-RCNN Mask-RCNN - Mask-RCNN Mask-RCNN - - Mask-RCNN - - U-Net (watershed, 1step) 6チームがAllenを超えることができた

kaggle Tokyo Meetup #4 Lightning Talk 2018 Data Science Bowl

Recommended

Recommended

More Related Content

Similar to kaggle Tokyo Meetup #4 Lightning Talk 2018 Data Science Bowl

Similar to kaggle Tokyo Meetup #4 Lightning Talk 2018 Data Science Bowl (20)

Recently uploaded

Recently uploaded (20)

kaggle Tokyo Meetup #4 Lightning Talk 2018 Data Science Bowl