Submit Search
Upload
Classification Project_Kaggle (Shelter aninals)
•
0 likes
•
203 views
재완 최
Follow
동물 보호소에 보호된 동물들의 분류모델을 수립하는 머신러닝 프로젝트 (패스트캠퍼스 데이터사이언스스쿨 5기 최재완, 오윤경, 진승완)
Read less
Read more
Data & Analytics
Report
Share
Report
Share
1 of 50
Recommended
최반장의 피벗테이블 마스터 클래스
최반장의 피벗테이블 마스터 클래스
재완 최
2019 야구수다를 돌아보는 숫자수다
2019 야구수다를 돌아보는 숫자수다
재완 최
2018 KBO Review by Doosan bears Fan Club
2018 KBO Review by Doosan bears Fan Club
재완 최
Toyota price project team data macho
Toyota price project team data macho
재완 최
2024 State of Marketing Report – by Hubspot
2024 State of Marketing Report – by Hubspot
Marius Sescu
Everything You Need To Know About ChatGPT
Everything You Need To Know About ChatGPT
Expeed Software
Product Design Trends in 2024 | Teenage Engineerings
Product Design Trends in 2024 | Teenage Engineerings
Pixeldarts
How Race, Age and Gender Shape Attitudes Towards Mental Health
How Race, Age and Gender Shape Attitudes Towards Mental Health
ThinkNow
Recommended
최반장의 피벗테이블 마스터 클래스
최반장의 피벗테이블 마스터 클래스
재완 최
2019 야구수다를 돌아보는 숫자수다
2019 야구수다를 돌아보는 숫자수다
재완 최
2018 KBO Review by Doosan bears Fan Club
2018 KBO Review by Doosan bears Fan Club
재완 최
Toyota price project team data macho
Toyota price project team data macho
재완 최
2024 State of Marketing Report – by Hubspot
2024 State of Marketing Report – by Hubspot
Marius Sescu
Everything You Need To Know About ChatGPT
Everything You Need To Know About ChatGPT
Expeed Software
Product Design Trends in 2024 | Teenage Engineerings
Product Design Trends in 2024 | Teenage Engineerings
Pixeldarts
How Race, Age and Gender Shape Attitudes Towards Mental Health
How Race, Age and Gender Shape Attitudes Towards Mental Health
ThinkNow
AI Trends in Creative Operations 2024 by Artwork Flow.pdf
AI Trends in Creative Operations 2024 by Artwork Flow.pdf
marketingartwork
Skeleton Culture Code
Skeleton Culture Code
Skeleton Technologies
PEPSICO Presentation to CAGNY Conference Feb 2024
PEPSICO Presentation to CAGNY Conference Feb 2024
Neil Kimberley
Content Methodology: A Best Practices Report (Webinar)
Content Methodology: A Best Practices Report (Webinar)
contently
How to Prepare For a Successful Job Search for 2024
How to Prepare For a Successful Job Search for 2024
Albert Qian
Social Media Marketing Trends 2024 // The Global Indie Insights
Social Media Marketing Trends 2024 // The Global Indie Insights
Kurio // The Social Media Age(ncy)
Trends In Paid Search: Navigating The Digital Landscape In 2024
Trends In Paid Search: Navigating The Digital Landscape In 2024
Search Engine Journal
5 Public speaking tips from TED - Visualized summary
5 Public speaking tips from TED - Visualized summary
SpeakerHub
ChatGPT and the Future of Work - Clark Boyd
ChatGPT and the Future of Work - Clark Boyd
Clark Boyd
Getting into the tech field. what next
Getting into the tech field. what next
Tessa Mero
Google's Just Not That Into You: Understanding Core Updates & Search Intent
Google's Just Not That Into You: Understanding Core Updates & Search Intent
Lily Ray
How to have difficult conversations
How to have difficult conversations
Rajiv Jayarajah, MAppComm, ACC
Introduction to Data Science
Introduction to Data Science
Christy Abraham Joy
Time Management & Productivity - Best Practices
Time Management & Productivity - Best Practices
Vit Horky
The six step guide to practical project management
The six step guide to practical project management
MindGenius
Beginners Guide to TikTok for Search - Rachel Pearson - We are Tilt __ Bright...
Beginners Guide to TikTok for Search - Rachel Pearson - We are Tilt __ Bright...
RachelPearson36
Unlocking the Power of ChatGPT and AI in Testing - A Real-World Look, present...
Unlocking the Power of ChatGPT and AI in Testing - A Real-World Look, present...
Applitools
12 Ways to Increase Your Influence at Work
12 Ways to Increase Your Influence at Work
GetSmarter
ChatGPT webinar slides
ChatGPT webinar slides
Alireza Esmikhani
More than Just Lines on a Map: Best Practices for U.S Bike Routes
More than Just Lines on a Map: Best Practices for U.S Bike Routes
Project for Public Spaces & National Center for Biking and Walking
More Related Content
Featured
AI Trends in Creative Operations 2024 by Artwork Flow.pdf
AI Trends in Creative Operations 2024 by Artwork Flow.pdf
marketingartwork
Skeleton Culture Code
Skeleton Culture Code
Skeleton Technologies
PEPSICO Presentation to CAGNY Conference Feb 2024
PEPSICO Presentation to CAGNY Conference Feb 2024
Neil Kimberley
Content Methodology: A Best Practices Report (Webinar)
Content Methodology: A Best Practices Report (Webinar)
contently
How to Prepare For a Successful Job Search for 2024
How to Prepare For a Successful Job Search for 2024
Albert Qian
Social Media Marketing Trends 2024 // The Global Indie Insights
Social Media Marketing Trends 2024 // The Global Indie Insights
Kurio // The Social Media Age(ncy)
Trends In Paid Search: Navigating The Digital Landscape In 2024
Trends In Paid Search: Navigating The Digital Landscape In 2024
Search Engine Journal
5 Public speaking tips from TED - Visualized summary
5 Public speaking tips from TED - Visualized summary
SpeakerHub
ChatGPT and the Future of Work - Clark Boyd
ChatGPT and the Future of Work - Clark Boyd
Clark Boyd
Getting into the tech field. what next
Getting into the tech field. what next
Tessa Mero
Google's Just Not That Into You: Understanding Core Updates & Search Intent
Google's Just Not That Into You: Understanding Core Updates & Search Intent
Lily Ray
How to have difficult conversations
How to have difficult conversations
Rajiv Jayarajah, MAppComm, ACC
Introduction to Data Science
Introduction to Data Science
Christy Abraham Joy
Time Management & Productivity - Best Practices
Time Management & Productivity - Best Practices
Vit Horky
The six step guide to practical project management
The six step guide to practical project management
MindGenius
Beginners Guide to TikTok for Search - Rachel Pearson - We are Tilt __ Bright...
Beginners Guide to TikTok for Search - Rachel Pearson - We are Tilt __ Bright...
RachelPearson36
Unlocking the Power of ChatGPT and AI in Testing - A Real-World Look, present...
Unlocking the Power of ChatGPT and AI in Testing - A Real-World Look, present...
Applitools
12 Ways to Increase Your Influence at Work
12 Ways to Increase Your Influence at Work
GetSmarter
ChatGPT webinar slides
ChatGPT webinar slides
Alireza Esmikhani
More than Just Lines on a Map: Best Practices for U.S Bike Routes
More than Just Lines on a Map: Best Practices for U.S Bike Routes
Project for Public Spaces & National Center for Biking and Walking
Featured
(20)
AI Trends in Creative Operations 2024 by Artwork Flow.pdf
AI Trends in Creative Operations 2024 by Artwork Flow.pdf
Skeleton Culture Code
Skeleton Culture Code
PEPSICO Presentation to CAGNY Conference Feb 2024
PEPSICO Presentation to CAGNY Conference Feb 2024
Content Methodology: A Best Practices Report (Webinar)
Content Methodology: A Best Practices Report (Webinar)
How to Prepare For a Successful Job Search for 2024
How to Prepare For a Successful Job Search for 2024
Social Media Marketing Trends 2024 // The Global Indie Insights
Social Media Marketing Trends 2024 // The Global Indie Insights
Trends In Paid Search: Navigating The Digital Landscape In 2024
Trends In Paid Search: Navigating The Digital Landscape In 2024
5 Public speaking tips from TED - Visualized summary
5 Public speaking tips from TED - Visualized summary
ChatGPT and the Future of Work - Clark Boyd
ChatGPT and the Future of Work - Clark Boyd
Getting into the tech field. what next
Getting into the tech field. what next
Google's Just Not That Into You: Understanding Core Updates & Search Intent
Google's Just Not That Into You: Understanding Core Updates & Search Intent
How to have difficult conversations
How to have difficult conversations
Introduction to Data Science
Introduction to Data Science
Time Management & Productivity - Best Practices
Time Management & Productivity - Best Practices
The six step guide to practical project management
The six step guide to practical project management
Beginners Guide to TikTok for Search - Rachel Pearson - We are Tilt __ Bright...
Beginners Guide to TikTok for Search - Rachel Pearson - We are Tilt __ Bright...
Unlocking the Power of ChatGPT and AI in Testing - A Real-World Look, present...
Unlocking the Power of ChatGPT and AI in Testing - A Real-World Look, present...
12 Ways to Increase Your Influence at Work
12 Ways to Increase Your Influence at Work
ChatGPT webinar slides
ChatGPT webinar slides
More than Just Lines on a Map: Best Practices for U.S Bike Routes
More than Just Lines on a Map: Best Practices for U.S Bike Routes
Classification Project_Kaggle (Shelter aninals)
1.
Classification Project 28. July
2017
2.
0. prologue
3.
팀소개
4.
5.
진승완 오윤경
6.
한명 더 있다.
7.
최재완
8.
Shelter Animal Outcomes from
kaggle
9.
Mission
10.
이관 안락사
11.
12.
1. EDA
13.
AnimalID Name DateTime Outcome Type Outcome Subtype Animal Type Sexupon Outcome Ageupon Outcome Breed
Color Column
14.
AnimalID Name DateTime Outcome Type Outcome Subtype Animal Type Sexupon Outcome Ageupon Outcome Breed
Color 독립변수 종속변수
15.
AnimalID Name DateTime Outcome Type Outcome Subtype Animal Type Sexupon Outcome Ageupon Outcome Breed Color 독립변수 종속변수 제외
포함
16.
11,134 Cats 15,595 Dogs Row 26,729 Animals
17.
6497 3917 4286 845 50 4272 5505 500 710 147 0 1000 2000 3000 4000 5000 6000 7000 Adoption Transfer Return_to_owner
Euthanasia Died Dogs Cats
18.
AnimalID Name DateTime Outcome Type Outcome Subtype Animal Type Sexupon Outcome Ageupon Outcome Breed Color Data
Cleaning Train Set과 Test Set 결합하여 전처리
19.
AnimalID Name DateTime Outcome Type Outcome Subtype Animal Type Sexupon Outcome Ageupon Outcome Breed Color 개, 고양이의
이름을 의미하는 질적데이터 이름이 없는 동물이 많다. 10916 29% 27269 71% NaN 이름 있음
20.
Name 0 1000 2000 3000 4000 5000 6000 7000 8000 9000 10000 Adoption Died Euthanasia
Return_to_owner Transfer 이름 없음 이름 있음 이름이 있는 경우 입양과 주인에게 돌아갈 가능성이 더 크다.
21.
AnimalID Name DateTime Outcome Type Outcome Subtype Animal Type Sexupon Outcome Ageupon Outcome Breed Color [결론] Name이 없는
경우를 0 Name이 있는 경우를 1로 처리.
22.
AnimalID Name DateTime Outcome Type Outcome Subtype Animal Type Sexupon Outcome Ageupon Outcome Breed Color
개, 고양이의 연령 주, 월, 연으로 구성된 양적데이터 [결론] 모든 값을 “일(day)”로 변환
23.
1 2 3
4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 Dogs 3780 1161 484 281 237 163 107 109 58 60 14 23 10 3 3 4 0 0 Cats 3583 158 118 59 64 41 44 57 28 48 18 16 19 9 7 2 0 1 0 500 1000 1500 2000 2500 3000 3500 4000 Adoption by Age_upon_outcome Dogs Cats
24.
AnimalID Name DateTime Outcome Type Outcome Subtype Animal Type Sexupon Outcome Ageupon Outcome Breed Color
동물의 종류 개, 또는 고양이로 구성된 질적데이터 [결론] 모든 값을 “0과 1”로 더미화
25.
AnimalID Name DateTime Outcome Type Outcome Subtype Animal Type Sexupon Outcome Ageupon Outcome Breed Color
동물의 성별 성별과 중성화 여부로 구분된 질적 데이터
26.
0.00% 10.00% 20.00% 30.00% 40.00% 50.00% 60.00% 70.00% Adoption Died Euthanasia
Return_to_owner Transfer Intact 5.13% 1.92% 12.48% 11.06% 69.41% Sterilized 55.96% 0.20% 3.10% 21.48% 19.26% Unknown 0.00% 10.68% 43.16% 5.56% 40.60% Intact Sterilized Unknown Sexupon Outcome 중성화 한 경우가 입양과 주인에게 돌아갈 가능성이 더 크다.
27.
AnimalID Name DateTime Outcome Type Outcome Subtype Animal Type Sexupon Outcome Ageupon Outcome Breed Color
개, 고양이의 종자 3가지의 유형으로 구성 1) 순종 (ex. Pit Bull) 2) Mix (ex. Pit Bull Mix) 3) 2종 결합 (ex. Pit Bull / Shetland Sheepdog) 너무 많다. Mix와 2종 결합은 무슨 차이인가.
28.
순종과 Mix종의 차이에
따른 결과 패턴을 보자.
29.
0.38 0.45 0.08 0.07 0.02 0.37 0.52 0.04 0.06 0.010 0.1 0.2 0.3 0.4 0.5 0.6 Adoption Died Euthanasia
Return_to_owner Transfer 순종 믹스 어? 비슷하다.
30.
31.
Breed 0.00% 10.00% 20.00% 30.00% 40.00% 50.00% 60.00% Adoption Died Euthanasia
Return_to_owner Transfer 순종 38.98% 0.79% 6.03% 17.51% 36.69% 잡종 50.46% 0.29% 4.15% 20.96% 24.13% 순종 잡종
32.
AnimalID Name DateTime Outcome Type Outcome Subtype Animal Type Sexupon Outcome Ageupon Outcome Breed Color
순종과 Mix 종은 “같은 종”으로 처리. 2종 결합은 일괄하여 “잡종”으로 처리
33.
AnimalID Name DateTime Outcome Type Outcome Subtype Animal Type Sexupon Outcome Ageupon Outcome Breed Color
개, 고양이의 색을 나타내는 질적데이터 2가지의 유형으로 구성 1) 단일 색상 (ex. Brown) 2) 혼합 색상 (ex. Brown / Black) [결론] 유형별로 0과 1로 더미화
34.
Color 0.00% 5.00% 10.00% 15.00% 20.00% 25.00% 30.00% 35.00% 40.00% 45.00% Adoption Died Euthanasia
Return_to_owner Transfer Unique Color 38.68% 0.83% 5.64% 14.81% 40.04% Mixed Color 41.77% 0.65% 5.98% 20.75% 30.85% Unique Color Mixed Color
35.
2. Trained Model
36.
KNN SVM Random Forest XG Boost
37.
2-1. KNN
38.
39.
2-2. SVM
40.
41.
2-3. Random Forest
42.
43.
2-4. XG Boost
44.
45.
Feature Importance
46.
0.62 0.64 0.63 0.64 KNN SVM Random
Forest XG Boost Accuracy 0.62 0.62 0.62 0.66 KNN SVM Random Forest XG Boost f1-score Score
47.
48.
3. Prediction
49.
50.
Thank You