2. 목차
1
2
3
STEP 1
STEP 2
STEP 3
https://dacon.io 2
EDA & 데이터 전처리
모델 구축 & 검증
결과 및 결언
EDA & 데이터 전처리
모델 구축 & 검증
결과 및 결언
• Single Model
• Multi Model
• Conclusion
• Suggestions
• Library & Loading
• Cleansing
• Feature Engineering
• Feature Selection
7. https://dacon.io 7
1. EDA & 데이터 전처리 : Feature Engineering
Before After
일사량
Time
일사량
Time
일사량 일사량
Y 평균
or Y18
Y 평균
or Y18
해 뜨기 전 해 진 후
8. https://dacon.io 8
1. EDA & 데이터 전처리 : Feature Selection
기온 vs mean of Ys / Y18
●파란색 : Train1 Data
●빨간색 : Train2 Data
일사량 vs mean of Ys / Y18
습도 vs mean of Ys / Y18
기온
일사량
습도
15. https://dacon.io 15
2. 모델 구축 & 검증 : Single Model
X['time'] = X.index % 144
Day Time : [34 – 100] Night Time : [0 – 33] & [101 - 143]
test
Y18
Day index = 33 / Night index = 100
17. https://dacon.io 17
2. 모델 구축 & 검증 : Multi Model
Day index = 21 / Night index = 91 Day index = 21 / Night index = 93 Day index = 21 / Night index = 95
Day index = 43 / Night index = 103 Day index = 43 / Night index = 105 Day index = 43 / Night index = 107
·····
Y18
Day index : np.arange(21, 44, 2) <- 12개
Night index : np.arange(91, 108, 2) <- 9개
# of Sub models : 12 * 9 = 108
"Sub model"
18. https://dacon.io 18
2. 모델 구축 & 검증 : Single Model
Day index = 21 / Night index = 91
·····
Fit
Y00
-
Y17
X
Day Time : [15 – 85]
Predict
Y00
-
Y17
X
Train1 :
Train2 :
Fit
Y00
-
Y17
X
Day Time : [18 – 85]
Predict
Y00
-
Y17
X
Train1 :
Train2 :
Fit
Y00
-
Y17
X
Day Time : [21 – 85]
Predict
Y00
-
Y17
X
Train1 :
Train2 :
Fit
Y00
-
Y17
X
Day Time : [21 – 97]
Predict
Y00
-
Y17
X
Train1 :
Train2 :
Fit
Y00
-
Y17
X
Day Time : [24 – 97]
Predict
Y00
-
Y17
X
Train1 :
Train2 :
Fit
Y00
-
Y17
X
Day Time : [27 – 97]
Predict
Y00
-
Y17
X
Train1 :
Train2 :
Day index extended = [15, 18, 21, 24, 27] / Night index extended = [85, 88, 91, 94, 97]
np.meshgrid([15, 18, 21, 24, 27], [85, 88, 91, 94, 97]) <- 총 25개
"Sub-sub model"
↑ Day 예시
Night도 똑같음
Y00
-
Y17
# Sub-sub models : 5 * 5 = 25
19. https://dacon.io 19
2. 모델 구축 & 검증 : Multi Model
Day index = 21 / Night index = 91 Day index = 21 / Night index = 93 Day index = 21 / Night index = 95
Day index = 43 / Night index = 103 Day index = 43 / Night index = 105 Day index = 43 / Night index = 107
·····
Y18
× 25 × 25 × 25 × 25 × 25 × 25
× 25× 25× 25× 25× 25× 25
# Sub models : 12 * 9 = 108 108 (# Sub models) * 2 (day, night)
* 25 (# Sub-sub models) * 18 (Y00~Y17)
= 97200
20. https://dacon.io 20
2. 모델 구축 & 검증 : Multi Model
·
·
·
다시 Fit 할 필요 없이 바로 Predict 가능
일반적인 model
Train_X Train_YFit
Predict Test Y_pred
Predict2 Test2 Y_pred2
21. https://dacon.io 21
2. 모델 구축 & 검증 : Multi Model
Day index = 21 / Night index = 91 Day index = 21 / Night index = 93 Day index = 21 / Night index = 95
Day index = 43 / Night index = 103 Day index = 43 / Night index = 105 Day index = 43 / Night index = 107
·····
Y18
Test2 Test2 Test2 Test2 Test2 Test2
Test2Test2Test2Test2Test2Test2
× 25 × 25 × 25 × 25 × 25 × 25
× 25× 25× 25× 25× 25× 25
Test2다시 Training...?
108 (# Sub models) * 2 (day, night)
* 25 (# Sub-sub models) * 18 (Y00~Y17)
= 97200
24. https://dacon.io 24
2. 모델 구축 & 검증 : Multi Model
Coefficients of
Mapping
X -> Y18
Y00
-
Y17
Predict
Fit
Y00
-
Y17
XTrain 1 :
XTrain 2 : Y18Fit
× 25
25. 25
2. 모델 구축 & 검증 : Multi Model
Day Time : [34 – 100] Night Time : [0 – 33] & [101 - 143]
DAY
Coefs
NIGHT
Coefs
Time X00 X01 ··· neg_insol Ys_intercept Y18_intercept
0 1.3 0.0 ··· 0.0 3.8 1.2
1 1.3 0.0 ··· 0.0 3.8 1.2
··· 1.3 0.0 ··· 0.0 3.8 1.2
34 0.0 5.5 ··· -2.4 1.3 0.5
··· 0.0 5.5 ··· -2.4 1.3 0.5
100 0.0 5.5 ··· -2.4 1.3 0.5
··· 1.3 0.0 ··· 0.0 3.8 1.2
143 1.3 0.0 ··· 0.0 3.8 1.2
Day coefs
Night coefs
Night coefs
Coef_df :
Coefs_df
"Sub model"
26. https://dacon.io 26
2. 모델 구축 & 검증 : Multi Model
·····
Coefs_df
Day index = 21 / Night index = 93 Day index = 21 / Night index = 95Day index = 21 / Night index = 91
Day index = 43 / Night index = 103 Day index = 43 / Night index = 105 Day index = 43 / Night index = 107
# Sub models : 12 * 9 = 108