SlideShare a Scribd company logo
1 of 16
Predictive System




Kyungseok, Song
๋ถ„๋ฅ˜์— ๋Œ€ํ•œ ๊ธฐ์ดˆ



๏ƒ  ๊ต์‚ฌํ•™์Šต (Supervised Learning)
 โ€“ ํ›ˆ๋ จ ๋ฐ์ดํ„ฐ(Training Data) ๋กœ ๋ถ€ํ„ฐ ํ•˜๋‚˜์˜ ํ•จ์ˆ˜๋ฅผ ์œ ์ถ”ํ•ด ๋‚ด๊ธฐ ์œ„ํ•œ ๊ธฐ๊ณ„ํ•™์Šต(Machine
  Learning)์˜ ํ•œ ๋ฐฉ๋ฒ•

๏ƒ  ์˜ˆ์ธก๋ชจ๋ธ
 โ€“ ์—ฐ์†ํ˜• : ๋ถ„๋ฅ˜(calssification)
 โ€“ ์ด์‚ฐํ˜• : ํšŒ๊ท€๋ถ„์„(regression)

๏ƒ ์˜ˆ์ œ๋ฅผ ํ†ตํ•œ ๊ฒฐ์ • ํŠธ๋ฆฌ ๋ฐฐ์šฐ๊ธฐ
 โ€“ ์ค‘์ฒฉ๋œ if-then ํ˜•์‹์˜ ๋ชจ๋ธ๋ง
๋ถ„๋ฅ˜์— ๋Œ€ํ•œ ๊ธฐ์ดˆ


๏ƒ  ์ •๋ณด ์—”ํŠธ๋กœํ”ผ(information entropy)
 โ€“ ๊ฐ’์˜ ๋ถ„ํฌ์™€ ๊ด€๋ จ๋œ ๋ฌด์งˆ์„œ ์ •๋„ ์ธก์ •.
๋ถ„๋ฅ˜์— ๋Œ€ํ•œ ๊ธฐ์ดˆ


๏ƒ  Dateset
๋ถ„๋ฅ˜์— ๋Œ€ํ•œ ๊ธฐ์ดˆ


๏ƒ  ๋ช‡๋ฒˆ์˜ ๋ถ„ํ• ์„ ๊ฑฐ์น˜๋ฉดโ€ฆ
๋ถ„๋ฅ˜์— ๋Œ€ํ•œ ๊ธฐ์ดˆ


๏ƒ  ๋‚˜์ด๋ธŒ ๋ฒ ์ด์ง€์–ธ ๋ถ„๋ฅ˜๊ธฐ
โ€“ simple probabilistic classification
๋ถ„๋ฅ˜์— ๋Œ€ํ•œ ๊ธฐ์ดˆ


๏ƒ  ๋ฒ ์ด์ง€์–ธ ๋„คํŠธ์›Œํฌ(๋ฏฟ์Œ ๋„คํŠธ์›Œํฌ ๋˜๋Š” ํ™•๋ฅ ์  ๋„คํŠธ์›Œํฌ)
โ€“ ๋‚˜์ด๋ธŒ ๋ฒ ์ด์ง€์–ธ ๋ถ„์„ ๋ฐฉ๋ฒ•์˜ ๊ทธ๋ž˜ํ”ฝ ํ‘œํ˜„.
์™ธ์นด API๋ฅผ ํ™œ์šฉํ•œ ๋ธ”๋กœ๊ทธ ๊ธ€ ๋ถ„๋ฅ˜



๏ƒ  Diagram
์™ธ์นด API๋ฅผ ํ™œ์šฉํ•œ ๋ธ”๋กœ๊ทธ ๊ธ€ ๋ถ„๋ฅ˜



๏ƒ  ๋ธ”๋กœ๊ทธ ๊ธ€์„ ๋ถ„๋ฅ˜ํ•˜๊ธฐ ์œ„ํ•œ ๋ฐ์ดํ„ฐ์…‹ ๋นŒ๋“œ

โ€“ ํ…Œ๊ทธ์…‹๊ณผ ๊ด€๋ จ๋œ ๋ธ”๋กœ๊ทธ ๊ธ€์„ ๊ฐ€์ ธ์˜จ๋‹ค (๊ด€์‹ฌํƒœ๊ทธ, ๊ด€์‹ฌ์—†๋Š”ํƒœ๊ทธ๋กœ ์ด๋ฃจ์–ด์ ธ
 ์žˆ๋‹ค)

โ€“ ๋ธ”๋กœ๊ทธ ๊ธ€์„ ํŒŒ์‹ฑํ•˜์—ฌ ํ…€๋ฒกํ„ฐ๋กœ ๋งŒ๋“ ๋‹ค.ํ…€ ๋ฒกํ„ฐ๋Š” ๊ด€์‹ฌ์—ฌ๋ถ€๋ฅผ ์˜๋ฏธํ•˜๋Š” ์˜ˆ์ธก
 ๊ฐ’๊ณผ ์—ฐ๊ด€.

โ€“ ๊ฐ ํ…€๋ฒกํ„ฐ๋ฅผ ์™ธ์นด Instance ๊ฐ์ฒด๋กœ ๋ณ€ํ™˜ํ•œ๋‹ค. ์ด๋Ÿฐ Instance ๊ฐ์ฒด๊ฐ€ ๋ชจ์—ฌ์„œ
 ๋ฐ์ดํ„ฐ์…‹์„ ๊ตฌ์„ฑํ•˜๋ฉฐ, ์ด๋Š” Instances ๊ฐ์ฒด๋กœ ํ‘œํ˜„๋œ๋‹ค.

โ€“ ๋ช…๋ชฉํ˜• ์†์„ฑ์ธ ํƒœ๊ทธ๋Š”, ๋ธ”๋กœ๊ทธ ๊ธ€์— ํƒœ๊ทธ๊ฐ€ ์กด์žฌํ•˜๋ฉด ๊ฐ’์ด ์ฐธ์ด๊ณ , ์กด์žฌํ•˜์ง€
 ์•Š์œผ๋ฉด ๊ฐ’์ด boolean ์†์„์œผ๋กœ ๊ฐ ํƒœ๊ทธ๋ฅผ ๋ณ€ํ™˜
์™ธ์นด API๋ฅผ ํ™œ์šฉํ•œ ๋ธ”๋กœ๊ทธ ๊ธ€ ๋ถ„๋ฅ˜



๏ƒ  ๋ถ„๋ฅ˜๊ธฐ ํด๋ž˜์Šค ๊ตฌ์ถ•
ํšŒ๊ท€ ๋ถ„์„์˜ ๊ธฐ์ดˆ


๏ƒ  ์„ ํ˜• ํšŒ๊ท€ ๋ถ„์„
 โ€“ Y=aX+b

๏ƒ  ์žฌ์‚ฐ์„ ๋งŽ์ด ๋ณด์œ ํ•œ ์‚ฌ์šฉ์ž์˜ ๋ฐ์ดํ„ฐ




๏ƒ ํšŒ๊ท€๋ถ„์„์„ ์‚ฌ์šฉํ•œ ๊ฒฐ๊ณผ
์™ธ์นด๋ฅผ ์‚ฌ์šฉํ•œ ํšŒ๊ท€ ๋ถ„์„


๏ƒ  Codesโ€ฆ.
JDM์„ ์‚ฌ์šฉํ•œ ๋ถ„๋ฅ˜์™€ ํšŒ๊ท€ ๋ถ„์„

๏ƒ  ์ฃผ์š” JDM ๊ต์‚ฌ ํ•™์Šต ๊ด€๋ จ ํด๋ž˜์Šค
JDM์„ ์‚ฌ์šฉํ•œ ๋ถ„๋ฅ˜์™€ ํšŒ๊ท€ ๋ถ„์„

๏ƒ  ์ฃผ์š” JDM ๊ต์‚ฌ ํ•™์Šต ๊ด€๋ จ ํด๋ž˜์Šค
JDM์„ ์‚ฌ์šฉํ•œ ๋ถ„๋ฅ˜์™€ ํšŒ๊ท€ ๋ถ„์„

๏ƒ  ์ฃผ์š” JDM ๊ต์‚ฌ ํ•™์Šต ๊ด€๋ จ ํด๋ž˜์Šค
JDM์„ ์‚ฌ์šฉํ•œ ๋ถ„๋ฅ˜์™€ ํšŒ๊ท€ ๋ถ„์„

๏ƒ  JDM API๋ฅผ ์ด์šฉํ•œ ๊ต์‚ฌ ํ•™์Šต ์„ค์ •
 โ€“   ๋ถ„๋ฅ˜ ์„ค์ • ๊ฐ์ฒด ์ƒ์„ฑ
 โ€“   JDM API๋ฅผ ์ด์šฉํ•œ ๋ถ„๋ฅ˜   ํƒœ์Šคํฌ ์ƒ์„ฑ
 โ€“   JDM API๋ฅผ ์ด์šฉํ•œ ๋ถ„๋ฅ˜   ํƒœ์Šคํฌ ์‹คํ–‰
 โ€“   JDM API๋ฅผ ์ด์šฉํ•œ ๋ถ„๋ฅ˜   ๋ชจ๋ธ ๊ฐ€์ ธ์˜ค๊ธฐ
 โ€“   JDM API๋ฅผ ์ด์šฉํ•œ ๋ชจ๋ธ   ํ…Œ์ŠคํŠธ

More Related Content

Viewers also liked

2012-02-15 Building Student Advocacy for OER (Connexions Conference)
2012-02-15 Building Student Advocacy for OER (Connexions Conference)2012-02-15 Building Student Advocacy for OER (Connexions Conference)
2012-02-15 Building Student Advocacy for OER (Connexions Conference)Nicole Allen
ย 
Ziua Pedagogului
Ziua PedagoguluiZiua Pedagogului
Ziua PedagoguluiAdela Negura
ย 
Dimple bawri oct 11
Dimple bawri oct 11Dimple bawri oct 11
Dimple bawri oct 11Mayuri Pitale
ย 
Intro to n screen-rev
Intro to n screen-revIntro to n screen-rev
Intro to n screen-revYoun Sang Jang
ย 
Filosofia 1
Filosofia 1Filosofia 1
Filosofia 1checho995
ย 
USGBC CO Southern Branch
USGBC CO Southern BranchUSGBC CO Southern Branch
USGBC CO Southern BranchZachary Collins
ย 
Gadgets(autosaved)
Gadgets(autosaved)Gadgets(autosaved)
Gadgets(autosaved)Tushar Upadhyay
ย 
Radio Sua Voz
Radio Sua VozRadio Sua Voz
Radio Sua VozDaniel_Cajobi
ย 
Pricing by Preston McAfee
Pricing by Preston McAfeePricing by Preston McAfee
Pricing by Preston McAfeeGene Chuang
ย 
Open Ed Advocacy: Ideas That Work (Tacoma CC, 7 Nov 2014)
Open Ed Advocacy: Ideas That Work (Tacoma CC, 7 Nov 2014)Open Ed Advocacy: Ideas That Work (Tacoma CC, 7 Nov 2014)
Open Ed Advocacy: Ideas That Work (Tacoma CC, 7 Nov 2014)Nicole Allen
ย 
Learning is Open #UWdtl
Learning is Open #UWdtlLearning is Open #UWdtl
Learning is Open #UWdtlNicole Allen
ย 
Penn State #OERSummit16 Keynote
Penn State #OERSummit16 KeynotePenn State #OERSummit16 Keynote
Penn State #OERSummit16 KeynoteNicole Allen
ย 
Hun bgali ariunsanaa
Hun bgali ariunsanaaHun bgali ariunsanaa
Hun bgali ariunsanaatsendeehuu
ย 

Viewers also liked (16)

2012-02-15 Building Student Advocacy for OER (Connexions Conference)
2012-02-15 Building Student Advocacy for OER (Connexions Conference)2012-02-15 Building Student Advocacy for OER (Connexions Conference)
2012-02-15 Building Student Advocacy for OER (Connexions Conference)
ย 
Eb aug11
Eb aug11Eb aug11
Eb aug11
ย 
Ziua Pedagogului
Ziua PedagoguluiZiua Pedagogului
Ziua Pedagogului
ย 
Dimple bawri oct 11
Dimple bawri oct 11Dimple bawri oct 11
Dimple bawri oct 11
ย 
Intro to n screen-rev
Intro to n screen-revIntro to n screen-rev
Intro to n screen-rev
ย 
Filosofia 1
Filosofia 1Filosofia 1
Filosofia 1
ย 
USGBC CO Southern Branch
USGBC CO Southern BranchUSGBC CO Southern Branch
USGBC CO Southern Branch
ย 
Gadgets(autosaved)
Gadgets(autosaved)Gadgets(autosaved)
Gadgets(autosaved)
ย 
Tp12 1
Tp12 1Tp12 1
Tp12 1
ย 
Proyecto de vida
Proyecto de vidaProyecto de vida
Proyecto de vida
ย 
Radio Sua Voz
Radio Sua VozRadio Sua Voz
Radio Sua Voz
ย 
Pricing by Preston McAfee
Pricing by Preston McAfeePricing by Preston McAfee
Pricing by Preston McAfee
ย 
Open Ed Advocacy: Ideas That Work (Tacoma CC, 7 Nov 2014)
Open Ed Advocacy: Ideas That Work (Tacoma CC, 7 Nov 2014)Open Ed Advocacy: Ideas That Work (Tacoma CC, 7 Nov 2014)
Open Ed Advocacy: Ideas That Work (Tacoma CC, 7 Nov 2014)
ย 
Learning is Open #UWdtl
Learning is Open #UWdtlLearning is Open #UWdtl
Learning is Open #UWdtl
ย 
Penn State #OERSummit16 Keynote
Penn State #OERSummit16 KeynotePenn State #OERSummit16 Keynote
Penn State #OERSummit16 Keynote
ย 
Hun bgali ariunsanaa
Hun bgali ariunsanaaHun bgali ariunsanaa
Hun bgali ariunsanaa
ย 

Similar to 10 predictionsystem - pptx

แ„€แ…ฅแ†ทแ„‰แ…ขแ†จแ„‹แ…ฆแ†ซแ„Œแ…ตแ†ซแ„‹แ…ฆ แ„Œแ…ฅแ†จแ„‹แ…ญแ†ผแ„ƒแ…ฌแ†ซ แ„ƒแ…ตแ†ธแ„…แ…ฅแ„‚แ…ตแ†ผ แ„†แ…ฉแ„ƒแ…ฆแ†ฏ แ„‡แ…กแ†ผแ„‡แ…ฅแ†ธแ„…แ…ฉแ†ซ
แ„€แ…ฅแ†ทแ„‰แ…ขแ†จแ„‹แ…ฆแ†ซแ„Œแ…ตแ†ซแ„‹แ…ฆ แ„Œแ…ฅแ†จแ„‹แ…ญแ†ผแ„ƒแ…ฌแ†ซ แ„ƒแ…ตแ†ธแ„…แ…ฅแ„‚แ…ตแ†ผ แ„†แ…ฉแ„ƒแ…ฆแ†ฏ แ„‡แ…กแ†ผแ„‡แ…ฅแ†ธแ„…แ…ฉแ†ซแ„€แ…ฅแ†ทแ„‰แ…ขแ†จแ„‹แ…ฆแ†ซแ„Œแ…ตแ†ซแ„‹แ…ฆ แ„Œแ…ฅแ†จแ„‹แ…ญแ†ผแ„ƒแ…ฌแ†ซ แ„ƒแ…ตแ†ธแ„…แ…ฅแ„‚แ…ตแ†ผ แ„†แ…ฉแ„ƒแ…ฆแ†ฏ แ„‡แ…กแ†ผแ„‡แ…ฅแ†ธแ„…แ…ฉแ†ซ
แ„€แ…ฅแ†ทแ„‰แ…ขแ†จแ„‹แ…ฆแ†ซแ„Œแ…ตแ†ซแ„‹แ…ฆ แ„Œแ…ฅแ†จแ„‹แ…ญแ†ผแ„ƒแ…ฌแ†ซ แ„ƒแ…ตแ†ธแ„…แ…ฅแ„‚แ…ตแ†ผ แ„†แ…ฉแ„ƒแ…ฆแ†ฏ แ„‡แ…กแ†ผแ„‡แ…ฅแ†ธแ„…แ…ฉแ†ซTae Young Lee
ย 
แ„‡แ…กแ†ฏแ„‘แ…ญแ„Œแ…กแ„…แ…ญ 11แ„Œแ…กแ†ผ
แ„‡แ…กแ†ฏแ„‘แ…ญแ„Œแ…กแ„…แ…ญ 11แ„Œแ…กแ†ผแ„‡แ…กแ†ฏแ„‘แ…ญแ„Œแ…กแ„…แ…ญ 11แ„Œแ…กแ†ผ
แ„‡แ…กแ†ฏแ„‘แ…ญแ„Œแ…กแ„…แ…ญ 11แ„Œแ…กแ†ผJuhui Park
ย 
์•„ํ•ดํŒ€ ์Šคํ„ฐ๋”” Ormแ„‹แ…ณแ†ซ แ„‹แ…ฅแ„„แ…ฅแ‡‚แ„€แ…ฆ แ„€แ…ขแ†จแ„Žแ…ฆแ„…แ…ณแ†ฏ แ„†แ…ขแ„‘แ…ตแ†ผแ„’แ…กแ†ฏแ„แ…ก
์•„ํ•ดํŒ€ ์Šคํ„ฐ๋”” Ormแ„‹แ…ณแ†ซ แ„‹แ…ฅแ„„แ…ฅแ‡‚แ„€แ…ฆ แ„€แ…ขแ†จแ„Žแ…ฆแ„…แ…ณแ†ฏ แ„†แ…ขแ„‘แ…ตแ†ผแ„’แ…กแ†ฏแ„แ…ก์•„ํ•ดํŒ€ ์Šคํ„ฐ๋”” Ormแ„‹แ…ณแ†ซ แ„‹แ…ฅแ„„แ…ฅแ‡‚แ„€แ…ฆ แ„€แ…ขแ†จแ„Žแ…ฆแ„…แ…ณแ†ฏ แ„†แ…ขแ„‘แ…ตแ†ผแ„’แ…กแ†ฏแ„แ…ก
์•„ํ•ดํŒ€ ์Šคํ„ฐ๋”” Ormแ„‹แ…ณแ†ซ แ„‹แ…ฅแ„„แ…ฅแ‡‚แ„€แ…ฆ แ„€แ…ขแ†จแ„Žแ…ฆแ„…แ…ณแ†ฏ แ„†แ…ขแ„‘แ…ตแ†ผแ„’แ…กแ†ฏแ„แ…กSunghyun Roh
ย 
PySpark ๋ฐฐ์šฐ๊ธฐ Ch 06. ML ํŒจํ‚ค์ง€ ์†Œ๊ฐœํ•˜๊ธฐ
PySpark ๋ฐฐ์šฐ๊ธฐ Ch 06. ML ํŒจํ‚ค์ง€ ์†Œ๊ฐœํ•˜๊ธฐPySpark ๋ฐฐ์šฐ๊ธฐ Ch 06. ML ํŒจํ‚ค์ง€ ์†Œ๊ฐœํ•˜๊ธฐ
PySpark ๋ฐฐ์šฐ๊ธฐ Ch 06. ML ํŒจํ‚ค์ง€ ์†Œ๊ฐœํ•˜๊ธฐ์ฐฌํฌ ์ด
ย 
UiPath์™€ Python ML(Machine Learning) ๋ชจ๋ธ ์—ฐ๋™
UiPath์™€ Python ML(Machine Learning) ๋ชจ๋ธ ์—ฐ๋™UiPath์™€ Python ML(Machine Learning) ๋ชจ๋ธ ์—ฐ๋™
UiPath์™€ Python ML(Machine Learning) ๋ชจ๋ธ ์—ฐ๋™TIMEGATE
ย 
Spark Day 2017 Machine Learning & Deep Learning With Spark
Spark Day 2017 Machine Learning & Deep Learning With SparkSpark Day 2017 Machine Learning & Deep Learning With Spark
Spark Day 2017 Machine Learning & Deep Learning With SparkSangHoon Lee
ย 

Similar to 10 predictionsystem - pptx (6)

แ„€แ…ฅแ†ทแ„‰แ…ขแ†จแ„‹แ…ฆแ†ซแ„Œแ…ตแ†ซแ„‹แ…ฆ แ„Œแ…ฅแ†จแ„‹แ…ญแ†ผแ„ƒแ…ฌแ†ซ แ„ƒแ…ตแ†ธแ„…แ…ฅแ„‚แ…ตแ†ผ แ„†แ…ฉแ„ƒแ…ฆแ†ฏ แ„‡แ…กแ†ผแ„‡แ…ฅแ†ธแ„…แ…ฉแ†ซ
แ„€แ…ฅแ†ทแ„‰แ…ขแ†จแ„‹แ…ฆแ†ซแ„Œแ…ตแ†ซแ„‹แ…ฆ แ„Œแ…ฅแ†จแ„‹แ…ญแ†ผแ„ƒแ…ฌแ†ซ แ„ƒแ…ตแ†ธแ„…แ…ฅแ„‚แ…ตแ†ผ แ„†แ…ฉแ„ƒแ…ฆแ†ฏ แ„‡แ…กแ†ผแ„‡แ…ฅแ†ธแ„…แ…ฉแ†ซแ„€แ…ฅแ†ทแ„‰แ…ขแ†จแ„‹แ…ฆแ†ซแ„Œแ…ตแ†ซแ„‹แ…ฆ แ„Œแ…ฅแ†จแ„‹แ…ญแ†ผแ„ƒแ…ฌแ†ซ แ„ƒแ…ตแ†ธแ„…แ…ฅแ„‚แ…ตแ†ผ แ„†แ…ฉแ„ƒแ…ฆแ†ฏ แ„‡แ…กแ†ผแ„‡แ…ฅแ†ธแ„…แ…ฉแ†ซ
แ„€แ…ฅแ†ทแ„‰แ…ขแ†จแ„‹แ…ฆแ†ซแ„Œแ…ตแ†ซแ„‹แ…ฆ แ„Œแ…ฅแ†จแ„‹แ…ญแ†ผแ„ƒแ…ฌแ†ซ แ„ƒแ…ตแ†ธแ„…แ…ฅแ„‚แ…ตแ†ผ แ„†แ…ฉแ„ƒแ…ฆแ†ฏ แ„‡แ…กแ†ผแ„‡แ…ฅแ†ธแ„…แ…ฉแ†ซ
ย 
แ„‡แ…กแ†ฏแ„‘แ…ญแ„Œแ…กแ„…แ…ญ 11แ„Œแ…กแ†ผ
แ„‡แ…กแ†ฏแ„‘แ…ญแ„Œแ…กแ„…แ…ญ 11แ„Œแ…กแ†ผแ„‡แ…กแ†ฏแ„‘แ…ญแ„Œแ…กแ„…แ…ญ 11แ„Œแ…กแ†ผ
แ„‡แ…กแ†ฏแ„‘แ…ญแ„Œแ…กแ„…แ…ญ 11แ„Œแ…กแ†ผ
ย 
์•„ํ•ดํŒ€ ์Šคํ„ฐ๋”” Ormแ„‹แ…ณแ†ซ แ„‹แ…ฅแ„„แ…ฅแ‡‚แ„€แ…ฆ แ„€แ…ขแ†จแ„Žแ…ฆแ„…แ…ณแ†ฏ แ„†แ…ขแ„‘แ…ตแ†ผแ„’แ…กแ†ฏแ„แ…ก
์•„ํ•ดํŒ€ ์Šคํ„ฐ๋”” Ormแ„‹แ…ณแ†ซ แ„‹แ…ฅแ„„แ…ฅแ‡‚แ„€แ…ฆ แ„€แ…ขแ†จแ„Žแ…ฆแ„…แ…ณแ†ฏ แ„†แ…ขแ„‘แ…ตแ†ผแ„’แ…กแ†ฏแ„แ…ก์•„ํ•ดํŒ€ ์Šคํ„ฐ๋”” Ormแ„‹แ…ณแ†ซ แ„‹แ…ฅแ„„แ…ฅแ‡‚แ„€แ…ฆ แ„€แ…ขแ†จแ„Žแ…ฆแ„…แ…ณแ†ฏ แ„†แ…ขแ„‘แ…ตแ†ผแ„’แ…กแ†ฏแ„แ…ก
์•„ํ•ดํŒ€ ์Šคํ„ฐ๋”” Ormแ„‹แ…ณแ†ซ แ„‹แ…ฅแ„„แ…ฅแ‡‚แ„€แ…ฆ แ„€แ…ขแ†จแ„Žแ…ฆแ„…แ…ณแ†ฏ แ„†แ…ขแ„‘แ…ตแ†ผแ„’แ…กแ†ฏแ„แ…ก
ย 
PySpark ๋ฐฐ์šฐ๊ธฐ Ch 06. ML ํŒจํ‚ค์ง€ ์†Œ๊ฐœํ•˜๊ธฐ
PySpark ๋ฐฐ์šฐ๊ธฐ Ch 06. ML ํŒจํ‚ค์ง€ ์†Œ๊ฐœํ•˜๊ธฐPySpark ๋ฐฐ์šฐ๊ธฐ Ch 06. ML ํŒจํ‚ค์ง€ ์†Œ๊ฐœํ•˜๊ธฐ
PySpark ๋ฐฐ์šฐ๊ธฐ Ch 06. ML ํŒจํ‚ค์ง€ ์†Œ๊ฐœํ•˜๊ธฐ
ย 
UiPath์™€ Python ML(Machine Learning) ๋ชจ๋ธ ์—ฐ๋™
UiPath์™€ Python ML(Machine Learning) ๋ชจ๋ธ ์—ฐ๋™UiPath์™€ Python ML(Machine Learning) ๋ชจ๋ธ ์—ฐ๋™
UiPath์™€ Python ML(Machine Learning) ๋ชจ๋ธ ์—ฐ๋™
ย 
Spark Day 2017 Machine Learning & Deep Learning With Spark
Spark Day 2017 Machine Learning & Deep Learning With SparkSpark Day 2017 Machine Learning & Deep Learning With Spark
Spark Day 2017 Machine Learning & Deep Learning With Spark
ย 

10 predictionsystem - pptx

  • 2. ๋ถ„๋ฅ˜์— ๋Œ€ํ•œ ๊ธฐ์ดˆ ๏ƒ  ๊ต์‚ฌํ•™์Šต (Supervised Learning) โ€“ ํ›ˆ๋ จ ๋ฐ์ดํ„ฐ(Training Data) ๋กœ ๋ถ€ํ„ฐ ํ•˜๋‚˜์˜ ํ•จ์ˆ˜๋ฅผ ์œ ์ถ”ํ•ด ๋‚ด๊ธฐ ์œ„ํ•œ ๊ธฐ๊ณ„ํ•™์Šต(Machine Learning)์˜ ํ•œ ๋ฐฉ๋ฒ• ๏ƒ  ์˜ˆ์ธก๋ชจ๋ธ โ€“ ์—ฐ์†ํ˜• : ๋ถ„๋ฅ˜(calssification) โ€“ ์ด์‚ฐํ˜• : ํšŒ๊ท€๋ถ„์„(regression) ๏ƒ ์˜ˆ์ œ๋ฅผ ํ†ตํ•œ ๊ฒฐ์ • ํŠธ๋ฆฌ ๋ฐฐ์šฐ๊ธฐ โ€“ ์ค‘์ฒฉ๋œ if-then ํ˜•์‹์˜ ๋ชจ๋ธ๋ง
  • 3. ๋ถ„๋ฅ˜์— ๋Œ€ํ•œ ๊ธฐ์ดˆ ๏ƒ  ์ •๋ณด ์—”ํŠธ๋กœํ”ผ(information entropy) โ€“ ๊ฐ’์˜ ๋ถ„ํฌ์™€ ๊ด€๋ จ๋œ ๋ฌด์งˆ์„œ ์ •๋„ ์ธก์ •.
  • 5. ๋ถ„๋ฅ˜์— ๋Œ€ํ•œ ๊ธฐ์ดˆ ๏ƒ  ๋ช‡๋ฒˆ์˜ ๋ถ„ํ• ์„ ๊ฑฐ์น˜๋ฉดโ€ฆ
  • 6. ๋ถ„๋ฅ˜์— ๋Œ€ํ•œ ๊ธฐ์ดˆ ๏ƒ  ๋‚˜์ด๋ธŒ ๋ฒ ์ด์ง€์–ธ ๋ถ„๋ฅ˜๊ธฐ โ€“ simple probabilistic classification
  • 7. ๋ถ„๋ฅ˜์— ๋Œ€ํ•œ ๊ธฐ์ดˆ ๏ƒ  ๋ฒ ์ด์ง€์–ธ ๋„คํŠธ์›Œํฌ(๋ฏฟ์Œ ๋„คํŠธ์›Œํฌ ๋˜๋Š” ํ™•๋ฅ ์  ๋„คํŠธ์›Œํฌ) โ€“ ๋‚˜์ด๋ธŒ ๋ฒ ์ด์ง€์–ธ ๋ถ„์„ ๋ฐฉ๋ฒ•์˜ ๊ทธ๋ž˜ํ”ฝ ํ‘œํ˜„.
  • 8. ์™ธ์นด API๋ฅผ ํ™œ์šฉํ•œ ๋ธ”๋กœ๊ทธ ๊ธ€ ๋ถ„๋ฅ˜ ๏ƒ  Diagram
  • 9. ์™ธ์นด API๋ฅผ ํ™œ์šฉํ•œ ๋ธ”๋กœ๊ทธ ๊ธ€ ๋ถ„๋ฅ˜ ๏ƒ  ๋ธ”๋กœ๊ทธ ๊ธ€์„ ๋ถ„๋ฅ˜ํ•˜๊ธฐ ์œ„ํ•œ ๋ฐ์ดํ„ฐ์…‹ ๋นŒ๋“œ โ€“ ํ…Œ๊ทธ์…‹๊ณผ ๊ด€๋ จ๋œ ๋ธ”๋กœ๊ทธ ๊ธ€์„ ๊ฐ€์ ธ์˜จ๋‹ค (๊ด€์‹ฌํƒœ๊ทธ, ๊ด€์‹ฌ์—†๋Š”ํƒœ๊ทธ๋กœ ์ด๋ฃจ์–ด์ ธ ์žˆ๋‹ค) โ€“ ๋ธ”๋กœ๊ทธ ๊ธ€์„ ํŒŒ์‹ฑํ•˜์—ฌ ํ…€๋ฒกํ„ฐ๋กœ ๋งŒ๋“ ๋‹ค.ํ…€ ๋ฒกํ„ฐ๋Š” ๊ด€์‹ฌ์—ฌ๋ถ€๋ฅผ ์˜๋ฏธํ•˜๋Š” ์˜ˆ์ธก ๊ฐ’๊ณผ ์—ฐ๊ด€. โ€“ ๊ฐ ํ…€๋ฒกํ„ฐ๋ฅผ ์™ธ์นด Instance ๊ฐ์ฒด๋กœ ๋ณ€ํ™˜ํ•œ๋‹ค. ์ด๋Ÿฐ Instance ๊ฐ์ฒด๊ฐ€ ๋ชจ์—ฌ์„œ ๋ฐ์ดํ„ฐ์…‹์„ ๊ตฌ์„ฑํ•˜๋ฉฐ, ์ด๋Š” Instances ๊ฐ์ฒด๋กœ ํ‘œํ˜„๋œ๋‹ค. โ€“ ๋ช…๋ชฉํ˜• ์†์„ฑ์ธ ํƒœ๊ทธ๋Š”, ๋ธ”๋กœ๊ทธ ๊ธ€์— ํƒœ๊ทธ๊ฐ€ ์กด์žฌํ•˜๋ฉด ๊ฐ’์ด ์ฐธ์ด๊ณ , ์กด์žฌํ•˜์ง€ ์•Š์œผ๋ฉด ๊ฐ’์ด boolean ์†์„์œผ๋กœ ๊ฐ ํƒœ๊ทธ๋ฅผ ๋ณ€ํ™˜
  • 10. ์™ธ์นด API๋ฅผ ํ™œ์šฉํ•œ ๋ธ”๋กœ๊ทธ ๊ธ€ ๋ถ„๋ฅ˜ ๏ƒ  ๋ถ„๋ฅ˜๊ธฐ ํด๋ž˜์Šค ๊ตฌ์ถ•
  • 11. ํšŒ๊ท€ ๋ถ„์„์˜ ๊ธฐ์ดˆ ๏ƒ  ์„ ํ˜• ํšŒ๊ท€ ๋ถ„์„ โ€“ Y=aX+b ๏ƒ  ์žฌ์‚ฐ์„ ๋งŽ์ด ๋ณด์œ ํ•œ ์‚ฌ์šฉ์ž์˜ ๋ฐ์ดํ„ฐ ๏ƒ ํšŒ๊ท€๋ถ„์„์„ ์‚ฌ์šฉํ•œ ๊ฒฐ๊ณผ
  • 13. JDM์„ ์‚ฌ์šฉํ•œ ๋ถ„๋ฅ˜์™€ ํšŒ๊ท€ ๋ถ„์„ ๏ƒ  ์ฃผ์š” JDM ๊ต์‚ฌ ํ•™์Šต ๊ด€๋ จ ํด๋ž˜์Šค
  • 14. JDM์„ ์‚ฌ์šฉํ•œ ๋ถ„๋ฅ˜์™€ ํšŒ๊ท€ ๋ถ„์„ ๏ƒ  ์ฃผ์š” JDM ๊ต์‚ฌ ํ•™์Šต ๊ด€๋ จ ํด๋ž˜์Šค
  • 15. JDM์„ ์‚ฌ์šฉํ•œ ๋ถ„๋ฅ˜์™€ ํšŒ๊ท€ ๋ถ„์„ ๏ƒ  ์ฃผ์š” JDM ๊ต์‚ฌ ํ•™์Šต ๊ด€๋ จ ํด๋ž˜์Šค
  • 16. JDM์„ ์‚ฌ์šฉํ•œ ๋ถ„๋ฅ˜์™€ ํšŒ๊ท€ ๋ถ„์„ ๏ƒ  JDM API๋ฅผ ์ด์šฉํ•œ ๊ต์‚ฌ ํ•™์Šต ์„ค์ • โ€“ ๋ถ„๋ฅ˜ ์„ค์ • ๊ฐ์ฒด ์ƒ์„ฑ โ€“ JDM API๋ฅผ ์ด์šฉํ•œ ๋ถ„๋ฅ˜ ํƒœ์Šคํฌ ์ƒ์„ฑ โ€“ JDM API๋ฅผ ์ด์šฉํ•œ ๋ถ„๋ฅ˜ ํƒœ์Šคํฌ ์‹คํ–‰ โ€“ JDM API๋ฅผ ์ด์šฉํ•œ ๋ถ„๋ฅ˜ ๋ชจ๋ธ ๊ฐ€์ ธ์˜ค๊ธฐ โ€“ JDM API๋ฅผ ์ด์šฉํ•œ ๋ชจ๋ธ ํ…Œ์ŠคํŠธ

Editor's Notes

  1. ๊ต์‚ฌํ•™์Šต(์ง€๋„ํ•™์Šต)Supervised Learning์€ ํ›ˆ๋ จ ๋ฐ์ดํ„ฐ(Training Data)๋กœ๋ถ€ํ„ฐ ํ•˜๋‚˜์˜ ํ•จ์ˆ˜๋ฅผ ์œ ์ถ”ํ•ด๋‚ด๊ธฐ ์œ„ํ•œ ๊ธฐ๊ณ„ํ•™์Šต(Machine Learning)์˜ ํ•œ ๋ฐฉ๋ฒ•์ด๋‹ค. ํ›ˆ๋ จ ๋ฐ์ดํ„ฐ๋Š” ์ผ๋ฐ˜์ ์œผ๋กœ ์ž…๋ ฅ ๊ฐ์ฒด์— ๋Œ€ํ•œ ์†์„ฑ์„ ๋ฒกํ„ฐ ํ˜•ํƒœ๋กœ ํฌํ•จํ•˜๊ณ  ์žˆ์œผ๋ฉฐ ๊ฐ๊ฐ์˜ ๋ฒกํ„ฐ์— ๋Œ€ํ•ด ์›ํ•˜๋Š” ๊ฒฐ๊ณผ๊ฐ€ ๋ฌด์—‡์ธ์ง€ ํ‘œ์‹œ๋˜์–ด ์žˆ๋‹ค. ์ด๋ ‡๊ฒŒ ์œ ์ถ”๋œ ํ•จ์ˆ˜ ์ค‘ ์—ฐ์†์ ์ธ ๊ฐ’์„ ์ถœ๋ ฅํ•˜๋Š” ๊ฒƒ์„ ํšŒ๊ท€๋ถ„์„(Regression)์ด๋ผ ํ•˜๊ณ  ์ฃผ์–ด์ง„ ์ž…๋ ฅ ๋ฒกํ„ฐ๊ฐ€ ์–ด๋–ค ์ข…๋ฅ˜์˜ ๊ฐ’์ธ์ง€ ํ‘œ์‹ํ•˜๋Š” ๊ฒƒ์„ ๋ถ„๋ฅ˜(Classification)๋ผ ํ•œ๋‹ค.- ์ง€๋„ ํ•™์Šต๊ธฐ(Supervised Learner)๊ฐ€ ํ•˜๋Š” ์ž‘์—…์€ ํ›ˆ๋ จ ๋ฐ์ดํ„ฐ๋กœ๋ถ€ํ„ฐ ์ฃผ์–ด์ง„ ๋ฐ์ดํ„ฐ์— ๋Œ€ํ•ด ์˜ˆ์ธกํ•˜๊ณ ์ž ํ•˜๋Š” ๊ฐ’์„ ์˜ฌ๋ฐ”๋กœ ์ถ”์ธกํ•ด๋‚ด๋Š” ๊ฒƒ์ด๋‹ค. ์ด ๋ชฉํ‘œ๋ฅผ ๋‹ฌ์„ฑํ•˜๊ธฐ ์œ„ํ•ด์„œ๋Š” ํ•™์Šต๊ธฐ๊ฐ€ โ€œ์•Œ๋งž์€โ€ ๋ฐฉ๋ฒ•์„ ํ†ตํ•˜์—ฌ ๊ธฐ์กด์˜ ํ›ˆ๋ จ ๋ฐ์ดํ„ฐ๋กœ๋ถ€ํ„ฐ ๋‚˜ํƒ€๋‚˜์ง€ ์•Š๋˜ ์ƒํ™ฉ๊นŒ์ง€๋„ ์ผ๋ฐ˜ํ™”ํ•˜์—ฌ ์ฒ˜๋ฆฌํ•  ์ˆ˜ ์žˆ์–ด์•ผ ํ•œ๋‹ค. ์‚ฌ๋žŒ๊ณผ ๋™๋ฌผ์— ๋Œ€์‘ํ•˜๋Š” ์‹ฌ๋ฆฌํ•™์œผ๋กœ๋Š” ๊ฐœ๋…ํ•™์Šต(Concept Learning)์„ ์˜ˆ๋กœ ๋“ค ์ˆ˜ ์žˆ๋‹ค.- Supervised Learning์„์ด์šฉํ•œ์•Œ๊ณ ๋ฆฌ์ฆ˜. ์„œํฌํŠธ ๋ฒกํ„ฐ ๋จธ์‹ (support vector machine). ์€๋‹‰ ๋งˆ๋ฅด์ฝ”ํ”„ ๋ชจ๋ธ(Hidden Markov model). ํšŒ๊ท€ ๋ถ„์„(Regression). ์‹ ๊ฒฝ๋ง(Neural network). ๋‚˜์ด๋ธŒ ๋ฉ”์ด์ฆˆ ๋ถ„๋ฅ˜(Naรฏve Bayes Classification)์˜ˆ์ธก๋ชจ๋ธ์„ ์‚ฌ์šฉํ• ๋•Œ๋Š” ํ•™์Šต, ์—ํ”Œ๋ฆฌ์ผ€์ด์…˜์˜ 2๋‹จ๊ณ„๊ฐ€ ์žˆ์Šต๋‹ˆ๋‹ค.- ํ•™์Šต๋‹จ๊ณ„์—์„œ๋Š” ์ž…๋ ฅ๊ณผ ์ถœ๋ ฅ ์†์„ฑ์„ ๊ฐ€์ง„ ๋ฐ์ดํ„ฐ ์…‹์ด ์ฃผ์–ด์ง€๊ณ , ๊ทธ ๋ฐ์ดํ„ฐ ์…‹์„ ๊ธฐ๋ฐ˜์œผ๋กœ ์ˆ˜ํ•™์ ์ธ ๋ชจ๋ธ์„ ๋นŒ๋“œํ•ฉ๋‹ˆ๋‹ค.- ์• ํ”Œ๋ฆฌ์ผ€์ด์…˜์—์„œ ๋ฌด์–ธ๊ฐ€๋ฅผ ์˜ˆ์ธกํ• ๋•Œ๋Š” ์ด ์ˆ˜ํ•™ ๋ชจ๋ธ์„ ์ด์šฉํ•ฉ๋‹ˆ๋‹ค.=๏ƒจ ์ƒ๋Œ€์ ์œผ๋กœ ์• ํ”Œ๋ฆฌ์ผ€์ด์…˜์—์„œ ์˜ˆ์ธก์€ ๋น ๋ฅด๊ธฐ๋•Œ๋ฌธ์— ์‹ค์‹œ๊ฐ„ ์˜ˆ์ธก์˜ ์ ์šฉ์ด ๊ฐ€๋Šฅํ•ฉ๋‹ˆ๋‹ค. ๋ฐ˜๋Œ€๋กœ ํ•™์Šต๋‹จ๊ณ„์—์„œ๋Š” ๋Š๋ฆฌ๊ธฐ๋•Œ๋ฌธ์— ๋น„๋™๊ธฐ์ ์ธ ์ˆ˜ํ–‰์ด ํ•„์š”ํ•ฉ๋‹ˆ๋‹ค.-
  2. ๋‚˜์ด๋ธŒ ๋ฒ ์ด์ง€์–ธ ๋ถ„๋ฅ˜๊ธฐ๋‹จ์ˆœํ•œ ํ™•๋ฅ ์  ๋ถ„๋ฅ˜๋ฒ• ์ด๋ผ๊ณ  ํ•ฉ๋‹ˆ๋‹ค. ํ™•๋ฅ  ๋ชจ๋ธ์€ ๋ฒ ์ด์ง€์˜ ์ •๋ฆฌ(http://www.aistudy.com/math/bayes_theorem.htm)์—์„œ ์œ ๋„ ๋˜์—ˆ๊ณ , ํ•ต์‹ฌ์ ์ธ ๋‚ด์šฉ์€ ์ƒˆ๋กœ์šด ์ •๋ณด๋ฅผ ์ด์šฉํ•˜์—ฌ ์ ์ง„์ ์œผ๋กœ ํ™•๋ฅ ์˜ ๊ฐœ์„ ์ž‘์—…์ด ์ด๋ฃจ์–ด ์ง„๋‹ค๊ณ  ํ•ฉ๋‹ˆ๋‹ค.
  3. Bayesian network ๋˜๋Š” Bayesian belief network ๋Š” ๋ณ€์ˆ˜๋ฅผ ํ‘œํ˜„ํ•˜๋Š” ๋…ธ๋“œ (node) ์™€ ๋ณ€์ˆ˜๋“ค ๊ฐ„์˜ ์˜์กด๊ด€๊ณ„๋ฅผ ํ‘œํ˜„ํ•˜๋Š” ํ˜ธ (arc) ์˜ ๋ฐฉํ–ฅ์„ฑ ๋น„์ˆœํ™˜ ๊ทธ๋ž˜ํ”„ (directed acyclic graph) ์ด๋‹ค. ๋…ธ๋“œ A ์—์„œ ๋…ธ๋“œ B ๊นŒ์ง€์˜ ํ˜ธ๊ฐ€ ์žˆ๋‹ค๋ฉด A ๋Š” B ์˜ parent ๋ผ๊ณ  ๋ถ€๋ฅธ๋‹ค. ๋…ธ๋“œ๊ฐ€ ๊ฐ’์ด ์ฃผ์–ด์ ธ ์žˆ๋‹ค๋ฉด evidence node ๋ผ๊ณ  ๋ถ€๋ฅธ๋‹ค. ํ•˜๋‚˜์˜ ๋…ธ๋“œ๋Š” ์ธก์ •๊ฐ’, ์ธ์ˆ˜, ์ˆจ๊ฒจ์ง„ (latent)ย ๋ณ€์ˆ˜, ๊ฐ€์„ค ๋“ฑ์˜ ์–ด๋–ค ์ข…๋ฅ˜์˜ ๋ณ€์ˆ˜์ผ ์ˆ˜๋„ ์žˆ๋‹ค. ๋…ธ๋“œ๋Š” ์ž„์˜์˜ ๋ณ€์ˆ˜๋ฅผ ํ‘œํ˜„ํ•˜๋Š”๋ฐ ์ œ์•ฝ์ด ์—†๋‹ค ; ์ด๊ฒƒ์ดย Bayesian network ์— ๋Œ€ํ•ด์„œ "Bayesian" ์ด๋ผ๋Š” ๊ฒƒ์ด๋‹ค (Nodes are not restricted to representing random variables; this is what is "Bayesian" about a Bayesian network).
  4. WEKABlogClassfier ํด๋ž˜์Šค๋ฅผ ์ƒ์†ํ•˜์—ฌ ์™ธ์นด ํšŒ์นด ํšŒ๊ท€ ๋ถ„์„ API๋“ค์„ ํ˜ธ์ถœํ•˜๋Š” WEKABlogPredictor ํด๋ž˜์Šค๋ฅผ ๋งŒ๋“ญ๋‹ˆ๋‹ค.
  5. WEKAPredictiveBlogDataSetCreatorImpl ์˜ ๊ตฌํ˜„์„ ์„ค๋ช…ํ•˜๋Š” ํŒŒํŠธ ์ž…๋‹ˆ๋‹ค.
  6. ํšŒ๊ท€๋ถ„์„X,Y ๋ผ๋Š” ๋‘ ๋ณ€์ˆ˜๊ฐ€ ์žˆ์„๋•Œ, ์ƒ๊ด€๋ถ„์„์„ ํ†ตํ•ด์„œ Y~X ๊ฐ€ ์„œ๋กœ ์ƒ๊ด€๊ด€๊ณ„๊ฐ€ ์žˆ์„์„ ์•Œ ์ˆ˜ ์žˆ์Šต๋‹ˆ๋‹ค.์—ฌ๊ธฐ์„œ ๋” ๋‚˜์•„๊ฐ€ Y=aX+b ๋ผ๋Š” ๋ฐฉ์ •์‹์„ ์–ป์„ ์ˆ˜ ์žˆ์œผ๋ฉด,X๋ผ๋Š” ๋ณ€์ˆ˜๋กœ Y์˜ ๊ฐ’์„ ์˜ˆ์ธกํ•  ์ˆ˜ ์žˆ๊ฒŒ ๋ฉ๋‹ˆ๋‹ค.์ด๋Ÿฐ ๋ฐฉ์ •์‹์„ ํšŒ๊ท€๋ฐฉ์ ์‹์ด๋ผ๊ณ  ํ•˜๋ฉฐ, ์ด๋Ÿฐ ๊ณผ์ •์„ ํšŒ๊ท€ ๋ถ„์„์ด๋ผ๊ณ  ํ•ฉ๋‹ˆ๋‹ค.
  7. WEKABlogClassifier ํด๋ž˜์Šค๋ฅผ ์ƒ์†๋ฐ›์•„ ๊ฐœ๋ฐœํ•œ๋‹ค.์—ฐ์†ํ˜• ์†์„ฑ์„ ๊ฐ€์ง„ Instance๋ฅผ ์ƒ์„ฑํ•ด์•ผ ํ•œ๋‹ค.