SlideShare a Scribd company logo
Enhancing Environmental Data
Forecasting Performance by
Utilizing Multi-region Data with
Hard-parameter sharing
Electronics and Telecommunications Research Institute
Jang-Ho Choi, Miyoung Jang, Taewhi Lee, Jongho Won and Jiyong Kim
janghochoi@etri.re.kr
Table of Contents
• Introduction
• Challenges
• Proposed Method
• Experimental Studies
• Q&A
Time-series Forecasting
• “Time series analysis comprises methods for analyzing time
series data in order to extract meaningful statistics and other
characteristics of the data.”
• “Time series forecasting is the use of a model to predict
future values based on previously observed values.”
• “Forecasting involves taking models fit on historical data and using
them to predict future observations.”
https://en.wikipedia.org/wiki/Time_series
https://machinelearningmastery.com/time-series-forecasting/
Environmental Data
• Environmental data, i.e. air quality and water quality data, are
generally multivariate, time-series data.
• They are generally collected from multiple sites.
<Water/Air Quality Monitoring Stations>
Challenges
• Datasets are not only scarce and expensive, but also often
severely imbalanced in practice.
• Insufficient and imbalance datasets can lead to biased models,
resulting poor prediction performance.
0
10
20
30
40
50
60
70
80
90
Chl-a (mg/m3)
Histogram of Chlorophyll-a Concentration
(mg/m3, Binwidth=1)
Chlorophyll-a Concentration in Daecheong Lake (2012)
#
Chl-a concentrations
Detour: Handling Data imbalance and
insufficiency
Utilizing Multi-region Datasets
• Training Regional Dataset (JangGye) vs. Additional Dataset
(JangGye + DaeCheong)
R2 MSE
Micro Acc
.
Macro Ac
c.
JangGye 0.5297 36.16 73.98% 66.72%
JangGye
+DaeCheong
0.4207 44.55 68.65% 49.04%
Difference 26.19% -18.83% 7.76% 36.05%
<Chlorophyll-a Prediction Results (JangGye)>
Hard-parameter Sharing
• In order to preserve region-specific
characteristic, we adopt hard-
parameter sharing from multi-task
learning [7].
• Hard-parameter sharing is a
technique to place common
hidden layers across multiple tasks,
while keeping task-specific layers.
Hard-parameter sharing is known
to reduce overfitting, but it is
generally restricted to tasks that
are closely related one another [8]. [8]
Multi-headed Networks
• Common layers fit on all data from multiple regions
• Region specific layers fit on their own regional data
<Iterative Training Process>
Common
Layers
R1 Layers
R2 Layers
R3 Layers
R4 Layers
R1
Data
R2
Data
R3
Data
R4
Data
입력 데이터
Experimental Studies
Multi-headed Neural Network Model
Algal Bloom
• “An algal bloom or algae bloom is a rapid increase or accumulation in the population of algae in freshwater or ma
rine water systems” (Wikipedia)
• For last few decades, algal bloom has been a serious problem, leading imbalance of organisms in the water.
• Severe algal bloom directly affects water quality as some of algae such as cyanobacteria produce toxic that are har
mful to other species—including humans.
YTN Joongang
Chlorophyll-a
• “Chlorophyll-a is a specific form of chlorophyll used in oxygenic p
hotosynthesis. It absorbs most energy from wavelengths of violet-
blue and orange-red light. It also reflects green-yellow light, and a
s such contributes to the observed green color of most plants.” (W
ikipedia)
• Korean Government has been operating the algal warning system
since 1998 based on chlorophyll-a concentration and the number
of cyanophyceae cells.
• Although using the concentration of chlorophyll-a as the primary indicator
has been debatable, it is still widely used as it can be measured at low cos
t, whereas counting cyanophyceae cells incur considerable human labor.
• Unfortunately, water quality datasets are not only scarce, but also
skewed and imbalanced due to their nature.
Datasets
• Water Quality Monitoring Station
(http://www.koreawqi.go.kr/)
• Daecheong, Janggye, Chungam, Namgang
• 2012/7/1 ~ 2018/3/31
• 8 Water Quality Index Variables:
• Chlorophyll-a
• Water Temperature, pH
• Total Electrical conductivity
• Dissolved Oxygen, Total Organic Carbons
• Total Nitrogen, Total Phosphorus
Class 0 Class 1 Class 2 Class 3
Chl-A 0~15mg/m3 15~25mg/m3 25~100mg/m3 100mg/m3+
JangGye 1217 255 142 0
DaeCheong 1451 164 70 2
CheongAm 794 523 535 10
NamGang 1478 156 365 5
Experiment Settings
f(x)
x1
x2
x3
x4
…
xn
x1
x2
x3
x4
…
xn
y y
f(x)
x1
x2
x3
x4
…
xn
x1
x2
x3
x4
…
xn
x1
x2
x3
x4
…
xn
x1
x2
x3
x4
…
xn
x1
x2
x3
x4
…
xn
x1
x2
x3
x4
…
xn
y y y y y y
f(x)
g1(z)
g2(z)
g3(z)
x1
x2
x3
x4
…
xn
x1
x2
x3
x4
…
xn
x1
x2
x3
x4
…
xn
x1
x2
x3
x4
…
xn
x1
x2
x3
x4
…
xn
x1
x2
x3
x4
…
xn
y
y
y
y
y
y
Single region Data
Experiment Results
R2 MSE Micro Acc. Macro Acc.
Baseline
(Single Region) 0.5297 36.16 73.98% 66.72%
Baseline
(Multiple Regions) 0.4207 44.55 68.65% 49.04%
Multi-headed
(Multiple Regions) 0.5633 32.70 75.36% 70.04%
<Chlorophyll-A Prediction Results (JangGye)>
Baseline Model with Single Region (upper)
vs. Multi-head Neural Network Model (lower)
Class 0 Class 1 Class 2
Chl-A 0~15mg/m3 15~25mg/m3 25~100mg/m3
Baseline
(Single Region) 83.00% 59.26% 57.89%
Baseline
(Multiple Regions) 91.50% 34.57% 21.05%
Multi-headed
(Multiple Regions) 83.72% 58.89% 67.50%
<Performance Comparison (Class Accuracy)>
<Performance Comparison (R2, MSE, Micro and Macro Accuracy)>
Acknowledgement
• This work was supported by the Institute of Information &
communications Technology Planning & Evaluation (IITP)
grant funded by the Korea government(MSIT) (No. 2018-0-
00219, Space-time complex artificial intelligence blue-green
algae prediction technology based on direct-readable water
quality complex sensor and hyperspectral image).
References
1) Agrawal, S., Barrington, L., Bromberg, C., Burge, J., Gazen, C., and Hickey, J., “Machine Learning for Precipitation
Nowcasting from Radar Images,” arXiv:cs.CV/1912.12132, 2019
2) Kurt Hornik, “Approximation capabilities of multilayer feedforward networks,” Neural Networks Volume 4, Issue 2,
Pages 251-257, 1991
3) Choi, J., Park, J., Park, H. and Min, O. (2017), DART: Fast and Efficient Distributed Stream Processing Framework
for Internet of Things. ETRI Journal, 39: 202-212. doi:10.4218/etrij.17.2816.0109
4) Salman H. Khan et al., Cost-Sensitive Learning of Deep Feature Representations From Imbalanced Data, IEEE
Transactions on Neural Networks and Learning Systems, Vol. 29, Issue: 8 (2015)
5) N. V. Chawla et al., SMOTE: Synthetic Minority Over-sampling Technique, Journal of Artificial Intelligence
Research, Vol. 16 (2002)
6) J. Choi, J. Kim, J. Won and O. Min, "Modelling Chlorophyll-a Concentration using Deep Neural Networks
considering Extreme Data Imbalance and Skewness," 2019 21st International Conference on Advanced
Communication Technology (ICACT) doi: 10.23919/ICACT.2019.8702027.
7) Caruana, R. "Multitask learning: A knowledge-based source of inductive bias." Proceedings of the Tenth
International Conference on Machine Learning. 1993.
8) Sebastian Ruder, “An Overview of Multi-Task Learning in Deep Neural Networks,” arXiv:cs.LG/1706.05098, 2017
Q&A
Thank you!

More Related Content

Similar to Choi_ICONI2020_발표자료_v1.0.pdf

Final capstone presentation
Final capstone presentationFinal capstone presentation
Final capstone presentation
ColinRichter3
 
For presentation capstone
For presentation   capstoneFor presentation   capstone
For presentation capstone
MichaelSmith808947
 
Capstone Presentation
Capstone PresentationCapstone Presentation
Capstone Presentation
Alexis900637
 
A novel fuzzy rule based system for assessment of ground water potability: A ...
A novel fuzzy rule based system for assessment of ground water potability: A ...A novel fuzzy rule based system for assessment of ground water potability: A ...
A novel fuzzy rule based system for assessment of ground water potability: A ...
IOSR Journals
 
Developing a stochastic simulation model for the generation of residential wa...
Developing a stochastic simulation model for the generation of residential wa...Developing a stochastic simulation model for the generation of residential wa...
Developing a stochastic simulation model for the generation of residential wa...
Environmental Intelligence Lab
 
Developing a stochastic simulation model for the generation of residential wa...
Developing a stochastic simulation model for the generation of residential wa...Developing a stochastic simulation model for the generation of residential wa...
Developing a stochastic simulation model for the generation of residential wa...
SmartH2O
 
WATER QUALITY PREDICTION
WATER QUALITY PREDICTIONWATER QUALITY PREDICTION
WATER QUALITY PREDICTION
Fasil47
 
Performance assessment of water filtration plants in pakistan - JBES
Performance assessment of water filtration plants in pakistan - JBESPerformance assessment of water filtration plants in pakistan - JBES
Performance assessment of water filtration plants in pakistan - JBES
Innspub Net
 
Final ppt dissertation
Final ppt dissertationFinal ppt dissertation
Final ppt dissertation
Lokesh Saini
 
IRJET- ANN-Based Modeling for Coagulant Dosage in Drinking Water Treatment Plant
IRJET- ANN-Based Modeling for Coagulant Dosage in Drinking Water Treatment PlantIRJET- ANN-Based Modeling for Coagulant Dosage in Drinking Water Treatment Plant
IRJET- ANN-Based Modeling for Coagulant Dosage in Drinking Water Treatment Plant
IRJET Journal
 
Chlorine Dose Determination in Water Distribution System of Jabalpur City usi...
Chlorine Dose Determination in Water Distribution System of Jabalpur City usi...Chlorine Dose Determination in Water Distribution System of Jabalpur City usi...
Chlorine Dose Determination in Water Distribution System of Jabalpur City usi...
IRJET Journal
 
IRJET- Modelling BOD and COD using Artificial Neural Network with Factor Anal...
IRJET- Modelling BOD and COD using Artificial Neural Network with Factor Anal...IRJET- Modelling BOD and COD using Artificial Neural Network with Factor Anal...
IRJET- Modelling BOD and COD using Artificial Neural Network with Factor Anal...
IRJET Journal
 
C6.03: User engagement within the ocean-colour climate change initiative - Sh...
C6.03: User engagement within the ocean-colour climate change initiative - Sh...C6.03: User engagement within the ocean-colour climate change initiative - Sh...
C6.03: User engagement within the ocean-colour climate change initiative - Sh...
Blue Planet Symposium
 
Assessing the importance of geo hydrological data acquisition in the developm...
Assessing the importance of geo hydrological data acquisition in the developm...Assessing the importance of geo hydrological data acquisition in the developm...
Assessing the importance of geo hydrological data acquisition in the developm...
Alexander Decker
 
Fones G - UEI Day 1 - Kochi Jan18
Fones G - UEI Day 1 - Kochi Jan18Fones G - UEI Day 1 - Kochi Jan18
Fones G - UEI Day 1 - Kochi Jan18
India UK Water Centre (IUKWC)
 
Industry released Water pollution alarming system
Industry released Water pollution alarming systemIndustry released Water pollution alarming system
Industry released Water pollution alarming system
dineshkumar1088888
 
Kr3518331839
Kr3518331839Kr3518331839
Kr3518331839
IJERA Editor
 
Seadome as alternative structure for shoreline protection and ecological enha...
Seadome as alternative structure for shoreline protection and ecological enha...Seadome as alternative structure for shoreline protection and ecological enha...
Seadome as alternative structure for shoreline protection and ecological enha...
Geoff Sedgman
 
2.2 IUKWC Workshop Freshwater EO - Andrew Tyler - Jun17
2.2 IUKWC Workshop Freshwater EO - Andrew Tyler - Jun172.2 IUKWC Workshop Freshwater EO - Andrew Tyler - Jun17
2.2 IUKWC Workshop Freshwater EO - Andrew Tyler - Jun17
India UK Water Centre (IUKWC)
 
Catalyst-Spring-2015
Catalyst-Spring-2015Catalyst-Spring-2015
Catalyst-Spring-2015Ellie Gadd
 

Similar to Choi_ICONI2020_발표자료_v1.0.pdf (20)

Final capstone presentation
Final capstone presentationFinal capstone presentation
Final capstone presentation
 
For presentation capstone
For presentation   capstoneFor presentation   capstone
For presentation capstone
 
Capstone Presentation
Capstone PresentationCapstone Presentation
Capstone Presentation
 
A novel fuzzy rule based system for assessment of ground water potability: A ...
A novel fuzzy rule based system for assessment of ground water potability: A ...A novel fuzzy rule based system for assessment of ground water potability: A ...
A novel fuzzy rule based system for assessment of ground water potability: A ...
 
Developing a stochastic simulation model for the generation of residential wa...
Developing a stochastic simulation model for the generation of residential wa...Developing a stochastic simulation model for the generation of residential wa...
Developing a stochastic simulation model for the generation of residential wa...
 
Developing a stochastic simulation model for the generation of residential wa...
Developing a stochastic simulation model for the generation of residential wa...Developing a stochastic simulation model for the generation of residential wa...
Developing a stochastic simulation model for the generation of residential wa...
 
WATER QUALITY PREDICTION
WATER QUALITY PREDICTIONWATER QUALITY PREDICTION
WATER QUALITY PREDICTION
 
Performance assessment of water filtration plants in pakistan - JBES
Performance assessment of water filtration plants in pakistan - JBESPerformance assessment of water filtration plants in pakistan - JBES
Performance assessment of water filtration plants in pakistan - JBES
 
Final ppt dissertation
Final ppt dissertationFinal ppt dissertation
Final ppt dissertation
 
IRJET- ANN-Based Modeling for Coagulant Dosage in Drinking Water Treatment Plant
IRJET- ANN-Based Modeling for Coagulant Dosage in Drinking Water Treatment PlantIRJET- ANN-Based Modeling for Coagulant Dosage in Drinking Water Treatment Plant
IRJET- ANN-Based Modeling for Coagulant Dosage in Drinking Water Treatment Plant
 
Chlorine Dose Determination in Water Distribution System of Jabalpur City usi...
Chlorine Dose Determination in Water Distribution System of Jabalpur City usi...Chlorine Dose Determination in Water Distribution System of Jabalpur City usi...
Chlorine Dose Determination in Water Distribution System of Jabalpur City usi...
 
IRJET- Modelling BOD and COD using Artificial Neural Network with Factor Anal...
IRJET- Modelling BOD and COD using Artificial Neural Network with Factor Anal...IRJET- Modelling BOD and COD using Artificial Neural Network with Factor Anal...
IRJET- Modelling BOD and COD using Artificial Neural Network with Factor Anal...
 
C6.03: User engagement within the ocean-colour climate change initiative - Sh...
C6.03: User engagement within the ocean-colour climate change initiative - Sh...C6.03: User engagement within the ocean-colour climate change initiative - Sh...
C6.03: User engagement within the ocean-colour climate change initiative - Sh...
 
Assessing the importance of geo hydrological data acquisition in the developm...
Assessing the importance of geo hydrological data acquisition in the developm...Assessing the importance of geo hydrological data acquisition in the developm...
Assessing the importance of geo hydrological data acquisition in the developm...
 
Fones G - UEI Day 1 - Kochi Jan18
Fones G - UEI Day 1 - Kochi Jan18Fones G - UEI Day 1 - Kochi Jan18
Fones G - UEI Day 1 - Kochi Jan18
 
Industry released Water pollution alarming system
Industry released Water pollution alarming systemIndustry released Water pollution alarming system
Industry released Water pollution alarming system
 
Kr3518331839
Kr3518331839Kr3518331839
Kr3518331839
 
Seadome as alternative structure for shoreline protection and ecological enha...
Seadome as alternative structure for shoreline protection and ecological enha...Seadome as alternative structure for shoreline protection and ecological enha...
Seadome as alternative structure for shoreline protection and ecological enha...
 
2.2 IUKWC Workshop Freshwater EO - Andrew Tyler - Jun17
2.2 IUKWC Workshop Freshwater EO - Andrew Tyler - Jun172.2 IUKWC Workshop Freshwater EO - Andrew Tyler - Jun17
2.2 IUKWC Workshop Freshwater EO - Andrew Tyler - Jun17
 
Catalyst-Spring-2015
Catalyst-Spring-2015Catalyst-Spring-2015
Catalyst-Spring-2015
 

Recently uploaded

Global Peatlands Map and Hotspot Explanation Atlas
Global Peatlands Map and Hotspot Explanation AtlasGlobal Peatlands Map and Hotspot Explanation Atlas
Global Peatlands Map and Hotspot Explanation Atlas
Global Landscapes Forum (GLF)
 
Alert-driven Community-based Forest monitoring: A case of the Peruvian Amazon
Alert-driven Community-based Forest monitoring: A case of the Peruvian AmazonAlert-driven Community-based Forest monitoring: A case of the Peruvian Amazon
Alert-driven Community-based Forest monitoring: A case of the Peruvian Amazon
CIFOR-ICRAF
 
Q&A with the Experts: The Food Service Playbook
Q&A with the Experts: The Food Service PlaybookQ&A with the Experts: The Food Service Playbook
Q&A with the Experts: The Food Service Playbook
World Resources Institute (WRI)
 
Celebrating World-environment-day-2024.pdf
Celebrating  World-environment-day-2024.pdfCelebrating  World-environment-day-2024.pdf
Celebrating World-environment-day-2024.pdf
rohankumarsinghrore1
 
一比一原版(UMTC毕业证书)明尼苏达大学双城分校毕业证如何办理
一比一原版(UMTC毕业证书)明尼苏达大学双城分校毕业证如何办理一比一原版(UMTC毕业证书)明尼苏达大学双城分校毕业证如何办理
一比一原版(UMTC毕业证书)明尼苏达大学双城分校毕业证如何办理
zm9ajxup
 
DENR-PENRO-Bulacan-Presentation Philippine EIS
DENR-PENRO-Bulacan-Presentation Philippine EISDENR-PENRO-Bulacan-Presentation Philippine EIS
DENR-PENRO-Bulacan-Presentation Philippine EIS
MarlonJayBayag
 
"Understanding the Carbon Cycle: Processes, Human Impacts, and Strategies for...
"Understanding the Carbon Cycle: Processes, Human Impacts, and Strategies for..."Understanding the Carbon Cycle: Processes, Human Impacts, and Strategies for...
"Understanding the Carbon Cycle: Processes, Human Impacts, and Strategies for...
MMariSelvam4
 
Epcon is One of the World's leading Manufacturing Companies.
Epcon is One of the World's leading Manufacturing Companies.Epcon is One of the World's leading Manufacturing Companies.
Epcon is One of the World's leading Manufacturing Companies.
EpconLP
 
Promoting Multilateral Cooperation for Sustainable Peatland management
Promoting Multilateral Cooperation for Sustainable Peatland managementPromoting Multilateral Cooperation for Sustainable Peatland management
Promoting Multilateral Cooperation for Sustainable Peatland management
Global Landscapes Forum (GLF)
 
Willie Nelson Net Worth: A Journey Through Music, Movies, and Business Ventures
Willie Nelson Net Worth: A Journey Through Music, Movies, and Business VenturesWillie Nelson Net Worth: A Journey Through Music, Movies, and Business Ventures
Willie Nelson Net Worth: A Journey Through Music, Movies, and Business Ventures
greendigital
 
Characterization and the Kinetics of drying at the drying oven and with micro...
Characterization and the Kinetics of drying at the drying oven and with micro...Characterization and the Kinetics of drying at the drying oven and with micro...
Characterization and the Kinetics of drying at the drying oven and with micro...
Open Access Research Paper
 
Overview of the Global Peatlands Assessment
Overview of the Global Peatlands AssessmentOverview of the Global Peatlands Assessment
Overview of the Global Peatlands Assessment
Global Landscapes Forum (GLF)
 
Peatland Management in Indonesia, Science to Policy and Knowledge Education
Peatland Management in Indonesia, Science to Policy and Knowledge EducationPeatland Management in Indonesia, Science to Policy and Knowledge Education
Peatland Management in Indonesia, Science to Policy and Knowledge Education
Global Landscapes Forum (GLF)
 
How about Huawei mobile phone-www.cfye-commerce.shop
How about Huawei mobile phone-www.cfye-commerce.shopHow about Huawei mobile phone-www.cfye-commerce.shop
How about Huawei mobile phone-www.cfye-commerce.shop
laozhuseo02
 
Climate Change All over the World .pptx
Climate Change All over the World  .pptxClimate Change All over the World  .pptx
Climate Change All over the World .pptx
sairaanwer024
 
Wildlife-AnIntroduction.pdf so that you know more about our environment
Wildlife-AnIntroduction.pdf so that you know more about our environmentWildlife-AnIntroduction.pdf so that you know more about our environment
Wildlife-AnIntroduction.pdf so that you know more about our environment
amishajha2407
 
Artificial Reefs by Kuddle Life Foundation - May 2024
Artificial Reefs by Kuddle Life Foundation - May 2024Artificial Reefs by Kuddle Life Foundation - May 2024
Artificial Reefs by Kuddle Life Foundation - May 2024
punit537210
 
Improving the Management of Peatlands and the Capacities of Stakeholders in I...
Improving the Management of Peatlands and the Capacities of Stakeholders in I...Improving the Management of Peatlands and the Capacities of Stakeholders in I...
Improving the Management of Peatlands and the Capacities of Stakeholders in I...
Global Landscapes Forum (GLF)
 
Drip Irrigation technology with solar power
Drip Irrigation technology with solar powerDrip Irrigation technology with solar power
Drip Irrigation technology with solar power
anikchanda4
 
Top 8 Strategies for Effective Sustainable Waste Management.pdf
Top 8 Strategies for Effective Sustainable Waste Management.pdfTop 8 Strategies for Effective Sustainable Waste Management.pdf
Top 8 Strategies for Effective Sustainable Waste Management.pdf
Jhon Wick
 

Recently uploaded (20)

Global Peatlands Map and Hotspot Explanation Atlas
Global Peatlands Map and Hotspot Explanation AtlasGlobal Peatlands Map and Hotspot Explanation Atlas
Global Peatlands Map and Hotspot Explanation Atlas
 
Alert-driven Community-based Forest monitoring: A case of the Peruvian Amazon
Alert-driven Community-based Forest monitoring: A case of the Peruvian AmazonAlert-driven Community-based Forest monitoring: A case of the Peruvian Amazon
Alert-driven Community-based Forest monitoring: A case of the Peruvian Amazon
 
Q&A with the Experts: The Food Service Playbook
Q&A with the Experts: The Food Service PlaybookQ&A with the Experts: The Food Service Playbook
Q&A with the Experts: The Food Service Playbook
 
Celebrating World-environment-day-2024.pdf
Celebrating  World-environment-day-2024.pdfCelebrating  World-environment-day-2024.pdf
Celebrating World-environment-day-2024.pdf
 
一比一原版(UMTC毕业证书)明尼苏达大学双城分校毕业证如何办理
一比一原版(UMTC毕业证书)明尼苏达大学双城分校毕业证如何办理一比一原版(UMTC毕业证书)明尼苏达大学双城分校毕业证如何办理
一比一原版(UMTC毕业证书)明尼苏达大学双城分校毕业证如何办理
 
DENR-PENRO-Bulacan-Presentation Philippine EIS
DENR-PENRO-Bulacan-Presentation Philippine EISDENR-PENRO-Bulacan-Presentation Philippine EIS
DENR-PENRO-Bulacan-Presentation Philippine EIS
 
"Understanding the Carbon Cycle: Processes, Human Impacts, and Strategies for...
"Understanding the Carbon Cycle: Processes, Human Impacts, and Strategies for..."Understanding the Carbon Cycle: Processes, Human Impacts, and Strategies for...
"Understanding the Carbon Cycle: Processes, Human Impacts, and Strategies for...
 
Epcon is One of the World's leading Manufacturing Companies.
Epcon is One of the World's leading Manufacturing Companies.Epcon is One of the World's leading Manufacturing Companies.
Epcon is One of the World's leading Manufacturing Companies.
 
Promoting Multilateral Cooperation for Sustainable Peatland management
Promoting Multilateral Cooperation for Sustainable Peatland managementPromoting Multilateral Cooperation for Sustainable Peatland management
Promoting Multilateral Cooperation for Sustainable Peatland management
 
Willie Nelson Net Worth: A Journey Through Music, Movies, and Business Ventures
Willie Nelson Net Worth: A Journey Through Music, Movies, and Business VenturesWillie Nelson Net Worth: A Journey Through Music, Movies, and Business Ventures
Willie Nelson Net Worth: A Journey Through Music, Movies, and Business Ventures
 
Characterization and the Kinetics of drying at the drying oven and with micro...
Characterization and the Kinetics of drying at the drying oven and with micro...Characterization and the Kinetics of drying at the drying oven and with micro...
Characterization and the Kinetics of drying at the drying oven and with micro...
 
Overview of the Global Peatlands Assessment
Overview of the Global Peatlands AssessmentOverview of the Global Peatlands Assessment
Overview of the Global Peatlands Assessment
 
Peatland Management in Indonesia, Science to Policy and Knowledge Education
Peatland Management in Indonesia, Science to Policy and Knowledge EducationPeatland Management in Indonesia, Science to Policy and Knowledge Education
Peatland Management in Indonesia, Science to Policy and Knowledge Education
 
How about Huawei mobile phone-www.cfye-commerce.shop
How about Huawei mobile phone-www.cfye-commerce.shopHow about Huawei mobile phone-www.cfye-commerce.shop
How about Huawei mobile phone-www.cfye-commerce.shop
 
Climate Change All over the World .pptx
Climate Change All over the World  .pptxClimate Change All over the World  .pptx
Climate Change All over the World .pptx
 
Wildlife-AnIntroduction.pdf so that you know more about our environment
Wildlife-AnIntroduction.pdf so that you know more about our environmentWildlife-AnIntroduction.pdf so that you know more about our environment
Wildlife-AnIntroduction.pdf so that you know more about our environment
 
Artificial Reefs by Kuddle Life Foundation - May 2024
Artificial Reefs by Kuddle Life Foundation - May 2024Artificial Reefs by Kuddle Life Foundation - May 2024
Artificial Reefs by Kuddle Life Foundation - May 2024
 
Improving the Management of Peatlands and the Capacities of Stakeholders in I...
Improving the Management of Peatlands and the Capacities of Stakeholders in I...Improving the Management of Peatlands and the Capacities of Stakeholders in I...
Improving the Management of Peatlands and the Capacities of Stakeholders in I...
 
Drip Irrigation technology with solar power
Drip Irrigation technology with solar powerDrip Irrigation technology with solar power
Drip Irrigation technology with solar power
 
Top 8 Strategies for Effective Sustainable Waste Management.pdf
Top 8 Strategies for Effective Sustainable Waste Management.pdfTop 8 Strategies for Effective Sustainable Waste Management.pdf
Top 8 Strategies for Effective Sustainable Waste Management.pdf
 

Choi_ICONI2020_발표자료_v1.0.pdf

  • 1. Enhancing Environmental Data Forecasting Performance by Utilizing Multi-region Data with Hard-parameter sharing Electronics and Telecommunications Research Institute Jang-Ho Choi, Miyoung Jang, Taewhi Lee, Jongho Won and Jiyong Kim janghochoi@etri.re.kr
  • 2. Table of Contents • Introduction • Challenges • Proposed Method • Experimental Studies • Q&A
  • 3. Time-series Forecasting • “Time series analysis comprises methods for analyzing time series data in order to extract meaningful statistics and other characteristics of the data.” • “Time series forecasting is the use of a model to predict future values based on previously observed values.” • “Forecasting involves taking models fit on historical data and using them to predict future observations.” https://en.wikipedia.org/wiki/Time_series https://machinelearningmastery.com/time-series-forecasting/
  • 4. Environmental Data • Environmental data, i.e. air quality and water quality data, are generally multivariate, time-series data. • They are generally collected from multiple sites. <Water/Air Quality Monitoring Stations>
  • 5. Challenges • Datasets are not only scarce and expensive, but also often severely imbalanced in practice. • Insufficient and imbalance datasets can lead to biased models, resulting poor prediction performance. 0 10 20 30 40 50 60 70 80 90 Chl-a (mg/m3) Histogram of Chlorophyll-a Concentration (mg/m3, Binwidth=1) Chlorophyll-a Concentration in Daecheong Lake (2012) # Chl-a concentrations
  • 6. Detour: Handling Data imbalance and insufficiency
  • 7. Utilizing Multi-region Datasets • Training Regional Dataset (JangGye) vs. Additional Dataset (JangGye + DaeCheong) R2 MSE Micro Acc . Macro Ac c. JangGye 0.5297 36.16 73.98% 66.72% JangGye +DaeCheong 0.4207 44.55 68.65% 49.04% Difference 26.19% -18.83% 7.76% 36.05% <Chlorophyll-a Prediction Results (JangGye)>
  • 8. Hard-parameter Sharing • In order to preserve region-specific characteristic, we adopt hard- parameter sharing from multi-task learning [7]. • Hard-parameter sharing is a technique to place common hidden layers across multiple tasks, while keeping task-specific layers. Hard-parameter sharing is known to reduce overfitting, but it is generally restricted to tasks that are closely related one another [8]. [8]
  • 9. Multi-headed Networks • Common layers fit on all data from multiple regions • Region specific layers fit on their own regional data <Iterative Training Process> Common Layers R1 Layers R2 Layers R3 Layers R4 Layers R1 Data R2 Data R3 Data R4 Data 입력 데이터
  • 11. Algal Bloom • “An algal bloom or algae bloom is a rapid increase or accumulation in the population of algae in freshwater or ma rine water systems” (Wikipedia) • For last few decades, algal bloom has been a serious problem, leading imbalance of organisms in the water. • Severe algal bloom directly affects water quality as some of algae such as cyanobacteria produce toxic that are har mful to other species—including humans. YTN Joongang
  • 12. Chlorophyll-a • “Chlorophyll-a is a specific form of chlorophyll used in oxygenic p hotosynthesis. It absorbs most energy from wavelengths of violet- blue and orange-red light. It also reflects green-yellow light, and a s such contributes to the observed green color of most plants.” (W ikipedia) • Korean Government has been operating the algal warning system since 1998 based on chlorophyll-a concentration and the number of cyanophyceae cells. • Although using the concentration of chlorophyll-a as the primary indicator has been debatable, it is still widely used as it can be measured at low cos t, whereas counting cyanophyceae cells incur considerable human labor. • Unfortunately, water quality datasets are not only scarce, but also skewed and imbalanced due to their nature.
  • 13. Datasets • Water Quality Monitoring Station (http://www.koreawqi.go.kr/) • Daecheong, Janggye, Chungam, Namgang • 2012/7/1 ~ 2018/3/31 • 8 Water Quality Index Variables: • Chlorophyll-a • Water Temperature, pH • Total Electrical conductivity • Dissolved Oxygen, Total Organic Carbons • Total Nitrogen, Total Phosphorus Class 0 Class 1 Class 2 Class 3 Chl-A 0~15mg/m3 15~25mg/m3 25~100mg/m3 100mg/m3+ JangGye 1217 255 142 0 DaeCheong 1451 164 70 2 CheongAm 794 523 535 10 NamGang 1478 156 365 5
  • 14. Experiment Settings f(x) x1 x2 x3 x4 … xn x1 x2 x3 x4 … xn y y f(x) x1 x2 x3 x4 … xn x1 x2 x3 x4 … xn x1 x2 x3 x4 … xn x1 x2 x3 x4 … xn x1 x2 x3 x4 … xn x1 x2 x3 x4 … xn y y y y y y f(x) g1(z) g2(z) g3(z) x1 x2 x3 x4 … xn x1 x2 x3 x4 … xn x1 x2 x3 x4 … xn x1 x2 x3 x4 … xn x1 x2 x3 x4 … xn x1 x2 x3 x4 … xn y y y y y y Single region Data
  • 15. Experiment Results R2 MSE Micro Acc. Macro Acc. Baseline (Single Region) 0.5297 36.16 73.98% 66.72% Baseline (Multiple Regions) 0.4207 44.55 68.65% 49.04% Multi-headed (Multiple Regions) 0.5633 32.70 75.36% 70.04% <Chlorophyll-A Prediction Results (JangGye)> Baseline Model with Single Region (upper) vs. Multi-head Neural Network Model (lower) Class 0 Class 1 Class 2 Chl-A 0~15mg/m3 15~25mg/m3 25~100mg/m3 Baseline (Single Region) 83.00% 59.26% 57.89% Baseline (Multiple Regions) 91.50% 34.57% 21.05% Multi-headed (Multiple Regions) 83.72% 58.89% 67.50% <Performance Comparison (Class Accuracy)> <Performance Comparison (R2, MSE, Micro and Macro Accuracy)>
  • 16. Acknowledgement • This work was supported by the Institute of Information & communications Technology Planning & Evaluation (IITP) grant funded by the Korea government(MSIT) (No. 2018-0- 00219, Space-time complex artificial intelligence blue-green algae prediction technology based on direct-readable water quality complex sensor and hyperspectral image).
  • 17. References 1) Agrawal, S., Barrington, L., Bromberg, C., Burge, J., Gazen, C., and Hickey, J., “Machine Learning for Precipitation Nowcasting from Radar Images,” arXiv:cs.CV/1912.12132, 2019 2) Kurt Hornik, “Approximation capabilities of multilayer feedforward networks,” Neural Networks Volume 4, Issue 2, Pages 251-257, 1991 3) Choi, J., Park, J., Park, H. and Min, O. (2017), DART: Fast and Efficient Distributed Stream Processing Framework for Internet of Things. ETRI Journal, 39: 202-212. doi:10.4218/etrij.17.2816.0109 4) Salman H. Khan et al., Cost-Sensitive Learning of Deep Feature Representations From Imbalanced Data, IEEE Transactions on Neural Networks and Learning Systems, Vol. 29, Issue: 8 (2015) 5) N. V. Chawla et al., SMOTE: Synthetic Minority Over-sampling Technique, Journal of Artificial Intelligence Research, Vol. 16 (2002) 6) J. Choi, J. Kim, J. Won and O. Min, "Modelling Chlorophyll-a Concentration using Deep Neural Networks considering Extreme Data Imbalance and Skewness," 2019 21st International Conference on Advanced Communication Technology (ICACT) doi: 10.23919/ICACT.2019.8702027. 7) Caruana, R. "Multitask learning: A knowledge-based source of inductive bias." Proceedings of the Tenth International Conference on Machine Learning. 1993. 8) Sebastian Ruder, “An Overview of Multi-Task Learning in Deep Neural Networks,” arXiv:cs.LG/1706.05098, 2017