This document discusses the applications and methodologies of machine learning within various scientific fields, including life sciences and bioinformatics, emphasizing the importance of data-driven science. It highlights the challenges posed by data deluge and the necessity of understanding the quality and coverage of data for successful machine learning outcomes. Additionally, it outlines different types of machine learning, including supervised, unsupervised, and reinforcement learning, and their roles in automating data analysis and pattern recognition.
Introduction to machine learning as a crucial tool in various domains, highlighting the presenter's background and current association.
Focus on machine learning with sparse structures, applications in science data, bioinformatics, and data mining.
Overview of the data-driven economy, exploration of machine learning capabilities, necessary care to prevent misuse.
Detailed discussion on machine learning types, especially supervised learning, including models and training methods.
Explanation of the training process, including handling data, setting parameters, and evaluating model performance.
Challenges faced in data quality, hyperparameters tuning, understanding training vs. test errors, and the importance of validation.
Summary of machine learning's role in a data-driven society, emphasizing the need to recognize its potential and limitations.Definition of machine learning and its applications in engineering and mathematics, highlighting its relationship with artificial intelligence.
データ駆動科学 (Data-‐‑‒Driven Sciences)
Thegrand aim of science is to cover the greatest
number of experimental facts by logical deduction
from the smallest number of hypotheses or axioms.
─── Albert Einstein
experimental factshypotheses/axioms
deduction
induction (or abduction)
ここは現在は優れた科学者(⼈人間)がセンスでやっている
(と⾔言うか、これこそが科学者の腕の⾒見見せ所?)
15.
データ駆動科学 (Data-‐‑‒Driven Sciences)
Thegrand aim of science is to cover the greatest
number of experimental facts by logical deduction
from the smallest number of hypotheses or axioms.
─── Albert Einstein
experimental factshypotheses/axioms
deduction
induction (or abduction)
「facts」が蓄積されてくれば、このinductionが機能しうる
(優れた科学者の経験と勘はその試⾏行行錯誤で醸成される?)
教師つき学習の⼆二つのタイプ
① 判別・分類 (classification)
② 回帰 (regression)
X y
(特徴ベクトル/説明変数) (応答変数/⽬目的変数)
?
yが離離散値 (典型例例は2値)
yes/no、true/false、positive/negative
例例) ⼿手書き⽂文字認識識、脱離離者予測、顔認識識、…
yが連続値
例例) 広告クリック率率率予測、降降⾬雨確率率率予測、…