SlideShare a Scribd company logo
1 of 4
ICE0403: Data & Text Mining
                                          Fall 2007




Instructor                Dr. Ji-Ae Shin
       Email              jiae@icu.ac.kr ( The best to reach me is via emails)
       Office       F612 (x 6172)


Objective of the Course
        To study machine learning techniques which are commonly used to extract
information/patterns in (textual) data, such as Inductive Logic Programming, Clustering,
Statistical Logic Networks, and Decision Trees.


Prerequisites
      Algorithms (A-); Artificial Intelligence (A-); Discrete Mathematics I;
      Probability & Statistics; Scripting Languages


References
       1.    (Textbook) I. Witten & E. Frank, Data Mining (2nd ed.), Elsevier, 2005
       2.    (Textbook) T. Mitchell, Machine Learning, McGraw-Hill, 1997
       3.    D. J. Hand, H. Mannila and P. Smyth. Principles of Data Mining (Adaptive
             Computation and Machine Learning), MIT Press, 2001
       4.    T. Hastie, R. Tibshirani, & J. H. Friedman, The Elements of Statistical
             Learning : Data Mining, Inference, and Prediction Springer Verlag, 2001.
       5.    R. O. Duda, P. E. Hart, & D. G. Stork, Pattern Classification Wiley-
             Interscience, 2000.
       6.    P. Langley, Elements of Machine Learning, Morgan Kaufman Publishers, San
             Fancisco, CA, 1995.
       7.    S. M. Weiss & C. A. Kulikowski, Computer Systems that Learn, Morgan
             Kaufman Publishers, San Fancisco, CA, 1991.
       8.    W. Shavlik & T. G. Dietterich (Eds.), Readings in Machine Learning, Morgan
             Kaufman Publishers, San Fancisco, CA, 1990.


Grading Policy
Homework (3: Theory & Practice)           36%
     Term Project                                     24%
     Class Participation                              10%
     Final Exam                                       30%


Topics in Consideration

   1.   Introduction
        Definition of learning systems. Goals and applications of machine learning.
        Aspects of developing a learning system: training data, concept representation,
        function approximation.

   2.   Inductive                            Classification
        The concept learning task. Concept learning as search through a hypothesis
        space. General-to-specific ordering of hypotheses. Finding maximally specific
        hypotheses. Version spaces and the candidate elimination algorithm. Learning
        conjunctive concepts. The importance of inductive bias.

   3.   Decision                      Tree                    Learning
        Representing concepts as decision trees. Recursive induction of decision trees.
        Picking the best splitting attribute: entropy and information gain. Searching for
        simple trees and computational complexity. Occam's razor. Overfitting, noisy
        data, and pruning.

   4.   Experimental         Evaluation        of        Learning         Algorithms
        Measuring the accuracy of learned hypotheses. Comparing learning algorithms:
        cross-validation, learning curves, and statistical hypothesis testing.

   5.   Rule        Learning:         Propositional          and        First-Order
        Translating decision trees into rules. Heuristic rule induction using separate and
        conquer and information gain. First-order Horn-clause induction (Inductive
        Logic Programming) and Foil. Learning recursive rules. Inverse resolution,
        Golem, and Progol.

   6.   Artificial   Neural       Networks        (if       time       permits)
        Neurons and biological motivation. Linear threshold units. Perceptrons:
representational limitation and gradient descent training. Multilayer networks
      and backpropagation. Hidden layers and constructing intermediate, distributed
      representations. Overfitting, learning network structure, recurrent networks.

7.    Support Vector Machines
      Maximum margin linear separators. Quadractic programming solution to finding
      maximum margin separators. Kernels for learning non-linear functions.

8.    Bayesian Learning
      Probability theory and Bayes rule. Naive Bayes learning algorithm. Parameter
      smoothing. Bayes nets and Markov nets for representing dependencies.

9.    Instance-Based                           Learning
      Constructing explicit generalizations versus comparing to past specific
      examples. k-Nearest-neighbor algorithm. Case-based learning.

10.   Text           Classification         (if          time            permits)
      Bag of words representation. Vector space model and cosine similarity.
      Relevance feedback and Rocchio algorithm. Versions of nearest neighbor and
      Naive Bayes for text.

11.   Clustering           and           Unsupervised            Learning
      Learning from unclassified data. Clustering. Hierarchical Aglomerative
      Clustering. k-means partitional clustering. Expectation maximization (EM) for
      soft clustering. Semi-supervised learning with EM using labeled and unlabled
      data.

12.   Language              Learning          (if           time           permits)
      Classification problems in language: word-sense disambiguation, sequence
      labeling. Hidden Markov models (HMM's). Veterbi algorithm for determining
      most-probable state sequences. Forward-backward EM algorithm for training the
      parameters of HMM's. Use of HMM's for speech recognition, part-of-speech
      tagging, and information extraction. Conditional random fields (CRF's).
      Probabilistic context-free grammars (PCFG). Parsing and learning with PCFGs.
      Lexicalized PCFGs.
13.   Using     Prior  Knowledge      in    Learning     (if  time     permits)
      Chapter 11, Chapter 12. Explanation-based learning. Learning in planning and
      problem-solving. Knowledge-based learning and theory refinement. Transfer
      learning.

More Related Content

What's hot

32_Nov07_MachineLear..
32_Nov07_MachineLear..32_Nov07_MachineLear..
32_Nov07_MachineLear..butest
 
Introduction to Machine Learning.
Introduction to Machine Learning.Introduction to Machine Learning.
Introduction to Machine Learning.butest
 
Language Combinatorics: A Sentence Pattern Extraction Architecture Based on C...
Language Combinatorics: A Sentence Pattern Extraction Architecture Based on C...Language Combinatorics: A Sentence Pattern Extraction Architecture Based on C...
Language Combinatorics: A Sentence Pattern Extraction Architecture Based on C...Waqas Tariq
 
Language Models for Information Retrieval
Language Models for Information RetrievalLanguage Models for Information Retrieval
Language Models for Information RetrievalDustin Smith
 
Practical deepllearningv1
Practical deepllearningv1Practical deepllearningv1
Practical deepllearningv1arthi v
 
17 1 knowledge-based system
17 1 knowledge-based system17 1 knowledge-based system
17 1 knowledge-based systemTianlu Wang
 
Arcomem training Topic Analysis Models beginners
Arcomem training Topic Analysis Models beginnersArcomem training Topic Analysis Models beginners
Arcomem training Topic Analysis Models beginnersarcomem
 
SFScon18 - Gabriele Sottocornola - Probabilistic Topic Models with MALLET
SFScon18 - Gabriele Sottocornola - Probabilistic Topic Models with MALLETSFScon18 - Gabriele Sottocornola - Probabilistic Topic Models with MALLET
SFScon18 - Gabriele Sottocornola - Probabilistic Topic Models with MALLETSouth Tyrol Free Software Conference
 
[Paper Reading] Supervised Learning of Universal Sentence Representations fro...
[Paper Reading] Supervised Learning of Universal Sentence Representations fro...[Paper Reading] Supervised Learning of Universal Sentence Representations fro...
[Paper Reading] Supervised Learning of Universal Sentence Representations fro...Hiroki Shimanaka
 
Blei lafferty2009
Blei lafferty2009Blei lafferty2009
Blei lafferty2009Ajay Ohri
 
Ekaw ontology learning for cost effective large-scale semantic annotation
Ekaw ontology learning for cost effective large-scale semantic annotationEkaw ontology learning for cost effective large-scale semantic annotation
Ekaw ontology learning for cost effective large-scale semantic annotationShahab Mokarizadeh
 
Islamic University Pattern Recognition & Neural Network 2019
Islamic University Pattern Recognition & Neural Network 2019 Islamic University Pattern Recognition & Neural Network 2019
Islamic University Pattern Recognition & Neural Network 2019 Rakibul Hasan Pranto
 

What's hot (20)

[IJET-V1I6P17] Authors : Mrs.R.Kalpana, Mrs.P.Padmapriya
[IJET-V1I6P17] Authors : Mrs.R.Kalpana, Mrs.P.Padmapriya[IJET-V1I6P17] Authors : Mrs.R.Kalpana, Mrs.P.Padmapriya
[IJET-V1I6P17] Authors : Mrs.R.Kalpana, Mrs.P.Padmapriya
 
32_Nov07_MachineLear..
32_Nov07_MachineLear..32_Nov07_MachineLear..
32_Nov07_MachineLear..
 
Introduction to Machine Learning.
Introduction to Machine Learning.Introduction to Machine Learning.
Introduction to Machine Learning.
 
Language Combinatorics: A Sentence Pattern Extraction Architecture Based on C...
Language Combinatorics: A Sentence Pattern Extraction Architecture Based on C...Language Combinatorics: A Sentence Pattern Extraction Architecture Based on C...
Language Combinatorics: A Sentence Pattern Extraction Architecture Based on C...
 
Sementic nets
Sementic netsSementic nets
Sementic nets
 
Language Models for Information Retrieval
Language Models for Information RetrievalLanguage Models for Information Retrieval
Language Models for Information Retrieval
 
E43022023
E43022023E43022023
E43022023
 
Reasoning in AI
Reasoning in AIReasoning in AI
Reasoning in AI
 
Lesson 19
Lesson 19Lesson 19
Lesson 19
 
Techniques Machine Learning
Techniques Machine LearningTechniques Machine Learning
Techniques Machine Learning
 
Practical deepllearningv1
Practical deepllearningv1Practical deepllearningv1
Practical deepllearningv1
 
17 1 knowledge-based system
17 1 knowledge-based system17 1 knowledge-based system
17 1 knowledge-based system
 
Arcomem training Topic Analysis Models beginners
Arcomem training Topic Analysis Models beginnersArcomem training Topic Analysis Models beginners
Arcomem training Topic Analysis Models beginners
 
SFScon18 - Gabriele Sottocornola - Probabilistic Topic Models with MALLET
SFScon18 - Gabriele Sottocornola - Probabilistic Topic Models with MALLETSFScon18 - Gabriele Sottocornola - Probabilistic Topic Models with MALLET
SFScon18 - Gabriele Sottocornola - Probabilistic Topic Models with MALLET
 
[Paper Reading] Supervised Learning of Universal Sentence Representations fro...
[Paper Reading] Supervised Learning of Universal Sentence Representations fro...[Paper Reading] Supervised Learning of Universal Sentence Representations fro...
[Paper Reading] Supervised Learning of Universal Sentence Representations fro...
 
number theory Rosen
number theory Rosen   number theory Rosen
number theory Rosen
 
Blei lafferty2009
Blei lafferty2009Blei lafferty2009
Blei lafferty2009
 
Ekaw ontology learning for cost effective large-scale semantic annotation
Ekaw ontology learning for cost effective large-scale semantic annotationEkaw ontology learning for cost effective large-scale semantic annotation
Ekaw ontology learning for cost effective large-scale semantic annotation
 
How to write a paper
How to write a paperHow to write a paper
How to write a paper
 
Islamic University Pattern Recognition & Neural Network 2019
Islamic University Pattern Recognition & Neural Network 2019 Islamic University Pattern Recognition & Neural Network 2019
Islamic University Pattern Recognition & Neural Network 2019
 

Viewers also liked

Dealing with Diversity: Understanding WCF Communication Options in ...
Dealing with Diversity: Understanding WCF Communication Options in ...Dealing with Diversity: Understanding WCF Communication Options in ...
Dealing with Diversity: Understanding WCF Communication Options in ...butest
 
MapReduce: Distributed Computing for Machine Learning
MapReduce: Distributed Computing for Machine LearningMapReduce: Distributed Computing for Machine Learning
MapReduce: Distributed Computing for Machine Learningbutest
 
GRM98012IGR.doc
GRM98012IGR.docGRM98012IGR.doc
GRM98012IGR.docbutest
 
Comparison of relational and attribute-IEEE-1999-published ...
Comparison of relational and attribute-IEEE-1999-published ...Comparison of relational and attribute-IEEE-1999-published ...
Comparison of relational and attribute-IEEE-1999-published ...butest
 
Machine Learning for Adversarial Agent Microworlds
Machine Learning for Adversarial Agent MicroworldsMachine Learning for Adversarial Agent Microworlds
Machine Learning for Adversarial Agent Microworldsbutest
 
Garbage Collection, Program Comprehension and Machine Learning
Garbage Collection, Program Comprehension and Machine LearningGarbage Collection, Program Comprehension and Machine Learning
Garbage Collection, Program Comprehension and Machine Learningbutest
 
pptx - Distributed Parallel Inference on Large Factor Graphs
pptx - Distributed Parallel Inference on Large Factor Graphspptx - Distributed Parallel Inference on Large Factor Graphs
pptx - Distributed Parallel Inference on Large Factor Graphsbutest
 
An Eye On Google, Executive Summary Presentation
An Eye On Google, Executive Summary PresentationAn Eye On Google, Executive Summary Presentation
An Eye On Google, Executive Summary PresentationKetzirah Lesser
 
Shrm poll diversity_final
Shrm poll diversity_finalShrm poll diversity_final
Shrm poll diversity_finalshrm
 

Viewers also liked (9)

Dealing with Diversity: Understanding WCF Communication Options in ...
Dealing with Diversity: Understanding WCF Communication Options in ...Dealing with Diversity: Understanding WCF Communication Options in ...
Dealing with Diversity: Understanding WCF Communication Options in ...
 
MapReduce: Distributed Computing for Machine Learning
MapReduce: Distributed Computing for Machine LearningMapReduce: Distributed Computing for Machine Learning
MapReduce: Distributed Computing for Machine Learning
 
GRM98012IGR.doc
GRM98012IGR.docGRM98012IGR.doc
GRM98012IGR.doc
 
Comparison of relational and attribute-IEEE-1999-published ...
Comparison of relational and attribute-IEEE-1999-published ...Comparison of relational and attribute-IEEE-1999-published ...
Comparison of relational and attribute-IEEE-1999-published ...
 
Machine Learning for Adversarial Agent Microworlds
Machine Learning for Adversarial Agent MicroworldsMachine Learning for Adversarial Agent Microworlds
Machine Learning for Adversarial Agent Microworlds
 
Garbage Collection, Program Comprehension and Machine Learning
Garbage Collection, Program Comprehension and Machine LearningGarbage Collection, Program Comprehension and Machine Learning
Garbage Collection, Program Comprehension and Machine Learning
 
pptx - Distributed Parallel Inference on Large Factor Graphs
pptx - Distributed Parallel Inference on Large Factor Graphspptx - Distributed Parallel Inference on Large Factor Graphs
pptx - Distributed Parallel Inference on Large Factor Graphs
 
An Eye On Google, Executive Summary Presentation
An Eye On Google, Executive Summary PresentationAn Eye On Google, Executive Summary Presentation
An Eye On Google, Executive Summary Presentation
 
Shrm poll diversity_final
Shrm poll diversity_finalShrm poll diversity_final
Shrm poll diversity_final
 

Similar to Course Syllabus

Project Proposal Topics Modeling (Ir)
Project Proposal    Topics Modeling (Ir)Project Proposal    Topics Modeling (Ir)
Project Proposal Topics Modeling (Ir)Svitlana volkova
 
Introduction to Machine Learning* Prof. D. Spears
Introduction to Machine Learning* Prof. D. SpearsIntroduction to Machine Learning* Prof. D. Spears
Introduction to Machine Learning* Prof. D. Spearsbutest
 
kantorNSF-NIJ-ISI-03-06-04.ppt
kantorNSF-NIJ-ISI-03-06-04.pptkantorNSF-NIJ-ISI-03-06-04.ppt
kantorNSF-NIJ-ISI-03-06-04.pptbutest
 
Brief Tour of Machine Learning
Brief Tour of Machine LearningBrief Tour of Machine Learning
Brief Tour of Machine Learningbutest
 
Discover How Scientific Data is Used for the Public Good with Natural Languag...
Discover How Scientific Data is Used for the Public Good with Natural Languag...Discover How Scientific Data is Used for the Public Good with Natural Languag...
Discover How Scientific Data is Used for the Public Good with Natural Languag...BaoTramDuong2
 
Machine Learning: Foundations Course Number 0368403401
Machine Learning: Foundations Course Number 0368403401Machine Learning: Foundations Course Number 0368403401
Machine Learning: Foundations Course Number 0368403401butest
 
Machine Learning: Foundations Course Number 0368403401
Machine Learning: Foundations Course Number 0368403401Machine Learning: Foundations Course Number 0368403401
Machine Learning: Foundations Course Number 0368403401butest
 
Proposed-curricula-MCSEwithSyllabus_24_...
Proposed-curricula-MCSEwithSyllabus_24_...Proposed-curricula-MCSEwithSyllabus_24_...
Proposed-curricula-MCSEwithSyllabus_24_...butest
 
Continuous Learning Algorithms - a Research Proposal Paper
Continuous Learning Algorithms - a Research Proposal PaperContinuous Learning Algorithms - a Research Proposal Paper
Continuous Learning Algorithms - a Research Proposal Papertjb910
 
Lexicon base approch
Lexicon base approchLexicon base approch
Lexicon base approchanil maurya
 
Jana Diesner, "Words and Networks: Considering the Content of Text Data for N...
Jana Diesner, "Words and Networks: Considering the Content of Text Data for N...Jana Diesner, "Words and Networks: Considering the Content of Text Data for N...
Jana Diesner, "Words and Networks: Considering the Content of Text Data for N...summersocialwebshop
 
Introduction.doc
Introduction.docIntroduction.doc
Introduction.docbutest
 
Lec 0 about the course
Lec 0 about the courseLec 0 about the course
Lec 0 about the courseEyob Sisay
 
Machine Learning: Foundations Course Number 0368403401
Machine Learning: Foundations Course Number 0368403401Machine Learning: Foundations Course Number 0368403401
Machine Learning: Foundations Course Number 0368403401butest
 
Presentation on Machine Learning and Data Mining
Presentation on Machine Learning and Data MiningPresentation on Machine Learning and Data Mining
Presentation on Machine Learning and Data Miningbutest
 
Xin Yao: "What can evolutionary computation do for you?"
Xin Yao: "What can evolutionary computation do for you?"Xin Yao: "What can evolutionary computation do for you?"
Xin Yao: "What can evolutionary computation do for you?"ieee_cis_cyprus
 

Similar to Course Syllabus (20)

What is AI ML NLP and how to apply them
What is AI ML NLP and how to apply themWhat is AI ML NLP and how to apply them
What is AI ML NLP and how to apply them
 
Project Proposal Topics Modeling (Ir)
Project Proposal    Topics Modeling (Ir)Project Proposal    Topics Modeling (Ir)
Project Proposal Topics Modeling (Ir)
 
AI Presentation 1
AI Presentation 1AI Presentation 1
AI Presentation 1
 
Introduction to Machine Learning* Prof. D. Spears
Introduction to Machine Learning* Prof. D. SpearsIntroduction to Machine Learning* Prof. D. Spears
Introduction to Machine Learning* Prof. D. Spears
 
kantorNSF-NIJ-ISI-03-06-04.ppt
kantorNSF-NIJ-ISI-03-06-04.pptkantorNSF-NIJ-ISI-03-06-04.ppt
kantorNSF-NIJ-ISI-03-06-04.ppt
 
Brief Tour of Machine Learning
Brief Tour of Machine LearningBrief Tour of Machine Learning
Brief Tour of Machine Learning
 
Discover How Scientific Data is Used for the Public Good with Natural Languag...
Discover How Scientific Data is Used for the Public Good with Natural Languag...Discover How Scientific Data is Used for the Public Good with Natural Languag...
Discover How Scientific Data is Used for the Public Good with Natural Languag...
 
Machine Learning: Foundations Course Number 0368403401
Machine Learning: Foundations Course Number 0368403401Machine Learning: Foundations Course Number 0368403401
Machine Learning: Foundations Course Number 0368403401
 
Machine Learning: Foundations Course Number 0368403401
Machine Learning: Foundations Course Number 0368403401Machine Learning: Foundations Course Number 0368403401
Machine Learning: Foundations Course Number 0368403401
 
Proposed-curricula-MCSEwithSyllabus_24_...
Proposed-curricula-MCSEwithSyllabus_24_...Proposed-curricula-MCSEwithSyllabus_24_...
Proposed-curricula-MCSEwithSyllabus_24_...
 
Continuous Learning Algorithms - a Research Proposal Paper
Continuous Learning Algorithms - a Research Proposal PaperContinuous Learning Algorithms - a Research Proposal Paper
Continuous Learning Algorithms - a Research Proposal Paper
 
6. ME Syllabus-converted.pdf
6. ME Syllabus-converted.pdf6. ME Syllabus-converted.pdf
6. ME Syllabus-converted.pdf
 
Lexicon base approch
Lexicon base approchLexicon base approch
Lexicon base approch
 
Jana Diesner, "Words and Networks: Considering the Content of Text Data for N...
Jana Diesner, "Words and Networks: Considering the Content of Text Data for N...Jana Diesner, "Words and Networks: Considering the Content of Text Data for N...
Jana Diesner, "Words and Networks: Considering the Content of Text Data for N...
 
Ai notes
Ai notesAi notes
Ai notes
 
Introduction.doc
Introduction.docIntroduction.doc
Introduction.doc
 
Lec 0 about the course
Lec 0 about the courseLec 0 about the course
Lec 0 about the course
 
Machine Learning: Foundations Course Number 0368403401
Machine Learning: Foundations Course Number 0368403401Machine Learning: Foundations Course Number 0368403401
Machine Learning: Foundations Course Number 0368403401
 
Presentation on Machine Learning and Data Mining
Presentation on Machine Learning and Data MiningPresentation on Machine Learning and Data Mining
Presentation on Machine Learning and Data Mining
 
Xin Yao: "What can evolutionary computation do for you?"
Xin Yao: "What can evolutionary computation do for you?"Xin Yao: "What can evolutionary computation do for you?"
Xin Yao: "What can evolutionary computation do for you?"
 

More from butest

EL MODELO DE NEGOCIO DE YOUTUBE
EL MODELO DE NEGOCIO DE YOUTUBEEL MODELO DE NEGOCIO DE YOUTUBE
EL MODELO DE NEGOCIO DE YOUTUBEbutest
 
1. MPEG I.B.P frame之不同
1. MPEG I.B.P frame之不同1. MPEG I.B.P frame之不同
1. MPEG I.B.P frame之不同butest
 
LESSONS FROM THE MICHAEL JACKSON TRIAL
LESSONS FROM THE MICHAEL JACKSON TRIALLESSONS FROM THE MICHAEL JACKSON TRIAL
LESSONS FROM THE MICHAEL JACKSON TRIALbutest
 
Timeline: The Life of Michael Jackson
Timeline: The Life of Michael JacksonTimeline: The Life of Michael Jackson
Timeline: The Life of Michael Jacksonbutest
 
Popular Reading Last Updated April 1, 2010 Adams, Lorraine The ...
Popular Reading Last Updated April 1, 2010 Adams, Lorraine The ...Popular Reading Last Updated April 1, 2010 Adams, Lorraine The ...
Popular Reading Last Updated April 1, 2010 Adams, Lorraine The ...butest
 
LESSONS FROM THE MICHAEL JACKSON TRIAL
LESSONS FROM THE MICHAEL JACKSON TRIALLESSONS FROM THE MICHAEL JACKSON TRIAL
LESSONS FROM THE MICHAEL JACKSON TRIALbutest
 
Com 380, Summer II
Com 380, Summer IICom 380, Summer II
Com 380, Summer IIbutest
 
The MYnstrel Free Press Volume 2: Economic Struggles, Meet Jazz
The MYnstrel Free Press Volume 2: Economic Struggles, Meet JazzThe MYnstrel Free Press Volume 2: Economic Struggles, Meet Jazz
The MYnstrel Free Press Volume 2: Economic Struggles, Meet Jazzbutest
 
MICHAEL JACKSON.doc
MICHAEL JACKSON.docMICHAEL JACKSON.doc
MICHAEL JACKSON.docbutest
 
Social Networks: Twitter Facebook SL - Slide 1
Social Networks: Twitter Facebook SL - Slide 1Social Networks: Twitter Facebook SL - Slide 1
Social Networks: Twitter Facebook SL - Slide 1butest
 
Facebook
Facebook Facebook
Facebook butest
 
Executive Summary Hare Chevrolet is a General Motors dealership ...
Executive Summary Hare Chevrolet is a General Motors dealership ...Executive Summary Hare Chevrolet is a General Motors dealership ...
Executive Summary Hare Chevrolet is a General Motors dealership ...butest
 
Welcome to the Dougherty County Public Library's Facebook and ...
Welcome to the Dougherty County Public Library's Facebook and ...Welcome to the Dougherty County Public Library's Facebook and ...
Welcome to the Dougherty County Public Library's Facebook and ...butest
 
NEWS ANNOUNCEMENT
NEWS ANNOUNCEMENTNEWS ANNOUNCEMENT
NEWS ANNOUNCEMENTbutest
 
C-2100 Ultra Zoom.doc
C-2100 Ultra Zoom.docC-2100 Ultra Zoom.doc
C-2100 Ultra Zoom.docbutest
 
MAC Printing on ITS Printers.doc.doc
MAC Printing on ITS Printers.doc.docMAC Printing on ITS Printers.doc.doc
MAC Printing on ITS Printers.doc.docbutest
 
Mac OS X Guide.doc
Mac OS X Guide.docMac OS X Guide.doc
Mac OS X Guide.docbutest
 
WEB DESIGN!
WEB DESIGN!WEB DESIGN!
WEB DESIGN!butest
 

More from butest (20)

EL MODELO DE NEGOCIO DE YOUTUBE
EL MODELO DE NEGOCIO DE YOUTUBEEL MODELO DE NEGOCIO DE YOUTUBE
EL MODELO DE NEGOCIO DE YOUTUBE
 
1. MPEG I.B.P frame之不同
1. MPEG I.B.P frame之不同1. MPEG I.B.P frame之不同
1. MPEG I.B.P frame之不同
 
LESSONS FROM THE MICHAEL JACKSON TRIAL
LESSONS FROM THE MICHAEL JACKSON TRIALLESSONS FROM THE MICHAEL JACKSON TRIAL
LESSONS FROM THE MICHAEL JACKSON TRIAL
 
Timeline: The Life of Michael Jackson
Timeline: The Life of Michael JacksonTimeline: The Life of Michael Jackson
Timeline: The Life of Michael Jackson
 
Popular Reading Last Updated April 1, 2010 Adams, Lorraine The ...
Popular Reading Last Updated April 1, 2010 Adams, Lorraine The ...Popular Reading Last Updated April 1, 2010 Adams, Lorraine The ...
Popular Reading Last Updated April 1, 2010 Adams, Lorraine The ...
 
LESSONS FROM THE MICHAEL JACKSON TRIAL
LESSONS FROM THE MICHAEL JACKSON TRIALLESSONS FROM THE MICHAEL JACKSON TRIAL
LESSONS FROM THE MICHAEL JACKSON TRIAL
 
Com 380, Summer II
Com 380, Summer IICom 380, Summer II
Com 380, Summer II
 
PPT
PPTPPT
PPT
 
The MYnstrel Free Press Volume 2: Economic Struggles, Meet Jazz
The MYnstrel Free Press Volume 2: Economic Struggles, Meet JazzThe MYnstrel Free Press Volume 2: Economic Struggles, Meet Jazz
The MYnstrel Free Press Volume 2: Economic Struggles, Meet Jazz
 
MICHAEL JACKSON.doc
MICHAEL JACKSON.docMICHAEL JACKSON.doc
MICHAEL JACKSON.doc
 
Social Networks: Twitter Facebook SL - Slide 1
Social Networks: Twitter Facebook SL - Slide 1Social Networks: Twitter Facebook SL - Slide 1
Social Networks: Twitter Facebook SL - Slide 1
 
Facebook
Facebook Facebook
Facebook
 
Executive Summary Hare Chevrolet is a General Motors dealership ...
Executive Summary Hare Chevrolet is a General Motors dealership ...Executive Summary Hare Chevrolet is a General Motors dealership ...
Executive Summary Hare Chevrolet is a General Motors dealership ...
 
Welcome to the Dougherty County Public Library's Facebook and ...
Welcome to the Dougherty County Public Library's Facebook and ...Welcome to the Dougherty County Public Library's Facebook and ...
Welcome to the Dougherty County Public Library's Facebook and ...
 
NEWS ANNOUNCEMENT
NEWS ANNOUNCEMENTNEWS ANNOUNCEMENT
NEWS ANNOUNCEMENT
 
C-2100 Ultra Zoom.doc
C-2100 Ultra Zoom.docC-2100 Ultra Zoom.doc
C-2100 Ultra Zoom.doc
 
MAC Printing on ITS Printers.doc.doc
MAC Printing on ITS Printers.doc.docMAC Printing on ITS Printers.doc.doc
MAC Printing on ITS Printers.doc.doc
 
Mac OS X Guide.doc
Mac OS X Guide.docMac OS X Guide.doc
Mac OS X Guide.doc
 
hier
hierhier
hier
 
WEB DESIGN!
WEB DESIGN!WEB DESIGN!
WEB DESIGN!
 

Course Syllabus

  • 1. ICE0403: Data & Text Mining Fall 2007 Instructor Dr. Ji-Ae Shin Email jiae@icu.ac.kr ( The best to reach me is via emails) Office F612 (x 6172) Objective of the Course To study machine learning techniques which are commonly used to extract information/patterns in (textual) data, such as Inductive Logic Programming, Clustering, Statistical Logic Networks, and Decision Trees. Prerequisites Algorithms (A-); Artificial Intelligence (A-); Discrete Mathematics I; Probability & Statistics; Scripting Languages References 1. (Textbook) I. Witten & E. Frank, Data Mining (2nd ed.), Elsevier, 2005 2. (Textbook) T. Mitchell, Machine Learning, McGraw-Hill, 1997 3. D. J. Hand, H. Mannila and P. Smyth. Principles of Data Mining (Adaptive Computation and Machine Learning), MIT Press, 2001 4. T. Hastie, R. Tibshirani, & J. H. Friedman, The Elements of Statistical Learning : Data Mining, Inference, and Prediction Springer Verlag, 2001. 5. R. O. Duda, P. E. Hart, & D. G. Stork, Pattern Classification Wiley- Interscience, 2000. 6. P. Langley, Elements of Machine Learning, Morgan Kaufman Publishers, San Fancisco, CA, 1995. 7. S. M. Weiss & C. A. Kulikowski, Computer Systems that Learn, Morgan Kaufman Publishers, San Fancisco, CA, 1991. 8. W. Shavlik & T. G. Dietterich (Eds.), Readings in Machine Learning, Morgan Kaufman Publishers, San Fancisco, CA, 1990. Grading Policy
  • 2. Homework (3: Theory & Practice) 36% Term Project 24% Class Participation 10% Final Exam 30% Topics in Consideration 1. Introduction Definition of learning systems. Goals and applications of machine learning. Aspects of developing a learning system: training data, concept representation, function approximation. 2. Inductive Classification The concept learning task. Concept learning as search through a hypothesis space. General-to-specific ordering of hypotheses. Finding maximally specific hypotheses. Version spaces and the candidate elimination algorithm. Learning conjunctive concepts. The importance of inductive bias. 3. Decision Tree Learning Representing concepts as decision trees. Recursive induction of decision trees. Picking the best splitting attribute: entropy and information gain. Searching for simple trees and computational complexity. Occam's razor. Overfitting, noisy data, and pruning. 4. Experimental Evaluation of Learning Algorithms Measuring the accuracy of learned hypotheses. Comparing learning algorithms: cross-validation, learning curves, and statistical hypothesis testing. 5. Rule Learning: Propositional and First-Order Translating decision trees into rules. Heuristic rule induction using separate and conquer and information gain. First-order Horn-clause induction (Inductive Logic Programming) and Foil. Learning recursive rules. Inverse resolution, Golem, and Progol. 6. Artificial Neural Networks (if time permits) Neurons and biological motivation. Linear threshold units. Perceptrons:
  • 3. representational limitation and gradient descent training. Multilayer networks and backpropagation. Hidden layers and constructing intermediate, distributed representations. Overfitting, learning network structure, recurrent networks. 7. Support Vector Machines Maximum margin linear separators. Quadractic programming solution to finding maximum margin separators. Kernels for learning non-linear functions. 8. Bayesian Learning Probability theory and Bayes rule. Naive Bayes learning algorithm. Parameter smoothing. Bayes nets and Markov nets for representing dependencies. 9. Instance-Based Learning Constructing explicit generalizations versus comparing to past specific examples. k-Nearest-neighbor algorithm. Case-based learning. 10. Text Classification (if time permits) Bag of words representation. Vector space model and cosine similarity. Relevance feedback and Rocchio algorithm. Versions of nearest neighbor and Naive Bayes for text. 11. Clustering and Unsupervised Learning Learning from unclassified data. Clustering. Hierarchical Aglomerative Clustering. k-means partitional clustering. Expectation maximization (EM) for soft clustering. Semi-supervised learning with EM using labeled and unlabled data. 12. Language Learning (if time permits) Classification problems in language: word-sense disambiguation, sequence labeling. Hidden Markov models (HMM's). Veterbi algorithm for determining most-probable state sequences. Forward-backward EM algorithm for training the parameters of HMM's. Use of HMM's for speech recognition, part-of-speech tagging, and information extraction. Conditional random fields (CRF's). Probabilistic context-free grammars (PCFG). Parsing and learning with PCFGs. Lexicalized PCFGs.
  • 4. 13. Using Prior Knowledge in Learning (if time permits) Chapter 11, Chapter 12. Explanation-based learning. Learning in planning and problem-solving. Knowledge-based learning and theory refinement. Transfer learning.