SlideShare a Scribd company logo
1 of 12
Download to read offline
Outline Introductions My research and contributions Additional information References
Summary of research activities
Tianpei Xie,
Advisor: Alfred O. Hero
1 / 12
Outline Introductions My research and contributions Additional information References
1 Introductions
Backgrounds
Robust learning from multiple sources
2 My research and contributions
3 Additional information
2 / 12
Outline Introductions My research and contributions Additional information References
Backgrounds
• With the advent of Big Data era, we experienced a great explosion in terms of
1 the amount of data that is public available;
2 the diversity of multiple data sources that are accessible simultaneously;
3 the power of computational resource.
Figure : One of the central neighborhood in Twitter
Network. https://dhs.standford.edu/
gephi-workshop/twitter-network-gallery/
Figure : Multi-modality data source
3 / 12
Outline Introductions My research and contributions Additional information References
Beyond the conventional machine learning
New challenges in machine learning area:
• Robustness of model in terms of low quality training data;
• Learning from multiple information sources;
• Ability in handling data inconsistency and high dimensionality.
• Interest: single-source learning with clean training set ⇒ robust
learning from multiple sources using information theory.
4 / 12
Outline Introductions My research and contributions Additional information References
Previous work
Robust learning
• Robust learning via surrogate loss e.g. [Bartlett and Mendelson,
2003], [Bousquet and Elisseeff, 2002], [Tyler, 2008], ROD [Xu et al.,
2006].
• Anomaly detection e.g. SVDD [Sch¨olkopf et al., 1999], GEM [Hero,
2006].
• Cons: sensitive to outlier in training sample, and solves a non-convex
optimization.
Learning from multiple source (Multi-view learning)
• Co-regularization on Euclidean feature space e.g. CCA [Hardoon
et al., 2004], SVM-2K [Farquhar et al., 2005], Neural Nets [Ngiam et al.,
2011]
• Cons: lack of ability to handle data with high uncertainty, high
dimensionality and between-view inconsistency.
5 / 12
Outline Introductions My research and contributions Additional information References
Contributions
Robust learning:
• Proposed the GEM-MED algorithm [Xie et al., 2014] as a joint classification +
anomaly detection on noisy training set.
• Rely on the GEM estimator [Hero, 2006], a non-parametric entropy estimator.
(a)
0 0.2 0.4 0.6 0.8 1
0.1
0.2
0.3
0.4
0.5
0.6
0.7
0.8
0.9
recall
precision
ROD−0.02
ROD−0.1
ROD−0.2
ROD−0.3
ROD−0.6
GEM−MED
(b)
Figure : (a) The low-entropy region estimated by GEM [Hero, 2006] to separate outlier (red triangle) from the
nominal (circle and square) (b) The Precision-Recall curve for anomaly detection under given corruption rate
for GEM-MED and ROD.
• It only needs to solve a convex problem.
6 / 12
Outline Introductions My research and contributions Additional information References
Our Contributions
Multi-view learning on statistical manifold:
• Assume data is given by parametric probability density function (p.d.f.) (data
with uncertainty.) and lies in a statistical manifold (space of all parametric p.d.f.).
• Proposed the CMV-MED algorithm [Xie et al., 2015] as the Co-regularization on
Statistical manifold, i.e. learning multiple models from the p.d.f. data.
• A robust consensus measure to quantify the between-view inconsistency using
the information divergence between p.d.fs .
2
classifier 1
1
-2-2
classifier 2
0
0.1
0.2
0.3
0.4
0.5
0.6
0.7
2
distance
(a)
(b)
Figure : (a) The proposed stochastic consensus constraint on statistical manifold as a robust inconsistency
measure. (b) The interpretation of GEM-MED as averaging multiple statistical models on the manifold.
7 / 12
Outline Introductions My research and contributions Additional information References
Current research: Node prediction in network
• Learning to predict node attributes by combining the network graph
topology and node distribution
... ... ...
? ?
personal info. friendship
node attribute
(meta data)
edge structure
ID
⇔
8 / 12
Outline Introductions My research and contributions Additional information References
List of publications
List of relevant publications:
1 Xie, Tianpei, Nasser M. Nasrabadi, and Alfred O. Hero. ”Semi-supervised Multi-view learning
on statistical manifold via stochastic consensus constraints.” in preparation.
2 Xie, Tianpei, Nasser M. Nasrabadi, and Alfred O. Hero. ”Learning to classify with possible
sensor failures.” submitted to IEEE Transaction on Signal Processing, 2016
3 Xie, Tianpei, Nasser M. Nasrabadi, and Alfred O. Hero. ”Semi-supervised multi-sensor
classification via consensus-based Multi-View Maximum Entropy Discrimination.” In
Acoustics, Speech and Signal Processing (ICASSP), 2015 IEEE International Conference on,
pp. 1936-1940. IEEE, 2015.
4 Xie, Tianpei, Nasser M. Nasrabadi, and Alfred O. Hero. ”Learning to classify with possible
sensor failures.” In Acoustics, Speech and Signal Processing (ICASSP), 2014 IEEE
International Conference on, pp. 2395-2399. IEEE, 2014.
9 / 12
Outline Introductions My research and contributions Additional information References
Websites and detailed information
• Contact:
Tianpei (Luke) Xie
Department of Electrical and Computing Engineering, University of
Michigan, Ann Arbor,
Office: 4313, EECS Building
TEL : 734-546-8048
Email: tianpei@umich.edu
• LinkedIn: personal webpage
https://www.linkedin.com/in/tianpeiluke
• Research: details for my research activities
http://tbayes.eecs.umich.edu/tianpei/research_main
• Github: my codes available
https://github.com/TianpeiLuke
10 / 12
Outline Introductions My research and contributions Additional information References
Peter L Bartlett and Shahar Mendelson. Rademacher and Gaussian
complexities: Risk bounds and structural results. The Journal of Machine
Learning Research, 3:463–482, 2003.
Olivier Bousquet and Andr´e Elisseeff. Stability and generalization. The
Journal of Machine Learning Research, 2:499–526, 2002.
Jason Farquhar, David Hardoon, Hongying Meng, John S Shawe-taylor, and
Sandor Szedmak. Two view learning: SVM-2K, theory and practice.
Advances in neural information processing systems, pages 355–362, 2005.
David Hardoon, Sandor Szedmak, and John Shawe-Taylor. Canonical
correlation analysis: An overview with application to learning methods.
Neural computation, 16(12):2639–2664, 2004.
Alfred O Hero. Geometric entropy minimization (GEM) for anomaly detection
and localization. Advances in Neural Information Processing Systems,
pages 585–592, 2006.
Jiquan Ngiam, Aditya Khosla, Mingyu Kim, Juhan Nam, Honglak Lee, and
Andrew Y Ng. Multimodal deep learning. Proceedings of the 28th
International Conference on Machine Learning (ICML-11), pages 689–696,
2011.
Bernhard Sch¨olkopf, Robert C Williamson, Alex J Smola, John Shawe-Taylor,
and John C Platt. Support vector method for novelty detection. Advances
In Neural Information Processing Systems, 12:582–588, 1999.
David E Tyler. Robust statistics: Theory and methods. Journal of the
American Statistical Association, 103(482):888–889, 2008. 11 / 12
Outline Introductions My research and contributions Additional information References
Thank you!
12 / 12

More Related Content

What's hot

Interlinking educational data to Web of Data (Thesis presentation)
Interlinking educational data to Web of Data (Thesis presentation)Interlinking educational data to Web of Data (Thesis presentation)
Interlinking educational data to Web of Data (Thesis presentation)Enayat Rajabi
 
Data Science for Every Student at RPI
Data Science for Every Student at RPIData Science for Every Student at RPI
Data Science for Every Student at RPISteven Miller
 
Lec1-Into
Lec1-IntoLec1-Into
Lec1-Intobutest
 
IBM Watson Classroom Experience
IBM Watson Classroom ExperienceIBM Watson Classroom Experience
IBM Watson Classroom ExperienceSteven Miller
 
Web Scale Information Extraction (ISWC2013 tutorial)
Web Scale Information Extraction (ISWC2013 tutorial)Web Scale Information Extraction (ISWC2013 tutorial)
Web Scale Information Extraction (ISWC2013 tutorial)Ziqi Zhang
 
Best practices data collection
Best practices data collectionBest practices data collection
Best practices data collectionSherry Lake
 
Evaluating the efficiency of rule techniques for file
Evaluating the efficiency of rule techniques for fileEvaluating the efficiency of rule techniques for file
Evaluating the efficiency of rule techniques for fileeSAT Publishing House
 
Evaluating the efficiency of rule techniques for file classification
Evaluating the efficiency of rule techniques for file classificationEvaluating the efficiency of rule techniques for file classification
Evaluating the efficiency of rule techniques for file classificationeSAT Journals
 
Brain Imaging Data Structure and Center for Reproducible Neuroscince
Brain Imaging Data Structure and Center for Reproducible NeuroscinceBrain Imaging Data Structure and Center for Reproducible Neuroscince
Brain Imaging Data Structure and Center for Reproducible NeuroscinceKrzysztof Gorgolewski
 
Enhancing Information Retrieval by Personalization Techniques
Enhancing Information Retrieval by Personalization TechniquesEnhancing Information Retrieval by Personalization Techniques
Enhancing Information Retrieval by Personalization Techniquesveningstonk
 
A Review on Neural Network Question Answering Systems
A Review on Neural Network Question Answering SystemsA Review on Neural Network Question Answering Systems
A Review on Neural Network Question Answering Systemsijaia
 
IRJET- Characteristics of Research Process and Methods for Web-Based Rese...
IRJET-  	  Characteristics of Research Process and Methods for Web-Based Rese...IRJET-  	  Characteristics of Research Process and Methods for Web-Based Rese...
IRJET- Characteristics of Research Process and Methods for Web-Based Rese...IRJET Journal
 
Metadata for Research Objects
Metadata for Research ObjectsMetadata for Research Objects
Metadata for Research Objectsseanb
 
Using ID3 Decision Tree Algorithm to the Student Grade Analysis and Prediction
Using ID3 Decision Tree Algorithm to the Student Grade Analysis and PredictionUsing ID3 Decision Tree Algorithm to the Student Grade Analysis and Prediction
Using ID3 Decision Tree Algorithm to the Student Grade Analysis and Predictionijtsrd
 
Semantic tagging for documents using 'short text' information
Semantic tagging for documents using 'short text' informationSemantic tagging for documents using 'short text' information
Semantic tagging for documents using 'short text' informationcsandit
 
Elsevier’s Healthcare Knowledge Graph
Elsevier’s Healthcare Knowledge GraphElsevier’s Healthcare Knowledge Graph
Elsevier’s Healthcare Knowledge GraphPaul Groth
 
Дмитрий Ветров. Математика больших данных: тензоры, нейросети, байесовский вы...
Дмитрий Ветров. Математика больших данных: тензоры, нейросети, байесовский вы...Дмитрий Ветров. Математика больших данных: тензоры, нейросети, байесовский вы...
Дмитрий Ветров. Математика больших данных: тензоры, нейросети, байесовский вы...Yandex
 

What's hot (20)

Interlinking educational data to Web of Data (Thesis presentation)
Interlinking educational data to Web of Data (Thesis presentation)Interlinking educational data to Web of Data (Thesis presentation)
Interlinking educational data to Web of Data (Thesis presentation)
 
Data Science for Every Student at RPI
Data Science for Every Student at RPIData Science for Every Student at RPI
Data Science for Every Student at RPI
 
Lec1-Into
Lec1-IntoLec1-Into
Lec1-Into
 
IBM Watson Classroom Experience
IBM Watson Classroom ExperienceIBM Watson Classroom Experience
IBM Watson Classroom Experience
 
Web Scale Information Extraction (ISWC2013 tutorial)
Web Scale Information Extraction (ISWC2013 tutorial)Web Scale Information Extraction (ISWC2013 tutorial)
Web Scale Information Extraction (ISWC2013 tutorial)
 
Best practices data collection
Best practices data collectionBest practices data collection
Best practices data collection
 
Evaluating the efficiency of rule techniques for file
Evaluating the efficiency of rule techniques for fileEvaluating the efficiency of rule techniques for file
Evaluating the efficiency of rule techniques for file
 
Evaluating the efficiency of rule techniques for file classification
Evaluating the efficiency of rule techniques for file classificationEvaluating the efficiency of rule techniques for file classification
Evaluating the efficiency of rule techniques for file classification
 
Brain Imaging Data Structure and Center for Reproducible Neuroscince
Brain Imaging Data Structure and Center for Reproducible NeuroscinceBrain Imaging Data Structure and Center for Reproducible Neuroscince
Brain Imaging Data Structure and Center for Reproducible Neuroscince
 
Enhancing Information Retrieval by Personalization Techniques
Enhancing Information Retrieval by Personalization TechniquesEnhancing Information Retrieval by Personalization Techniques
Enhancing Information Retrieval by Personalization Techniques
 
A Review on Neural Network Question Answering Systems
A Review on Neural Network Question Answering SystemsA Review on Neural Network Question Answering Systems
A Review on Neural Network Question Answering Systems
 
Contrast Pattern Aided Regression and Classification
Contrast Pattern Aided Regression and ClassificationContrast Pattern Aided Regression and Classification
Contrast Pattern Aided Regression and Classification
 
IRJET- Characteristics of Research Process and Methods for Web-Based Rese...
IRJET-  	  Characteristics of Research Process and Methods for Web-Based Rese...IRJET-  	  Characteristics of Research Process and Methods for Web-Based Rese...
IRJET- Characteristics of Research Process and Methods for Web-Based Rese...
 
Metadata for Research Objects
Metadata for Research ObjectsMetadata for Research Objects
Metadata for Research Objects
 
Using ID3 Decision Tree Algorithm to the Student Grade Analysis and Prediction
Using ID3 Decision Tree Algorithm to the Student Grade Analysis and PredictionUsing ID3 Decision Tree Algorithm to the Student Grade Analysis and Prediction
Using ID3 Decision Tree Algorithm to the Student Grade Analysis and Prediction
 
Semantic tagging for documents using 'short text' information
Semantic tagging for documents using 'short text' informationSemantic tagging for documents using 'short text' information
Semantic tagging for documents using 'short text' information
 
Elsevier’s Healthcare Knowledge Graph
Elsevier’s Healthcare Knowledge GraphElsevier’s Healthcare Knowledge Graph
Elsevier’s Healthcare Knowledge Graph
 
Дмитрий Ветров. Математика больших данных: тензоры, нейросети, байесовский вы...
Дмитрий Ветров. Математика больших данных: тензоры, нейросети, байесовский вы...Дмитрий Ветров. Математика больших данных: тензоры, нейросети, байесовский вы...
Дмитрий Ветров. Математика больших данных: тензоры, нейросети, байесовский вы...
 
NISO Virtual Conference Scientific Data Management: Caring for Your Instituti...
NISO Virtual Conference Scientific Data Management: Caring for Your Instituti...NISO Virtual Conference Scientific Data Management: Caring for Your Instituti...
NISO Virtual Conference Scientific Data Management: Caring for Your Instituti...
 
Viva
VivaViva
Viva
 

Viewers also liked

Open Innovation - MBA Mondragon Unibertsitatea 2009
Open Innovation - MBA Mondragon Unibertsitatea 2009Open Innovation - MBA Mondragon Unibertsitatea 2009
Open Innovation - MBA Mondragon Unibertsitatea 2009MIK Research
 
Erik Huesca Global IPv6 Summit México 2009
Erik Huesca Global IPv6 Summit México 2009Erik Huesca Global IPv6 Summit México 2009
Erik Huesca Global IPv6 Summit México 2009Jaime Olmos
 
Instalando nagios kuman hoy luis
Instalando nagios kuman hoy luisInstalando nagios kuman hoy luis
Instalando nagios kuman hoy luisLuis Kuman
 
Desarrollo de un Sistema de Juego Ubicuo bajo Plataforma Android
Desarrollo de un Sistema de Juego Ubicuo bajo Plataforma AndroidDesarrollo de un Sistema de Juego Ubicuo bajo Plataforma Android
Desarrollo de un Sistema de Juego Ubicuo bajo Plataforma AndroidJuan Pizarro
 
Feria ganadera de jutiapa
Feria ganadera de jutiapaFeria ganadera de jutiapa
Feria ganadera de jutiapaEduardo Carias
 
Contaminación ambiental teresa
Contaminación ambiental   teresaContaminación ambiental   teresa
Contaminación ambiental teresaTEREART
 
Introducción al SEO en español
Introducción al SEO en españolIntroducción al SEO en español
Introducción al SEO en españolXanarts
 
Generaciones De Los Sistemas Operativos
Generaciones De Los Sistemas OperativosGeneraciones De Los Sistemas Operativos
Generaciones De Los Sistemas OperativosEdward Loja
 
Trabajo Del Martin
Trabajo Del MartinTrabajo Del Martin
Trabajo Del Martinmartin
 

Viewers also liked (17)

Open Innovation - MBA Mondragon Unibertsitatea 2009
Open Innovation - MBA Mondragon Unibertsitatea 2009Open Innovation - MBA Mondragon Unibertsitatea 2009
Open Innovation - MBA Mondragon Unibertsitatea 2009
 
E-MAIL
E-MAILE-MAIL
E-MAIL
 
Abc Digital
Abc DigitalAbc Digital
Abc Digital
 
precio
precioprecio
precio
 
Erik Huesca Global IPv6 Summit México 2009
Erik Huesca Global IPv6 Summit México 2009Erik Huesca Global IPv6 Summit México 2009
Erik Huesca Global IPv6 Summit México 2009
 
Instalando nagios kuman hoy luis
Instalando nagios kuman hoy luisInstalando nagios kuman hoy luis
Instalando nagios kuman hoy luis
 
C:\fakepath\bla3
C:\fakepath\bla3C:\fakepath\bla3
C:\fakepath\bla3
 
Desarrollo de un Sistema de Juego Ubicuo bajo Plataforma Android
Desarrollo de un Sistema de Juego Ubicuo bajo Plataforma AndroidDesarrollo de un Sistema de Juego Ubicuo bajo Plataforma Android
Desarrollo de un Sistema de Juego Ubicuo bajo Plataforma Android
 
Feria ganadera de jutiapa
Feria ganadera de jutiapaFeria ganadera de jutiapa
Feria ganadera de jutiapa
 
Contaminación ambiental teresa
Contaminación ambiental   teresaContaminación ambiental   teresa
Contaminación ambiental teresa
 
Concepto de fraccion
Concepto de fraccionConcepto de fraccion
Concepto de fraccion
 
Capitulos 60 64
Capitulos 60  64Capitulos 60  64
Capitulos 60 64
 
Introducción al SEO en español
Introducción al SEO en españolIntroducción al SEO en español
Introducción al SEO en español
 
Generaciones De Los Sistemas Operativos
Generaciones De Los Sistemas OperativosGeneraciones De Los Sistemas Operativos
Generaciones De Los Sistemas Operativos
 
60 Segundos
60 Segundos60 Segundos
60 Segundos
 
Trabajo Del Martin
Trabajo Del MartinTrabajo Del Martin
Trabajo Del Martin
 
Population Risk Scores and Plan Design
Population Risk Scores and Plan DesignPopulation Risk Scores and Plan Design
Population Risk Scores and Plan Design
 

Similar to tianpei_research_summary

Hattrick-Simpers MRS Webinar on AI in Materials
Hattrick-Simpers MRS Webinar on AI in MaterialsHattrick-Simpers MRS Webinar on AI in Materials
Hattrick-Simpers MRS Webinar on AI in MaterialsJason Hattrick-Simpers
 
HKU Data Curation MLIM7350 Class 8
HKU Data Curation MLIM7350 Class 8HKU Data Curation MLIM7350 Class 8
HKU Data Curation MLIM7350 Class 8Scott Edmunds
 
What Are Links in Linked Open Data? A Characterization and Evaluation of Link...
What Are Links in Linked Open Data? A Characterization and Evaluation of Link...What Are Links in Linked Open Data? A Characterization and Evaluation of Link...
What Are Links in Linked Open Data? A Characterization and Evaluation of Link...Armin Haller
 
Pemanfaatan Big Data Dalam Riset 2023.pptx
Pemanfaatan Big Data Dalam Riset 2023.pptxPemanfaatan Big Data Dalam Riset 2023.pptx
Pemanfaatan Big Data Dalam Riset 2023.pptxelisarosa29
 
Research trends qualitative analysis in cscl
Research trends  qualitative analysis in csclResearch trends  qualitative analysis in cscl
Research trends qualitative analysis in csclMerlien Institute
 
What's all the data about? - Linking and Profiling of Linked Datasets
What's all the data about? - Linking and Profiling of Linked DatasetsWhat's all the data about? - Linking and Profiling of Linked Datasets
What's all the data about? - Linking and Profiling of Linked DatasetsStefan Dietze
 
surveyofdnnlearning.pdf
surveyofdnnlearning.pdfsurveyofdnnlearning.pdf
surveyofdnnlearning.pdfAnkita Tiwari
 
Data Science Meets Biomedicine, Does Anything Change
Data Science Meets Biomedicine, Does Anything ChangeData Science Meets Biomedicine, Does Anything Change
Data Science Meets Biomedicine, Does Anything ChangePhilip Bourne
 
Creating an Urban Legend: A System for Electrophysiology Data Management and ...
Creating an Urban Legend: A System for Electrophysiology Data Management and ...Creating an Urban Legend: A System for Electrophysiology Data Management and ...
Creating an Urban Legend: A System for Electrophysiology Data Management and ...Anita de Waard
 
Metadata and Metrics to Support Open Access
Metadata and Metrics to Support Open AccessMetadata and Metrics to Support Open Access
Metadata and Metrics to Support Open AccessMicah Altman
 
Project E: Citation
Project E: CitationProject E: Citation
Project E: CitationLizLyon
 
Introduction to Data Mining
Introduction to Data MiningIntroduction to Data Mining
Introduction to Data MiningAbcdDcba12
 
Emerging Data Citation Infrastructure
Emerging Data Citation InfrastructureEmerging Data Citation Infrastructure
Emerging Data Citation InfrastructureMicah Altman
 
On distributed fuzzy decision trees for big data
On distributed fuzzy decision trees for big dataOn distributed fuzzy decision trees for big data
On distributed fuzzy decision trees for big datanexgentechnology
 
A Reuse-based Lightweight Method for Developing Linked Data Ontologies and Vo...
A Reuse-based Lightweight Method for Developing Linked Data Ontologies and Vo...A Reuse-based Lightweight Method for Developing Linked Data Ontologies and Vo...
A Reuse-based Lightweight Method for Developing Linked Data Ontologies and Vo...María Poveda Villalón
 
Keynote speech - Carole Goble - Jisc Digital Festival 2015
Keynote speech - Carole Goble - Jisc Digital Festival 2015Keynote speech - Carole Goble - Jisc Digital Festival 2015
Keynote speech - Carole Goble - Jisc Digital Festival 2015Jisc
 
RARE and FAIR Science: Reproducibility and Research Objects
RARE and FAIR Science: Reproducibility and Research ObjectsRARE and FAIR Science: Reproducibility and Research Objects
RARE and FAIR Science: Reproducibility and Research ObjectsCarole Goble
 
Thoughts on Knowledge Graphs & Deeper Provenance
Thoughts on Knowledge Graphs  & Deeper ProvenanceThoughts on Knowledge Graphs  & Deeper Provenance
Thoughts on Knowledge Graphs & Deeper ProvenancePaul Groth
 

Similar to tianpei_research_summary (20)

Hattrick-Simpers MRS Webinar on AI in Materials
Hattrick-Simpers MRS Webinar on AI in MaterialsHattrick-Simpers MRS Webinar on AI in Materials
Hattrick-Simpers MRS Webinar on AI in Materials
 
HKU Data Curation MLIM7350 Class 8
HKU Data Curation MLIM7350 Class 8HKU Data Curation MLIM7350 Class 8
HKU Data Curation MLIM7350 Class 8
 
What Are Links in Linked Open Data? A Characterization and Evaluation of Link...
What Are Links in Linked Open Data? A Characterization and Evaluation of Link...What Are Links in Linked Open Data? A Characterization and Evaluation of Link...
What Are Links in Linked Open Data? A Characterization and Evaluation of Link...
 
Pemanfaatan Big Data Dalam Riset 2023.pptx
Pemanfaatan Big Data Dalam Riset 2023.pptxPemanfaatan Big Data Dalam Riset 2023.pptx
Pemanfaatan Big Data Dalam Riset 2023.pptx
 
Research trends qualitative analysis in cscl
Research trends  qualitative analysis in csclResearch trends  qualitative analysis in cscl
Research trends qualitative analysis in cscl
 
What's all the data about? - Linking and Profiling of Linked Datasets
What's all the data about? - Linking and Profiling of Linked DatasetsWhat's all the data about? - Linking and Profiling of Linked Datasets
What's all the data about? - Linking and Profiling of Linked Datasets
 
Lecture_1_Intro_toDS&AI.pptx
Lecture_1_Intro_toDS&AI.pptxLecture_1_Intro_toDS&AI.pptx
Lecture_1_Intro_toDS&AI.pptx
 
surveyofdnnlearning.pdf
surveyofdnnlearning.pdfsurveyofdnnlearning.pdf
surveyofdnnlearning.pdf
 
Data Science Meets Biomedicine, Does Anything Change
Data Science Meets Biomedicine, Does Anything ChangeData Science Meets Biomedicine, Does Anything Change
Data Science Meets Biomedicine, Does Anything Change
 
Creating an Urban Legend: A System for Electrophysiology Data Management and ...
Creating an Urban Legend: A System for Electrophysiology Data Management and ...Creating an Urban Legend: A System for Electrophysiology Data Management and ...
Creating an Urban Legend: A System for Electrophysiology Data Management and ...
 
Metadata and Metrics to Support Open Access
Metadata and Metrics to Support Open AccessMetadata and Metrics to Support Open Access
Metadata and Metrics to Support Open Access
 
Project E: Citation
Project E: CitationProject E: Citation
Project E: Citation
 
Introduction to Data Mining
Introduction to Data MiningIntroduction to Data Mining
Introduction to Data Mining
 
Emerging Data Citation Infrastructure
Emerging Data Citation InfrastructureEmerging Data Citation Infrastructure
Emerging Data Citation Infrastructure
 
On distributed fuzzy decision trees for big data
On distributed fuzzy decision trees for big dataOn distributed fuzzy decision trees for big data
On distributed fuzzy decision trees for big data
 
A Reuse-based Lightweight Method for Developing Linked Data Ontologies and Vo...
A Reuse-based Lightweight Method for Developing Linked Data Ontologies and Vo...A Reuse-based Lightweight Method for Developing Linked Data Ontologies and Vo...
A Reuse-based Lightweight Method for Developing Linked Data Ontologies and Vo...
 
M045067275
M045067275M045067275
M045067275
 
Keynote speech - Carole Goble - Jisc Digital Festival 2015
Keynote speech - Carole Goble - Jisc Digital Festival 2015Keynote speech - Carole Goble - Jisc Digital Festival 2015
Keynote speech - Carole Goble - Jisc Digital Festival 2015
 
RARE and FAIR Science: Reproducibility and Research Objects
RARE and FAIR Science: Reproducibility and Research ObjectsRARE and FAIR Science: Reproducibility and Research Objects
RARE and FAIR Science: Reproducibility and Research Objects
 
Thoughts on Knowledge Graphs & Deeper Provenance
Thoughts on Knowledge Graphs  & Deeper ProvenanceThoughts on Knowledge Graphs  & Deeper Provenance
Thoughts on Knowledge Graphs & Deeper Provenance
 

Recently uploaded

Top Rated Pune Call Girls Budhwar Peth ⟟ 6297143586 ⟟ Call Me For Genuine Se...
Top Rated  Pune Call Girls Budhwar Peth ⟟ 6297143586 ⟟ Call Me For Genuine Se...Top Rated  Pune Call Girls Budhwar Peth ⟟ 6297143586 ⟟ Call Me For Genuine Se...
Top Rated Pune Call Girls Budhwar Peth ⟟ 6297143586 ⟟ Call Me For Genuine Se...Call Girls in Nagpur High Profile
 
Introduction and different types of Ethernet.pptx
Introduction and different types of Ethernet.pptxIntroduction and different types of Ethernet.pptx
Introduction and different types of Ethernet.pptxupamatechverse
 
(TARA) Talegaon Dabhade Call Girls Just Call 7001035870 [ Cash on Delivery ] ...
(TARA) Talegaon Dabhade Call Girls Just Call 7001035870 [ Cash on Delivery ] ...(TARA) Talegaon Dabhade Call Girls Just Call 7001035870 [ Cash on Delivery ] ...
(TARA) Talegaon Dabhade Call Girls Just Call 7001035870 [ Cash on Delivery ] ...ranjana rawat
 
(ANJALI) Dange Chowk Call Girls Just Call 7001035870 [ Cash on Delivery ] Pun...
(ANJALI) Dange Chowk Call Girls Just Call 7001035870 [ Cash on Delivery ] Pun...(ANJALI) Dange Chowk Call Girls Just Call 7001035870 [ Cash on Delivery ] Pun...
(ANJALI) Dange Chowk Call Girls Just Call 7001035870 [ Cash on Delivery ] Pun...ranjana rawat
 
Structural Analysis and Design of Foundations: A Comprehensive Handbook for S...
Structural Analysis and Design of Foundations: A Comprehensive Handbook for S...Structural Analysis and Design of Foundations: A Comprehensive Handbook for S...
Structural Analysis and Design of Foundations: A Comprehensive Handbook for S...Dr.Costas Sachpazis
 
Model Call Girl in Narela Delhi reach out to us at 🔝8264348440🔝
Model Call Girl in Narela Delhi reach out to us at 🔝8264348440🔝Model Call Girl in Narela Delhi reach out to us at 🔝8264348440🔝
Model Call Girl in Narela Delhi reach out to us at 🔝8264348440🔝soniya singh
 
Coefficient of Thermal Expansion and their Importance.pptx
Coefficient of Thermal Expansion and their Importance.pptxCoefficient of Thermal Expansion and their Importance.pptx
Coefficient of Thermal Expansion and their Importance.pptxAsutosh Ranjan
 
the ladakh protest in leh ladakh 2024 sonam wangchuk.pptx
the ladakh protest in leh ladakh 2024 sonam wangchuk.pptxthe ladakh protest in leh ladakh 2024 sonam wangchuk.pptx
the ladakh protest in leh ladakh 2024 sonam wangchuk.pptxhumanexperienceaaa
 
MANUFACTURING PROCESS-II UNIT-5 NC MACHINE TOOLS
MANUFACTURING PROCESS-II UNIT-5 NC MACHINE TOOLSMANUFACTURING PROCESS-II UNIT-5 NC MACHINE TOOLS
MANUFACTURING PROCESS-II UNIT-5 NC MACHINE TOOLSSIVASHANKAR N
 
High Profile Call Girls Nagpur Isha Call 7001035870 Meet With Nagpur Escorts
High Profile Call Girls Nagpur Isha Call 7001035870 Meet With Nagpur EscortsHigh Profile Call Girls Nagpur Isha Call 7001035870 Meet With Nagpur Escorts
High Profile Call Girls Nagpur Isha Call 7001035870 Meet With Nagpur Escortsranjana rawat
 
Decoding Kotlin - Your guide to solving the mysterious in Kotlin.pptx
Decoding Kotlin - Your guide to solving the mysterious in Kotlin.pptxDecoding Kotlin - Your guide to solving the mysterious in Kotlin.pptx
Decoding Kotlin - Your guide to solving the mysterious in Kotlin.pptxJoão Esperancinha
 
SPICE PARK APR2024 ( 6,793 SPICE Models )
SPICE PARK APR2024 ( 6,793 SPICE Models )SPICE PARK APR2024 ( 6,793 SPICE Models )
SPICE PARK APR2024 ( 6,793 SPICE Models )Tsuyoshi Horigome
 
Booking open Available Pune Call Girls Koregaon Park 6297143586 Call Hot Ind...
Booking open Available Pune Call Girls Koregaon Park  6297143586 Call Hot Ind...Booking open Available Pune Call Girls Koregaon Park  6297143586 Call Hot Ind...
Booking open Available Pune Call Girls Koregaon Park 6297143586 Call Hot Ind...Call Girls in Nagpur High Profile
 
(PRIYA) Rajgurunagar Call Girls Just Call 7001035870 [ Cash on Delivery ] Pun...
(PRIYA) Rajgurunagar Call Girls Just Call 7001035870 [ Cash on Delivery ] Pun...(PRIYA) Rajgurunagar Call Girls Just Call 7001035870 [ Cash on Delivery ] Pun...
(PRIYA) Rajgurunagar Call Girls Just Call 7001035870 [ Cash on Delivery ] Pun...ranjana rawat
 
(ANVI) Koregaon Park Call Girls Just Call 7001035870 [ Cash on Delivery ] Pun...
(ANVI) Koregaon Park Call Girls Just Call 7001035870 [ Cash on Delivery ] Pun...(ANVI) Koregaon Park Call Girls Just Call 7001035870 [ Cash on Delivery ] Pun...
(ANVI) Koregaon Park Call Girls Just Call 7001035870 [ Cash on Delivery ] Pun...ranjana rawat
 
Call Girls in Nagpur Suman Call 7001035870 Meet With Nagpur Escorts
Call Girls in Nagpur Suman Call 7001035870 Meet With Nagpur EscortsCall Girls in Nagpur Suman Call 7001035870 Meet With Nagpur Escorts
Call Girls in Nagpur Suman Call 7001035870 Meet With Nagpur EscortsCall Girls in Nagpur High Profile
 
VIP Call Girls Service Hitech City Hyderabad Call +91-8250192130
VIP Call Girls Service Hitech City Hyderabad Call +91-8250192130VIP Call Girls Service Hitech City Hyderabad Call +91-8250192130
VIP Call Girls Service Hitech City Hyderabad Call +91-8250192130Suhani Kapoor
 

Recently uploaded (20)

Top Rated Pune Call Girls Budhwar Peth ⟟ 6297143586 ⟟ Call Me For Genuine Se...
Top Rated  Pune Call Girls Budhwar Peth ⟟ 6297143586 ⟟ Call Me For Genuine Se...Top Rated  Pune Call Girls Budhwar Peth ⟟ 6297143586 ⟟ Call Me For Genuine Se...
Top Rated Pune Call Girls Budhwar Peth ⟟ 6297143586 ⟟ Call Me For Genuine Se...
 
Introduction and different types of Ethernet.pptx
Introduction and different types of Ethernet.pptxIntroduction and different types of Ethernet.pptx
Introduction and different types of Ethernet.pptx
 
(TARA) Talegaon Dabhade Call Girls Just Call 7001035870 [ Cash on Delivery ] ...
(TARA) Talegaon Dabhade Call Girls Just Call 7001035870 [ Cash on Delivery ] ...(TARA) Talegaon Dabhade Call Girls Just Call 7001035870 [ Cash on Delivery ] ...
(TARA) Talegaon Dabhade Call Girls Just Call 7001035870 [ Cash on Delivery ] ...
 
(ANJALI) Dange Chowk Call Girls Just Call 7001035870 [ Cash on Delivery ] Pun...
(ANJALI) Dange Chowk Call Girls Just Call 7001035870 [ Cash on Delivery ] Pun...(ANJALI) Dange Chowk Call Girls Just Call 7001035870 [ Cash on Delivery ] Pun...
(ANJALI) Dange Chowk Call Girls Just Call 7001035870 [ Cash on Delivery ] Pun...
 
Structural Analysis and Design of Foundations: A Comprehensive Handbook for S...
Structural Analysis and Design of Foundations: A Comprehensive Handbook for S...Structural Analysis and Design of Foundations: A Comprehensive Handbook for S...
Structural Analysis and Design of Foundations: A Comprehensive Handbook for S...
 
Model Call Girl in Narela Delhi reach out to us at 🔝8264348440🔝
Model Call Girl in Narela Delhi reach out to us at 🔝8264348440🔝Model Call Girl in Narela Delhi reach out to us at 🔝8264348440🔝
Model Call Girl in Narela Delhi reach out to us at 🔝8264348440🔝
 
Roadmap to Membership of RICS - Pathways and Routes
Roadmap to Membership of RICS - Pathways and RoutesRoadmap to Membership of RICS - Pathways and Routes
Roadmap to Membership of RICS - Pathways and Routes
 
Coefficient of Thermal Expansion and their Importance.pptx
Coefficient of Thermal Expansion and their Importance.pptxCoefficient of Thermal Expansion and their Importance.pptx
Coefficient of Thermal Expansion and their Importance.pptx
 
the ladakh protest in leh ladakh 2024 sonam wangchuk.pptx
the ladakh protest in leh ladakh 2024 sonam wangchuk.pptxthe ladakh protest in leh ladakh 2024 sonam wangchuk.pptx
the ladakh protest in leh ladakh 2024 sonam wangchuk.pptx
 
MANUFACTURING PROCESS-II UNIT-5 NC MACHINE TOOLS
MANUFACTURING PROCESS-II UNIT-5 NC MACHINE TOOLSMANUFACTURING PROCESS-II UNIT-5 NC MACHINE TOOLS
MANUFACTURING PROCESS-II UNIT-5 NC MACHINE TOOLS
 
High Profile Call Girls Nagpur Isha Call 7001035870 Meet With Nagpur Escorts
High Profile Call Girls Nagpur Isha Call 7001035870 Meet With Nagpur EscortsHigh Profile Call Girls Nagpur Isha Call 7001035870 Meet With Nagpur Escorts
High Profile Call Girls Nagpur Isha Call 7001035870 Meet With Nagpur Escorts
 
Decoding Kotlin - Your guide to solving the mysterious in Kotlin.pptx
Decoding Kotlin - Your guide to solving the mysterious in Kotlin.pptxDecoding Kotlin - Your guide to solving the mysterious in Kotlin.pptx
Decoding Kotlin - Your guide to solving the mysterious in Kotlin.pptx
 
SPICE PARK APR2024 ( 6,793 SPICE Models )
SPICE PARK APR2024 ( 6,793 SPICE Models )SPICE PARK APR2024 ( 6,793 SPICE Models )
SPICE PARK APR2024 ( 6,793 SPICE Models )
 
Booking open Available Pune Call Girls Koregaon Park 6297143586 Call Hot Ind...
Booking open Available Pune Call Girls Koregaon Park  6297143586 Call Hot Ind...Booking open Available Pune Call Girls Koregaon Park  6297143586 Call Hot Ind...
Booking open Available Pune Call Girls Koregaon Park 6297143586 Call Hot Ind...
 
(PRIYA) Rajgurunagar Call Girls Just Call 7001035870 [ Cash on Delivery ] Pun...
(PRIYA) Rajgurunagar Call Girls Just Call 7001035870 [ Cash on Delivery ] Pun...(PRIYA) Rajgurunagar Call Girls Just Call 7001035870 [ Cash on Delivery ] Pun...
(PRIYA) Rajgurunagar Call Girls Just Call 7001035870 [ Cash on Delivery ] Pun...
 
(ANVI) Koregaon Park Call Girls Just Call 7001035870 [ Cash on Delivery ] Pun...
(ANVI) Koregaon Park Call Girls Just Call 7001035870 [ Cash on Delivery ] Pun...(ANVI) Koregaon Park Call Girls Just Call 7001035870 [ Cash on Delivery ] Pun...
(ANVI) Koregaon Park Call Girls Just Call 7001035870 [ Cash on Delivery ] Pun...
 
Call Girls in Nagpur Suman Call 7001035870 Meet With Nagpur Escorts
Call Girls in Nagpur Suman Call 7001035870 Meet With Nagpur EscortsCall Girls in Nagpur Suman Call 7001035870 Meet With Nagpur Escorts
Call Girls in Nagpur Suman Call 7001035870 Meet With Nagpur Escorts
 
VIP Call Girls Service Hitech City Hyderabad Call +91-8250192130
VIP Call Girls Service Hitech City Hyderabad Call +91-8250192130VIP Call Girls Service Hitech City Hyderabad Call +91-8250192130
VIP Call Girls Service Hitech City Hyderabad Call +91-8250192130
 
★ CALL US 9953330565 ( HOT Young Call Girls In Badarpur delhi NCR
★ CALL US 9953330565 ( HOT Young Call Girls In Badarpur delhi NCR★ CALL US 9953330565 ( HOT Young Call Girls In Badarpur delhi NCR
★ CALL US 9953330565 ( HOT Young Call Girls In Badarpur delhi NCR
 
9953056974 Call Girls In South Ex, Escorts (Delhi) NCR.pdf
9953056974 Call Girls In South Ex, Escorts (Delhi) NCR.pdf9953056974 Call Girls In South Ex, Escorts (Delhi) NCR.pdf
9953056974 Call Girls In South Ex, Escorts (Delhi) NCR.pdf
 

tianpei_research_summary

  • 1. Outline Introductions My research and contributions Additional information References Summary of research activities Tianpei Xie, Advisor: Alfred O. Hero 1 / 12
  • 2. Outline Introductions My research and contributions Additional information References 1 Introductions Backgrounds Robust learning from multiple sources 2 My research and contributions 3 Additional information 2 / 12
  • 3. Outline Introductions My research and contributions Additional information References Backgrounds • With the advent of Big Data era, we experienced a great explosion in terms of 1 the amount of data that is public available; 2 the diversity of multiple data sources that are accessible simultaneously; 3 the power of computational resource. Figure : One of the central neighborhood in Twitter Network. https://dhs.standford.edu/ gephi-workshop/twitter-network-gallery/ Figure : Multi-modality data source 3 / 12
  • 4. Outline Introductions My research and contributions Additional information References Beyond the conventional machine learning New challenges in machine learning area: • Robustness of model in terms of low quality training data; • Learning from multiple information sources; • Ability in handling data inconsistency and high dimensionality. • Interest: single-source learning with clean training set ⇒ robust learning from multiple sources using information theory. 4 / 12
  • 5. Outline Introductions My research and contributions Additional information References Previous work Robust learning • Robust learning via surrogate loss e.g. [Bartlett and Mendelson, 2003], [Bousquet and Elisseeff, 2002], [Tyler, 2008], ROD [Xu et al., 2006]. • Anomaly detection e.g. SVDD [Sch¨olkopf et al., 1999], GEM [Hero, 2006]. • Cons: sensitive to outlier in training sample, and solves a non-convex optimization. Learning from multiple source (Multi-view learning) • Co-regularization on Euclidean feature space e.g. CCA [Hardoon et al., 2004], SVM-2K [Farquhar et al., 2005], Neural Nets [Ngiam et al., 2011] • Cons: lack of ability to handle data with high uncertainty, high dimensionality and between-view inconsistency. 5 / 12
  • 6. Outline Introductions My research and contributions Additional information References Contributions Robust learning: • Proposed the GEM-MED algorithm [Xie et al., 2014] as a joint classification + anomaly detection on noisy training set. • Rely on the GEM estimator [Hero, 2006], a non-parametric entropy estimator. (a) 0 0.2 0.4 0.6 0.8 1 0.1 0.2 0.3 0.4 0.5 0.6 0.7 0.8 0.9 recall precision ROD−0.02 ROD−0.1 ROD−0.2 ROD−0.3 ROD−0.6 GEM−MED (b) Figure : (a) The low-entropy region estimated by GEM [Hero, 2006] to separate outlier (red triangle) from the nominal (circle and square) (b) The Precision-Recall curve for anomaly detection under given corruption rate for GEM-MED and ROD. • It only needs to solve a convex problem. 6 / 12
  • 7. Outline Introductions My research and contributions Additional information References Our Contributions Multi-view learning on statistical manifold: • Assume data is given by parametric probability density function (p.d.f.) (data with uncertainty.) and lies in a statistical manifold (space of all parametric p.d.f.). • Proposed the CMV-MED algorithm [Xie et al., 2015] as the Co-regularization on Statistical manifold, i.e. learning multiple models from the p.d.f. data. • A robust consensus measure to quantify the between-view inconsistency using the information divergence between p.d.fs . 2 classifier 1 1 -2-2 classifier 2 0 0.1 0.2 0.3 0.4 0.5 0.6 0.7 2 distance (a) (b) Figure : (a) The proposed stochastic consensus constraint on statistical manifold as a robust inconsistency measure. (b) The interpretation of GEM-MED as averaging multiple statistical models on the manifold. 7 / 12
  • 8. Outline Introductions My research and contributions Additional information References Current research: Node prediction in network • Learning to predict node attributes by combining the network graph topology and node distribution ... ... ... ? ? personal info. friendship node attribute (meta data) edge structure ID ⇔ 8 / 12
  • 9. Outline Introductions My research and contributions Additional information References List of publications List of relevant publications: 1 Xie, Tianpei, Nasser M. Nasrabadi, and Alfred O. Hero. ”Semi-supervised Multi-view learning on statistical manifold via stochastic consensus constraints.” in preparation. 2 Xie, Tianpei, Nasser M. Nasrabadi, and Alfred O. Hero. ”Learning to classify with possible sensor failures.” submitted to IEEE Transaction on Signal Processing, 2016 3 Xie, Tianpei, Nasser M. Nasrabadi, and Alfred O. Hero. ”Semi-supervised multi-sensor classification via consensus-based Multi-View Maximum Entropy Discrimination.” In Acoustics, Speech and Signal Processing (ICASSP), 2015 IEEE International Conference on, pp. 1936-1940. IEEE, 2015. 4 Xie, Tianpei, Nasser M. Nasrabadi, and Alfred O. Hero. ”Learning to classify with possible sensor failures.” In Acoustics, Speech and Signal Processing (ICASSP), 2014 IEEE International Conference on, pp. 2395-2399. IEEE, 2014. 9 / 12
  • 10. Outline Introductions My research and contributions Additional information References Websites and detailed information • Contact: Tianpei (Luke) Xie Department of Electrical and Computing Engineering, University of Michigan, Ann Arbor, Office: 4313, EECS Building TEL : 734-546-8048 Email: tianpei@umich.edu • LinkedIn: personal webpage https://www.linkedin.com/in/tianpeiluke • Research: details for my research activities http://tbayes.eecs.umich.edu/tianpei/research_main • Github: my codes available https://github.com/TianpeiLuke 10 / 12
  • 11. Outline Introductions My research and contributions Additional information References Peter L Bartlett and Shahar Mendelson. Rademacher and Gaussian complexities: Risk bounds and structural results. The Journal of Machine Learning Research, 3:463–482, 2003. Olivier Bousquet and Andr´e Elisseeff. Stability and generalization. The Journal of Machine Learning Research, 2:499–526, 2002. Jason Farquhar, David Hardoon, Hongying Meng, John S Shawe-taylor, and Sandor Szedmak. Two view learning: SVM-2K, theory and practice. Advances in neural information processing systems, pages 355–362, 2005. David Hardoon, Sandor Szedmak, and John Shawe-Taylor. Canonical correlation analysis: An overview with application to learning methods. Neural computation, 16(12):2639–2664, 2004. Alfred O Hero. Geometric entropy minimization (GEM) for anomaly detection and localization. Advances in Neural Information Processing Systems, pages 585–592, 2006. Jiquan Ngiam, Aditya Khosla, Mingyu Kim, Juhan Nam, Honglak Lee, and Andrew Y Ng. Multimodal deep learning. Proceedings of the 28th International Conference on Machine Learning (ICML-11), pages 689–696, 2011. Bernhard Sch¨olkopf, Robert C Williamson, Alex J Smola, John Shawe-Taylor, and John C Platt. Support vector method for novelty detection. Advances In Neural Information Processing Systems, 12:582–588, 1999. David E Tyler. Robust statistics: Theory and methods. Journal of the American Statistical Association, 103(482):888–889, 2008. 11 / 12
  • 12. Outline Introductions My research and contributions Additional information References Thank you! 12 / 12