SlideShare a Scribd company logo
Research Statement
Kuan-ming Lin
January 2007
Since I took the Database course by Prof. Jun Yang at Duke Computer Science
Department, I have been attracted by the excitement of data mining. Broadly
speaking, I am interested in finding useful patterns in large stores of data which
can be handled only by computers. In earlier years, such data of large size were
collected by business or organizations and kept in proprietary data warehouses.
Therefore, many data mining techniques were specialized to tackle individual
cases, thus their generalizability remained questionable. Nowadays, as the
Internet has become the largest worldwide information storage, it is possible for
the public to perform data analysis on public databases, which also motivates my
career choice. In particular, I am curious in mining three categories of public data,
namely biological databases[3][5], Web documents[1], and the Semantic Web.
A number of repositories of biological data, particularly via high throughput
methods representing different aspects of biological processes, have been
publicized in the past decade. The data collectively are believed to provide more
information than what they were explained individually. For example, we could
find functions to unknown genes by integrative analysis on multiple microarrays,
genome sequences, NMR and X-ray data, etc. Then, how to extract
complementary information across heterogeneous datasets becomes appealing
for identifying new biological relations. Since there is little known underlying
theoretical model available for biological datasets, this is very challenging and
worth solving.
As the rapid growing of information on the Web, retrieving interesting
information from Web documents, such as what Google and Facebook have done,
has high commercial potential value. Among various multimedia mining
challenges, I believe Web text mining research, benefited from traditional
information retrieval, should provides new opportunities, in that Web documents
are by nature highly unstructured yet interconnected through hyperlinks.
Designing new techniques for mining Web document could lead to great
applications, so I will be happy to be involved in related groups.
The Semantic Web is a solution to extend the current Web content to provide
intelligent mining and analysis on the Web. Unlike the currently prevalent HTML
and XML, Semantic Web languages explicitly define associations between
semantics, or meaning, with the Web document content. The semantics are
formally specified in well-designed ontology; therefore, they can be either shared
via the Internet or refined for local needs. The current standard for the Semantic
Web exists: OWL, a W3C Recommendation, for example, yet integrations from
heterogeneous Semantic Web groups and ontology are still in its infantry. Besides,
the current semantic reasoning and ontology updating tools are still far from
satisfactory. Therefore, I would like to contribute to this new field with my
already cumulated knowledge on data mining and database integration.
Throughout the undergraduate and master studies, I have developed skills in the
data mining subfields, ranging from theoretical studies like approximation
algorithms for integrative data mining, to machine learning issues like feature
selection and discovery and SVM[2][4], to applications in bioinformatics and Web
mining. A few of my studies focused on bioinformatics and statistical learning,
and I have gained experiences on Web text mining through my research
internship and a summer school lecture series in Taiwan. Meanwhile, I have
published two papers at Duke[3][5].
To sum up, I hope to continue my career in data mining related software fields, in
Taiwan or other places where cloud computing and analysis are encouraged.
References
[1] C.-C. Huang, K.-M. Lin, and L.-F. Chien. Automatic Training Corpora
Acquisition through Web Mining. In Proceedings of The 2005 IEEE/WIC/ACM
International Conference on Web Intelligence
[2] K.-M. Lin. Reduction Techniques for Support Vector Machines. Master’s thesis,
National Taiwan University, Taipei, Taiwan, 2002. Adviser: Chih-Jen Lin.
[3] K.-M. Lin and J. Kang. Exploiting Inter-gene Information for Microarray Data
Integration. In Proceedings of The ACM Symposium on Applied Computing (SAC),
March 2007.
[4] K.-M. Lin and C.-J. Lin. A Study on Reduced Support Vector Machines. IEEE
Transactions on Neural Networks, 14(6): 1449–1559, 2003.
[5] K.-M. Lin, J. Zheng, and J. Zhang. Automatic Structure Prediction of HIV
Coreceptor CCR5. In Proceedings of Bioinformatics in Taiwan Symposium (BIT),
Taichung, Taiwan, September 2006.

More Related Content

What's hot

Research proposal writing 2013
Research proposal writing 2013Research proposal writing 2013
Research proposal writing 2013
Chukunoye Ochonogor
 
Research methodology
Research methodologyResearch methodology
Research methodology
Devika Manulal
 
Postdoc Reseacher CV
Postdoc Reseacher CVPostdoc Reseacher CV
Postdoc Reseacher CV
ArmanZhanbotin
 
Research Report Review | Business Communication | Walli Butt
Research Report Review | Business Communication | Walli Butt Research Report Review | Business Communication | Walli Butt
Research Report Review | Business Communication | Walli Butt
WalliTech
 
Research proposal writing
Research proposal  writingResearch proposal  writing
Research proposal writing
chelliah paramasivan
 
Self Introduction
Self IntroductionSelf Introduction
Self Introductionumekoumeda
 
Effective PhD research Progress
Effective PhD research ProgressEffective PhD research Progress
Effective PhD research Progress
Omer Mahfoodh
 
Research Statement
Research StatementResearch Statement
Research StatementYuan Tang
 
research process
 research process research process
research process
kpgandhi
 
steps in research proposal or aspect of research proposal
steps in research proposal or aspect of research proposalsteps in research proposal or aspect of research proposal
steps in research proposal or aspect of research proposal
ZaryabQureshi3
 
Research proposal
Research proposalResearch proposal
Research proposal
Dana Popescu
 
Writing a scientific research paper
Writing a scientific research paperWriting a scientific research paper
Writing a scientific research paper
EssayAcademy
 
How to (seriously) read a scientific paper
How to (seriously) read a scientific paperHow to (seriously) read a scientific paper
How to (seriously) read a scientific paper
Aboul Ella Hassanien
 
Nanotechnology20120918 19-26 lecture 4-5-6 - Nanomaterials
Nanotechnology20120918  19-26 lecture 4-5-6 - NanomaterialsNanotechnology20120918  19-26 lecture 4-5-6 - Nanomaterials
Nanotechnology20120918 19-26 lecture 4-5-6 - Nanomaterials
Chin Yung Jyi
 
Writing research proposal
Writing research proposal Writing research proposal
Writing research proposal
Kaimrc_Rss_Jd
 
Manuscript writing
Manuscript writing Manuscript writing
Manuscript writing
MitalPatani
 
Research gate
Research gateResearch gate
Research gate
Amal Al-Yasiri
 

What's hot (20)

Research proposal writing 2013
Research proposal writing 2013Research proposal writing 2013
Research proposal writing 2013
 
Research methodology
Research methodologyResearch methodology
Research methodology
 
Postdoc Reseacher CV
Postdoc Reseacher CVPostdoc Reseacher CV
Postdoc Reseacher CV
 
Research Report Review | Business Communication | Walli Butt
Research Report Review | Business Communication | Walli Butt Research Report Review | Business Communication | Walli Butt
Research Report Review | Business Communication | Walli Butt
 
Research proposal writing
Research proposal  writingResearch proposal  writing
Research proposal writing
 
Research question
Research questionResearch question
Research question
 
Self Introduction
Self IntroductionSelf Introduction
Self Introduction
 
Project Proposal
Project  ProposalProject  Proposal
Project Proposal
 
Effective PhD research Progress
Effective PhD research ProgressEffective PhD research Progress
Effective PhD research Progress
 
Research Statement
Research StatementResearch Statement
Research Statement
 
research process
 research process research process
research process
 
steps in research proposal or aspect of research proposal
steps in research proposal or aspect of research proposalsteps in research proposal or aspect of research proposal
steps in research proposal or aspect of research proposal
 
Research proposal
Research proposalResearch proposal
Research proposal
 
Writing a scientific research paper
Writing a scientific research paperWriting a scientific research paper
Writing a scientific research paper
 
PhD CV: Postdoctoral Research
PhD CV: Postdoctoral ResearchPhD CV: Postdoctoral Research
PhD CV: Postdoctoral Research
 
How to (seriously) read a scientific paper
How to (seriously) read a scientific paperHow to (seriously) read a scientific paper
How to (seriously) read a scientific paper
 
Nanotechnology20120918 19-26 lecture 4-5-6 - Nanomaterials
Nanotechnology20120918  19-26 lecture 4-5-6 - NanomaterialsNanotechnology20120918  19-26 lecture 4-5-6 - Nanomaterials
Nanotechnology20120918 19-26 lecture 4-5-6 - Nanomaterials
 
Writing research proposal
Writing research proposal Writing research proposal
Writing research proposal
 
Manuscript writing
Manuscript writing Manuscript writing
Manuscript writing
 
Research gate
Research gateResearch gate
Research gate
 

Similar to Research Statement

Web Mining
Web MiningWeb Mining
Web Mining
Shobha Rani
 
Web Mining Based Framework for Ontology Learning
Web Mining Based Framework for Ontology LearningWeb Mining Based Framework for Ontology Learning
Web Mining Based Framework for Ontology Learning
csandit
 
Linked Data Workshop Stanford University
Linked Data Workshop Stanford University Linked Data Workshop Stanford University
Linked Data Workshop Stanford University
Talis Consulting
 
Web Mining for an Academic Portal: The case of Al-Imam Muhammad Ibn Saud Isla...
Web Mining for an Academic Portal: The case of Al-Imam Muhammad Ibn Saud Isla...Web Mining for an Academic Portal: The case of Al-Imam Muhammad Ibn Saud Isla...
Web Mining for an Academic Portal: The case of Al-Imam Muhammad Ibn Saud Isla...IOSR Journals
 
Semantic Web-Linked Data and Libraries
Semantic Web-Linked Data and LibrariesSemantic Web-Linked Data and Libraries
Semantic Web-Linked Data and Libraries
Indian Institute of Management Ahmedabad
 
Researcher Reliance on Digital Libraries: A Descriptive Analysis
Researcher Reliance on Digital Libraries: A Descriptive AnalysisResearcher Reliance on Digital Libraries: A Descriptive Analysis
Researcher Reliance on Digital Libraries: A Descriptive Analysis
IJAEMSJORNAL
 
An efficient educational data mining approach to support e-learning
An efficient educational data mining approach to support e-learningAn efficient educational data mining approach to support e-learning
An efficient educational data mining approach to support e-learning
Venu Madhav
 
WEB MINING: PATTERN DISCOVERY ON THE WORLD WIDE WEB - 2011
WEB MINING: PATTERN DISCOVERY ON THE WORLD WIDE WEB - 2011WEB MINING: PATTERN DISCOVERY ON THE WORLD WIDE WEB - 2011
WEB MINING: PATTERN DISCOVERY ON THE WORLD WIDE WEB - 2011
Mustafa TURAN
 
Minning www
Minning wwwMinning www
Minning www
Sonali Parab
 
A Study Web Data Mining Challenges And Application For Information Extraction
A Study  Web Data Mining Challenges And Application For Information ExtractionA Study  Web Data Mining Challenges And Application For Information Extraction
A Study Web Data Mining Challenges And Application For Information Extraction
Scott Bou
 
Topic Modeling : Clustering of Deep Webpages
Topic Modeling : Clustering of Deep WebpagesTopic Modeling : Clustering of Deep Webpages
Topic Modeling : Clustering of Deep Webpages
csandit
 
Topic Modeling : Clustering of Deep Webpages
Topic Modeling : Clustering of Deep WebpagesTopic Modeling : Clustering of Deep Webpages
Topic Modeling : Clustering of Deep Webpages
csandit
 
Linked Open Data_mlanet13
Linked Open Data_mlanet13Linked Open Data_mlanet13
Linked Open Data_mlanet13
Kristi Holmes
 
Image retrieval from the world wide web issues, techniques, and systems
Image retrieval from the world wide web issues, techniques, and systemsImage retrieval from the world wide web issues, techniques, and systems
Image retrieval from the world wide web issues, techniques, and systemsunyil96
 
Image retrieval from the world wide web
Image retrieval from the world wide webImage retrieval from the world wide web
Image retrieval from the world wide webunyil96
 
The Revolution Of Cloud Computing
The Revolution Of Cloud ComputingThe Revolution Of Cloud Computing
The Revolution Of Cloud Computing
Carmen Sanborn
 

Similar to Research Statement (20)

Web Mining
Web MiningWeb Mining
Web Mining
 
Web Mining Based Framework for Ontology Learning
Web Mining Based Framework for Ontology LearningWeb Mining Based Framework for Ontology Learning
Web Mining Based Framework for Ontology Learning
 
5463 26 web mining
5463 26 web mining5463 26 web mining
5463 26 web mining
 
Linked Data Workshop Stanford University
Linked Data Workshop Stanford University Linked Data Workshop Stanford University
Linked Data Workshop Stanford University
 
Web Mining for an Academic Portal: The case of Al-Imam Muhammad Ibn Saud Isla...
Web Mining for an Academic Portal: The case of Al-Imam Muhammad Ibn Saud Isla...Web Mining for an Academic Portal: The case of Al-Imam Muhammad Ibn Saud Isla...
Web Mining for an Academic Portal: The case of Al-Imam Muhammad Ibn Saud Isla...
 
Semantic Web-Linked Data and Libraries
Semantic Web-Linked Data and LibrariesSemantic Web-Linked Data and Libraries
Semantic Web-Linked Data and Libraries
 
Researcher Reliance on Digital Libraries: A Descriptive Analysis
Researcher Reliance on Digital Libraries: A Descriptive AnalysisResearcher Reliance on Digital Libraries: A Descriptive Analysis
Researcher Reliance on Digital Libraries: A Descriptive Analysis
 
01635156
0163515601635156
01635156
 
Introduction abstract
Introduction abstractIntroduction abstract
Introduction abstract
 
An efficient educational data mining approach to support e-learning
An efficient educational data mining approach to support e-learningAn efficient educational data mining approach to support e-learning
An efficient educational data mining approach to support e-learning
 
Ngdm09 han gao
Ngdm09 han gaoNgdm09 han gao
Ngdm09 han gao
 
WEB MINING: PATTERN DISCOVERY ON THE WORLD WIDE WEB - 2011
WEB MINING: PATTERN DISCOVERY ON THE WORLD WIDE WEB - 2011WEB MINING: PATTERN DISCOVERY ON THE WORLD WIDE WEB - 2011
WEB MINING: PATTERN DISCOVERY ON THE WORLD WIDE WEB - 2011
 
Minning www
Minning wwwMinning www
Minning www
 
A Study Web Data Mining Challenges And Application For Information Extraction
A Study  Web Data Mining Challenges And Application For Information ExtractionA Study  Web Data Mining Challenges And Application For Information Extraction
A Study Web Data Mining Challenges And Application For Information Extraction
 
Topic Modeling : Clustering of Deep Webpages
Topic Modeling : Clustering of Deep WebpagesTopic Modeling : Clustering of Deep Webpages
Topic Modeling : Clustering of Deep Webpages
 
Topic Modeling : Clustering of Deep Webpages
Topic Modeling : Clustering of Deep WebpagesTopic Modeling : Clustering of Deep Webpages
Topic Modeling : Clustering of Deep Webpages
 
Linked Open Data_mlanet13
Linked Open Data_mlanet13Linked Open Data_mlanet13
Linked Open Data_mlanet13
 
Image retrieval from the world wide web issues, techniques, and systems
Image retrieval from the world wide web issues, techniques, and systemsImage retrieval from the world wide web issues, techniques, and systems
Image retrieval from the world wide web issues, techniques, and systems
 
Image retrieval from the world wide web
Image retrieval from the world wide webImage retrieval from the world wide web
Image retrieval from the world wide web
 
The Revolution Of Cloud Computing
The Revolution Of Cloud ComputingThe Revolution Of Cloud Computing
The Revolution Of Cloud Computing
 

Research Statement

  • 1. Research Statement Kuan-ming Lin January 2007 Since I took the Database course by Prof. Jun Yang at Duke Computer Science Department, I have been attracted by the excitement of data mining. Broadly speaking, I am interested in finding useful patterns in large stores of data which can be handled only by computers. In earlier years, such data of large size were collected by business or organizations and kept in proprietary data warehouses. Therefore, many data mining techniques were specialized to tackle individual cases, thus their generalizability remained questionable. Nowadays, as the Internet has become the largest worldwide information storage, it is possible for the public to perform data analysis on public databases, which also motivates my career choice. In particular, I am curious in mining three categories of public data, namely biological databases[3][5], Web documents[1], and the Semantic Web. A number of repositories of biological data, particularly via high throughput methods representing different aspects of biological processes, have been publicized in the past decade. The data collectively are believed to provide more information than what they were explained individually. For example, we could find functions to unknown genes by integrative analysis on multiple microarrays, genome sequences, NMR and X-ray data, etc. Then, how to extract complementary information across heterogeneous datasets becomes appealing for identifying new biological relations. Since there is little known underlying theoretical model available for biological datasets, this is very challenging and worth solving. As the rapid growing of information on the Web, retrieving interesting information from Web documents, such as what Google and Facebook have done, has high commercial potential value. Among various multimedia mining challenges, I believe Web text mining research, benefited from traditional information retrieval, should provides new opportunities, in that Web documents are by nature highly unstructured yet interconnected through hyperlinks. Designing new techniques for mining Web document could lead to great applications, so I will be happy to be involved in related groups. The Semantic Web is a solution to extend the current Web content to provide intelligent mining and analysis on the Web. Unlike the currently prevalent HTML
  • 2. and XML, Semantic Web languages explicitly define associations between semantics, or meaning, with the Web document content. The semantics are formally specified in well-designed ontology; therefore, they can be either shared via the Internet or refined for local needs. The current standard for the Semantic Web exists: OWL, a W3C Recommendation, for example, yet integrations from heterogeneous Semantic Web groups and ontology are still in its infantry. Besides, the current semantic reasoning and ontology updating tools are still far from satisfactory. Therefore, I would like to contribute to this new field with my already cumulated knowledge on data mining and database integration. Throughout the undergraduate and master studies, I have developed skills in the data mining subfields, ranging from theoretical studies like approximation algorithms for integrative data mining, to machine learning issues like feature selection and discovery and SVM[2][4], to applications in bioinformatics and Web mining. A few of my studies focused on bioinformatics and statistical learning, and I have gained experiences on Web text mining through my research internship and a summer school lecture series in Taiwan. Meanwhile, I have published two papers at Duke[3][5]. To sum up, I hope to continue my career in data mining related software fields, in Taiwan or other places where cloud computing and analysis are encouraged. References [1] C.-C. Huang, K.-M. Lin, and L.-F. Chien. Automatic Training Corpora Acquisition through Web Mining. In Proceedings of The 2005 IEEE/WIC/ACM International Conference on Web Intelligence [2] K.-M. Lin. Reduction Techniques for Support Vector Machines. Master’s thesis, National Taiwan University, Taipei, Taiwan, 2002. Adviser: Chih-Jen Lin. [3] K.-M. Lin and J. Kang. Exploiting Inter-gene Information for Microarray Data Integration. In Proceedings of The ACM Symposium on Applied Computing (SAC), March 2007. [4] K.-M. Lin and C.-J. Lin. A Study on Reduced Support Vector Machines. IEEE Transactions on Neural Networks, 14(6): 1449–1559, 2003. [5] K.-M. Lin, J. Zheng, and J. Zhang. Automatic Structure Prediction of HIV Coreceptor CCR5. In Proceedings of Bioinformatics in Taiwan Symposium (BIT), Taichung, Taiwan, September 2006.