SlideShare a Scribd company logo
IOSR Journal of Computer Engineering (IOSR-JCE)
e-ISSN: 2278-0661,p-ISSN: 2278-8727, Volume 17, Issue 1, Ver. I (Jan – Feb. 2015), PP 65-71
www.iosrjournals.org
DOI: 10.9790/0661-17116571 www.iosrjournals.org 65 | Page
Research Paper Selection Based On an Ontology and Text Mining
Technique Using Clustering
1
Snehal Shivaji Patil 2
S.A.Uddin
Alhabeeb college of Engg. & Tech. Hyderabad, India.
Alhabeeb college of Engg. & Tech. Hyderabad, India.
Abstract: Research Paper Selection is important decision making task for the Government funding Agency,
Universities, research Institutes. Ontology is Knowledge Repository in which concepts and terms defined as well
as relationship between these concepts. In this paper Ontology is old research papers repository of keywords
and frequencies of that keywords of the research papers of funding agencies. Ontology makes the tasks of
searching similar patterns of text that is to be more effective, efficient and interactive. The current method of
grouping of papers for research paper selection based on similarities of Keywords and Frequencies of research
papers of ontology. Text mining is method for extracting unknown information from the large documents. The
Research Papers in each domain are clustered using Text mining Technique. Grouped Research papers are
assigned to appropriate reviewer or domain experts for peer review systematically. The Reviewer results are
collected and papers are get ranked based on experts review results.
Keywords: Ontology, Text Mining, Classification, Document Preprocessing, Clustering.
I. Introduction
Research Paper Selection is important decision making task for many Organizations such as
Government funding Agency, Universities, research Institutes.
Fig.1. Research Paper Selection Process.
Fig. 1 shows the processes of research project selection i.e. Call for papers (CFP), paper submission,
paper grouping, paper assignment to experts, peer review, aggregation of review results, panel evaluation, and
final awarding decision. These processes are very similar in other funding agencies, except that there are a very
large number of papers that need to be grouped for peer review. Four to five reviewers are assigned to review
each paper so as to assure accurate and reliable opinions on papers. To deal with the large volume, it is
necessary to group papers according to their similarities in research disciplines and then to assign the paper
groups to relevant reviewers.
In the first section we Call for Research Paper means uploading Research paper and submitting the
details of that paper. Classification of research papers is based on keywords of papers similar with ontology
keywords and frequencies of those keywords. Department members are classified into six groups according to
their decision making in research paper selection. Decision making cooperate with each other to accomplish
overall goal of selecting research paper as shown above in figure the output of one block is input to the next
block.
Department is responsible for selection of research papers for classification. Department members
classify research papers and assign them to external reviewer for evaluation and commentary. If department
member may not have knowledge about research paper in all research domain and contents of many papers were
not fully understood when papers were grouped and while assigning grouped papers to external reviewers.
Therefore, there was an effective approach to group the submitted research Papers and assign the papers to
external reviewers with computer supports. So we use ontology based text- mining approach is proposed to
solve the problem.
Research Paper Selection Based On an Ontology and Text Mining Technique Using Clustering
DOI: 10.9790/0661-17116571 www.iosrjournals.org 66 | Page
Section II reviews the literature on research paper selection, grouping of papers and assigning the
grouped papers to external reviewer systematically. The proposed method on an ontology based text mining
framework for Research Paper Selection is described in Section III. Section IV validates and evaluates the
method, and then discusses the potential application.
II. Literature Review
Selection of research projects is an important research topic in research and development (R&D)
project management. Previous re- search deals with specific topics, and several formal methods and models are
available for this purpose.
For example, Chen and Gorla [2] proposed a fuzzy-logic-based model as a decision tool for project
selection. Henriksen and Traynor [3] presented a scoring tool for project evaluation and selection. Ghasemzadeh
and Archer [4] offered a decision support approach to project portfolio selection. Machacha and Bhattacharya
[5] proposed a fuzzy logic approach to project selection. Butler et al. [6] used a multiple attribute utility theory
for project ranking and selection. Loch and Kavadias [7] established a dynamic programming model for project
selection, while Meade and Presley [8] developed an analytic network process model. Greiner et al. [9] proposed
a hybrid AHP and integer programming approach to support project selection, and Tian et al. [10] suggested an
organizational decision support approach for selecting R&D projects. Cook et al. [11] presented a method of
optimal allocation of papers to reviewers in order to facilitate the selection process. Arya and Mittendorf [12]
proposed a rotation program method for project assignment. For example Choi and Park [13] used text-mining
approach for R&D paper screening. Girotra et al. [14] offered an empirical study to value projects in a portfolio.
Sun et al. [15] developed a decision support system to evaluate reviewers for research project selection. Finally,
Sun et al. [16] proposed a hybrid knowledge-based and modeling approach to assign reviewers to papers for
research project selection.
Cheng and Wei [2008] proposed clustering-based category-hierarchy integration (CHI) technique,
which is an extension of the clustering-based category integration (CCI) technique. This method was improving
the effectiveness of category-hierarchy integration compared with that attained by nonhierarchical category-
integration techniques particularly homogeneous [16].
Methods have been developed to group papers for peer review tasks. For example, Hettich and Pazzani
[2006] proposed a text-mining approach to group papers, identify reviewers, and assign reviewers to papers.
Current methods group papers according to keywords. Unfortunately, papers with similar research areas might
be placed in wrong groups due to the following reasons: first, keywords are incomplete information about the
full content of the papers. Second, keywords are provided by applicants who may have subjective views and
misconceptions, and keywords are only a partial representation of the research papers. Third, manual grouping
is usually conducted by division managers or program directors in funding agencies. They may have different
understanding about the research disciplines and may not have adequate knowledge to assign papers into the
right groups. [17]
III. Existing System
The existing system is an Ontology-Based Text-Mining Method to cluster research papers based on
their similarities in research areas. It consists of three phases. Ontology is a knowledge repository in which
concepts and terms are defined as well as relationships between these concepts.
It consists of axioms, relationships and set of concepts that describe a domain of interests and
represents an agreed-upon conceptualization of the domain’s ―real-world‖ setting. Implicit knowledge for
humans is made explicit for computers by ontology. Thus, ontology can automate information processing and
can facilitate text mining in a specific domain (such as research project selection). An ontology based text
mining framework has been built for clustering the research papers according to their discipline areas.
Text mining refers generally to the process of extracting interesting information and knowledge from
unstructured text. The main difference between regular data mining and text mining is that text mining patterns
are extracted from natural language text rather than from structured databases of facts.
Research Paper Selection Based On an Ontology and Text Mining Technique Using Clustering
DOI: 10.9790/0661-17116571 www.iosrjournals.org 67 | Page
Fig.2 Main Framework of Proposed system.
1. Constructing Research Ontology: A research ontology containing the projects funded in latest five years
is constructed according to keywords, and it is updated annually. As domain ontology research ontology is a
public concept set of the research project management domain. The research topics of different disciplines
can be clearly expressed by research ontology.
2. Classifying New Research Papers: New research papers are classified according to the keyword stored in
ontology with the topic identified using Topic Identification Algorithm.
3. Clustering: Research Papers Based on Similarities Using Text Mining: After the research papers are
classified by the discipline areas, the papers in each discipline are clustered using the text- mining
technique. The main clustering process consists of five steps, text document collection, text document
preprocessing, text document encoding.
A. Graphs:
In the existing system we display the graph accoprding to domain, keywords and institute wise.
1. Domain wise graph:
Fig.3 Domain wise graph
Research Paper Selection Based On an Ontology and Text Mining Technique Using Clustering
DOI: 10.9790/0661-17116571 www.iosrjournals.org 68 | Page
This bar chart shows the domain wise graph. This graph contains the domain and the frequency. AS per
we submit the domain wise paper in the existing systsem after classifying & reviewing paper the graph will
display the selected paper domain and domains frequency. This graph shows the cloud computing and data
mining domains and freqency is higher than biometrics.
2. Keyword wise graph:
Fig.4 Keyword wise graph
This bar chart shows the keyword wise graph of selected papers.This graph contains the domain wise
paper and papers keyword and the frequency. AS per we submit the domain wise paper in the existing systsem
after classifying & reviewing paper the graph will display the selected paper domain and keywords of that
papers and domain frequency. This graph shows the keywords of cloud computing and data mining and
freqency is higher than biometrics.
3. Institute wise graph:
Fig.5 Institute wise graph
This bar chart shows the institute name wise graph. The graph shows the college name and value means
how many papers get submitted. In this graph MIT and PICT College’s student submitted lot of papers so value
increased as compared other colleges.
IV. Ontology Based Text Mining Framewok
In the R&D, after papers are submitted, the next important task is to group papers and assign them to
reviewers. The papers in each group should have similar research characteristics. For instance, if the papers in a
group fall into the same primary research discipline (e.g., supply chain management) and the number of papers
is small, manual grouping based on keywords listed in papers can be used and assign them to reviewer
manually. However, if the number of papers is large, it is very difficult to group papers and assign them to
reviewer manually. So the papers are classified using ontology and topic identification algorithm and then
papers are clustered using text mining and last it is submitted to reviewer systematically.
Research Paper Selection Based On an Ontology and Text Mining Technique Using Clustering
DOI: 10.9790/0661-17116571 www.iosrjournals.org 69 | Page
Fig.6 Structure of Research Ontology
As shown in Above Fig. It is Tree like Structure Containing Category, Sector, Domain and Topic
Name of Research Papers. Considering the example Scientific Research is the Category, under that there are
different sectors like Computer Science, E&TC. Sectors containing the Different Domains of the research area
.Each domain contain the various topic names of the Research papers. Each topic name contains keywords of
more than two domains then that topic name gets classified under domain which comes first. By using this
method we solve the problem of grouping of research papers under the proper domain area.
Module1: Research Ontology Building:
Step1) creating the research topics: The keywords of the supported research projects each year are collected,
and their frequencies are counted. The keyword frequency is the sum of the same keywords that appeared in the
discipline during the most recent five years. Creating the research topics of the discipline Ak, (k =1 ,2,...,K). The
keywords and their frequencies are denoted by the feature set (Nok,IDk,year,{ (keyword1,frequency1),
(keyword2, frequency2),..., (keywordk,frequencyk)}, where Nok is the sequence number of the kth record and
IDk is the corresponding discipline code. For instance, if discipline Ak has two keywords in 2007 (i.e., ―data
mining‖ and ―business intelligence‖) and the total number of counts for them are 30 and 50, respectively, the
discipline can be denoted by (Nok, IDk, 2007, {(data mining, 30), (business intelligence, 50)}). In this way, a
feature set of each discipline can be created. The keyword frequency in the feature set is the sum of the same
keywords that appeared in this discipline during the most recent five years (shown in Fig. 4), and then, the
feature set of Ak is denoted by (Nok,IDk,{(keyword1,frequency1)(keyword2, frequency2) (keywordk,
frequencyk)}).
Step2) Constructing the research ontology: First, the research ontology is categorized according to scientific
research areas introduced in the background. It is then developed on the basis of several specific research areas.
Sep3) Updating the research ontology: Once the project funding is completed each year, the research ontology
is updated according to agency’s policy and the change of the feature set.
Module 2: Classifying New Research Papers:
Research papers are classified by the domain area based on ontology keywords. This is done using the research
ontology as follows. Suppose that there are K discipline areas, and Ak denotes area k(k =1 ,2,...,K). Pi denotes
papers i(i =1 ,2,...,I), and Sk represents the set of papers which belongs to area k.
Module 3: Clustering Research Papers Based on Similarities Using Text Mining:
After the research papers are classified by the domain areas, the papers in each domain are clustered using the
text-mining technique. The main clustering process consists of four steps, as shown in Fig. text document
collection, text document preprocessing, text document encoding, and text vector clustering. The details of each
step are as follows.
Fig.7 Process of text mining
Research Paper Selection Based On an Ontology and Text Mining Technique Using Clustering
DOI: 10.9790/0661-17116571 www.iosrjournals.org 70 | Page
Step 1) Text document collection: After the research papers are classified according to the discipline areas, the
paper documents in each discipline Ak(k =1 ,2,...,K) are collected for text document preprocessing.
Step 2) Text document preprocessing: The contents of papers are usually nano structured. Because the texts
of the papers consist of characters which are difficult to segment, the research ontology is used to analyze,
extract, and identify the keywords in the full text of the papers.. Finally, a further reduction in the vocabulary
size can be achieved through the removal of all stop words that appeared only a few times in all paper
documents.
Step 3) Text document encoding: After text documents are segmented, they are converted into a feature vector
representation: V =( v1,v2,...,vM), where M is the number of features selected and vi(i =1 ,2,...,M) is the TF-
IDF encoding [18] of the keyword wi. TF-IDF encoding describes a weighted method based on inverse
document frequency (IDF) combined with the term frequency (TF) to produce the feature v, such that vi = tfi
∗log(N/dfi), where N is the total number of papers in the discipline, tfi is the term frequency of the feature word
wi, anddf i is the number of papers containing the word wi. Thus, research papers can be represented by
corresponding feature vectors.
Step 4) Text vector clustering: This step uses K-MEANS clustering algorithm to cluster the feature vectors
based on similarities of re- search areas. The K-MEANS clustering algorithm is a typical unsupervised learning
neural network model that clusters input data with similarities. Details of the K-MEANS algorithm.
V. Clustering Using K-Means Algorithm
After the construction of the document vector, the process of clustering is carried out. The K-MEANS clustering
algorithm is used to meet the purpose of this project. The basic algorithm of K-MEANS used for the project is
as following:
K-Means Algorithm Use:
For partitioning where each cluster’s center is represented by the mean value of the objects in the cluster.
Input:
k: the number of clusters,
Output:
A set of k clusters.
Method:
Step 1: Choose k numbers of clusters to be determined.
Step 2: Choose Ck centroids randomly as the initial centers of the clusters.
Step 3: Repeat
3.1: Assign each object to their closest cluster center using Euclidean distance.
3.2: Compute new cluster center by calculating mean points.
Step 4: Until
4.1: No change in cluster center OR
4.2: No object changes its clusters.
VI. Conclusion And Future Work
This paper has presented a framework on ontology based text mining for grouping research papers and
assigning the grouped paper to reviewers systematically. Research ontology is constructed to categorize the
concept terms in different discipline areas and to form relationships among them. It facilitates text-mining and
optimization techniques to cluster research papers based on their similarities and then to assign them to reviewer
according to their concerned research area. The papers are assigned to reviewer with the help of knowledge
based agent. Future work is needed to replace the work of reviewer by system. Also, there is a need to
empirically compare the results of manual classification to text-mining classification. Also there is need to
sending message on user’s mobile number and also further profile schedule.
References
[1]. Y. H. Sun, J. Ma, Z. P. Fan, and J. Wang, ―A group decision support approach to evaluate experts for R&D project selection,‖ IEEE
Trans Eng. Manag., vol. 55, no. 1, pp. 158–170, Feb.2008.
[2]. A. D. Henriksen and A. J. Traynor, ―A practical R&D project-selection scoring tool,‖ IEEE Trans. Eng. Manag., vol. 46, no. 2, pp.
158–170,May 1999.
[3]. S. Bechhofer et al., OWL Web Ontology Language Reference, W3C recommendation, vol.10, p. 2006-01, 2004.
Research Paper Selection Based On an Ontology and Text Mining Technique Using Clustering
DOI: 10.9790/0661-17116571 www.iosrjournals.org 71 | Page
[4]. B. Yildiz and S.Miksch, ―ontoX—A method for ontology-driven information extraction,‖ in Proc.ICCSA (3), vol. 4707, Lecture
Notes in Computer Science, O. Gervasi andM. L. Gavrilova, Eds., 2007, pp. 660–673, Berlin,Germany: Springer-Verlag.
[5]. Jian Ma, Wei Xu, Yong-hong Sun, Efraim Turban, Shouyang Wang‖An Ontology-Based Text- Mining Method to Cluster Papers
for Research Project Selection‖,IEEE Trans an systems and humans vol.42,no.3 May2012
[6]. A. Maedche and S. Staab, ―The Text-To-Onto ontology learning environment,‖in Proc. 8th Int.Conf. Conceptual Struct., Darmstadt,
Germany,2000, pp. 14–18.
[7]. E. Turban, D. Zhou, and J. Ma, ―A group decision support approach to evaluating journals,‖ Inf. Manage., vol. 42, no. 1, pp. 31–44,
Dec. 2004.
[8]. C. Choi and Y. Park, ―R&D paper screening system based on text mining approach,‖ Int. J.Technol. Intell. Plan., vol. 2, no. 1, pp.
61–72,2006.
[9]. D. Roussinov and H. Chen, ―Document clustering for electronic meetings:An experimental comparison of two techniques,‖ Decis.
Support Syst.,vol. 27, no. 1/2, pp. 67–79, Nov. 1999.
[10]. C. Wei, C. S. Yang, H. W. Hsiao, and T. H. Cheng, ―Combining preference- and content based approaches for improving document
clustering effectiveness,‖ Inf. process. Manage.,vol. 42, no. 2, pp. 350–372,Mar. 2006.
[11]. T. A. Runkler and J. C. Bezdek, ―Web mining with relational clustering,‖Int. J. Approx. Reason., vol. 32, no. 2/3, pp. 217–236, Feb.
2003
[12]. H. Li, K. Zhang, and T. Jiang, ―Minimum entropy clustering and applications to gene expression analysis,‖ in Proc. 3rd IEEE
Comput. Syst. Bioinform. Conf., Stanford, CA, 2004, pp. 142–151.
[13]. S. Gauch, J. Chaffee, and A. Pretschner, ―Ontology-based personalized search and browsing,‖ Web Intell. Agent Syst., vol. 1, no.
3/4,pp. 219– 234, Dec. 2003.
[14]. L. M. Meade and A. Presley, ―R&D project selection using the analytic network process,‖IEEE Trans. Eng. Manag., vol. 49, no. 1,
pp. 59– 66, Feb. 2002.
[15]. Hossein Shahsavand Baghdadi and Bali Ranaivo-Malançon ,―An Automatic Topic Identification Algorithm,‖ Journal of Computer
Science 7 (9): 1363-1367, 2011 ISSN 1549-3636
[16]. T. H. Cheng and C. P. Wei, ―A clustering-based approach for integrating document-category hierarchies,‖ IEEE Trans. Syst., Man,
Cybern.A,Syst., Humans, vol. 38, no. 2, pp. 410–424, Mar. 2008.
[17]. S. Hettich and M. Pazzani, ―Mining for paper reviewers: Lessons learned at the National Science Foundation,‖ in Proc. 12th Int.
Conf.Knowl. Discov. Data Mining, 2006, pp. 862–871.

More Related Content

What's hot

0739456x17723971
0739456x177239710739456x17723971
0739456x17723971
kamilHussain15
 
Viva
VivaViva
Novelty detection via topic modeling in research articles
Novelty detection via topic modeling in research articlesNovelty detection via topic modeling in research articles
Novelty detection via topic modeling in research articles
csandit
 
A Review on Text Mining in Data Mining
A Review on Text Mining in Data MiningA Review on Text Mining in Data Mining
A Review on Text Mining in Data Mining
ijsc
 
Systematic review on project actuality
Systematic review on project actualitySystematic review on project actuality
Systematic review on project actuality
ijcsit
 
Semantic tagging for documents using 'short text' information
Semantic tagging for documents using 'short text' informationSemantic tagging for documents using 'short text' information
Semantic tagging for documents using 'short text' information
csandit
 
IRJET- Concept Extraction from Ambiguous Text Document using K-Means
IRJET- Concept Extraction from Ambiguous Text Document using K-MeansIRJET- Concept Extraction from Ambiguous Text Document using K-Means
IRJET- Concept Extraction from Ambiguous Text Document using K-Means
IRJET Journal
 
An Evaluation of Preprocessing Techniques for Text Classification
An Evaluation of Preprocessing Techniques for Text ClassificationAn Evaluation of Preprocessing Techniques for Text Classification
An Evaluation of Preprocessing Techniques for Text Classification
IJCSIS Research Publications
 
Wasana (2011) a systematic, tool-supported method for conducting lr in is
Wasana (2011)   a systematic, tool-supported method for conducting lr in isWasana (2011)   a systematic, tool-supported method for conducting lr in is
Wasana (2011) a systematic, tool-supported method for conducting lr in is
Researchworkshop
 
A Semantic Retrieval System for Extracting Relationships from Biological Corpus
A Semantic Retrieval System for Extracting Relationships from Biological CorpusA Semantic Retrieval System for Extracting Relationships from Biological Corpus
A Semantic Retrieval System for Extracting Relationships from Biological Corpus
ijcsit
 
Enhancing Information Retrieval by Personalization Techniques
Enhancing Information Retrieval by Personalization TechniquesEnhancing Information Retrieval by Personalization Techniques
Enhancing Information Retrieval by Personalization Techniques
veningstonk
 
Survey on Existing Text Mining Frameworks and A Proposed Idealistic Framework...
Survey on Existing Text Mining Frameworks and A Proposed Idealistic Framework...Survey on Existing Text Mining Frameworks and A Proposed Idealistic Framework...
Survey on Existing Text Mining Frameworks and A Proposed Idealistic Framework...
ijceronline
 
Ontology Based PMSE with Manifold Preference
Ontology Based PMSE with Manifold PreferenceOntology Based PMSE with Manifold Preference
Ontology Based PMSE with Manifold Preference
IJCERT
 
A Comprehensive Survey on Comparisons across Contextual Pre-Filtering, Contex...
A Comprehensive Survey on Comparisons across Contextual Pre-Filtering, Contex...A Comprehensive Survey on Comparisons across Contextual Pre-Filtering, Contex...
A Comprehensive Survey on Comparisons across Contextual Pre-Filtering, Contex...
TELKOMNIKA JOURNAL
 
IRJET- Text Document Clustering using K-Means Algorithm
IRJET-  	  Text Document Clustering using K-Means Algorithm IRJET-  	  Text Document Clustering using K-Means Algorithm
IRJET- Text Document Clustering using K-Means Algorithm
IRJET Journal
 
A BOOTSTRAPPING METHOD FOR AUTOMATIC CONSTRUCTING OF THE WEB ONTOLOGY INSTANC...
A BOOTSTRAPPING METHOD FOR AUTOMATIC CONSTRUCTING OF THE WEB ONTOLOGY INSTANC...A BOOTSTRAPPING METHOD FOR AUTOMATIC CONSTRUCTING OF THE WEB ONTOLOGY INSTANC...
A BOOTSTRAPPING METHOD FOR AUTOMATIC CONSTRUCTING OF THE WEB ONTOLOGY INSTANC...
IJwest
 

What's hot (16)

0739456x17723971
0739456x177239710739456x17723971
0739456x17723971
 
Viva
VivaViva
Viva
 
Novelty detection via topic modeling in research articles
Novelty detection via topic modeling in research articlesNovelty detection via topic modeling in research articles
Novelty detection via topic modeling in research articles
 
A Review on Text Mining in Data Mining
A Review on Text Mining in Data MiningA Review on Text Mining in Data Mining
A Review on Text Mining in Data Mining
 
Systematic review on project actuality
Systematic review on project actualitySystematic review on project actuality
Systematic review on project actuality
 
Semantic tagging for documents using 'short text' information
Semantic tagging for documents using 'short text' informationSemantic tagging for documents using 'short text' information
Semantic tagging for documents using 'short text' information
 
IRJET- Concept Extraction from Ambiguous Text Document using K-Means
IRJET- Concept Extraction from Ambiguous Text Document using K-MeansIRJET- Concept Extraction from Ambiguous Text Document using K-Means
IRJET- Concept Extraction from Ambiguous Text Document using K-Means
 
An Evaluation of Preprocessing Techniques for Text Classification
An Evaluation of Preprocessing Techniques for Text ClassificationAn Evaluation of Preprocessing Techniques for Text Classification
An Evaluation of Preprocessing Techniques for Text Classification
 
Wasana (2011) a systematic, tool-supported method for conducting lr in is
Wasana (2011)   a systematic, tool-supported method for conducting lr in isWasana (2011)   a systematic, tool-supported method for conducting lr in is
Wasana (2011) a systematic, tool-supported method for conducting lr in is
 
A Semantic Retrieval System for Extracting Relationships from Biological Corpus
A Semantic Retrieval System for Extracting Relationships from Biological CorpusA Semantic Retrieval System for Extracting Relationships from Biological Corpus
A Semantic Retrieval System for Extracting Relationships from Biological Corpus
 
Enhancing Information Retrieval by Personalization Techniques
Enhancing Information Retrieval by Personalization TechniquesEnhancing Information Retrieval by Personalization Techniques
Enhancing Information Retrieval by Personalization Techniques
 
Survey on Existing Text Mining Frameworks and A Proposed Idealistic Framework...
Survey on Existing Text Mining Frameworks and A Proposed Idealistic Framework...Survey on Existing Text Mining Frameworks and A Proposed Idealistic Framework...
Survey on Existing Text Mining Frameworks and A Proposed Idealistic Framework...
 
Ontology Based PMSE with Manifold Preference
Ontology Based PMSE with Manifold PreferenceOntology Based PMSE with Manifold Preference
Ontology Based PMSE with Manifold Preference
 
A Comprehensive Survey on Comparisons across Contextual Pre-Filtering, Contex...
A Comprehensive Survey on Comparisons across Contextual Pre-Filtering, Contex...A Comprehensive Survey on Comparisons across Contextual Pre-Filtering, Contex...
A Comprehensive Survey on Comparisons across Contextual Pre-Filtering, Contex...
 
IRJET- Text Document Clustering using K-Means Algorithm
IRJET-  	  Text Document Clustering using K-Means Algorithm IRJET-  	  Text Document Clustering using K-Means Algorithm
IRJET- Text Document Clustering using K-Means Algorithm
 
A BOOTSTRAPPING METHOD FOR AUTOMATIC CONSTRUCTING OF THE WEB ONTOLOGY INSTANC...
A BOOTSTRAPPING METHOD FOR AUTOMATIC CONSTRUCTING OF THE WEB ONTOLOGY INSTANC...A BOOTSTRAPPING METHOD FOR AUTOMATIC CONSTRUCTING OF THE WEB ONTOLOGY INSTANC...
A BOOTSTRAPPING METHOD FOR AUTOMATIC CONSTRUCTING OF THE WEB ONTOLOGY INSTANC...
 

Viewers also liked

Multiple Equilibria and Chemical Distribution of Some Bio Metals With β-Amide...
Multiple Equilibria and Chemical Distribution of Some Bio Metals With β-Amide...Multiple Equilibria and Chemical Distribution of Some Bio Metals With β-Amide...
Multiple Equilibria and Chemical Distribution of Some Bio Metals With β-Amide...
IOSR Journals
 
C012652430
C012652430C012652430
C012652430
IOSR Journals
 
Developing Web Browser-Jan
Developing Web Browser-JanDeveloping Web Browser-Jan
Developing Web Browser-Jan
IOSR Journals
 
Cloud Computing for hand-held Devices:Enhancing Smart phones viability with C...
Cloud Computing for hand-held Devices:Enhancing Smart phones viability with C...Cloud Computing for hand-held Devices:Enhancing Smart phones viability with C...
Cloud Computing for hand-held Devices:Enhancing Smart phones viability with C...
IOSR Journals
 
B012651523
B012651523B012651523
B012651523
IOSR Journals
 
C1302020913
C1302020913C1302020913
C1302020913
IOSR Journals
 
Securing Group Communication in Partially Distributed Systems
Securing Group Communication in Partially Distributed SystemsSecuring Group Communication in Partially Distributed Systems
Securing Group Communication in Partially Distributed Systems
IOSR Journals
 
N130305105111
N130305105111N130305105111
N130305105111
IOSR Journals
 
T130403141145
T130403141145T130403141145
T130403141145
IOSR Journals
 
I1303044552
I1303044552I1303044552
I1303044552
IOSR Journals
 
L012247886
L012247886L012247886
L012247886
IOSR Journals
 
C012521523
C012521523C012521523
C012521523
IOSR Journals
 
J012546068
J012546068J012546068
J012546068
IOSR Journals
 
F012533444
F012533444F012533444
F012533444
IOSR Journals
 
O0131492100
O0131492100O0131492100
O0131492100
IOSR Journals
 
F0623642
F0623642F0623642
F0623642
IOSR Journals
 
H012624853
H012624853H012624853
H012624853
IOSR Journals
 
Q01742112115
Q01742112115Q01742112115
Q01742112115
IOSR Journals
 
F011136467
F011136467F011136467
F011136467
IOSR Journals
 

Viewers also liked (20)

Multiple Equilibria and Chemical Distribution of Some Bio Metals With β-Amide...
Multiple Equilibria and Chemical Distribution of Some Bio Metals With β-Amide...Multiple Equilibria and Chemical Distribution of Some Bio Metals With β-Amide...
Multiple Equilibria and Chemical Distribution of Some Bio Metals With β-Amide...
 
C012652430
C012652430C012652430
C012652430
 
Developing Web Browser-Jan
Developing Web Browser-JanDeveloping Web Browser-Jan
Developing Web Browser-Jan
 
Cloud Computing for hand-held Devices:Enhancing Smart phones viability with C...
Cloud Computing for hand-held Devices:Enhancing Smart phones viability with C...Cloud Computing for hand-held Devices:Enhancing Smart phones viability with C...
Cloud Computing for hand-held Devices:Enhancing Smart phones viability with C...
 
B012651523
B012651523B012651523
B012651523
 
I0665256
I0665256I0665256
I0665256
 
C1302020913
C1302020913C1302020913
C1302020913
 
Securing Group Communication in Partially Distributed Systems
Securing Group Communication in Partially Distributed SystemsSecuring Group Communication in Partially Distributed Systems
Securing Group Communication in Partially Distributed Systems
 
N130305105111
N130305105111N130305105111
N130305105111
 
T130403141145
T130403141145T130403141145
T130403141145
 
I1303044552
I1303044552I1303044552
I1303044552
 
L012247886
L012247886L012247886
L012247886
 
C012521523
C012521523C012521523
C012521523
 
J012546068
J012546068J012546068
J012546068
 
F012533444
F012533444F012533444
F012533444
 
O0131492100
O0131492100O0131492100
O0131492100
 
F0623642
F0623642F0623642
F0623642
 
H012624853
H012624853H012624853
H012624853
 
Q01742112115
Q01742112115Q01742112115
Q01742112115
 
F011136467
F011136467F011136467
F011136467
 

Similar to Research Paper Selection Based On an Ontology and Text Mining Technique Using Clustering

Research on ontology based information retrieval techniques
Research on ontology based information retrieval techniquesResearch on ontology based information retrieval techniques
Research on ontology based information retrieval techniques
Kausar Mukadam
 
Navigation through citation network based on content similarity using cosine ...
Navigation through citation network based on content similarity using cosine ...Navigation through citation network based on content similarity using cosine ...
Navigation through citation network based on content similarity using cosine ...
Salam Shah
 
A Centroid And Relationship Based Clustering For Organizing Research Papers
A Centroid And Relationship Based Clustering For Organizing Research PapersA Centroid And Relationship Based Clustering For Organizing Research Papers
A Centroid And Relationship Based Clustering For Organizing Research Papers
Daniel Wachtel
 
A Social Network-Empowered Research Analytics Framework For Project Selection
A Social Network-Empowered Research Analytics Framework For Project SelectionA Social Network-Empowered Research Analytics Framework For Project Selection
A Social Network-Empowered Research Analytics Framework For Project Selection
Nat Rice
 
A Citation-Based Recommender System For Scholarly Paper Recommendation
A Citation-Based Recommender System For Scholarly Paper RecommendationA Citation-Based Recommender System For Scholarly Paper Recommendation
A Citation-Based Recommender System For Scholarly Paper Recommendation
Daniel Wachtel
 
Ontology based clustering in research project
Ontology based clustering in research projectOntology based clustering in research project
Ontology based clustering in research project
eSAT Publishing House
 
An Effective Academic Research Papers Recommendation For Non-Profiled Users
An Effective Academic Research Papers Recommendation For Non-Profiled UsersAn Effective Academic Research Papers Recommendation For Non-Profiled Users
An Effective Academic Research Papers Recommendation For Non-Profiled Users
Nat Rice
 
Systematic literature review technique.pptx
Systematic literature review technique.pptxSystematic literature review technique.pptx
Systematic literature review technique.pptx
TANMAY DAS GUPTA
 
09 9241 it co-citation network investigation (edit ari)
09 9241 it  co-citation network investigation (edit ari)09 9241 it  co-citation network investigation (edit ari)
09 9241 it co-citation network investigation (edit ari)
IAESIJEECS
 
Assignment 2 Case StudyRead the following articleAgostinho
Assignment 2 Case StudyRead the following articleAgostinhoAssignment 2 Case StudyRead the following articleAgostinho
Assignment 2 Case StudyRead the following articleAgostinho
desteinbrook
 
82611944
8261194482611944
82611944
MelindaCahyani2
 
A Two-Stage Method For Scientific Papers Analysis
A Two-Stage Method For Scientific Papers AnalysisA Two-Stage Method For Scientific Papers Analysis
A Two-Stage Method For Scientific Papers Analysis
Justin Knight
 
A Two-Stage Method For Scientific Papers Analysis.Pdf
A Two-Stage Method For Scientific Papers Analysis.PdfA Two-Stage Method For Scientific Papers Analysis.Pdf
A Two-Stage Method For Scientific Papers Analysis.Pdf
Sophia Diaz
 
Clicking Past Google
Clicking Past GoogleClicking Past Google
Clicking Past Google
Douglas Joubert
 
Classification of News and Research Articles Using Text Pattern Mining
Classification of News and Research Articles Using Text Pattern MiningClassification of News and Research Articles Using Text Pattern Mining
Classification of News and Research Articles Using Text Pattern Mining
IOSR Journals
 
A Review Of Text Mining Techniques And Applications
A Review Of Text Mining Techniques And ApplicationsA Review Of Text Mining Techniques And Applications
A Review Of Text Mining Techniques And Applications
Lisa Graves
 
How to conduct a bibliometric analysis?
How to conduct a bibliometric analysis?How to conduct a bibliometric analysis?
How to conduct a bibliometric analysis?
Anandhan22
 
PDF-How to Conduct Bibliometric Analyses.pdf
PDF-How to Conduct Bibliometric Analyses.pdfPDF-How to Conduct Bibliometric Analyses.pdf
PDF-How to Conduct Bibliometric Analyses.pdf
Tutors India
 
A Collaborative Approach Toward Scientific Paper Recommendation Using Citatio...
A Collaborative Approach Toward Scientific Paper Recommendation Using Citatio...A Collaborative Approach Toward Scientific Paper Recommendation Using Citatio...
A Collaborative Approach Toward Scientific Paper Recommendation Using Citatio...
Melinda Watson
 

Similar to Research Paper Selection Based On an Ontology and Text Mining Technique Using Clustering (20)

Research on ontology based information retrieval techniques
Research on ontology based information retrieval techniquesResearch on ontology based information retrieval techniques
Research on ontology based information retrieval techniques
 
Navigation through citation network based on content similarity using cosine ...
Navigation through citation network based on content similarity using cosine ...Navigation through citation network based on content similarity using cosine ...
Navigation through citation network based on content similarity using cosine ...
 
A Centroid And Relationship Based Clustering For Organizing Research Papers
A Centroid And Relationship Based Clustering For Organizing Research PapersA Centroid And Relationship Based Clustering For Organizing Research Papers
A Centroid And Relationship Based Clustering For Organizing Research Papers
 
A Social Network-Empowered Research Analytics Framework For Project Selection
A Social Network-Empowered Research Analytics Framework For Project SelectionA Social Network-Empowered Research Analytics Framework For Project Selection
A Social Network-Empowered Research Analytics Framework For Project Selection
 
A Citation-Based Recommender System For Scholarly Paper Recommendation
A Citation-Based Recommender System For Scholarly Paper RecommendationA Citation-Based Recommender System For Scholarly Paper Recommendation
A Citation-Based Recommender System For Scholarly Paper Recommendation
 
Ontology based clustering in research project
Ontology based clustering in research projectOntology based clustering in research project
Ontology based clustering in research project
 
An Effective Academic Research Papers Recommendation For Non-Profiled Users
An Effective Academic Research Papers Recommendation For Non-Profiled UsersAn Effective Academic Research Papers Recommendation For Non-Profiled Users
An Effective Academic Research Papers Recommendation For Non-Profiled Users
 
Systematic literature review technique.pptx
Systematic literature review technique.pptxSystematic literature review technique.pptx
Systematic literature review technique.pptx
 
09 9241 it co-citation network investigation (edit ari)
09 9241 it  co-citation network investigation (edit ari)09 9241 it  co-citation network investigation (edit ari)
09 9241 it co-citation network investigation (edit ari)
 
Assignment 2 Case StudyRead the following articleAgostinho
Assignment 2 Case StudyRead the following articleAgostinhoAssignment 2 Case StudyRead the following articleAgostinho
Assignment 2 Case StudyRead the following articleAgostinho
 
82611944
8261194482611944
82611944
 
Ijetcas14 446
Ijetcas14 446Ijetcas14 446
Ijetcas14 446
 
A Two-Stage Method For Scientific Papers Analysis
A Two-Stage Method For Scientific Papers AnalysisA Two-Stage Method For Scientific Papers Analysis
A Two-Stage Method For Scientific Papers Analysis
 
A Two-Stage Method For Scientific Papers Analysis.Pdf
A Two-Stage Method For Scientific Papers Analysis.PdfA Two-Stage Method For Scientific Papers Analysis.Pdf
A Two-Stage Method For Scientific Papers Analysis.Pdf
 
Clicking Past Google
Clicking Past GoogleClicking Past Google
Clicking Past Google
 
Classification of News and Research Articles Using Text Pattern Mining
Classification of News and Research Articles Using Text Pattern MiningClassification of News and Research Articles Using Text Pattern Mining
Classification of News and Research Articles Using Text Pattern Mining
 
A Review Of Text Mining Techniques And Applications
A Review Of Text Mining Techniques And ApplicationsA Review Of Text Mining Techniques And Applications
A Review Of Text Mining Techniques And Applications
 
How to conduct a bibliometric analysis?
How to conduct a bibliometric analysis?How to conduct a bibliometric analysis?
How to conduct a bibliometric analysis?
 
PDF-How to Conduct Bibliometric Analyses.pdf
PDF-How to Conduct Bibliometric Analyses.pdfPDF-How to Conduct Bibliometric Analyses.pdf
PDF-How to Conduct Bibliometric Analyses.pdf
 
A Collaborative Approach Toward Scientific Paper Recommendation Using Citatio...
A Collaborative Approach Toward Scientific Paper Recommendation Using Citatio...A Collaborative Approach Toward Scientific Paper Recommendation Using Citatio...
A Collaborative Approach Toward Scientific Paper Recommendation Using Citatio...
 

More from IOSR Journals

A011140104
A011140104A011140104
A011140104
IOSR Journals
 
M0111397100
M0111397100M0111397100
M0111397100
IOSR Journals
 
L011138596
L011138596L011138596
L011138596
IOSR Journals
 
K011138084
K011138084K011138084
K011138084
IOSR Journals
 
J011137479
J011137479J011137479
J011137479
IOSR Journals
 
I011136673
I011136673I011136673
I011136673
IOSR Journals
 
G011134454
G011134454G011134454
G011134454
IOSR Journals
 
H011135565
H011135565H011135565
H011135565
IOSR Journals
 
F011134043
F011134043F011134043
F011134043
IOSR Journals
 
E011133639
E011133639E011133639
E011133639
IOSR Journals
 
D011132635
D011132635D011132635
D011132635
IOSR Journals
 
C011131925
C011131925C011131925
C011131925
IOSR Journals
 
B011130918
B011130918B011130918
B011130918
IOSR Journals
 
A011130108
A011130108A011130108
A011130108
IOSR Journals
 
I011125160
I011125160I011125160
I011125160
IOSR Journals
 
H011124050
H011124050H011124050
H011124050
IOSR Journals
 
G011123539
G011123539G011123539
G011123539
IOSR Journals
 
F011123134
F011123134F011123134
F011123134
IOSR Journals
 
E011122530
E011122530E011122530
E011122530
IOSR Journals
 
D011121524
D011121524D011121524
D011121524
IOSR Journals
 

More from IOSR Journals (20)

A011140104
A011140104A011140104
A011140104
 
M0111397100
M0111397100M0111397100
M0111397100
 
L011138596
L011138596L011138596
L011138596
 
K011138084
K011138084K011138084
K011138084
 
J011137479
J011137479J011137479
J011137479
 
I011136673
I011136673I011136673
I011136673
 
G011134454
G011134454G011134454
G011134454
 
H011135565
H011135565H011135565
H011135565
 
F011134043
F011134043F011134043
F011134043
 
E011133639
E011133639E011133639
E011133639
 
D011132635
D011132635D011132635
D011132635
 
C011131925
C011131925C011131925
C011131925
 
B011130918
B011130918B011130918
B011130918
 
A011130108
A011130108A011130108
A011130108
 
I011125160
I011125160I011125160
I011125160
 
H011124050
H011124050H011124050
H011124050
 
G011123539
G011123539G011123539
G011123539
 
F011123134
F011123134F011123134
F011123134
 
E011122530
E011122530E011122530
E011122530
 
D011121524
D011121524D011121524
D011121524
 

Recently uploaded

Nuclear Power Economics and Structuring 2024
Nuclear Power Economics and Structuring 2024Nuclear Power Economics and Structuring 2024
Nuclear Power Economics and Structuring 2024
Massimo Talia
 
ASME IX(9) 2007 Full Version .pdf
ASME IX(9)  2007 Full Version       .pdfASME IX(9)  2007 Full Version       .pdf
ASME IX(9) 2007 Full Version .pdf
AhmedHussein950959
 
Sachpazis:Terzaghi Bearing Capacity Estimation in simple terms with Calculati...
Sachpazis:Terzaghi Bearing Capacity Estimation in simple terms with Calculati...Sachpazis:Terzaghi Bearing Capacity Estimation in simple terms with Calculati...
Sachpazis:Terzaghi Bearing Capacity Estimation in simple terms with Calculati...
Dr.Costas Sachpazis
 
English lab ppt no titlespecENG PPTt.pdf
English lab ppt no titlespecENG PPTt.pdfEnglish lab ppt no titlespecENG PPTt.pdf
English lab ppt no titlespecENG PPTt.pdf
BrazilAccount1
 
Runway Orientation Based on the Wind Rose Diagram.pptx
Runway Orientation Based on the Wind Rose Diagram.pptxRunway Orientation Based on the Wind Rose Diagram.pptx
Runway Orientation Based on the Wind Rose Diagram.pptx
SupreethSP4
 
AKS UNIVERSITY Satna Final Year Project By OM Hardaha.pdf
AKS UNIVERSITY Satna Final Year Project By OM Hardaha.pdfAKS UNIVERSITY Satna Final Year Project By OM Hardaha.pdf
AKS UNIVERSITY Satna Final Year Project By OM Hardaha.pdf
SamSarthak3
 
Immunizing Image Classifiers Against Localized Adversary Attacks
Immunizing Image Classifiers Against Localized Adversary AttacksImmunizing Image Classifiers Against Localized Adversary Attacks
Immunizing Image Classifiers Against Localized Adversary Attacks
gerogepatton
 
Governing Equations for Fundamental Aerodynamics_Anderson2010.pdf
Governing Equations for Fundamental Aerodynamics_Anderson2010.pdfGoverning Equations for Fundamental Aerodynamics_Anderson2010.pdf
Governing Equations for Fundamental Aerodynamics_Anderson2010.pdf
WENKENLI1
 
Design and Analysis of Algorithms-DP,Backtracking,Graphs,B&B
Design and Analysis of Algorithms-DP,Backtracking,Graphs,B&BDesign and Analysis of Algorithms-DP,Backtracking,Graphs,B&B
Design and Analysis of Algorithms-DP,Backtracking,Graphs,B&B
Sreedhar Chowdam
 
space technology lecture notes on satellite
space technology lecture notes on satellitespace technology lecture notes on satellite
space technology lecture notes on satellite
ongomchris
 
Fundamentals of Electric Drives and its applications.pptx
Fundamentals of Electric Drives and its applications.pptxFundamentals of Electric Drives and its applications.pptx
Fundamentals of Electric Drives and its applications.pptx
manasideore6
 
Investor-Presentation-Q1FY2024 investor presentation document.pptx
Investor-Presentation-Q1FY2024 investor presentation document.pptxInvestor-Presentation-Q1FY2024 investor presentation document.pptx
Investor-Presentation-Q1FY2024 investor presentation document.pptx
AmarGB2
 
一比一原版(SFU毕业证)西蒙菲莎大学毕业证成绩单如何办理
一比一原版(SFU毕业证)西蒙菲莎大学毕业证成绩单如何办理一比一原版(SFU毕业证)西蒙菲莎大学毕业证成绩单如何办理
一比一原版(SFU毕业证)西蒙菲莎大学毕业证成绩单如何办理
bakpo1
 
Architectural Portfolio Sean Lockwood
Architectural Portfolio Sean LockwoodArchitectural Portfolio Sean Lockwood
Architectural Portfolio Sean Lockwood
seandesed
 
H.Seo, ICLR 2024, MLILAB, KAIST AI.pdf
H.Seo,  ICLR 2024, MLILAB,  KAIST AI.pdfH.Seo,  ICLR 2024, MLILAB,  KAIST AI.pdf
H.Seo, ICLR 2024, MLILAB, KAIST AI.pdf
MLILAB
 
power quality voltage fluctuation UNIT - I.pptx
power quality voltage fluctuation UNIT - I.pptxpower quality voltage fluctuation UNIT - I.pptx
power quality voltage fluctuation UNIT - I.pptx
ViniHema
 
Final project report on grocery store management system..pdf
Final project report on grocery store management system..pdfFinal project report on grocery store management system..pdf
Final project report on grocery store management system..pdf
Kamal Acharya
 
The Benefits and Techniques of Trenchless Pipe Repair.pdf
The Benefits and Techniques of Trenchless Pipe Repair.pdfThe Benefits and Techniques of Trenchless Pipe Repair.pdf
The Benefits and Techniques of Trenchless Pipe Repair.pdf
Pipe Restoration Solutions
 
weather web application report.pdf
weather web application report.pdfweather web application report.pdf
weather web application report.pdf
Pratik Pawar
 
Top 10 Oil and Gas Projects in Saudi Arabia 2024.pdf
Top 10 Oil and Gas Projects in Saudi Arabia 2024.pdfTop 10 Oil and Gas Projects in Saudi Arabia 2024.pdf
Top 10 Oil and Gas Projects in Saudi Arabia 2024.pdf
Teleport Manpower Consultant
 

Recently uploaded (20)

Nuclear Power Economics and Structuring 2024
Nuclear Power Economics and Structuring 2024Nuclear Power Economics and Structuring 2024
Nuclear Power Economics and Structuring 2024
 
ASME IX(9) 2007 Full Version .pdf
ASME IX(9)  2007 Full Version       .pdfASME IX(9)  2007 Full Version       .pdf
ASME IX(9) 2007 Full Version .pdf
 
Sachpazis:Terzaghi Bearing Capacity Estimation in simple terms with Calculati...
Sachpazis:Terzaghi Bearing Capacity Estimation in simple terms with Calculati...Sachpazis:Terzaghi Bearing Capacity Estimation in simple terms with Calculati...
Sachpazis:Terzaghi Bearing Capacity Estimation in simple terms with Calculati...
 
English lab ppt no titlespecENG PPTt.pdf
English lab ppt no titlespecENG PPTt.pdfEnglish lab ppt no titlespecENG PPTt.pdf
English lab ppt no titlespecENG PPTt.pdf
 
Runway Orientation Based on the Wind Rose Diagram.pptx
Runway Orientation Based on the Wind Rose Diagram.pptxRunway Orientation Based on the Wind Rose Diagram.pptx
Runway Orientation Based on the Wind Rose Diagram.pptx
 
AKS UNIVERSITY Satna Final Year Project By OM Hardaha.pdf
AKS UNIVERSITY Satna Final Year Project By OM Hardaha.pdfAKS UNIVERSITY Satna Final Year Project By OM Hardaha.pdf
AKS UNIVERSITY Satna Final Year Project By OM Hardaha.pdf
 
Immunizing Image Classifiers Against Localized Adversary Attacks
Immunizing Image Classifiers Against Localized Adversary AttacksImmunizing Image Classifiers Against Localized Adversary Attacks
Immunizing Image Classifiers Against Localized Adversary Attacks
 
Governing Equations for Fundamental Aerodynamics_Anderson2010.pdf
Governing Equations for Fundamental Aerodynamics_Anderson2010.pdfGoverning Equations for Fundamental Aerodynamics_Anderson2010.pdf
Governing Equations for Fundamental Aerodynamics_Anderson2010.pdf
 
Design and Analysis of Algorithms-DP,Backtracking,Graphs,B&B
Design and Analysis of Algorithms-DP,Backtracking,Graphs,B&BDesign and Analysis of Algorithms-DP,Backtracking,Graphs,B&B
Design and Analysis of Algorithms-DP,Backtracking,Graphs,B&B
 
space technology lecture notes on satellite
space technology lecture notes on satellitespace technology lecture notes on satellite
space technology lecture notes on satellite
 
Fundamentals of Electric Drives and its applications.pptx
Fundamentals of Electric Drives and its applications.pptxFundamentals of Electric Drives and its applications.pptx
Fundamentals of Electric Drives and its applications.pptx
 
Investor-Presentation-Q1FY2024 investor presentation document.pptx
Investor-Presentation-Q1FY2024 investor presentation document.pptxInvestor-Presentation-Q1FY2024 investor presentation document.pptx
Investor-Presentation-Q1FY2024 investor presentation document.pptx
 
一比一原版(SFU毕业证)西蒙菲莎大学毕业证成绩单如何办理
一比一原版(SFU毕业证)西蒙菲莎大学毕业证成绩单如何办理一比一原版(SFU毕业证)西蒙菲莎大学毕业证成绩单如何办理
一比一原版(SFU毕业证)西蒙菲莎大学毕业证成绩单如何办理
 
Architectural Portfolio Sean Lockwood
Architectural Portfolio Sean LockwoodArchitectural Portfolio Sean Lockwood
Architectural Portfolio Sean Lockwood
 
H.Seo, ICLR 2024, MLILAB, KAIST AI.pdf
H.Seo,  ICLR 2024, MLILAB,  KAIST AI.pdfH.Seo,  ICLR 2024, MLILAB,  KAIST AI.pdf
H.Seo, ICLR 2024, MLILAB, KAIST AI.pdf
 
power quality voltage fluctuation UNIT - I.pptx
power quality voltage fluctuation UNIT - I.pptxpower quality voltage fluctuation UNIT - I.pptx
power quality voltage fluctuation UNIT - I.pptx
 
Final project report on grocery store management system..pdf
Final project report on grocery store management system..pdfFinal project report on grocery store management system..pdf
Final project report on grocery store management system..pdf
 
The Benefits and Techniques of Trenchless Pipe Repair.pdf
The Benefits and Techniques of Trenchless Pipe Repair.pdfThe Benefits and Techniques of Trenchless Pipe Repair.pdf
The Benefits and Techniques of Trenchless Pipe Repair.pdf
 
weather web application report.pdf
weather web application report.pdfweather web application report.pdf
weather web application report.pdf
 
Top 10 Oil and Gas Projects in Saudi Arabia 2024.pdf
Top 10 Oil and Gas Projects in Saudi Arabia 2024.pdfTop 10 Oil and Gas Projects in Saudi Arabia 2024.pdf
Top 10 Oil and Gas Projects in Saudi Arabia 2024.pdf
 

Research Paper Selection Based On an Ontology and Text Mining Technique Using Clustering

  • 1. IOSR Journal of Computer Engineering (IOSR-JCE) e-ISSN: 2278-0661,p-ISSN: 2278-8727, Volume 17, Issue 1, Ver. I (Jan – Feb. 2015), PP 65-71 www.iosrjournals.org DOI: 10.9790/0661-17116571 www.iosrjournals.org 65 | Page Research Paper Selection Based On an Ontology and Text Mining Technique Using Clustering 1 Snehal Shivaji Patil 2 S.A.Uddin Alhabeeb college of Engg. & Tech. Hyderabad, India. Alhabeeb college of Engg. & Tech. Hyderabad, India. Abstract: Research Paper Selection is important decision making task for the Government funding Agency, Universities, research Institutes. Ontology is Knowledge Repository in which concepts and terms defined as well as relationship between these concepts. In this paper Ontology is old research papers repository of keywords and frequencies of that keywords of the research papers of funding agencies. Ontology makes the tasks of searching similar patterns of text that is to be more effective, efficient and interactive. The current method of grouping of papers for research paper selection based on similarities of Keywords and Frequencies of research papers of ontology. Text mining is method for extracting unknown information from the large documents. The Research Papers in each domain are clustered using Text mining Technique. Grouped Research papers are assigned to appropriate reviewer or domain experts for peer review systematically. The Reviewer results are collected and papers are get ranked based on experts review results. Keywords: Ontology, Text Mining, Classification, Document Preprocessing, Clustering. I. Introduction Research Paper Selection is important decision making task for many Organizations such as Government funding Agency, Universities, research Institutes. Fig.1. Research Paper Selection Process. Fig. 1 shows the processes of research project selection i.e. Call for papers (CFP), paper submission, paper grouping, paper assignment to experts, peer review, aggregation of review results, panel evaluation, and final awarding decision. These processes are very similar in other funding agencies, except that there are a very large number of papers that need to be grouped for peer review. Four to five reviewers are assigned to review each paper so as to assure accurate and reliable opinions on papers. To deal with the large volume, it is necessary to group papers according to their similarities in research disciplines and then to assign the paper groups to relevant reviewers. In the first section we Call for Research Paper means uploading Research paper and submitting the details of that paper. Classification of research papers is based on keywords of papers similar with ontology keywords and frequencies of those keywords. Department members are classified into six groups according to their decision making in research paper selection. Decision making cooperate with each other to accomplish overall goal of selecting research paper as shown above in figure the output of one block is input to the next block. Department is responsible for selection of research papers for classification. Department members classify research papers and assign them to external reviewer for evaluation and commentary. If department member may not have knowledge about research paper in all research domain and contents of many papers were not fully understood when papers were grouped and while assigning grouped papers to external reviewers. Therefore, there was an effective approach to group the submitted research Papers and assign the papers to external reviewers with computer supports. So we use ontology based text- mining approach is proposed to solve the problem.
  • 2. Research Paper Selection Based On an Ontology and Text Mining Technique Using Clustering DOI: 10.9790/0661-17116571 www.iosrjournals.org 66 | Page Section II reviews the literature on research paper selection, grouping of papers and assigning the grouped papers to external reviewer systematically. The proposed method on an ontology based text mining framework for Research Paper Selection is described in Section III. Section IV validates and evaluates the method, and then discusses the potential application. II. Literature Review Selection of research projects is an important research topic in research and development (R&D) project management. Previous re- search deals with specific topics, and several formal methods and models are available for this purpose. For example, Chen and Gorla [2] proposed a fuzzy-logic-based model as a decision tool for project selection. Henriksen and Traynor [3] presented a scoring tool for project evaluation and selection. Ghasemzadeh and Archer [4] offered a decision support approach to project portfolio selection. Machacha and Bhattacharya [5] proposed a fuzzy logic approach to project selection. Butler et al. [6] used a multiple attribute utility theory for project ranking and selection. Loch and Kavadias [7] established a dynamic programming model for project selection, while Meade and Presley [8] developed an analytic network process model. Greiner et al. [9] proposed a hybrid AHP and integer programming approach to support project selection, and Tian et al. [10] suggested an organizational decision support approach for selecting R&D projects. Cook et al. [11] presented a method of optimal allocation of papers to reviewers in order to facilitate the selection process. Arya and Mittendorf [12] proposed a rotation program method for project assignment. For example Choi and Park [13] used text-mining approach for R&D paper screening. Girotra et al. [14] offered an empirical study to value projects in a portfolio. Sun et al. [15] developed a decision support system to evaluate reviewers for research project selection. Finally, Sun et al. [16] proposed a hybrid knowledge-based and modeling approach to assign reviewers to papers for research project selection. Cheng and Wei [2008] proposed clustering-based category-hierarchy integration (CHI) technique, which is an extension of the clustering-based category integration (CCI) technique. This method was improving the effectiveness of category-hierarchy integration compared with that attained by nonhierarchical category- integration techniques particularly homogeneous [16]. Methods have been developed to group papers for peer review tasks. For example, Hettich and Pazzani [2006] proposed a text-mining approach to group papers, identify reviewers, and assign reviewers to papers. Current methods group papers according to keywords. Unfortunately, papers with similar research areas might be placed in wrong groups due to the following reasons: first, keywords are incomplete information about the full content of the papers. Second, keywords are provided by applicants who may have subjective views and misconceptions, and keywords are only a partial representation of the research papers. Third, manual grouping is usually conducted by division managers or program directors in funding agencies. They may have different understanding about the research disciplines and may not have adequate knowledge to assign papers into the right groups. [17] III. Existing System The existing system is an Ontology-Based Text-Mining Method to cluster research papers based on their similarities in research areas. It consists of three phases. Ontology is a knowledge repository in which concepts and terms are defined as well as relationships between these concepts. It consists of axioms, relationships and set of concepts that describe a domain of interests and represents an agreed-upon conceptualization of the domain’s ―real-world‖ setting. Implicit knowledge for humans is made explicit for computers by ontology. Thus, ontology can automate information processing and can facilitate text mining in a specific domain (such as research project selection). An ontology based text mining framework has been built for clustering the research papers according to their discipline areas. Text mining refers generally to the process of extracting interesting information and knowledge from unstructured text. The main difference between regular data mining and text mining is that text mining patterns are extracted from natural language text rather than from structured databases of facts.
  • 3. Research Paper Selection Based On an Ontology and Text Mining Technique Using Clustering DOI: 10.9790/0661-17116571 www.iosrjournals.org 67 | Page Fig.2 Main Framework of Proposed system. 1. Constructing Research Ontology: A research ontology containing the projects funded in latest five years is constructed according to keywords, and it is updated annually. As domain ontology research ontology is a public concept set of the research project management domain. The research topics of different disciplines can be clearly expressed by research ontology. 2. Classifying New Research Papers: New research papers are classified according to the keyword stored in ontology with the topic identified using Topic Identification Algorithm. 3. Clustering: Research Papers Based on Similarities Using Text Mining: After the research papers are classified by the discipline areas, the papers in each discipline are clustered using the text- mining technique. The main clustering process consists of five steps, text document collection, text document preprocessing, text document encoding. A. Graphs: In the existing system we display the graph accoprding to domain, keywords and institute wise. 1. Domain wise graph: Fig.3 Domain wise graph
  • 4. Research Paper Selection Based On an Ontology and Text Mining Technique Using Clustering DOI: 10.9790/0661-17116571 www.iosrjournals.org 68 | Page This bar chart shows the domain wise graph. This graph contains the domain and the frequency. AS per we submit the domain wise paper in the existing systsem after classifying & reviewing paper the graph will display the selected paper domain and domains frequency. This graph shows the cloud computing and data mining domains and freqency is higher than biometrics. 2. Keyword wise graph: Fig.4 Keyword wise graph This bar chart shows the keyword wise graph of selected papers.This graph contains the domain wise paper and papers keyword and the frequency. AS per we submit the domain wise paper in the existing systsem after classifying & reviewing paper the graph will display the selected paper domain and keywords of that papers and domain frequency. This graph shows the keywords of cloud computing and data mining and freqency is higher than biometrics. 3. Institute wise graph: Fig.5 Institute wise graph This bar chart shows the institute name wise graph. The graph shows the college name and value means how many papers get submitted. In this graph MIT and PICT College’s student submitted lot of papers so value increased as compared other colleges. IV. Ontology Based Text Mining Framewok In the R&D, after papers are submitted, the next important task is to group papers and assign them to reviewers. The papers in each group should have similar research characteristics. For instance, if the papers in a group fall into the same primary research discipline (e.g., supply chain management) and the number of papers is small, manual grouping based on keywords listed in papers can be used and assign them to reviewer manually. However, if the number of papers is large, it is very difficult to group papers and assign them to reviewer manually. So the papers are classified using ontology and topic identification algorithm and then papers are clustered using text mining and last it is submitted to reviewer systematically.
  • 5. Research Paper Selection Based On an Ontology and Text Mining Technique Using Clustering DOI: 10.9790/0661-17116571 www.iosrjournals.org 69 | Page Fig.6 Structure of Research Ontology As shown in Above Fig. It is Tree like Structure Containing Category, Sector, Domain and Topic Name of Research Papers. Considering the example Scientific Research is the Category, under that there are different sectors like Computer Science, E&TC. Sectors containing the Different Domains of the research area .Each domain contain the various topic names of the Research papers. Each topic name contains keywords of more than two domains then that topic name gets classified under domain which comes first. By using this method we solve the problem of grouping of research papers under the proper domain area. Module1: Research Ontology Building: Step1) creating the research topics: The keywords of the supported research projects each year are collected, and their frequencies are counted. The keyword frequency is the sum of the same keywords that appeared in the discipline during the most recent five years. Creating the research topics of the discipline Ak, (k =1 ,2,...,K). The keywords and their frequencies are denoted by the feature set (Nok,IDk,year,{ (keyword1,frequency1), (keyword2, frequency2),..., (keywordk,frequencyk)}, where Nok is the sequence number of the kth record and IDk is the corresponding discipline code. For instance, if discipline Ak has two keywords in 2007 (i.e., ―data mining‖ and ―business intelligence‖) and the total number of counts for them are 30 and 50, respectively, the discipline can be denoted by (Nok, IDk, 2007, {(data mining, 30), (business intelligence, 50)}). In this way, a feature set of each discipline can be created. The keyword frequency in the feature set is the sum of the same keywords that appeared in this discipline during the most recent five years (shown in Fig. 4), and then, the feature set of Ak is denoted by (Nok,IDk,{(keyword1,frequency1)(keyword2, frequency2) (keywordk, frequencyk)}). Step2) Constructing the research ontology: First, the research ontology is categorized according to scientific research areas introduced in the background. It is then developed on the basis of several specific research areas. Sep3) Updating the research ontology: Once the project funding is completed each year, the research ontology is updated according to agency’s policy and the change of the feature set. Module 2: Classifying New Research Papers: Research papers are classified by the domain area based on ontology keywords. This is done using the research ontology as follows. Suppose that there are K discipline areas, and Ak denotes area k(k =1 ,2,...,K). Pi denotes papers i(i =1 ,2,...,I), and Sk represents the set of papers which belongs to area k. Module 3: Clustering Research Papers Based on Similarities Using Text Mining: After the research papers are classified by the domain areas, the papers in each domain are clustered using the text-mining technique. The main clustering process consists of four steps, as shown in Fig. text document collection, text document preprocessing, text document encoding, and text vector clustering. The details of each step are as follows. Fig.7 Process of text mining
  • 6. Research Paper Selection Based On an Ontology and Text Mining Technique Using Clustering DOI: 10.9790/0661-17116571 www.iosrjournals.org 70 | Page Step 1) Text document collection: After the research papers are classified according to the discipline areas, the paper documents in each discipline Ak(k =1 ,2,...,K) are collected for text document preprocessing. Step 2) Text document preprocessing: The contents of papers are usually nano structured. Because the texts of the papers consist of characters which are difficult to segment, the research ontology is used to analyze, extract, and identify the keywords in the full text of the papers.. Finally, a further reduction in the vocabulary size can be achieved through the removal of all stop words that appeared only a few times in all paper documents. Step 3) Text document encoding: After text documents are segmented, they are converted into a feature vector representation: V =( v1,v2,...,vM), where M is the number of features selected and vi(i =1 ,2,...,M) is the TF- IDF encoding [18] of the keyword wi. TF-IDF encoding describes a weighted method based on inverse document frequency (IDF) combined with the term frequency (TF) to produce the feature v, such that vi = tfi ∗log(N/dfi), where N is the total number of papers in the discipline, tfi is the term frequency of the feature word wi, anddf i is the number of papers containing the word wi. Thus, research papers can be represented by corresponding feature vectors. Step 4) Text vector clustering: This step uses K-MEANS clustering algorithm to cluster the feature vectors based on similarities of re- search areas. The K-MEANS clustering algorithm is a typical unsupervised learning neural network model that clusters input data with similarities. Details of the K-MEANS algorithm. V. Clustering Using K-Means Algorithm After the construction of the document vector, the process of clustering is carried out. The K-MEANS clustering algorithm is used to meet the purpose of this project. The basic algorithm of K-MEANS used for the project is as following: K-Means Algorithm Use: For partitioning where each cluster’s center is represented by the mean value of the objects in the cluster. Input: k: the number of clusters, Output: A set of k clusters. Method: Step 1: Choose k numbers of clusters to be determined. Step 2: Choose Ck centroids randomly as the initial centers of the clusters. Step 3: Repeat 3.1: Assign each object to their closest cluster center using Euclidean distance. 3.2: Compute new cluster center by calculating mean points. Step 4: Until 4.1: No change in cluster center OR 4.2: No object changes its clusters. VI. Conclusion And Future Work This paper has presented a framework on ontology based text mining for grouping research papers and assigning the grouped paper to reviewers systematically. Research ontology is constructed to categorize the concept terms in different discipline areas and to form relationships among them. It facilitates text-mining and optimization techniques to cluster research papers based on their similarities and then to assign them to reviewer according to their concerned research area. The papers are assigned to reviewer with the help of knowledge based agent. Future work is needed to replace the work of reviewer by system. Also, there is a need to empirically compare the results of manual classification to text-mining classification. Also there is need to sending message on user’s mobile number and also further profile schedule. References [1]. Y. H. Sun, J. Ma, Z. P. Fan, and J. Wang, ―A group decision support approach to evaluate experts for R&D project selection,‖ IEEE Trans Eng. Manag., vol. 55, no. 1, pp. 158–170, Feb.2008. [2]. A. D. Henriksen and A. J. Traynor, ―A practical R&D project-selection scoring tool,‖ IEEE Trans. Eng. Manag., vol. 46, no. 2, pp. 158–170,May 1999. [3]. S. Bechhofer et al., OWL Web Ontology Language Reference, W3C recommendation, vol.10, p. 2006-01, 2004.
  • 7. Research Paper Selection Based On an Ontology and Text Mining Technique Using Clustering DOI: 10.9790/0661-17116571 www.iosrjournals.org 71 | Page [4]. B. Yildiz and S.Miksch, ―ontoX—A method for ontology-driven information extraction,‖ in Proc.ICCSA (3), vol. 4707, Lecture Notes in Computer Science, O. Gervasi andM. L. Gavrilova, Eds., 2007, pp. 660–673, Berlin,Germany: Springer-Verlag. [5]. Jian Ma, Wei Xu, Yong-hong Sun, Efraim Turban, Shouyang Wang‖An Ontology-Based Text- Mining Method to Cluster Papers for Research Project Selection‖,IEEE Trans an systems and humans vol.42,no.3 May2012 [6]. A. Maedche and S. Staab, ―The Text-To-Onto ontology learning environment,‖in Proc. 8th Int.Conf. Conceptual Struct., Darmstadt, Germany,2000, pp. 14–18. [7]. E. Turban, D. Zhou, and J. Ma, ―A group decision support approach to evaluating journals,‖ Inf. Manage., vol. 42, no. 1, pp. 31–44, Dec. 2004. [8]. C. Choi and Y. Park, ―R&D paper screening system based on text mining approach,‖ Int. J.Technol. Intell. Plan., vol. 2, no. 1, pp. 61–72,2006. [9]. D. Roussinov and H. Chen, ―Document clustering for electronic meetings:An experimental comparison of two techniques,‖ Decis. Support Syst.,vol. 27, no. 1/2, pp. 67–79, Nov. 1999. [10]. C. Wei, C. S. Yang, H. W. Hsiao, and T. H. Cheng, ―Combining preference- and content based approaches for improving document clustering effectiveness,‖ Inf. process. Manage.,vol. 42, no. 2, pp. 350–372,Mar. 2006. [11]. T. A. Runkler and J. C. Bezdek, ―Web mining with relational clustering,‖Int. J. Approx. Reason., vol. 32, no. 2/3, pp. 217–236, Feb. 2003 [12]. H. Li, K. Zhang, and T. Jiang, ―Minimum entropy clustering and applications to gene expression analysis,‖ in Proc. 3rd IEEE Comput. Syst. Bioinform. Conf., Stanford, CA, 2004, pp. 142–151. [13]. S. Gauch, J. Chaffee, and A. Pretschner, ―Ontology-based personalized search and browsing,‖ Web Intell. Agent Syst., vol. 1, no. 3/4,pp. 219– 234, Dec. 2003. [14]. L. M. Meade and A. Presley, ―R&D project selection using the analytic network process,‖IEEE Trans. Eng. Manag., vol. 49, no. 1, pp. 59– 66, Feb. 2002. [15]. Hossein Shahsavand Baghdadi and Bali Ranaivo-Malançon ,―An Automatic Topic Identification Algorithm,‖ Journal of Computer Science 7 (9): 1363-1367, 2011 ISSN 1549-3636 [16]. T. H. Cheng and C. P. Wei, ―A clustering-based approach for integrating document-category hierarchies,‖ IEEE Trans. Syst., Man, Cybern.A,Syst., Humans, vol. 38, no. 2, pp. 410–424, Mar. 2008. [17]. S. Hettich and M. Pazzani, ―Mining for paper reviewers: Lessons learned at the National Science Foundation,‖ in Proc. 12th Int. Conf.Knowl. Discov. Data Mining, 2006, pp. 862–871.