SlideShare a Scribd company logo
International Journal on Soft Computing, Artificial Intelligence and Applications (IJSCAI), Vol.5, No.1, February 2016
DOI :10.5121/ijscai.2016.5106 53
A NAIVE METHOD FOR ONTOLOGY
CONSTRUCTION
Preeti Kathiria
1
and Sukraat Ahluwalia
2
1
Assistant Professor, Department of Computer Science and Engineering, Nirma
University, Gujarat, India
2
Student, Department of Computer Science and Engineering, Nirma University,
Gujarat, India
ABSTRACT
Ontologies are being used to organize information in many domains like artificial intelligence,
information science, semantic web, library science. Ontologies of an entity having different information
can be merged to create more knowledge of that particular entity. Ontologies today are powering more
accurate search and retrieval in websites like Wikipedia etc. As we move towards the future to Web 3.0,
also termed as the semantic web, ontologies will play a more important role.
Ontologies are represented in various forms like RDF, RDFS, XML, OWL etc. Querying ontologies can
yield basic information about an entity. This paper proposes an automated method for ontology creation,
using concepts from NLP (Natural Language Processing), Information Retrieval and Machine Learning.
Concepts drawn from these domains help in designing more accurate ontologies represented using the
XML format. This paper uses document classification using classification algorithms for assigning labels
to documents, document similarity to cluster similar documents to the input document, together, and
summarization to shorten the text and keep important terms essential in making the ontology. The module
is constructed using the Python programming language and NLTK (Natural Language Toolkit). The
ontologies created in XML will convey to a lay person the definition of the important term's and their
lexical relationships.
KEYWORDS
Ontology, Semantic Web, Domain Ontology, Ontology Construction, Classification, Document Similarity.
1. INTRODUCTION
Ontology is primarily a philosophical concept, the philosophical definition of ontology is
“The study of the nature of existence of an entity and it's relationships”. Computer Science
extends this definition of ontology by adding a taxonomy. The taxonomy can is based on the
entity. A good example of the ontology of an entity is given below from a discussion on
semantic web.
International Journal on Soft Computing, Artificial Intelligence and Applications (IJSCAI), Vol.5, No.1, February 2016
54
Figure 1: Ontological representation of the English version of the book.[1]
The above picture is an ontology constructed for the book “The Glass Palace” written by
Amitav Ghosh. The ontology is visualized as a graph where each of the parts of an entity, in this
case the book, such as the city (London), Authors' name (Amitav Ghosh), all form the nodes of
the graph. An RDF triple representation of the ontology as illustrated in [1] would be:
Figure 2: RDF representation of the ontology.[1]
Querying the information above can give vital information like the author of the book, the
publisher etc. With ontology as a method for organizing information, extraction of basic
information of an entity becomes much more easier. Now consider another ontology of the book
but with a different author for the translated version of the book. The graph as shown in [1]
would be:
Figure 3: Ontological representation of French version of the book.[1]
International Journal on Soft Computing, Artificial Intelligence and Applications (IJSCAI), Vol.5, No.1, February 2016
55
Since the two ontologies above have the same URI, they can be merged to form a singular
ontology of the entity, containing more information now. Querying the new ontology now yields
more information than before. A merged ontology would be diagrammatically represented as
illustrated in:
Figure 4: Merged Ontology.[1]
The advantages of using ontologies can be summarized from [2]:
Ontologies provide a specific vocabulary with which knowledge can be represented. This
vocabulary allows us to specify which entities will be represented, how they will be represented,
how they can be grouped, and what relationships connect them together. The vocabulary can be
seen as a sort of social contract between a data producer and a data consumer. The more accurate
the ontology, the greater the potential understanding of how data can be used, If an ontology is
complex, including relationships a data consumer doesn't require then it can become confusing,
complicated, and difficult to use and maintain.
Ontologies allow us to express the formal rules for inference. When software reads our ontology,
it should have all the information necessary to draw the same conclusions from our data that we
did. When actions of software are consistent with the rules expressed in ontology, we say that the
software has made an ontological commitment. In object-oriented programming (OOP), it is easy
to forget how many rules the system is handling for us. For example, a subclass in an OO system
contains its parent's members (methods and variables) because, somewhere in the system,
software asserted that having access to parent members is consistent with what it means to be an
OO subclass. By design, semantic systems using ontology, leave it up to you to build or invoke
the necessary software to make inferences based on that model.
2. RELATED WORK
Researchers have tried to construct ontologies through different means:
1) Manually using tools like Protege, OWL etc.
2) Using automated methods drawing techniques from areas like NLP, Machine Learning, and
Information Retrieval etc.
An example that follows the second technique is employed by Q.Song, J. Liu, X. Wang and
J.Wang [3] , employs techniques from data mining and information retrieval. It uses K Means
International Journal on Soft Computing, Artificial Intelligence and Applications (IJSCAI), Vol.5, No.1, February 2016
56
clustering to construct the feature vector using tf-idf of each page. They further use Bayesian
classifiers on sample data sets to construct domain ontologies. S.Hong Sie and J.Hua Yeh[4],
have proposed an automated construction method based on a hierarchy generation process with a
guided concept clustering method(GCC), to reduce human effort in maintaining large knowledge
networks.
Among the methods that propose creation of ontology using manual methods, the method
proposed by L.Seremati and A. Kameas[5], is targeted towards novice ontology developers with
little experience in developing ontologies. The ontology developer in their method, has
experience in the domain for which ontology is being made, the ontology construction is done by
re-using pre-existing ontologies. The second case in manual construction is domain based in the
area of biomedical ontologies and has been proposed by F.K Hussain,A.S Sidhu, T.S Dhillon, E.
Chang[6]. The method takes a case study of protein ontologies, constructing ontologies by taking
protein data from existing data sources. The design of the ontologies is distributed across sections
like SubClass,ATOMsequence of protein. The final method of manual construction proposed by
S. Jain[7] takes into account the relation of a topic to other topic, for eg. a topic of computer
science may also be related to domains of Mathematics, Statistics, Electrical Engineering etc. The
method uses OWL to construct domain ontologies of various topics and sub-topics covered in a
document.
3. ONTOLOGY CONSTRUCTION MODULE
3.1. Document Classification
The very first step in the construction phase is classifying the type of the document. To construct
a domain ontology, it is important that a label type be assigned to the document, so that the
correct relation can be assigned. To do classification in the construction phase Naive-Bayes
classification shall be used.
3.1.1. Naive-Bayes Classification
Naive Bayes classification per its name works on bayesian probability. Bayes formula can be
represented as:
Figure 5: Bayes Formula.[8]
P(A) and P(B) are the the probabilities of A and B without regard to each other. P(A|B) is the
probability of occurrence of A given that B is true. In Naive Bayes classification the principle
applied is that every label has a say. This means that out of the set of possible labels for an entity,
the probabilities of the entity belonging to all labels is calculated using the above stated formula.
Therefore if an entity could belong to N possible labels represented by a vector x = (x1,...,xn), ,
then the formula for classification using Naive Bayes can be represented as:
Figure 6: Formula for Naïve Bayes classification.[9]
International Journal on Soft Computing, Artificial Intelligence and Applications (IJSCAI), Vol.5, No.1, February 2016
57
The above formula is suitable when the data is discrete. If the data is continuous for example the
classifier might have to identify the gender based on real attributes like height, weight, foot size,
attributes which can be in floating point format, the above formula will not work. In cases like
that, values of each class are distributed according to a Gaussian distribution, given by the
formula:
Figure 7: Naive-Bayes for continuous data.[8]
Naive Bayes is a lazy learner among classifiers. Naive Bayes requires to be first run against a set
of training data where it calculates probabilities for different features that belong to particular
labels. Once that is done, the data to be classified is fed in and the probabilities for its features are
calculated for each label, using the results of the training set. The label which has the maximum
probability for occurrence, is assigned to the data.
3.1.2. Using Naive-Bayes to classify documents
The aim of the Ontology construction module is to classify a document as belonging to either
Mathematics or Computer Science. At first a training set of documents belonging to both domains
are input to the classifier, the label of the document is also input to the classifier. The classifier
uses a feature vector, W = (W1,....,Wn), to represent words of the document. The probability of
the word occurring is mapped to the label of the training set documents. Once this process has
been done, the document to which the label has been assigned is split into the same feature vector
as the training set documents and its probabilities are calculated based on the results of the
training set for both domains - Mathematics and Computer Science. The domain which has the
higher probability, is the label that is assigned to the document. Once classification of a document
is complete, the next step is to search the most similar documents.
3.2. Document Similarity
After classification, the next step is to search for documents that are similar to the document for
which the ontology will be constructed. The method to be used is the vector space model.
3.2.1. Vector Space Model
The vector space model begins by creating the bag of words of two or more documents.
Considering two documents A and B, the bag of words of both documents is a vector of the
collection of unique words in both the documents. A vector V = (V1,...,Vn), is constructed for
both documents, the vector constitutes of the count of occurrence of each word in the documents
that corresponds to the bag of words. The vector can also be called a term frequency vector,
abbreviated as tf, The next step is to calculate the inverse document frequency or idf , of the
documents. The inverse document frequency of a document is a measure of how important the
word is to the document, whether it is common or rare across a set of documents. The idf of a
International Journal on Soft Computing, Artificial Intelligence and Applications (IJSCAI), Vol.5, No.1, February 2016
58
document is useful in filtering out stop words, words that occur very frequently but do not add
much meaning to the document. The inverse document frequency can be calculated by:
Figure 8: Inverse Document Frequency. [10]
Where N is the number of documents in the dataset and D is the document in which the term t
belongs to. Once this value has been calculated the similarity is given by:
Figure 9: Cosine similarity for computing document similarity.[10]
The cosine similarity of the document is calculated with respect to all documents in the corpus
and the top five documents are selected. The text of all the five documents is merged along with
the original document and is followed by summarization.
3.3. Summarizing the text
The next step is that of summarization. Summarization has the benefits of reducing the text size to
be analyse and also returns the important keyword's of the document. The algorithm as proposed
in [11] is as described below:
1) Segment the corpus into individual sentences.
2) Compare the intersection of a sentence with all other sentences. An intersection here means the
count of the common words in the document divided by 2(total sentences being compared).
Assign an intersection score based on the formula:
Figure 10: Formula for computing intersection score.
3) The above result is stored for each sentence in a 2-D array. What one ends up with is a graph
showing the commonness of one sentence with other sentences in the text.
4) The intersection score for all sentences is added up.
5) Add the top sentence with highest score from each paragraph to the summarized text. To
increase the accuracy cosine similarity is used:
International Journal on Soft Computing, Artificial Intelligence and Applications (IJSCAI), Vol.5, No.1, February 2016
59
The above algorithm when used contracts the size of the text by almost 60% on an average
without much loss of information. The first task post summarization is to firstly remove all stop
words from sentences in the summarized text. Stop words constitute of words like a, an, and the
etc. Stop words are important in the context of deciphering the meaning of the sentence in text
mining[6]. In the problem stated here stop words are not very helpful. In a text corpus words like
and, the, have a very high frequency of occurrence and should not be considered while taking
term frequency of documents, as will be discussed in the paragraph below. Post summarization
the top N(where N = selected number of words for ontology creation) words are ranked using a
TF(Term Frequency) like index. The words are ranked based on the number of times they appear
in the text compared to other words. The ontologies of these N words (in this case five, for
testing) is constructed.
Ontology construction after summarization is based on lexical relationships of important
keywords. To derive the lexical relationships of an entity WordNet is used. WordNet developed
by the Cognitive Sciences Laboratory at Princeton, consists of words ordered by their lexical
relationships, into groups of synonyms called as synsets. WordNet classifies the words into is-a
relationships. WordNet is sometimes attributed as a lexical ontology due to the
hypernym/hyponym relationships present among synsets.
The following lexical relationships are taken into consideration when constructing our word
ontology:
1) Synonyms: A synonym is a word or phrase that has exactly the same meaning of another word
in the same or another language, For example the words defer and procrastinate, mean the same
thing in the English language. Synonyms are an important lexical relation to be used since it is
important to know the words that are similar to a word being analyzed and share the same
meaning with that word.
2) Hyponym: Hyponym is a word of more specific meaning than a general or superordinate term
applicable to it. For example, spoon is a hyponym of cutlery. Therefore if the idea of concept
hierarchies is used then hyponyms of a word can be labelled as going down the concept
hierarchy. Hyponyms are useful in the sense that they can give immediate examples of a word.
For eg: The hyponyms of the word “motor car” give the result set as various types of auto-mobile
like a convertible, electric car and so on, therefore using these hyponyms we are “drilling down”
to the subclasses of a word.
3) Hypernym: Hypernym is a word or phrase whose referents form a set including as a subset the
referents of a subordinate term. For example the hypernyms of motor car would be entity,
physical object, wheeled vehicle. Therefore hypernymy can be categorized as “rolling up” the
concept hierarchy and going to the superclass of the object or the origin of the word. Using the
relationships of hypernymy and hyponym we can see that the basic criteria of object and its
relationships is being maintained while giving detailed knowledge of the word at the same time.
The relationships of the word are then ordered in XML format giving the information regarding
relationships, definition of an entity as described in an ontology. Using these ontologies in
dictionary based applications will enhance the information obtained as not only the meaning of
the word is returned but also words similar to it, words that it can originate from and words that
originate from it are retrieved.
International Journal on Soft Computing, Artificial Intelligence and Applications (IJSCAI), Vol.5, No.1, February 2016
60
4. TESTING
To test the module documents from different domains were taken. Documents were tested based
on the number of different lexical relations generated for the top five words in each document.
The results have been described in the table below, in the table below, the numbers corresponding
to sections - Definitions, Synonyms, Hypernyms and Hyponyms, refer to the number of lexical
relation found for each word and added as part of the ontology:
Word Definitions Synonyms Hypernyms Hyponyms
Ontology 2 1 5 0
Document 6 4 15 48
Keyword 0 0 0 0
Concept 1 3 2 28
Relations 7 19 23 50
Table 1: Results of Ontology components after running ontology module.
The above results in the table signify the total number of lexical relations that could be derived
from WordNet for the top words after summarizing the text. The total number describes the size
of each component of the ontology that has been formed.
Figure 11: Result of the ontology created.
International Journal on Soft Computing, Artificial Intelligence and Applications (IJSCAI), Vol.5, No.1, February 2016
61
Figure 12: Format of the ontology created.
In fig.12 the tree structured skeleton of the ontology created is displayed, the diagram is an
abstract representation of the structure each ontology that is created follows. The structure is
based on the hierarchy of the lexical relation of each word in the ontology, starting with the
domain to which the word belongs to , moving to the meaning or the definition of the word, and
then to the lexical relations of synonyms, hypernyms and hyponyms. fig.11, in contrast displays
how the actual ontology will look like.
Figure 13: Tree structure of the ontology.
5. CONCLUSION
In this paper an automated method of creating ontologies has been proposed and tested. The
system proposed performs the tasks of document classification, searching for the most similar
documents, summarizing text and creating the ontology in XML format. Document classification
helps to label any new document that is input into the system, tagging the domain to which it
belongs to. Document similarity implemented using vector space model is used to find the most
similar documents to the input document, from within the data set of documents present in the
system, leading to a richer ontology being formed after getting closely related terms to the terms
International Journal on Soft Computing, Artificial Intelligence and Applications (IJSCAI), Vol.5, No.1, February 2016
62
in the input document. Summarizing performs the function of shortening the large text thus
formed and keeping mostly relevant, required terms of the document.
The ontologies thus formed are based on lexical relations of a word i.e synonyms, concepts of
hypernymy and hyponymy, meaning of the words. In the future applications of such systems can
be used in word processing software's, pdf readers, to predict a document the user is reading and
get various, accurate details of important terms and the related terms as well.
REFERENCES
[1] Herman I, “Introduction to the Semantic Web”, Johnson & Johnson, W3C Consortium, 2009.
[2] Segaran, Toby, “Programming the Semantic Web”, O'Reilly Publications, 2009.
[3] Song, Liu, Wang and Wang, “A Novel Automatic Ontology Construction Method based
on Web Data”, Tenth International Conference on Intelligent Information Hiding and
Multimedia Signal Processing, IEEE Computer Society, 2014.
[4] Hong Sie, S, and Hua Yueh, J. , “Automatic Ontology Creation Using Schema Information”,
Proceedings of the 2006 IEEE/ACM/WIC Conference on Web Intelligence, IEEE Computer Society,
2006.
[5] Seremeti, L. and Kameas, A, “A Task-Based Ontology Approach for Novice Ontology Developers”,
Fourth Balkan Conference in Informatics, IEEE Computer Society, 2009.
[6] Hussain, F.K, Sidhu, A.S, Dhillon, T.S and Chang, E. “Engineering Trustworthy Ontologies;
Case Study of Protein Ontology”, Proceedings of the 19th IEEE Symposium on Computer-Based
Medical Systems, IEEE Computer Society, 2006.
[7] Jain, Sonal and Pareek, J, “Automatic Topic Identification from Learning Material: An
Ontological Approach”, ICCCEA, 2010.
[8] “Bayes Theorem”, Wikipedia the Free Encyclopedia.
[9] “Naive Bayes Classifier”, Wikipedia the Free Encyclopedia.
[10] “Vector Space Model”, Wikipedia the Free Encyclopedia.
[11] Russel, M, “Mining the Social Web”, O'Reilly Publications, 2013.
AUTHORS
Preeti Kathiria is an Assistant Professor in the Computer Science and Engineering Department at Nirma
University in Ahmedabad.
Sukraat Ahluwalia is a final year student in the Computer Science and Engineering Department at Nirma
University in Ahmedabad.

More Related Content

What's hot

Dynamic & Attribute Weighted KNN for Document Classification Using Bootstrap ...
Dynamic & Attribute Weighted KNN for Document Classification Using Bootstrap ...Dynamic & Attribute Weighted KNN for Document Classification Using Bootstrap ...
Dynamic & Attribute Weighted KNN for Document Classification Using Bootstrap ...
IJERA Editor
 
Research Inventy : International Journal of Engineering and Science
Research Inventy : International Journal of Engineering and ScienceResearch Inventy : International Journal of Engineering and Science
Research Inventy : International Journal of Engineering and Science
researchinventy
 
Feature selection, optimization and clustering strategies of text documents
Feature selection, optimization and clustering strategies of text documentsFeature selection, optimization and clustering strategies of text documents
Feature selection, optimization and clustering strategies of text documents
IJECEIAES
 
Ck32985989
Ck32985989Ck32985989
Ck32985989
IJMER
 
Ontology-based Data Integration
Ontology-based Data IntegrationOntology-based Data Integration
Ontology-based Data Integration
Janna Hastings
 
Ontology Mapping
Ontology MappingOntology Mapping
Ontology Mapping
butest
 
Text Mining: (Asynchronous Sequences)
Text Mining: (Asynchronous Sequences)Text Mining: (Asynchronous Sequences)
Text Mining: (Asynchronous Sequences)
IJERA Editor
 
Top 5 MOST VIEWED LANGUAGE COMPUTING ARTICLE - International Journal on Natur...
Top 5 MOST VIEWED LANGUAGE COMPUTING ARTICLE - International Journal on Natur...Top 5 MOST VIEWED LANGUAGE COMPUTING ARTICLE - International Journal on Natur...
Top 5 MOST VIEWED LANGUAGE COMPUTING ARTICLE - International Journal on Natur...
kevig
 
DATA MINING.doc
DATA MINING.docDATA MINING.doc
DATA MINING.doc
butest
 
A Survey on Sentiment Categorization of Movie Reviews
A Survey on Sentiment Categorization of Movie ReviewsA Survey on Sentiment Categorization of Movie Reviews
A Survey on Sentiment Categorization of Movie Reviews
Editor IJMTER
 
Review: Semi-Supervised Learning Methods for Word Sense Disambiguation
Review: Semi-Supervised Learning Methods for Word Sense DisambiguationReview: Semi-Supervised Learning Methods for Word Sense Disambiguation
Review: Semi-Supervised Learning Methods for Word Sense Disambiguation
IOSR Journals
 
Analysis of Classification Algorithm in Data Mining
Analysis of Classification Algorithm in Data MiningAnalysis of Classification Algorithm in Data Mining
Analysis of Classification Algorithm in Data Mining
ijdmtaiir
 
Named entity recognition using web document corpus
Named entity recognition using web document corpusNamed entity recognition using web document corpus
Named entity recognition using web document corpus
IJMIT JOURNAL
 
Named Entity Recognition Using Web Document Corpus
Named Entity Recognition Using Web Document CorpusNamed Entity Recognition Using Web Document Corpus
Named Entity Recognition Using Web Document Corpus
IJMIT JOURNAL
 
Dc32644652
Dc32644652Dc32644652
Dc32644652
IJERA Editor
 
Sentimental classification analysis of polarity multi-view textual data using...
Sentimental classification analysis of polarity multi-view textual data using...Sentimental classification analysis of polarity multi-view textual data using...
Sentimental classification analysis of polarity multi-view textual data using...
IJECEIAES
 
Concept integration using edit distance and n gram match
Concept integration using edit distance and n gram match Concept integration using edit distance and n gram match
Concept integration using edit distance and n gram match
ijdms
 

What's hot (17)

Dynamic & Attribute Weighted KNN for Document Classification Using Bootstrap ...
Dynamic & Attribute Weighted KNN for Document Classification Using Bootstrap ...Dynamic & Attribute Weighted KNN for Document Classification Using Bootstrap ...
Dynamic & Attribute Weighted KNN for Document Classification Using Bootstrap ...
 
Research Inventy : International Journal of Engineering and Science
Research Inventy : International Journal of Engineering and ScienceResearch Inventy : International Journal of Engineering and Science
Research Inventy : International Journal of Engineering and Science
 
Feature selection, optimization and clustering strategies of text documents
Feature selection, optimization and clustering strategies of text documentsFeature selection, optimization and clustering strategies of text documents
Feature selection, optimization and clustering strategies of text documents
 
Ck32985989
Ck32985989Ck32985989
Ck32985989
 
Ontology-based Data Integration
Ontology-based Data IntegrationOntology-based Data Integration
Ontology-based Data Integration
 
Ontology Mapping
Ontology MappingOntology Mapping
Ontology Mapping
 
Text Mining: (Asynchronous Sequences)
Text Mining: (Asynchronous Sequences)Text Mining: (Asynchronous Sequences)
Text Mining: (Asynchronous Sequences)
 
Top 5 MOST VIEWED LANGUAGE COMPUTING ARTICLE - International Journal on Natur...
Top 5 MOST VIEWED LANGUAGE COMPUTING ARTICLE - International Journal on Natur...Top 5 MOST VIEWED LANGUAGE COMPUTING ARTICLE - International Journal on Natur...
Top 5 MOST VIEWED LANGUAGE COMPUTING ARTICLE - International Journal on Natur...
 
DATA MINING.doc
DATA MINING.docDATA MINING.doc
DATA MINING.doc
 
A Survey on Sentiment Categorization of Movie Reviews
A Survey on Sentiment Categorization of Movie ReviewsA Survey on Sentiment Categorization of Movie Reviews
A Survey on Sentiment Categorization of Movie Reviews
 
Review: Semi-Supervised Learning Methods for Word Sense Disambiguation
Review: Semi-Supervised Learning Methods for Word Sense DisambiguationReview: Semi-Supervised Learning Methods for Word Sense Disambiguation
Review: Semi-Supervised Learning Methods for Word Sense Disambiguation
 
Analysis of Classification Algorithm in Data Mining
Analysis of Classification Algorithm in Data MiningAnalysis of Classification Algorithm in Data Mining
Analysis of Classification Algorithm in Data Mining
 
Named entity recognition using web document corpus
Named entity recognition using web document corpusNamed entity recognition using web document corpus
Named entity recognition using web document corpus
 
Named Entity Recognition Using Web Document Corpus
Named Entity Recognition Using Web Document CorpusNamed Entity Recognition Using Web Document Corpus
Named Entity Recognition Using Web Document Corpus
 
Dc32644652
Dc32644652Dc32644652
Dc32644652
 
Sentimental classification analysis of polarity multi-view textual data using...
Sentimental classification analysis of polarity multi-view textual data using...Sentimental classification analysis of polarity multi-view textual data using...
Sentimental classification analysis of polarity multi-view textual data using...
 
Concept integration using edit distance and n gram match
Concept integration using edit distance and n gram match Concept integration using edit distance and n gram match
Concept integration using edit distance and n gram match
 

Similar to A Naive Method For Ontology Construction

A N E XTENSION OF P ROTÉGÉ FOR AN AUTOMA TIC F UZZY - O NTOLOGY BUILDING U...
A N  E XTENSION OF  P ROTÉGÉ FOR AN AUTOMA TIC  F UZZY - O NTOLOGY BUILDING U...A N  E XTENSION OF  P ROTÉGÉ FOR AN AUTOMA TIC  F UZZY - O NTOLOGY BUILDING U...
A N E XTENSION OF P ROTÉGÉ FOR AN AUTOMA TIC F UZZY - O NTOLOGY BUILDING U...
ijcsit
 
Towards Ontology Development Based on Relational Database
Towards Ontology Development Based on Relational DatabaseTowards Ontology Development Based on Relational Database
Towards Ontology Development Based on Relational Database
ijbuiiir1
 
IRJET - Deep Collaborrative Filtering with Aspect Information
IRJET - Deep Collaborrative Filtering with Aspect InformationIRJET - Deep Collaborrative Filtering with Aspect Information
IRJET - Deep Collaborrative Filtering with Aspect Information
IRJET Journal
 
Enhancing Semantic Mining
Enhancing Semantic MiningEnhancing Semantic Mining
Enhancing Semantic Mining
Santhosh Kumar
 
SEMANTIC INTEGRATION FOR AUTOMATIC ONTOLOGY MAPPING
SEMANTIC INTEGRATION FOR AUTOMATIC ONTOLOGY MAPPING SEMANTIC INTEGRATION FOR AUTOMATIC ONTOLOGY MAPPING
SEMANTIC INTEGRATION FOR AUTOMATIC ONTOLOGY MAPPING
cscpconf
 
SEARCH OF INFORMATION BASED CONTENT IN SEMI-STRUCTURED DOCUMENTS USING INTERF...
SEARCH OF INFORMATION BASED CONTENT IN SEMI-STRUCTURED DOCUMENTS USING INTERF...SEARCH OF INFORMATION BASED CONTENT IN SEMI-STRUCTURED DOCUMENTS USING INTERF...
SEARCH OF INFORMATION BASED CONTENT IN SEMI-STRUCTURED DOCUMENTS USING INTERF...
ijcsitcejournal
 
ONTOLOGY SERVICE CENTER: A DATAHUB FOR ONTOLOGY APPLICATION
ONTOLOGY SERVICE CENTER: A DATAHUB FOR ONTOLOGY APPLICATIONONTOLOGY SERVICE CENTER: A DATAHUB FOR ONTOLOGY APPLICATION
ONTOLOGY SERVICE CENTER: A DATAHUB FOR ONTOLOGY APPLICATION
IJwest
 
ONTOLOGY SERVICE CENTER: A DATAHUB FOR ONTOLOGY APPLICATION
ONTOLOGY SERVICE CENTER: A DATAHUB FOR  ONTOLOGY APPLICATION ONTOLOGY SERVICE CENTER: A DATAHUB FOR  ONTOLOGY APPLICATION
ONTOLOGY SERVICE CENTER: A DATAHUB FOR ONTOLOGY APPLICATION
dannyijwest
 
Recruitment Based On Ontology with Enhanced Security Features
Recruitment Based On Ontology with Enhanced Security FeaturesRecruitment Based On Ontology with Enhanced Security Features
Recruitment Based On Ontology with Enhanced Security Features
theijes
 
Adaptive named entity recognition for social network analysis and domain onto...
Adaptive named entity recognition for social network analysis and domain onto...Adaptive named entity recognition for social network analysis and domain onto...
Adaptive named entity recognition for social network analysis and domain onto...
Cuong Tran Van
 
A Survey of Ontology-based Information Extraction for Social Media Content An...
A Survey of Ontology-based Information Extraction for Social Media Content An...A Survey of Ontology-based Information Extraction for Social Media Content An...
A Survey of Ontology-based Information Extraction for Social Media Content An...
ijcnes
 
Towards From Manual to Automatic Semantic Annotation: Based on Ontology Eleme...
Towards From Manual to Automatic Semantic Annotation: Based on Ontology Eleme...Towards From Manual to Automatic Semantic Annotation: Based on Ontology Eleme...
Towards From Manual to Automatic Semantic Annotation: Based on Ontology Eleme...
IJwest
 
A PROPOSED MULTI-DOMAIN APPROACH FOR AUTOMATIC CLASSIFICATION OF TEXT DOCUMENTS
A PROPOSED MULTI-DOMAIN APPROACH FOR AUTOMATIC CLASSIFICATION OF TEXT DOCUMENTSA PROPOSED MULTI-DOMAIN APPROACH FOR AUTOMATIC CLASSIFICATION OF TEXT DOCUMENTS
A PROPOSED MULTI-DOMAIN APPROACH FOR AUTOMATIC CLASSIFICATION OF TEXT DOCUMENTS
ijsc
 
A Proposed Multi-Domain Approach for Automatic Classification of Text Documents
A Proposed Multi-Domain Approach for Automatic Classification of Text Documents A Proposed Multi-Domain Approach for Automatic Classification of Text Documents
A Proposed Multi-Domain Approach for Automatic Classification of Text Documents
ijsc
 
Swoogle: Showcasing the Significance of Semantic Search
Swoogle: Showcasing the Significance of Semantic SearchSwoogle: Showcasing the Significance of Semantic Search
Swoogle: Showcasing the Significance of Semantic Search
IDES Editor
 
Bibliography (Microsoft Word, 61k)
Bibliography (Microsoft Word, 61k)Bibliography (Microsoft Word, 61k)
Bibliography (Microsoft Word, 61k)
butest
 
A category theoretic model of rdf ontology
A category theoretic model of rdf ontologyA category theoretic model of rdf ontology
A category theoretic model of rdf ontology
IJwest
 
The Revolution Of Cloud Computing
The Revolution Of Cloud ComputingThe Revolution Of Cloud Computing
The Revolution Of Cloud Computing
Carmen Sanborn
 
SWSN UNIT-3.pptx we can information about swsn professional
SWSN UNIT-3.pptx we can information about swsn professionalSWSN UNIT-3.pptx we can information about swsn professional
SWSN UNIT-3.pptx we can information about swsn professional
gowthamnaidu0986
 
ONTOLOGICAL MODEL FOR CHARACTER RECOGNITION BASED ON SPATIAL RELATIONS
ONTOLOGICAL MODEL FOR CHARACTER RECOGNITION BASED ON SPATIAL RELATIONSONTOLOGICAL MODEL FOR CHARACTER RECOGNITION BASED ON SPATIAL RELATIONS
ONTOLOGICAL MODEL FOR CHARACTER RECOGNITION BASED ON SPATIAL RELATIONS
sipij
 

Similar to A Naive Method For Ontology Construction (20)

A N E XTENSION OF P ROTÉGÉ FOR AN AUTOMA TIC F UZZY - O NTOLOGY BUILDING U...
A N  E XTENSION OF  P ROTÉGÉ FOR AN AUTOMA TIC  F UZZY - O NTOLOGY BUILDING U...A N  E XTENSION OF  P ROTÉGÉ FOR AN AUTOMA TIC  F UZZY - O NTOLOGY BUILDING U...
A N E XTENSION OF P ROTÉGÉ FOR AN AUTOMA TIC F UZZY - O NTOLOGY BUILDING U...
 
Towards Ontology Development Based on Relational Database
Towards Ontology Development Based on Relational DatabaseTowards Ontology Development Based on Relational Database
Towards Ontology Development Based on Relational Database
 
IRJET - Deep Collaborrative Filtering with Aspect Information
IRJET - Deep Collaborrative Filtering with Aspect InformationIRJET - Deep Collaborrative Filtering with Aspect Information
IRJET - Deep Collaborrative Filtering with Aspect Information
 
Enhancing Semantic Mining
Enhancing Semantic MiningEnhancing Semantic Mining
Enhancing Semantic Mining
 
SEMANTIC INTEGRATION FOR AUTOMATIC ONTOLOGY MAPPING
SEMANTIC INTEGRATION FOR AUTOMATIC ONTOLOGY MAPPING SEMANTIC INTEGRATION FOR AUTOMATIC ONTOLOGY MAPPING
SEMANTIC INTEGRATION FOR AUTOMATIC ONTOLOGY MAPPING
 
SEARCH OF INFORMATION BASED CONTENT IN SEMI-STRUCTURED DOCUMENTS USING INTERF...
SEARCH OF INFORMATION BASED CONTENT IN SEMI-STRUCTURED DOCUMENTS USING INTERF...SEARCH OF INFORMATION BASED CONTENT IN SEMI-STRUCTURED DOCUMENTS USING INTERF...
SEARCH OF INFORMATION BASED CONTENT IN SEMI-STRUCTURED DOCUMENTS USING INTERF...
 
ONTOLOGY SERVICE CENTER: A DATAHUB FOR ONTOLOGY APPLICATION
ONTOLOGY SERVICE CENTER: A DATAHUB FOR ONTOLOGY APPLICATIONONTOLOGY SERVICE CENTER: A DATAHUB FOR ONTOLOGY APPLICATION
ONTOLOGY SERVICE CENTER: A DATAHUB FOR ONTOLOGY APPLICATION
 
ONTOLOGY SERVICE CENTER: A DATAHUB FOR ONTOLOGY APPLICATION
ONTOLOGY SERVICE CENTER: A DATAHUB FOR  ONTOLOGY APPLICATION ONTOLOGY SERVICE CENTER: A DATAHUB FOR  ONTOLOGY APPLICATION
ONTOLOGY SERVICE CENTER: A DATAHUB FOR ONTOLOGY APPLICATION
 
Recruitment Based On Ontology with Enhanced Security Features
Recruitment Based On Ontology with Enhanced Security FeaturesRecruitment Based On Ontology with Enhanced Security Features
Recruitment Based On Ontology with Enhanced Security Features
 
Adaptive named entity recognition for social network analysis and domain onto...
Adaptive named entity recognition for social network analysis and domain onto...Adaptive named entity recognition for social network analysis and domain onto...
Adaptive named entity recognition for social network analysis and domain onto...
 
A Survey of Ontology-based Information Extraction for Social Media Content An...
A Survey of Ontology-based Information Extraction for Social Media Content An...A Survey of Ontology-based Information Extraction for Social Media Content An...
A Survey of Ontology-based Information Extraction for Social Media Content An...
 
Towards From Manual to Automatic Semantic Annotation: Based on Ontology Eleme...
Towards From Manual to Automatic Semantic Annotation: Based on Ontology Eleme...Towards From Manual to Automatic Semantic Annotation: Based on Ontology Eleme...
Towards From Manual to Automatic Semantic Annotation: Based on Ontology Eleme...
 
A PROPOSED MULTI-DOMAIN APPROACH FOR AUTOMATIC CLASSIFICATION OF TEXT DOCUMENTS
A PROPOSED MULTI-DOMAIN APPROACH FOR AUTOMATIC CLASSIFICATION OF TEXT DOCUMENTSA PROPOSED MULTI-DOMAIN APPROACH FOR AUTOMATIC CLASSIFICATION OF TEXT DOCUMENTS
A PROPOSED MULTI-DOMAIN APPROACH FOR AUTOMATIC CLASSIFICATION OF TEXT DOCUMENTS
 
A Proposed Multi-Domain Approach for Automatic Classification of Text Documents
A Proposed Multi-Domain Approach for Automatic Classification of Text Documents A Proposed Multi-Domain Approach for Automatic Classification of Text Documents
A Proposed Multi-Domain Approach for Automatic Classification of Text Documents
 
Swoogle: Showcasing the Significance of Semantic Search
Swoogle: Showcasing the Significance of Semantic SearchSwoogle: Showcasing the Significance of Semantic Search
Swoogle: Showcasing the Significance of Semantic Search
 
Bibliography (Microsoft Word, 61k)
Bibliography (Microsoft Word, 61k)Bibliography (Microsoft Word, 61k)
Bibliography (Microsoft Word, 61k)
 
A category theoretic model of rdf ontology
A category theoretic model of rdf ontologyA category theoretic model of rdf ontology
A category theoretic model of rdf ontology
 
The Revolution Of Cloud Computing
The Revolution Of Cloud ComputingThe Revolution Of Cloud Computing
The Revolution Of Cloud Computing
 
SWSN UNIT-3.pptx we can information about swsn professional
SWSN UNIT-3.pptx we can information about swsn professionalSWSN UNIT-3.pptx we can information about swsn professional
SWSN UNIT-3.pptx we can information about swsn professional
 
ONTOLOGICAL MODEL FOR CHARACTER RECOGNITION BASED ON SPATIAL RELATIONS
ONTOLOGICAL MODEL FOR CHARACTER RECOGNITION BASED ON SPATIAL RELATIONSONTOLOGICAL MODEL FOR CHARACTER RECOGNITION BASED ON SPATIAL RELATIONS
ONTOLOGICAL MODEL FOR CHARACTER RECOGNITION BASED ON SPATIAL RELATIONS
 

More from IJSCAI Journal

Design and Implementation of Smart Cooking Based on Amazon Echo
Design and Implementation of Smart Cooking Based on Amazon EchoDesign and Implementation of Smart Cooking Based on Amazon Echo
Design and Implementation of Smart Cooking Based on Amazon Echo
IJSCAI Journal
 
Forecasting Macroeconomical Indices with Machine Learning : Impartial Analysi...
Forecasting Macroeconomical Indices with Machine Learning : Impartial Analysi...Forecasting Macroeconomical Indices with Machine Learning : Impartial Analysi...
Forecasting Macroeconomical Indices with Machine Learning : Impartial Analysi...
IJSCAI Journal
 
Intelligent Electrical Multi Outlets Controlled and Activated by a Data Minin...
Intelligent Electrical Multi Outlets Controlled and Activated by a Data Minin...Intelligent Electrical Multi Outlets Controlled and Activated by a Data Minin...
Intelligent Electrical Multi Outlets Controlled and Activated by a Data Minin...
IJSCAI Journal
 
Nov 2018 Table of contents; current issue -International Journal on Soft Comp...
Nov 2018 Table of contents; current issue -International Journal on Soft Comp...Nov 2018 Table of contents; current issue -International Journal on Soft Comp...
Nov 2018 Table of contents; current issue -International Journal on Soft Comp...
IJSCAI Journal
 
6th international conference on artificial intelligence and applications (aia...
6th international conference on artificial intelligence and applications (aia...6th international conference on artificial intelligence and applications (aia...
6th international conference on artificial intelligence and applications (aia...
IJSCAI Journal
 
Image Generation from Caption
Image Generation from Caption Image Generation from Caption
Image Generation from Caption
IJSCAI Journal
 
Temporally Extended Actions For Reinforcement Learning Based Schedulers
Temporally Extended Actions For Reinforcement Learning Based Schedulers Temporally Extended Actions For Reinforcement Learning Based Schedulers
Temporally Extended Actions For Reinforcement Learning Based Schedulers
IJSCAI Journal
 
Knowledgebase Systems in Neuro Science - A Study
Knowledgebase Systems in Neuro Science - A StudyKnowledgebase Systems in Neuro Science - A Study
Knowledgebase Systems in Neuro Science - A Study
IJSCAI Journal
 
An Iranian Cash Recognition Assistance System For Visually Impaireds
An Iranian Cash Recognition Assistance System For Visually Impaireds An Iranian Cash Recognition Assistance System For Visually Impaireds
An Iranian Cash Recognition Assistance System For Visually Impaireds
IJSCAI Journal
 
An Experimental Study of Feature Extraction Techniques in Opinion Mining
An Experimental Study of Feature Extraction Techniques in Opinion MiningAn Experimental Study of Feature Extraction Techniques in Opinion Mining
An Experimental Study of Feature Extraction Techniques in Opinion Mining
IJSCAI Journal
 
Monte-Carlo Tree Search For The "Mr Jack" Board Game
Monte-Carlo Tree Search For The "Mr Jack" Board Game Monte-Carlo Tree Search For The "Mr Jack" Board Game
Monte-Carlo Tree Search For The "Mr Jack" Board Game
IJSCAI Journal
 
Unsupervised learning models of invariant features in images: Recent developm...
Unsupervised learning models of invariant features in images: Recent developm...Unsupervised learning models of invariant features in images: Recent developm...
Unsupervised learning models of invariant features in images: Recent developm...
IJSCAI Journal
 
Big Data Analytics: Challenges And Applications For Text, Audio, Video, And S...
Big Data Analytics: Challenges And Applications For Text, Audio, Video, And S...Big Data Analytics: Challenges And Applications For Text, Audio, Video, And S...
Big Data Analytics: Challenges And Applications For Text, Audio, Video, And S...
IJSCAI Journal
 
A Study on Graph Storage Database of NOSQL
A Study on Graph Storage Database of NOSQLA Study on Graph Storage Database of NOSQL
A Study on Graph Storage Database of NOSQL
IJSCAI Journal
 
A Study on Graph Storage Database of NOSQL
A Study on Graph Storage Database of NOSQLA Study on Graph Storage Database of NOSQL
A Study on Graph Storage Database of NOSQL
IJSCAI Journal
 
Estimation Of The Parameters Of Solar Cells From Current-Voltage Characterist...
Estimation Of The Parameters Of Solar Cells From Current-Voltage Characterist...Estimation Of The Parameters Of Solar Cells From Current-Voltage Characterist...
Estimation Of The Parameters Of Solar Cells From Current-Voltage Characterist...
IJSCAI Journal
 
Implementation of Folksonomy Based Tag Cloud Model for Information Retrieval ...
Implementation of Folksonomy Based Tag Cloud Model for Information Retrieval ...Implementation of Folksonomy Based Tag Cloud Model for Information Retrieval ...
Implementation of Folksonomy Based Tag Cloud Model for Information Retrieval ...
IJSCAI Journal
 
Study of Distance Measurement Techniques in Context to Prediction Model of We...
Study of Distance Measurement Techniques in Context to Prediction Model of We...Study of Distance Measurement Techniques in Context to Prediction Model of We...
Study of Distance Measurement Techniques in Context to Prediction Model of We...
IJSCAI Journal
 
A BINARY BAT INSPIRED ALGORITHM FOR THE CLASSIFICATION OF BREAST CANCER DATA
A BINARY BAT INSPIRED ALGORITHM FOR THE CLASSIFICATION OF BREAST CANCER DATAA BINARY BAT INSPIRED ALGORITHM FOR THE CLASSIFICATION OF BREAST CANCER DATA
A BINARY BAT INSPIRED ALGORITHM FOR THE CLASSIFICATION OF BREAST CANCER DATA
IJSCAI Journal
 
Design of Dual Axis Solar Tracker System Based on Fuzzy Inference Systems
Design of Dual Axis Solar Tracker System Based on Fuzzy Inference SystemsDesign of Dual Axis Solar Tracker System Based on Fuzzy Inference Systems
Design of Dual Axis Solar Tracker System Based on Fuzzy Inference Systems
IJSCAI Journal
 

More from IJSCAI Journal (20)

Design and Implementation of Smart Cooking Based on Amazon Echo
Design and Implementation of Smart Cooking Based on Amazon EchoDesign and Implementation of Smart Cooking Based on Amazon Echo
Design and Implementation of Smart Cooking Based on Amazon Echo
 
Forecasting Macroeconomical Indices with Machine Learning : Impartial Analysi...
Forecasting Macroeconomical Indices with Machine Learning : Impartial Analysi...Forecasting Macroeconomical Indices with Machine Learning : Impartial Analysi...
Forecasting Macroeconomical Indices with Machine Learning : Impartial Analysi...
 
Intelligent Electrical Multi Outlets Controlled and Activated by a Data Minin...
Intelligent Electrical Multi Outlets Controlled and Activated by a Data Minin...Intelligent Electrical Multi Outlets Controlled and Activated by a Data Minin...
Intelligent Electrical Multi Outlets Controlled and Activated by a Data Minin...
 
Nov 2018 Table of contents; current issue -International Journal on Soft Comp...
Nov 2018 Table of contents; current issue -International Journal on Soft Comp...Nov 2018 Table of contents; current issue -International Journal on Soft Comp...
Nov 2018 Table of contents; current issue -International Journal on Soft Comp...
 
6th international conference on artificial intelligence and applications (aia...
6th international conference on artificial intelligence and applications (aia...6th international conference on artificial intelligence and applications (aia...
6th international conference on artificial intelligence and applications (aia...
 
Image Generation from Caption
Image Generation from Caption Image Generation from Caption
Image Generation from Caption
 
Temporally Extended Actions For Reinforcement Learning Based Schedulers
Temporally Extended Actions For Reinforcement Learning Based Schedulers Temporally Extended Actions For Reinforcement Learning Based Schedulers
Temporally Extended Actions For Reinforcement Learning Based Schedulers
 
Knowledgebase Systems in Neuro Science - A Study
Knowledgebase Systems in Neuro Science - A StudyKnowledgebase Systems in Neuro Science - A Study
Knowledgebase Systems in Neuro Science - A Study
 
An Iranian Cash Recognition Assistance System For Visually Impaireds
An Iranian Cash Recognition Assistance System For Visually Impaireds An Iranian Cash Recognition Assistance System For Visually Impaireds
An Iranian Cash Recognition Assistance System For Visually Impaireds
 
An Experimental Study of Feature Extraction Techniques in Opinion Mining
An Experimental Study of Feature Extraction Techniques in Opinion MiningAn Experimental Study of Feature Extraction Techniques in Opinion Mining
An Experimental Study of Feature Extraction Techniques in Opinion Mining
 
Monte-Carlo Tree Search For The "Mr Jack" Board Game
Monte-Carlo Tree Search For The "Mr Jack" Board Game Monte-Carlo Tree Search For The "Mr Jack" Board Game
Monte-Carlo Tree Search For The "Mr Jack" Board Game
 
Unsupervised learning models of invariant features in images: Recent developm...
Unsupervised learning models of invariant features in images: Recent developm...Unsupervised learning models of invariant features in images: Recent developm...
Unsupervised learning models of invariant features in images: Recent developm...
 
Big Data Analytics: Challenges And Applications For Text, Audio, Video, And S...
Big Data Analytics: Challenges And Applications For Text, Audio, Video, And S...Big Data Analytics: Challenges And Applications For Text, Audio, Video, And S...
Big Data Analytics: Challenges And Applications For Text, Audio, Video, And S...
 
A Study on Graph Storage Database of NOSQL
A Study on Graph Storage Database of NOSQLA Study on Graph Storage Database of NOSQL
A Study on Graph Storage Database of NOSQL
 
A Study on Graph Storage Database of NOSQL
A Study on Graph Storage Database of NOSQLA Study on Graph Storage Database of NOSQL
A Study on Graph Storage Database of NOSQL
 
Estimation Of The Parameters Of Solar Cells From Current-Voltage Characterist...
Estimation Of The Parameters Of Solar Cells From Current-Voltage Characterist...Estimation Of The Parameters Of Solar Cells From Current-Voltage Characterist...
Estimation Of The Parameters Of Solar Cells From Current-Voltage Characterist...
 
Implementation of Folksonomy Based Tag Cloud Model for Information Retrieval ...
Implementation of Folksonomy Based Tag Cloud Model for Information Retrieval ...Implementation of Folksonomy Based Tag Cloud Model for Information Retrieval ...
Implementation of Folksonomy Based Tag Cloud Model for Information Retrieval ...
 
Study of Distance Measurement Techniques in Context to Prediction Model of We...
Study of Distance Measurement Techniques in Context to Prediction Model of We...Study of Distance Measurement Techniques in Context to Prediction Model of We...
Study of Distance Measurement Techniques in Context to Prediction Model of We...
 
A BINARY BAT INSPIRED ALGORITHM FOR THE CLASSIFICATION OF BREAST CANCER DATA
A BINARY BAT INSPIRED ALGORITHM FOR THE CLASSIFICATION OF BREAST CANCER DATAA BINARY BAT INSPIRED ALGORITHM FOR THE CLASSIFICATION OF BREAST CANCER DATA
A BINARY BAT INSPIRED ALGORITHM FOR THE CLASSIFICATION OF BREAST CANCER DATA
 
Design of Dual Axis Solar Tracker System Based on Fuzzy Inference Systems
Design of Dual Axis Solar Tracker System Based on Fuzzy Inference SystemsDesign of Dual Axis Solar Tracker System Based on Fuzzy Inference Systems
Design of Dual Axis Solar Tracker System Based on Fuzzy Inference Systems
 

Recently uploaded

OpenID AuthZEN Interop Read Out - Authorization
OpenID AuthZEN Interop Read Out - AuthorizationOpenID AuthZEN Interop Read Out - Authorization
OpenID AuthZEN Interop Read Out - Authorization
David Brossard
 
Monitoring and Managing Anomaly Detection on OpenShift.pdf
Monitoring and Managing Anomaly Detection on OpenShift.pdfMonitoring and Managing Anomaly Detection on OpenShift.pdf
Monitoring and Managing Anomaly Detection on OpenShift.pdf
Tosin Akinosho
 
Cosa hanno in comune un mattoncino Lego e la backdoor XZ?
Cosa hanno in comune un mattoncino Lego e la backdoor XZ?Cosa hanno in comune un mattoncino Lego e la backdoor XZ?
Cosa hanno in comune un mattoncino Lego e la backdoor XZ?
Speck&Tech
 
Artificial Intelligence for XMLDevelopment
Artificial Intelligence for XMLDevelopmentArtificial Intelligence for XMLDevelopment
Artificial Intelligence for XMLDevelopment
Octavian Nadolu
 
みなさんこんにちはこれ何文字まで入るの?40文字以下不可とか本当に意味わからないけどこれ限界文字数書いてないからマジでやばい文字数いけるんじゃないの?えこ...
みなさんこんにちはこれ何文字まで入るの?40文字以下不可とか本当に意味わからないけどこれ限界文字数書いてないからマジでやばい文字数いけるんじゃないの?えこ...みなさんこんにちはこれ何文字まで入るの?40文字以下不可とか本当に意味わからないけどこれ限界文字数書いてないからマジでやばい文字数いけるんじゃないの?えこ...
みなさんこんにちはこれ何文字まで入るの?40文字以下不可とか本当に意味わからないけどこれ限界文字数書いてないからマジでやばい文字数いけるんじゃないの?えこ...
名前 です男
 
Project Management Semester Long Project - Acuity
Project Management Semester Long Project - AcuityProject Management Semester Long Project - Acuity
Project Management Semester Long Project - Acuity
jpupo2018
 
UI5 Controls simplified - UI5con2024 presentation
UI5 Controls simplified - UI5con2024 presentationUI5 Controls simplified - UI5con2024 presentation
UI5 Controls simplified - UI5con2024 presentation
Wouter Lemaire
 
Salesforce Integration for Bonterra Impact Management (fka Social Solutions A...
Salesforce Integration for Bonterra Impact Management (fka Social Solutions A...Salesforce Integration for Bonterra Impact Management (fka Social Solutions A...
Salesforce Integration for Bonterra Impact Management (fka Social Solutions A...
Jeffrey Haguewood
 
TrustArc Webinar - 2024 Global Privacy Survey
TrustArc Webinar - 2024 Global Privacy SurveyTrustArc Webinar - 2024 Global Privacy Survey
TrustArc Webinar - 2024 Global Privacy Survey
TrustArc
 
Best 20 SEO Techniques To Improve Website Visibility In SERP
Best 20 SEO Techniques To Improve Website Visibility In SERPBest 20 SEO Techniques To Improve Website Visibility In SERP
Best 20 SEO Techniques To Improve Website Visibility In SERP
Pixlogix Infotech
 
Serial Arm Control in Real Time Presentation
Serial Arm Control in Real Time PresentationSerial Arm Control in Real Time Presentation
Serial Arm Control in Real Time Presentation
tolgahangng
 
Columbus Data & Analytics Wednesdays - June 2024
Columbus Data & Analytics Wednesdays - June 2024Columbus Data & Analytics Wednesdays - June 2024
Columbus Data & Analytics Wednesdays - June 2024
Jason Packer
 
How to use Firebase Data Connect For Flutter
How to use Firebase Data Connect For FlutterHow to use Firebase Data Connect For Flutter
How to use Firebase Data Connect For Flutter
Daiki Mogmet Ito
 
How to Interpret Trends in the Kalyan Rajdhani Mix Chart.pdf
How to Interpret Trends in the Kalyan Rajdhani Mix Chart.pdfHow to Interpret Trends in the Kalyan Rajdhani Mix Chart.pdf
How to Interpret Trends in the Kalyan Rajdhani Mix Chart.pdf
Chart Kalyan
 
How to Get CNIC Information System with Paksim Ga.pptx
How to Get CNIC Information System with Paksim Ga.pptxHow to Get CNIC Information System with Paksim Ga.pptx
How to Get CNIC Information System with Paksim Ga.pptx
danishmna97
 
Your One-Stop Shop for Python Success: Top 10 US Python Development Providers
Your One-Stop Shop for Python Success: Top 10 US Python Development ProvidersYour One-Stop Shop for Python Success: Top 10 US Python Development Providers
Your One-Stop Shop for Python Success: Top 10 US Python Development Providers
akankshawande
 
Unlock the Future of Search with MongoDB Atlas_ Vector Search Unleashed.pdf
Unlock the Future of Search with MongoDB Atlas_ Vector Search Unleashed.pdfUnlock the Future of Search with MongoDB Atlas_ Vector Search Unleashed.pdf
Unlock the Future of Search with MongoDB Atlas_ Vector Search Unleashed.pdf
Malak Abu Hammad
 
UiPath Test Automation using UiPath Test Suite series, part 6
UiPath Test Automation using UiPath Test Suite series, part 6UiPath Test Automation using UiPath Test Suite series, part 6
UiPath Test Automation using UiPath Test Suite series, part 6
DianaGray10
 
Presentation of the OECD Artificial Intelligence Review of Germany
Presentation of the OECD Artificial Intelligence Review of GermanyPresentation of the OECD Artificial Intelligence Review of Germany
Presentation of the OECD Artificial Intelligence Review of Germany
innovationoecd
 
Programming Foundation Models with DSPy - Meetup Slides
Programming Foundation Models with DSPy - Meetup SlidesProgramming Foundation Models with DSPy - Meetup Slides
Programming Foundation Models with DSPy - Meetup Slides
Zilliz
 

Recently uploaded (20)

OpenID AuthZEN Interop Read Out - Authorization
OpenID AuthZEN Interop Read Out - AuthorizationOpenID AuthZEN Interop Read Out - Authorization
OpenID AuthZEN Interop Read Out - Authorization
 
Monitoring and Managing Anomaly Detection on OpenShift.pdf
Monitoring and Managing Anomaly Detection on OpenShift.pdfMonitoring and Managing Anomaly Detection on OpenShift.pdf
Monitoring and Managing Anomaly Detection on OpenShift.pdf
 
Cosa hanno in comune un mattoncino Lego e la backdoor XZ?
Cosa hanno in comune un mattoncino Lego e la backdoor XZ?Cosa hanno in comune un mattoncino Lego e la backdoor XZ?
Cosa hanno in comune un mattoncino Lego e la backdoor XZ?
 
Artificial Intelligence for XMLDevelopment
Artificial Intelligence for XMLDevelopmentArtificial Intelligence for XMLDevelopment
Artificial Intelligence for XMLDevelopment
 
みなさんこんにちはこれ何文字まで入るの?40文字以下不可とか本当に意味わからないけどこれ限界文字数書いてないからマジでやばい文字数いけるんじゃないの?えこ...
みなさんこんにちはこれ何文字まで入るの?40文字以下不可とか本当に意味わからないけどこれ限界文字数書いてないからマジでやばい文字数いけるんじゃないの?えこ...みなさんこんにちはこれ何文字まで入るの?40文字以下不可とか本当に意味わからないけどこれ限界文字数書いてないからマジでやばい文字数いけるんじゃないの?えこ...
みなさんこんにちはこれ何文字まで入るの?40文字以下不可とか本当に意味わからないけどこれ限界文字数書いてないからマジでやばい文字数いけるんじゃないの?えこ...
 
Project Management Semester Long Project - Acuity
Project Management Semester Long Project - AcuityProject Management Semester Long Project - Acuity
Project Management Semester Long Project - Acuity
 
UI5 Controls simplified - UI5con2024 presentation
UI5 Controls simplified - UI5con2024 presentationUI5 Controls simplified - UI5con2024 presentation
UI5 Controls simplified - UI5con2024 presentation
 
Salesforce Integration for Bonterra Impact Management (fka Social Solutions A...
Salesforce Integration for Bonterra Impact Management (fka Social Solutions A...Salesforce Integration for Bonterra Impact Management (fka Social Solutions A...
Salesforce Integration for Bonterra Impact Management (fka Social Solutions A...
 
TrustArc Webinar - 2024 Global Privacy Survey
TrustArc Webinar - 2024 Global Privacy SurveyTrustArc Webinar - 2024 Global Privacy Survey
TrustArc Webinar - 2024 Global Privacy Survey
 
Best 20 SEO Techniques To Improve Website Visibility In SERP
Best 20 SEO Techniques To Improve Website Visibility In SERPBest 20 SEO Techniques To Improve Website Visibility In SERP
Best 20 SEO Techniques To Improve Website Visibility In SERP
 
Serial Arm Control in Real Time Presentation
Serial Arm Control in Real Time PresentationSerial Arm Control in Real Time Presentation
Serial Arm Control in Real Time Presentation
 
Columbus Data & Analytics Wednesdays - June 2024
Columbus Data & Analytics Wednesdays - June 2024Columbus Data & Analytics Wednesdays - June 2024
Columbus Data & Analytics Wednesdays - June 2024
 
How to use Firebase Data Connect For Flutter
How to use Firebase Data Connect For FlutterHow to use Firebase Data Connect For Flutter
How to use Firebase Data Connect For Flutter
 
How to Interpret Trends in the Kalyan Rajdhani Mix Chart.pdf
How to Interpret Trends in the Kalyan Rajdhani Mix Chart.pdfHow to Interpret Trends in the Kalyan Rajdhani Mix Chart.pdf
How to Interpret Trends in the Kalyan Rajdhani Mix Chart.pdf
 
How to Get CNIC Information System with Paksim Ga.pptx
How to Get CNIC Information System with Paksim Ga.pptxHow to Get CNIC Information System with Paksim Ga.pptx
How to Get CNIC Information System with Paksim Ga.pptx
 
Your One-Stop Shop for Python Success: Top 10 US Python Development Providers
Your One-Stop Shop for Python Success: Top 10 US Python Development ProvidersYour One-Stop Shop for Python Success: Top 10 US Python Development Providers
Your One-Stop Shop for Python Success: Top 10 US Python Development Providers
 
Unlock the Future of Search with MongoDB Atlas_ Vector Search Unleashed.pdf
Unlock the Future of Search with MongoDB Atlas_ Vector Search Unleashed.pdfUnlock the Future of Search with MongoDB Atlas_ Vector Search Unleashed.pdf
Unlock the Future of Search with MongoDB Atlas_ Vector Search Unleashed.pdf
 
UiPath Test Automation using UiPath Test Suite series, part 6
UiPath Test Automation using UiPath Test Suite series, part 6UiPath Test Automation using UiPath Test Suite series, part 6
UiPath Test Automation using UiPath Test Suite series, part 6
 
Presentation of the OECD Artificial Intelligence Review of Germany
Presentation of the OECD Artificial Intelligence Review of GermanyPresentation of the OECD Artificial Intelligence Review of Germany
Presentation of the OECD Artificial Intelligence Review of Germany
 
Programming Foundation Models with DSPy - Meetup Slides
Programming Foundation Models with DSPy - Meetup SlidesProgramming Foundation Models with DSPy - Meetup Slides
Programming Foundation Models with DSPy - Meetup Slides
 

A Naive Method For Ontology Construction

  • 1. International Journal on Soft Computing, Artificial Intelligence and Applications (IJSCAI), Vol.5, No.1, February 2016 DOI :10.5121/ijscai.2016.5106 53 A NAIVE METHOD FOR ONTOLOGY CONSTRUCTION Preeti Kathiria 1 and Sukraat Ahluwalia 2 1 Assistant Professor, Department of Computer Science and Engineering, Nirma University, Gujarat, India 2 Student, Department of Computer Science and Engineering, Nirma University, Gujarat, India ABSTRACT Ontologies are being used to organize information in many domains like artificial intelligence, information science, semantic web, library science. Ontologies of an entity having different information can be merged to create more knowledge of that particular entity. Ontologies today are powering more accurate search and retrieval in websites like Wikipedia etc. As we move towards the future to Web 3.0, also termed as the semantic web, ontologies will play a more important role. Ontologies are represented in various forms like RDF, RDFS, XML, OWL etc. Querying ontologies can yield basic information about an entity. This paper proposes an automated method for ontology creation, using concepts from NLP (Natural Language Processing), Information Retrieval and Machine Learning. Concepts drawn from these domains help in designing more accurate ontologies represented using the XML format. This paper uses document classification using classification algorithms for assigning labels to documents, document similarity to cluster similar documents to the input document, together, and summarization to shorten the text and keep important terms essential in making the ontology. The module is constructed using the Python programming language and NLTK (Natural Language Toolkit). The ontologies created in XML will convey to a lay person the definition of the important term's and their lexical relationships. KEYWORDS Ontology, Semantic Web, Domain Ontology, Ontology Construction, Classification, Document Similarity. 1. INTRODUCTION Ontology is primarily a philosophical concept, the philosophical definition of ontology is “The study of the nature of existence of an entity and it's relationships”. Computer Science extends this definition of ontology by adding a taxonomy. The taxonomy can is based on the entity. A good example of the ontology of an entity is given below from a discussion on semantic web.
  • 2. International Journal on Soft Computing, Artificial Intelligence and Applications (IJSCAI), Vol.5, No.1, February 2016 54 Figure 1: Ontological representation of the English version of the book.[1] The above picture is an ontology constructed for the book “The Glass Palace” written by Amitav Ghosh. The ontology is visualized as a graph where each of the parts of an entity, in this case the book, such as the city (London), Authors' name (Amitav Ghosh), all form the nodes of the graph. An RDF triple representation of the ontology as illustrated in [1] would be: Figure 2: RDF representation of the ontology.[1] Querying the information above can give vital information like the author of the book, the publisher etc. With ontology as a method for organizing information, extraction of basic information of an entity becomes much more easier. Now consider another ontology of the book but with a different author for the translated version of the book. The graph as shown in [1] would be: Figure 3: Ontological representation of French version of the book.[1]
  • 3. International Journal on Soft Computing, Artificial Intelligence and Applications (IJSCAI), Vol.5, No.1, February 2016 55 Since the two ontologies above have the same URI, they can be merged to form a singular ontology of the entity, containing more information now. Querying the new ontology now yields more information than before. A merged ontology would be diagrammatically represented as illustrated in: Figure 4: Merged Ontology.[1] The advantages of using ontologies can be summarized from [2]: Ontologies provide a specific vocabulary with which knowledge can be represented. This vocabulary allows us to specify which entities will be represented, how they will be represented, how they can be grouped, and what relationships connect them together. The vocabulary can be seen as a sort of social contract between a data producer and a data consumer. The more accurate the ontology, the greater the potential understanding of how data can be used, If an ontology is complex, including relationships a data consumer doesn't require then it can become confusing, complicated, and difficult to use and maintain. Ontologies allow us to express the formal rules for inference. When software reads our ontology, it should have all the information necessary to draw the same conclusions from our data that we did. When actions of software are consistent with the rules expressed in ontology, we say that the software has made an ontological commitment. In object-oriented programming (OOP), it is easy to forget how many rules the system is handling for us. For example, a subclass in an OO system contains its parent's members (methods and variables) because, somewhere in the system, software asserted that having access to parent members is consistent with what it means to be an OO subclass. By design, semantic systems using ontology, leave it up to you to build or invoke the necessary software to make inferences based on that model. 2. RELATED WORK Researchers have tried to construct ontologies through different means: 1) Manually using tools like Protege, OWL etc. 2) Using automated methods drawing techniques from areas like NLP, Machine Learning, and Information Retrieval etc. An example that follows the second technique is employed by Q.Song, J. Liu, X. Wang and J.Wang [3] , employs techniques from data mining and information retrieval. It uses K Means
  • 4. International Journal on Soft Computing, Artificial Intelligence and Applications (IJSCAI), Vol.5, No.1, February 2016 56 clustering to construct the feature vector using tf-idf of each page. They further use Bayesian classifiers on sample data sets to construct domain ontologies. S.Hong Sie and J.Hua Yeh[4], have proposed an automated construction method based on a hierarchy generation process with a guided concept clustering method(GCC), to reduce human effort in maintaining large knowledge networks. Among the methods that propose creation of ontology using manual methods, the method proposed by L.Seremati and A. Kameas[5], is targeted towards novice ontology developers with little experience in developing ontologies. The ontology developer in their method, has experience in the domain for which ontology is being made, the ontology construction is done by re-using pre-existing ontologies. The second case in manual construction is domain based in the area of biomedical ontologies and has been proposed by F.K Hussain,A.S Sidhu, T.S Dhillon, E. Chang[6]. The method takes a case study of protein ontologies, constructing ontologies by taking protein data from existing data sources. The design of the ontologies is distributed across sections like SubClass,ATOMsequence of protein. The final method of manual construction proposed by S. Jain[7] takes into account the relation of a topic to other topic, for eg. a topic of computer science may also be related to domains of Mathematics, Statistics, Electrical Engineering etc. The method uses OWL to construct domain ontologies of various topics and sub-topics covered in a document. 3. ONTOLOGY CONSTRUCTION MODULE 3.1. Document Classification The very first step in the construction phase is classifying the type of the document. To construct a domain ontology, it is important that a label type be assigned to the document, so that the correct relation can be assigned. To do classification in the construction phase Naive-Bayes classification shall be used. 3.1.1. Naive-Bayes Classification Naive Bayes classification per its name works on bayesian probability. Bayes formula can be represented as: Figure 5: Bayes Formula.[8] P(A) and P(B) are the the probabilities of A and B without regard to each other. P(A|B) is the probability of occurrence of A given that B is true. In Naive Bayes classification the principle applied is that every label has a say. This means that out of the set of possible labels for an entity, the probabilities of the entity belonging to all labels is calculated using the above stated formula. Therefore if an entity could belong to N possible labels represented by a vector x = (x1,...,xn), , then the formula for classification using Naive Bayes can be represented as: Figure 6: Formula for Naïve Bayes classification.[9]
  • 5. International Journal on Soft Computing, Artificial Intelligence and Applications (IJSCAI), Vol.5, No.1, February 2016 57 The above formula is suitable when the data is discrete. If the data is continuous for example the classifier might have to identify the gender based on real attributes like height, weight, foot size, attributes which can be in floating point format, the above formula will not work. In cases like that, values of each class are distributed according to a Gaussian distribution, given by the formula: Figure 7: Naive-Bayes for continuous data.[8] Naive Bayes is a lazy learner among classifiers. Naive Bayes requires to be first run against a set of training data where it calculates probabilities for different features that belong to particular labels. Once that is done, the data to be classified is fed in and the probabilities for its features are calculated for each label, using the results of the training set. The label which has the maximum probability for occurrence, is assigned to the data. 3.1.2. Using Naive-Bayes to classify documents The aim of the Ontology construction module is to classify a document as belonging to either Mathematics or Computer Science. At first a training set of documents belonging to both domains are input to the classifier, the label of the document is also input to the classifier. The classifier uses a feature vector, W = (W1,....,Wn), to represent words of the document. The probability of the word occurring is mapped to the label of the training set documents. Once this process has been done, the document to which the label has been assigned is split into the same feature vector as the training set documents and its probabilities are calculated based on the results of the training set for both domains - Mathematics and Computer Science. The domain which has the higher probability, is the label that is assigned to the document. Once classification of a document is complete, the next step is to search the most similar documents. 3.2. Document Similarity After classification, the next step is to search for documents that are similar to the document for which the ontology will be constructed. The method to be used is the vector space model. 3.2.1. Vector Space Model The vector space model begins by creating the bag of words of two or more documents. Considering two documents A and B, the bag of words of both documents is a vector of the collection of unique words in both the documents. A vector V = (V1,...,Vn), is constructed for both documents, the vector constitutes of the count of occurrence of each word in the documents that corresponds to the bag of words. The vector can also be called a term frequency vector, abbreviated as tf, The next step is to calculate the inverse document frequency or idf , of the documents. The inverse document frequency of a document is a measure of how important the word is to the document, whether it is common or rare across a set of documents. The idf of a
  • 6. International Journal on Soft Computing, Artificial Intelligence and Applications (IJSCAI), Vol.5, No.1, February 2016 58 document is useful in filtering out stop words, words that occur very frequently but do not add much meaning to the document. The inverse document frequency can be calculated by: Figure 8: Inverse Document Frequency. [10] Where N is the number of documents in the dataset and D is the document in which the term t belongs to. Once this value has been calculated the similarity is given by: Figure 9: Cosine similarity for computing document similarity.[10] The cosine similarity of the document is calculated with respect to all documents in the corpus and the top five documents are selected. The text of all the five documents is merged along with the original document and is followed by summarization. 3.3. Summarizing the text The next step is that of summarization. Summarization has the benefits of reducing the text size to be analyse and also returns the important keyword's of the document. The algorithm as proposed in [11] is as described below: 1) Segment the corpus into individual sentences. 2) Compare the intersection of a sentence with all other sentences. An intersection here means the count of the common words in the document divided by 2(total sentences being compared). Assign an intersection score based on the formula: Figure 10: Formula for computing intersection score. 3) The above result is stored for each sentence in a 2-D array. What one ends up with is a graph showing the commonness of one sentence with other sentences in the text. 4) The intersection score for all sentences is added up. 5) Add the top sentence with highest score from each paragraph to the summarized text. To increase the accuracy cosine similarity is used:
  • 7. International Journal on Soft Computing, Artificial Intelligence and Applications (IJSCAI), Vol.5, No.1, February 2016 59 The above algorithm when used contracts the size of the text by almost 60% on an average without much loss of information. The first task post summarization is to firstly remove all stop words from sentences in the summarized text. Stop words constitute of words like a, an, and the etc. Stop words are important in the context of deciphering the meaning of the sentence in text mining[6]. In the problem stated here stop words are not very helpful. In a text corpus words like and, the, have a very high frequency of occurrence and should not be considered while taking term frequency of documents, as will be discussed in the paragraph below. Post summarization the top N(where N = selected number of words for ontology creation) words are ranked using a TF(Term Frequency) like index. The words are ranked based on the number of times they appear in the text compared to other words. The ontologies of these N words (in this case five, for testing) is constructed. Ontology construction after summarization is based on lexical relationships of important keywords. To derive the lexical relationships of an entity WordNet is used. WordNet developed by the Cognitive Sciences Laboratory at Princeton, consists of words ordered by their lexical relationships, into groups of synonyms called as synsets. WordNet classifies the words into is-a relationships. WordNet is sometimes attributed as a lexical ontology due to the hypernym/hyponym relationships present among synsets. The following lexical relationships are taken into consideration when constructing our word ontology: 1) Synonyms: A synonym is a word or phrase that has exactly the same meaning of another word in the same or another language, For example the words defer and procrastinate, mean the same thing in the English language. Synonyms are an important lexical relation to be used since it is important to know the words that are similar to a word being analyzed and share the same meaning with that word. 2) Hyponym: Hyponym is a word of more specific meaning than a general or superordinate term applicable to it. For example, spoon is a hyponym of cutlery. Therefore if the idea of concept hierarchies is used then hyponyms of a word can be labelled as going down the concept hierarchy. Hyponyms are useful in the sense that they can give immediate examples of a word. For eg: The hyponyms of the word “motor car” give the result set as various types of auto-mobile like a convertible, electric car and so on, therefore using these hyponyms we are “drilling down” to the subclasses of a word. 3) Hypernym: Hypernym is a word or phrase whose referents form a set including as a subset the referents of a subordinate term. For example the hypernyms of motor car would be entity, physical object, wheeled vehicle. Therefore hypernymy can be categorized as “rolling up” the concept hierarchy and going to the superclass of the object or the origin of the word. Using the relationships of hypernymy and hyponym we can see that the basic criteria of object and its relationships is being maintained while giving detailed knowledge of the word at the same time. The relationships of the word are then ordered in XML format giving the information regarding relationships, definition of an entity as described in an ontology. Using these ontologies in dictionary based applications will enhance the information obtained as not only the meaning of the word is returned but also words similar to it, words that it can originate from and words that originate from it are retrieved.
  • 8. International Journal on Soft Computing, Artificial Intelligence and Applications (IJSCAI), Vol.5, No.1, February 2016 60 4. TESTING To test the module documents from different domains were taken. Documents were tested based on the number of different lexical relations generated for the top five words in each document. The results have been described in the table below, in the table below, the numbers corresponding to sections - Definitions, Synonyms, Hypernyms and Hyponyms, refer to the number of lexical relation found for each word and added as part of the ontology: Word Definitions Synonyms Hypernyms Hyponyms Ontology 2 1 5 0 Document 6 4 15 48 Keyword 0 0 0 0 Concept 1 3 2 28 Relations 7 19 23 50 Table 1: Results of Ontology components after running ontology module. The above results in the table signify the total number of lexical relations that could be derived from WordNet for the top words after summarizing the text. The total number describes the size of each component of the ontology that has been formed. Figure 11: Result of the ontology created.
  • 9. International Journal on Soft Computing, Artificial Intelligence and Applications (IJSCAI), Vol.5, No.1, February 2016 61 Figure 12: Format of the ontology created. In fig.12 the tree structured skeleton of the ontology created is displayed, the diagram is an abstract representation of the structure each ontology that is created follows. The structure is based on the hierarchy of the lexical relation of each word in the ontology, starting with the domain to which the word belongs to , moving to the meaning or the definition of the word, and then to the lexical relations of synonyms, hypernyms and hyponyms. fig.11, in contrast displays how the actual ontology will look like. Figure 13: Tree structure of the ontology. 5. CONCLUSION In this paper an automated method of creating ontologies has been proposed and tested. The system proposed performs the tasks of document classification, searching for the most similar documents, summarizing text and creating the ontology in XML format. Document classification helps to label any new document that is input into the system, tagging the domain to which it belongs to. Document similarity implemented using vector space model is used to find the most similar documents to the input document, from within the data set of documents present in the system, leading to a richer ontology being formed after getting closely related terms to the terms
  • 10. International Journal on Soft Computing, Artificial Intelligence and Applications (IJSCAI), Vol.5, No.1, February 2016 62 in the input document. Summarizing performs the function of shortening the large text thus formed and keeping mostly relevant, required terms of the document. The ontologies thus formed are based on lexical relations of a word i.e synonyms, concepts of hypernymy and hyponymy, meaning of the words. In the future applications of such systems can be used in word processing software's, pdf readers, to predict a document the user is reading and get various, accurate details of important terms and the related terms as well. REFERENCES [1] Herman I, “Introduction to the Semantic Web”, Johnson & Johnson, W3C Consortium, 2009. [2] Segaran, Toby, “Programming the Semantic Web”, O'Reilly Publications, 2009. [3] Song, Liu, Wang and Wang, “A Novel Automatic Ontology Construction Method based on Web Data”, Tenth International Conference on Intelligent Information Hiding and Multimedia Signal Processing, IEEE Computer Society, 2014. [4] Hong Sie, S, and Hua Yueh, J. , “Automatic Ontology Creation Using Schema Information”, Proceedings of the 2006 IEEE/ACM/WIC Conference on Web Intelligence, IEEE Computer Society, 2006. [5] Seremeti, L. and Kameas, A, “A Task-Based Ontology Approach for Novice Ontology Developers”, Fourth Balkan Conference in Informatics, IEEE Computer Society, 2009. [6] Hussain, F.K, Sidhu, A.S, Dhillon, T.S and Chang, E. “Engineering Trustworthy Ontologies; Case Study of Protein Ontology”, Proceedings of the 19th IEEE Symposium on Computer-Based Medical Systems, IEEE Computer Society, 2006. [7] Jain, Sonal and Pareek, J, “Automatic Topic Identification from Learning Material: An Ontological Approach”, ICCCEA, 2010. [8] “Bayes Theorem”, Wikipedia the Free Encyclopedia. [9] “Naive Bayes Classifier”, Wikipedia the Free Encyclopedia. [10] “Vector Space Model”, Wikipedia the Free Encyclopedia. [11] Russel, M, “Mining the Social Web”, O'Reilly Publications, 2013. AUTHORS Preeti Kathiria is an Assistant Professor in the Computer Science and Engineering Department at Nirma University in Ahmedabad. Sukraat Ahluwalia is a final year student in the Computer Science and Engineering Department at Nirma University in Ahmedabad.