SlideShare a Scribd company logo
1 of 10
Download to read offline
International Journal of Electrical and Computer Engineering (IJECE)
Vol. 8, No. 1, February 2018, pp. 1~10
ISSN: 2088-8708, DOI: 10.11591/ijece.v8i1.pp1-10  1
Journal homepage: http://iaescore.com/journals/index.php/IJECE
MalayIK: An Ontological Approach to Knowledge
Transformation in Malay Unstructured Documents
Fatimah Sidi1
, Iskandar Ishak2
, Marzanah A. Jabar3
1,2
Department of Computer Science, Faculty of Computer Science and Information Technology,
Universiti Putra Malaysia, Malaysia
3
Department of Software Engineering and Information System,
Faculty of Computer Science and Information Technology, Universiti Putra Malaysia, Malaysia
Article Info ABSTRACT
Article history:
Received Feb 17, 2017
Revised Jul 13, 2017
Accepted Nov 29, 2017
The number of unstructured documents written in Malay language is
enormously available on the web and intranets. However, unstructured
documents cannot be queried in simple ways, hence the knowledge contained
in such documents can neither be used by automatic systems nor could be
understood easily and clearly by humans. This paper proposes a new
approach to transform extracted knowledge in Malay unstructured document
using ontology by identifying, organizing, and structuring the documents into
an interrogative structured form. A Malay knowledge base, the MalayIK
corpus is developed and used to test the MalayIK-Ontology against Ontos, an
existing data extraction engine. The experimental results from MalayIK-
Ontology have shown a significant improvement of knowledge extraction
over Ontos implementation. This shows that clear knowledge organization
and structuring concept is able to increase understanding, which leads to
potential increase in sharable and reusable of concepts among the
community.
Keyword:
Interrogative knowledge
Knowledge transformation
Ontology
Unstructured documents
Copyright © 2018 Institute of Advanced Engineering and Science.
All rights reserved.
Corresponding Author:
Fatimah Sidi,
Department of Computer Science,
Faculty of Computer Science and Information Technology,
Universiti Putra Malaysia,
43400, UPM, Selangor, Malaysia.
Email: fatimah@upm.edu.my
1. INTRODUCTION
The difficulty of defining knowledge in unstructured documents is due to the paradox that
knowledge resides in a person‟s mind and at the same time, it has to be captured, stored, and reported. For
that, philosophers classify knowledge into knowing-that and knowing-how. Knowing-that is factual where
data are stored in databases and facts can be recalled, processed, and disseminated. While knowing-how is
actionable to do something, turning data into information and in turn into knowledge [1].
However, structured data represent only a little part of the overall organization of knowledge; in
fact, the major part of this knowledge is incorporated in textual documents. For example, available business
data are captured in text files that are not structured, e.g. memoranda and journal articles that are available
electronically [2-4]. A large portion of the available information does not appear in structured databases but
rather in collections of text articles drawn from various sources [5]. Thus, the main concern here is to dig
knowledge from the available vast amount of textual documents.
The proposed ontological approach to knowledge transformation is based on interrogative structure
[6] and conceptual modeling [7-11] approach. In transforming the extracted knowledge in unstructured
document, “deep-level understanding” of complete sentences is extracted by identifying, organizing, and
structuring the information into interrogative structured form. The “deep-level understanding” of complete
 ISSN: 2088-8708
Int J Elec & Comp Eng, Vol. 8, No. 1, February 2018 : 1 – 10
2
sentence refers to the understanding of a group of words in a complete sentence which, when they are written
down, begin with a capital letter and end with a full stop, question mark, or exclamation mark.
The interrogative approach to knowledge extraction relies on data and conceptual modeling, as well
as context and knowledge representation. Knowledge extraction supports the creation of: (1) knowledge;
(2) relationship; (3) contextual information; and (4) representation of common languages. It gives aid in the
transformation of extracted knowledge in an unstructured document into an interrogative structured form.
The first issue to address corresponds to the need for a mechanism to identify knowledge from the sourced
unstructured document in order to extract the knowledge. This is essentially in the interrogative knowledge
identification, which identifies the type of document by separating text into knowledge, information or data
and unifying it with personal components of values and beliefs. To identify knowledge, the approach of
answering interrogatively is proposed to answer the question within the text in unstructured document.
The interrogative contextual information is derived from the incorporation of context and additional
information annotation with context key facility. Context is an abstraction of the context factors, which are
represented as concepts [12]. It is further exploited by [13] as contextual information, where information
entered into the computer is tagged with context keys facilitating future retrieval using those keys. It may be
any information that could be used to characterize the situation of an entity i.e. person, place, object [14]. For
that, the interrogative contextual information is utilized to understand the process of making sense of
information into knowledge and maintain the meaning of the information. This is to gain the interpretation of
the identical knowledge by classifying the main point of the unstructured document interrogatively.
The rational to incorporate personal components towards the interrogative knowledge identification
is as follows. According to [15], personal components have a powerful impact on organizational knowledge
[16]. Assert that knowledge is a fluid mix of frame experience, values, contextual information, and expert
insight. It originates in the mind of the knower to determine a large part of what the knower sees, absorbs,
and concludes from his observations [17]. Stated that knowledge is a private and personal thing, which is
intuitive and strongly linked to the user‟s values and beliefs. By manually transforming documents, values
are embedded because humans read documents, extract the values of existing fields, and then enter the values
into a user interface [18].
2. RESEARCH METHOD
This research proposes the MalayIK-Ontology model that is designed to transform extracted
knowledge in Malay unstructured documents into an interrogative structured form based on interrogative
knowledge identification as well as interrogative knowledge organization and structuring. The first step is to
prepare the unstructured documents into an extension of plain text. The second step is to invoke the lexicon
identifier that uses lexicon interrogative analysis matching rules of a specific corpus, which in this research,
the MalayIK-Corpus. The lexicon identifier is used to identify and to extract knowledge in each of the
complete sentences written in the Malay unstructured document. It is also used to extract interrogative lexical
constructs from the individual unstructured document.
Next, the third step is to invoke the object recognizer that uses matching rules of object interrogative
analysis in order to extract the ontological constructs from the interrogative lexical constructs. The object
recognizer populates and maps the objects using ontology engineering, which is a mechanism of a knowledge
structure to represent the concept and relationship of the abstract model on how people think about things in
the world. Finally, the fourth step is to populate the database scheme by transforming the ontological
constructs through connection between the ontology model and the object-relationship model.
The MalayIK-Ontology architecture (Figure 1) consists of three main components, which are IKL-
Identifier, IKO-Recognizer, and IKS-OntologyDB. IKL-Identifier attempts to answer the question within the
text interrogatively, and IKS-OntologyDB connects the ontology and object-relationship model to be
populated into database.
The Protégé-Frames editor [19] is adopted in this research to structure and capture knowledge. It
provides a full-fledged user interface and knowledge server to support users in constructing and storing
frame-based domain ontologies, customizing data entry forms, and entering instance data. An object-based
recognizer using interrogative knowledge approach or IKO-Recognizer [20], [21] is also adopted. In this
research, the IKO-Recognizer maps the object interrogative analysis rule with ontology.
Int J Elec & Comp Eng ISSN: 2088-8708 
MalayIK: An Ontological Approach to Knowledge Transformation in Malay Unstructured …. (Fatimah Sidi)
3
Construct Ontology
Text Preprocessing
Unstructured
Document Tokenization
Part-of-Speech Construction
Interrogative Analysis
Convert Ontological Constructs into Structured Form
Domain
knowledge
Instance
knowledge
Structured
Form
Object-Relationship Model
Java classes
……{
…( ..)…{
……..} }
Objects
Protégé
Knowledge-base
System
Concepts, Properties,
Attributes and Facets
of ontological constructs
as in Protégé Knowledge-base System Database System
Interrogative lexical
constructs
Tokenization
Convert into
extension of
plain text file
Mapping
Objects
ó
Protégé (Classes, Slots)
Object Recognizer
Unstructured
Document
List of
Concept/Class
Ontological
constructs
Basic Design
Structured
Document
(Populated
SQL
Database)
Database Description
Mapping
Ontological constructs
ó
Object-Relationship Model
Lexicon interrogative
analysis matching rules
List of Interrogative Question:
‘apa’ (what), ‘siapa’ (who),
‘bila’ (when), ‘di mana’ (where),
‘mengapa’ (why)
and ‘bagaimana’ (how)
List of
Keyword, Constant, Term and Phrase
Definition / Learning of Mapping
Interrogative
lexical
constructs
IKL-Identifier:
Interrogative Knowledge Lexicon Identifier answering
the question within the text interrogatively.
IKO-Recognizer:
Interrogative Knowledge Object Recognizer
mapping object interrogative analysis rule with
ontology.
MalayIK-
Corpus
Classes/Concepts, Properties, Attributes
and Facets (role restrictions)
IKS-OntologyDB:
Interrogative
Knowledge
Structure connecting
ontology and
object-relationship
model
to populate
into database.
Record-Level
Objects,
Relationships
and
Constraints
Preparation:
Conversion of
file.
List of
Object
Figure 1. The MalayIK-Ontology Model(left) and System Architecture of MalayIK-Ontology(right)
2.1. IKL-Identifier
The Interrogative Knowledge Lexicon- (IKL-) Identifier is a lexicon identifier that uses lexicon
interrogative analysis of ‘apa’ (what), ‘siapa’ (who), ‘bila’ (when), ‘di mana’ (where), ‘mengapa’ (why), and
‘bagaimana’ (how) in answering interrogative questions within the text in an unstructured document. The
mechanism for the IKL-Identifier is to convert sentences into interrogative lexical constructs in the form of
interrogative annotation.
Basically, the IKL-Identifier identifies the type of interrogative lexical constructs in each complete
sentence within the Malay unstructured document by separating the text into knowledge, information or data.
It is also responsible to tag the interrogative lexical constructs with interrogative contextual information,
which is important to interpret the information into knowledge and maintain the meaning of the information
in the Malay unstructured document. The processes of the IKL-Identifier are illustrated in Figure 2, which are
tokenization, lexicon interrogative analysis, interrogative contextual information tagging, and phrases
constructor tagging.
Lexicon exists No
Yes
Insert into
MalayIK-Corpus as
extended lexicon
Insert into
interrogative
annotation
Construct and
capture phrases
and proper nouns
MalayIK-
Corpus
Lexicons
Interrogative
lexical
constructs
Tokenization
Lexicon
Interrogative
Analysis
Interrogative
Contextual
Information
Tagging
Phrases
Constructor
Tagging
Figure 2. IKL-Identifier Processes
 ISSN: 2088-8708
Int J Elec & Comp Eng, Vol. 8, No. 1, February 2018 : 1 – 10
4
During tokenization, the text of unstructured document is segmented into sentences and tokenized
into lexicons. Subsequently, the case format is defined, either the lexicon will hold digits, lower, upper, title
or toggle cases. Each lexicon will then be assigned with automated serial number by lines, sentences, and
token numbers. Next, during the Lexicon Interrogative Analysis, each lexicon is analyzed with lexicon
interrogative analysis matching rules of the MalayIK-Corpus using the standard Data Manipulation Language
(DML). DML is used to analyze, to check, and to insert the lexicon into interrogative annotation as
interrogative lexical construct, should it exists. Any new lexicon that is analyzed will be inserted and defined
in the MalayIK-Corpus.
Finally, the interrogative lexical constructs are used during Phrases Constructor Tagging. In this
step, a phrase is constructed by putting together words based on interrogative annotation of the word. A
phrase is a group of words, which contains an idea that forms a unit in which writing is part rather than a
whole of a sentence. The words are divided depending on their use in a part of speech.
Malay sentence:
Brian Fielding Frost yang tercinta, umur 41, meninggal dunia pagi Selasa, September 30, 1998, disebabkan
oleh kecederaan dialami dalam satu kemalangan kereta.
English sentence:
Our beloved Brian Fielding Frost, age 41, passed away Tuesday morning, September 30, 1998, due to
injuries sustained in an automobile accident.
2.2. IKO-Recognizer
The IKO-Recognizer (Interrogative Knowledge Object Recognizer) performs matching and
mapping object interrogative analysis rule of what/who/when/where/ why/how to extract ontological
constructs [20], [21]. There are two major processes, object recognizer and mapping process. First, the object
recognizer uses object interrogative analysis rules by utilizing Object-Oriented Programming (OOP) in order
to conceptually organize the program around its data (objects/concepts). In this process, a number of object
interrogative analysis rules and precondition language is pre-defined but users may manually define
additional rules. Second, the following mapping process uses an ontology engineering approach, whereby
objects that have been created by the object recognizer are accessible as plug-ins in the ontology system.
The main process in the IKO-Recognizer is the Object Interrogative Analysis Rules and the
Precondition Language. Object interrogative analysis rules capitalize on Java OOP class encapsulation
approach. For this, the object interrogative analysis rules use interrogative elements as the most upper class
of the object. The structure and behaviors of the objects are implemented through (a) Struktur Kata Nama Am
(Noun Structure) and (b) Struktur Leksikon Semantik (Semantic Lexicon Structure) in order to construct
objects.
The first structure, which is the Struktur Kata Nama Am (Noun Structure), the object interrogative
analysis rule is defined by combining the structure and behavior of an object with its inheritance and its
conceptual modifiers of one or more subclasses in a hierarchical structure. The structure and behaviour of the
object are defined by ‘kata_masuk’ as tagged during the interrogative lexical construct earlier. The
„kata_masuk’ for ‘penyelidik’ (investigator) is the grammatical information of „kata nama am‟. It is a noun of
‘kata nama am orang’, which refers to as a conceptual of ‘Orang’ (People), and has the interrogative element
of ‘siapa’ (who). Hence, it inherits the general behaviour or properties of its parent ‘siapa’ (who).
In the second structure, the Struktur Leksikon Semantik (Lexicon Semantic Structure), the object
interrogative analysis rule uses the corresponding structure and behaviour of semantic lexicon that defines
interrogative elements of ‘bila’ (when), ‘di mana’ (where), ‘mengapa’ (why), and ‘bagaimana’ (how). The
semantic lexicons of ‘bila’ (when) and ‘di mana’ (where) correspond to the phrase or proper noun
constructed after the semantic lexicon of the interrogative elements. The structure and behaviour of semantic
lexicon ‘bila’ (when) shows about the time at which an event take place. Whereas, the semantic lexicon of ‘di
mana’ (where) shows about the place something is in, or is coming from or going to.
However, the semantic lexicons of ‘mengapa’ (why) and ‘bagaimana’ (how) correspond to the
predicate after the semantic lexicons of ‘mengapa’ (why) and ‘bagaimana’ (how). Reason being is to
describe the meaning of the semantic lexicons and to give information about the sentence. The semantic
lexicon of ‘mengapa’ (why) talks about the reasons for something which introduces a relative. Whereas, the
semantic lexicon ‘bagaimana’ (how) explains the way in which something happens or is done and introduces
a statement or fact. The objects of ‘mengapa’ (why) and ‘bagaimana’ (how) correspond accordingly to their
definitions of interrogative element.
Int J Elec & Comp Eng ISSN: 2088-8708 
MalayIK: An Ontological Approach to Knowledge Transformation in Malay Unstructured …. (Fatimah Sidi)
5
2.3. IKS-OntologyDB
The IKS-OntologyDB is a process of exporting the ontology structure into a database. The metadata
of the information regarding the relationships, properties, attributes, and facets of the class structure are
created in the ontology system and are exported into Microsoft Access. The exportation is done by using the
facility provided by Protégé by selecting the option of Export to HTML format. The transformation of the
knowledge-based system created via the Protégé to the database management system by using HTML format.
The HTML information is used to create attributes and constraints in the Microsoft Access. The table is
created according to the definition and declarations of the SQL schema and ontology declaration of the
Protégé knowledge-based system. This is shown in Table 1.
Table 1. Ontology versus SQL Declaration Properties
Ontology
Declaration
SQL
Declaration
Class Table
Slot Field Name
Property Data Type
Facet Constraint
Cardinality Cardinality
The components of ontology and conceptual model are basically equivalent in terms of concepts and
entities, relationships, and attributes. The ontological constructs generated are mapped with the Object-
oriented System Model (OSM) established by [7-11]. It is used by the object-relationship model to describe
the data interest which includes relationships, lexical appearances and context keywords. Besides, it is used
to structure the data identified and extracted and populate them into database scheme.
In general, the relevant knowledge about an object set is represented by a colon (:) after an object-
set name which denotes that the object set is a specialization. For example, the lexical object set of Death
Date: Date, where date describes the string patterns of interrogative element of ‘bila’ (when). For the lexical
object set of Deceased Name: Name and Relative Name: Name, name is matched by recognizing the string
patterns of proper nouns interrogative element of ‘siapa’ (who). The context keywords indicate the presence
of an object in an object set. For example, ‘kematian’ (died) and ‘meninggal dunia’ (passed away) are the
context keywords for Death Date; ‘pengebumian’ (buried) is a context keyword for Interment.
2.4. MalayIK Corpus
This research uses the MalayIK-Ontology based on interrogative approach. While most approaches
of text processing as discussed in [22] use NLP or information extraction to select the set of keywords or
phrases to be analyzed, ontology approach is able to avoid mislead in the “vocabulary problem” which leads
to spurious results. By establishing a fixed set of general concepts (“People”, “Location”, “Things”) with the
entry of word answering the question interrogatively (“People”, “Location”, “Things” refer to „who‟,
„where‟, ‟what‟ respectively), the vocabulary used in the rule mapping phase may be controlled.
The most important attribute is the grammatical information of lexicon entry to answer the question
of the lexicon grammatical information interrogatively besides the root word (lexicon). The MalayIK-Corpus
is a Malay language corpus where the Malay dictionary of Kamus Dewan [23, 24] and the dictionary of root
words act as important secondary controls of the lexicon entries. It also refers to the dictionary of Kamus
Imbuhan Bahasa Melayu [25], Kamus Dwibahasa Oxford Fajar [26], and Kamus Komprehensif Bahasa
Melayu [27]. The lexicons entries are manually inserted in the database using standard DML of the related
database.
In order to create a general purpose corpus for Malay language, the Ahmad‟s and Abdullah‟s stop
words [23], [25] are included which indicate pronoun, auxiliary verb, adverb, predicate, preposition,
negative, conjunction, relative and determinant.
Table 2 presents examples of words entry extracted from MalayIK-Corpus in a table format (by
columns and rows). The header row of Table 2 refers the attributes of corpus by columns. The rest of the
rows are examples of words entries for ‘rumah’ (house), ‘sejak’ (since), ‘penyelidik’ (researcher), ‘di’ (at),
‘kerana’ (because) and ‘dengan’ (with). It answers the question of interrogative of ‘apa’ (what), ‘bila’
(when), ‘siapa’ (who), ‘di mana’ (where), ‘mengapa’ (why), and ‘bagaimana’ (how) respectively.
Basically, the grammatical information of ‘rumah’ and ‘penyelidik’ is noun (‘kata nama am’) but
are classified as different category. The word ‘rumah’ (house) falls under categorization of „Things‟ which
answers the interrogative question of „what‟. While ‘penyelidik’ (researcher) falls under categorization of
„People‟ which answers the interrogative question of „who‟.
 ISSN: 2088-8708
Int J Elec & Comp Eng, Vol. 8, No. 1, February 2018 : 1 – 10
6
Table 2. Examples of MalayIK-Corpus
kata
dasar
Perkataan kata masuk
elemen
interogatif
Status
rumah rumah kata nama am benda apa 1
sejak sejak kata sendi nama masa bila 2
selidik penyelidik kata nama am orang siapa 1
di di kata sendi nama tempat dan arah di mana 2
kerana kerana kata hubung pancangan mengapa 2
dengan dengan kata sendi nama bersama-sama bagaimana 2
3. RESULTS AND ANALYSIS
The first question that arises in designing the MalayIK-Ontology is to check whether the
constant/keyword recognizer to extract and structure data of Ontos can be applied to Malay unstructured
documents. The next question is to check whether the MalayIK-Ontology can identify knowledge as well as
data to be extracted and structured are equivalent or better than Ontos. Furthermore, the knowledge or data
obtained needs to be checked for its validity. This is to prove that the MalayIK-Ontology works effectively in
extracting and structuring data as compared with Ontos. Therefore, following are the steps taken to perform
the experiment.
The accuracy of Ontos is measured by the numbers of data extracted between the English and Malay
obituaries. The accuracy of MalayIK-Ontology is measured by numbers of knowledge or data extracted.
When applying Ontos on English and Malay obituaries, three tables are created based on the obituaries
ontology, which are DeceasedPerson, Viewing, and DeceasedPersonRelationshipRelativeName. For
MalayIK-Ontology, the table DeceasedPersonRelationshipRelativeName is used to compare the translation of
Malay language for Relationship. An example of data extracted for both Ontos and MalayIK-Ontology are
listed in Table 3 for DeceasedPerson.
Table 3. List of Data Extracted for Table DeceasedPerson
Extraction
Manual
Extraction
Ontos
MalayIK-
Ontology
Language of Obituary English English Malay Malay
Facts/Attributes
DeceasedPerson 1002 1002 1002 1002
DeceasedName
Lemar K.
Adamson
Lemar K.
Adamson
Lemar K.
Adamson
Lemar K.
Adamson
Age 84 84 199 84
DeathDate 9/30/1998 9/30/1998 9/30/1998 9/30/1998
BirthDate 6/12/1914 6/12/1914 6/12/1914 6/12/1914
Funeral 5002 5002 5002 5002
FuneralDate 10/5/1998 10/5/1998 - 10/5/1998
FuneralAddress
Silverbell
Ward, 1540
E. Linden
1540 E.
Linden
1540 E.
Linden
Silverbell
Ward, 1540
E. Linden
FuneralTime 10:00 AM 10:00 10:00 10:00 AM
Interment 7002 7002 7002 7002
IntermentAddress
City
Cemetery
236 S. Scott 236 S. Scott
City
Cemetery
IntermentDate - - - -
(-) No data extracted
3.1. Analysis of Ontos on English and Malay Obituaries
The results of Ontos being applied on English and Malay obituaries are shown in Table 4 and
Table 5. This table shows the counted number of facts (attributes values) in the test-set documents of English
and Malay obituaries. They [7-11] are consistent with their implementation, which only extracts explicit
constants. A string is counted as correct, if the constant extracted occurs in the text. With this understanding,
counting is basically straightforward. Due to their name lexicon is incomplete and because of their name-
extraction expressions are not rich; sometimes parts of a name are missed or a single name were split into
two. For these cases, they list the count after + in the Declared Correctly column.
Int J Elec & Comp Eng ISSN: 2088-8708 
MalayIK: An Ontological Approach to Knowledge Transformation in Malay Unstructured …. (Fatimah Sidi)
7
Partial names also caused most of the problems for the large number of incorrectly identified
relatives. With a more accurate and complete lexicon coupled with richer name-extraction expression, they
believe they can achieve much higher precision.
As anticipated, the experiment to check the constant/keyword recognizer of Ontos on Malay
obituaries does not produce the same results as English obituaries for numbers of facts generated. However,
the results show that the DeceasedPerson, DeceasedName, BirthDate and DeathDate generate 100% recall
and precision. Facts generated are classified as nonlexical and lexical objects set. The nonlexical and lexical
objects set are described in what they defined as data frames. A data frame describes the string patterns for its
constants.
For that, results of lexical objects sets such as counted number of facts for the IntermentDate,
IntermentAddress, ViewingDate, ViewingAddress, Relationship and RelativeName generate 0% recall and
precision listed in Table 4. This is due to the context keywords in Malay obituaries being translated.
However, non-lexical object sets such as DeceasedPerson are always generated for an obituary record and
consequently the results for that sets are 100% recall and precision. They represent non-lexical object sets by
surrogate identifiers which are generally easier to identify correctly. This shows Ontos can be applied on
Malay obituaries for data frame of non-lexical object sets by surrogate identifiers.
Table 4. Results of Ontos on English and Malay Obituaries
Facts
Number
of
Facts
in
Source
(N)
Number of Facts Declared Correctly
+ Partially Correct
(C)
Number of Facts Declared
Incorrectly
(I)
Recall
Ratio
(%)
(C/N)
Precision
Ratio
(%)
(C/C+I)
E M E M E M E M
DeceasedPerson 3 3 3 0 0 100 100 100 100
DeceasedName 3 3 3 0 0 100 100 100 100
Age 3 3 3 0 3 100 0 100 50
BirthDate 3 3 3 0 0 100 100 100 100
DeathDate 3 3 3 0 0 100 100 100 100
Funeral
FuneralDate 3 3 0 0 0 100 0 100 0
FuneralAddress 3 0 0 3 2 0 0 0 0
FuneralTime 3 2 1 1 1 67 33 67 50
Interment
IntermentDate 0 0 0 2 0 0 0 0 0
IntermentAddress 3 0 0 3 3 0 0 0 0
Viewing
ViewingDate 1 0 0 0 0 0 0 0 0
ViewingAddress 3 0 0 0 1 0 0 0 0
BeginingTime 3 2 1 0 1 67 33 100 50
EndingTime 3 3 1 0 0 100 33 100 100
RelationshipRelativeName
Relationship 8 8 0 0 0 100 0 100 0
RelativeName 28 6 0 9 0 22 0 40 0
(E) English, (M) Malay
3.2. Analysis of MalayIK-Ontology on Malay Obituaries
The results of MalayIK-Ontology being applied with Malay obituaries to determine its ability to
identify and extract data as compared with Ontos are listed in Table 5. Table 5 shows the results obtained are
compared with the results of the first experiment on Ontos. The purpose of comparison is to check the
validity of data identified and generated by the MalayIK-Ontology. In this second experiment, results of both
nonlexical (DeceasedPerson) and lexical objects (Funeral, Interment, Viewing and
RelationshipRelativeName) of MalayIK-Ontology generate 100% recall and precision except for
ViewingAddress which generates 67% recall and precision. This is due to the location keyword indicating
 ISSN: 2088-8708
Int J Elec & Comp Eng, Vol. 8, No. 1, February 2018 : 1 – 10
8
address for the lexical object of ViewingAddress in Malay obituaries being translated as something done by a
person. Whereas, the lexical object for IntermentDate definitely generates 0%, this is because there is no
interment date specified in the obituaries. Hence, this shows that there is an improvement in extracting and
structuring data of Ontos lexical objects by the MalayIK-Ontology.
Table 5. Results of Ontos and MalayIK-Ontology
Facts N
Number of
Facts
Declared
Correctly +
Partially
Correct (C)
Number of
Facts
Declared
Incorrectly
(I)
Recall
Ratio
(%)
(C/N)
Precision
Ratio
(%)
(C/C+I)
Ontos
M
O
Ontos
M
O
Ontos MO Ontos MO
E M M E M M E M M E M M
DeceasedPerson 3 3 3 3 0 0 0 100 100 100 100 100 100
DeceasedName 3 3 3 3 0 0 0 100 100 100 100 100 100
Age 3 3 3 3 0 3 0 100 0 100 100 50 100
BirthDate 3 3 3 3 0 0 0 100 100 100 100 100 100
DeathDate 3 3 3 3 0 0 0 100 100 100 100 100 100
Funeral
FuneralDate 3 3 0 3 0 0 0 100 0 100 100 0 100
FuneralAddress 3 0 0 3 3 2 0 0 0 100 0 0 100
FuneralTime 3 2 1 3 1 1 0 67 33 100 67 50 100
Interment
IntermentDate 0 0 0 0 2 0 0 0 0 0 0 0 0
IntermentAddress 3 0 0 3 3 3 0 0 0 100 0 0 100
Viewing
ViewingDate 1 0 0 1 0 0 0 0 0 100 0 0 100
ViewingAddress 3 0 0 2 0 1 1 0 0 67 0 0 67
BeginingTime 3 2 1 3 0 1 0 67 33 100 100 50 100
EndingTime 3 3 1 3 0 0 0 100 33 100 100 100 100
RelationshipRelativeName
Relationship 8 8 0 8 0 0 0 100 0 100 100 0 100
RelativeName
2
8
6 0 28 9 0 0 22 0 100 40 0 100
(N) Number of Facts in Source, (E) English, (M) Malay, (MO) MalayIK-Ontology
4. CONCLUSION
The main objective of this research is to propose a new approach to transform extracted knowledge
in Malay unstructured document by identifying, organizing, and structuring them into interrogative structured
form. In order to achieve this objective, an approach is established through the MalayIK-Ontology approach.
Based on the results, the annotation of interrogative contextual information tagged in interrogative lexical
constructs improves the data extraction. The annotation of interrogative contextual information is annotated
with interrogative and grammatical information of the lexical constructs. For example, the lexical object set
of BirthDate, DeathDate, FuneralDate, and ViewingDate generate precision of 100% which also generate
100% precision on Ontos.
Int J Elec & Comp Eng ISSN: 2088-8708 
MalayIK: An Ontological Approach to Knowledge Transformation in Malay Unstructured …. (Fatimah Sidi)
9
This improvement is due to the implementation of annotation interrogative contextual information
which is tagged with interrogative element of ‘bila’ (when) which describes about the string patterns of time
at which things happened. For lexical object set of DeceasedName and RelativeName which also generate
precision of 100%, as the name is matched by recognizing the string patterns of proper nouns which is tagged
with interrogative element of ‘siapa’ (who). Besides, phrases or proper nouns based on interrogative
annotation of the lexicon are also annotated in the lexical constructs.
5. ACKNOWLEDGEMENT
The work was supported by the Ministry of Higher Education Malaysia under the Fundamental Research
Grant Scheme (FRGS 03-12-10-999FR).
REFERENCES
[1] Spiegler, I.: Technology and Knowledge: Bridging a “Generating” Gap, Information & Management, vol. 40(6),
pp. 533-539, 2003.
[2] T. K. Das and P. M. Kumar, "Big Data Analytics: A Framework for Unstructured Data Analysis", International
Journal of Engineering and Technology, Vol. 5, pp. 153-156, 2013
[3] K. Cukier, “The Economist, Data, data everywhere: A special report on managing information”, 2010,
February 25, retrieved from http://www.economist. com/node/15557443
[4] N. Khan, et al., "Big Data: Survey, Technologies, Opportunities, and Challenges", The Scientific World Journal,
vol. 2014, pp. 1-18, 2014.
[5] R. Feldman, "Mining Unstructured Data", in International Conference on Knowledge Discovery and Data
Mining 5th ACM SIGKDD, August, 1999.
[6] E. J. Quigley and A. Debons, "Interrogative theory of information and knowledge", in ACM SIGCPR conference
on Computer personnel research, pp. 4-10, 1999, pp. 4-10.
[7] D. W. Embley, et al., "A Conceptual-Modeling Approach to Extracting Data from the Web", in 17th International
Conference on Conceptual Modeling, 1998, pp. 78-91.
[8] D. W. Embley, "Ontology-based Extraction and Structuring of Information from Data-rich Unstructured
Documents", in 7th International Conference on Information and Knowledge Management, 1998, pp. 52-59.
[9] D. W. Embley, et al., "Conceptual-Model-based Data Extraction from Multiple-Record Web Pages, Data and
Knowledge Engineering", Data & Knowledge Engineering Journal, vol. 31(3), pp. 227-251, 1999.
[10] D. W. Embley, et al., "Record-Boundary Discovery in Web Documents", in Proceeding of the 1999 ACM
SIGMOD International Conference on Management of Data, 1999, pp. 467-478.
[11] D. W. Embley, "Toward Semantic Understanding: An Approach Based on Information Extraction Ontologies" in
15th Australasian Database Conference, 2004, pp. 3-12.
[12] B. N. Schilit and M. M. Theimer, "Disseminating Active Map Information to Mobile Hosts", IEEE Network,
vol. 8(5), pp. 22-32, 1994.
[13] Lamming, M.G. and Newman, W.M., "Activity-based Information Retrieval: Technology in Support of Personal
Memory", in 12th World Computer Congress on Personal Computers and Intelligent Systems - Information
Processing, 1992, pp. 68-81.
[14] G. D. Abowd, et al., "Towards a Better Understanding of Context and Context-Awareness", in 1st International
Symposium on Handheld and Ubiquitous Computing 1999, pp. 304-307.
[15] I. Nonaka, "A Dynamic Theory of Organizational Knowledge Creation". Organization Science, vol. 5(1),
pp. 14-37, 1994.
[16] T. H. Davenport, et al., "Working Knowledge: How Organizations Manage What They Know", Harvard
Business School Press, 2000.
[17] F. J. Varela, et al., "The Embodied Mind", MIT Press, 1992.
[18] R. Virk, "Transforming Unstructured Content into “Meaningful” XML", 2004,
http://www.dmreview.com/whitepaper/WID413.pdf.
[19] N. F. Noy, and D. L. McGuinness, "Ontology Development 101: A Guide to Creating Your First Ontology",
Knowledge Systems Laboratory, Technical Report KSL-01-05, 2001.
[20] I. Ishak, et al., "Object Recognizer For Organizing And Structuring Unstructured Documents Using Interrogative
Knowledge", Journal of Theoretical & Applied Information Technology, vol. 83, pp. 442-450, 2016.
[21] F. Sidi, F. and M. A. Jabar, "Interrogative Knowledge Organization and Structuring from Unstructured
Documents". Knowledge Management and Innovation: A Business Competitive Edge Perspective, pp. 1028-1032,
2010.
[22] M. Shamsfard and A. A. Barforoush, "The State of the Art in Ontology Learning: A Framework for Comparison",
Knowledge Engineering Review, vol. 18(4), pp. 293-316, 2003.
[23] Dewan Bahasa Perpustakaan, "Kamus Dewan Edisi Ketiga", Dewan Bahasa dan Pustaka, 2002.
[24] Dewan Bahasa Perpustakaan, "Kamus Dewan Edisi Keempat", Dewan Bahasa dan Pustaka, 2005.
[25] H. M. Ali, "Kamus Imbuhan Bahasa Melayu Edisi Kedua", Penerbit Fajar Bakti Sdn. Bhd., 1993.
[26] J. M. Hawkins, "Kamus Dwibahasa Oxford Fajar Edisi Ketiga", Penerbit Fajar Bakti Sdn. Bhd., 2001.
[27] A. Othman, "Kamus Komprehensif Bahasa Melayu", Penerbit Fajar Bakti Sdn. Bhd., a subsidiary of Oxford
University Press, 2005.
 ISSN: 2088-8708
Int J Elec & Comp Eng, Vol. 8, No. 1, February 2018 : 1 – 10
10
BIOGRAPHIES OF AUTHORS
Fatimah Sidi holds a PhD in Management Information Systems from Universiti Putra Malaysia. She
received her Bachelor and Masters degree in Computer Science from the same university. She has
twenty years experience as System Analyst in System Development and Enterprise Database
Management. Her research interest spans around Knowledge and Information Management Systems,
Data and Knowledge Engineering, as well as Database and Data Warehousing
Iskandar Ishak received his BIT (Hons) from Universiti Tenaga Nasional in 2002. He received
Master of Technology in Information Technology in the year 2003 from the Royal Melbourne Institute
of Technology University and completed his PhD in Computer Science in 2011 at the Universiti
Teknologi Malaysia. He is now a Senior Lecturer at the Department of Computer Science, Faculty of
Computer Science and Information Technology, Universiti Putra Malaysia since 2011. His research
interest lies in the area of Database System, Information Systems, Big Data, Peer-to-Peer and Mobile
Computing and Information Retrieval
Marzanah A. Jabar graduated from Universiti Kebangsaan Malaysia with BSc. in Quantitative
Studies in 1983. She received her MSc. in Computer Science in the year 2001 and her PhD in
Management Information System in the year 2007, both from Universiti Putra Malaysia. She has over
twenty years experience in leading information systems development. Her research interest lies in the
area of Software and Knowledge Engineering, Knowledge Management Systems, and strategic use of
Information Systems and Technologies

More Related Content

What's hot

Data and Information Integration: Information Extraction
Data and Information Integration: Information ExtractionData and Information Integration: Information Extraction
Data and Information Integration: Information ExtractionIJMER
 
Structured and Unstructured Information Extraction Using Text Mining and Natu...
Structured and Unstructured Information Extraction Using Text Mining and Natu...Structured and Unstructured Information Extraction Using Text Mining and Natu...
Structured and Unstructured Information Extraction Using Text Mining and Natu...rahulmonikasharma
 
Research on ontology based information retrieval techniques
Research on ontology based information retrieval techniquesResearch on ontology based information retrieval techniques
Research on ontology based information retrieval techniquesKausar Mukadam
 
AUTOMATED DISCOVERY OF LOGICAL FALLACIES IN LEGAL ARGUMENTATION
AUTOMATED DISCOVERY OF LOGICAL FALLACIES IN LEGAL ARGUMENTATIONAUTOMATED DISCOVERY OF LOGICAL FALLACIES IN LEGAL ARGUMENTATION
AUTOMATED DISCOVERY OF LOGICAL FALLACIES IN LEGAL ARGUMENTATIONijaia
 
AN ELABORATION OF TEXT CATEGORIZATION AND AUTOMATIC TEXT CLASSIFICATION THROU...
AN ELABORATION OF TEXT CATEGORIZATION AND AUTOMATIC TEXT CLASSIFICATION THROU...AN ELABORATION OF TEXT CATEGORIZATION AND AUTOMATIC TEXT CLASSIFICATION THROU...
AN ELABORATION OF TEXT CATEGORIZATION AND AUTOMATIC TEXT CLASSIFICATION THROU...cseij
 
Clustering of Deep WebPages: A Comparative Study
Clustering of Deep WebPages: A Comparative StudyClustering of Deep WebPages: A Comparative Study
Clustering of Deep WebPages: A Comparative Studyijcsit
 
IRJET- Concept Extraction from Ambiguous Text Document using K-Means
IRJET- Concept Extraction from Ambiguous Text Document using K-MeansIRJET- Concept Extraction from Ambiguous Text Document using K-Means
IRJET- Concept Extraction from Ambiguous Text Document using K-MeansIRJET Journal
 
A Domain Based Approach to Information Retrieval in Digital Libraries
A Domain Based Approach to Information Retrieval in Digital LibrariesA Domain Based Approach to Information Retrieval in Digital Libraries
A Domain Based Approach to Information Retrieval in Digital LibrariesFulvio Rotella
 
Survey of Machine Learning Techniques in Textual Document Classification
Survey of Machine Learning Techniques in Textual Document ClassificationSurvey of Machine Learning Techniques in Textual Document Classification
Survey of Machine Learning Techniques in Textual Document ClassificationIOSR Journals
 
Introducing CIDOC-CRM (Cch KR workshop #2.1)
Introducing CIDOC-CRM (Cch KR workshop #2.1)Introducing CIDOC-CRM (Cch KR workshop #2.1)
Introducing CIDOC-CRM (Cch KR workshop #2.1)Michele Pasin
 
INFORMATION RETRIEVAL TECHNIQUE FOR WEB USING NLP
INFORMATION RETRIEVAL TECHNIQUE FOR WEB USING NLP INFORMATION RETRIEVAL TECHNIQUE FOR WEB USING NLP
INFORMATION RETRIEVAL TECHNIQUE FOR WEB USING NLP ijnlc
 
ESTIMATION OF REGRESSION COEFFICIENTS USING GEOMETRIC MEAN OF SQUARED ERROR F...
ESTIMATION OF REGRESSION COEFFICIENTS USING GEOMETRIC MEAN OF SQUARED ERROR F...ESTIMATION OF REGRESSION COEFFICIENTS USING GEOMETRIC MEAN OF SQUARED ERROR F...
ESTIMATION OF REGRESSION COEFFICIENTS USING GEOMETRIC MEAN OF SQUARED ERROR F...ijaia
 
A Domain Based Approach to Information Retrieval in Digital Libraries - Rotel...
A Domain Based Approach to Information Retrieval in Digital Libraries - Rotel...A Domain Based Approach to Information Retrieval in Digital Libraries - Rotel...
A Domain Based Approach to Information Retrieval in Digital Libraries - Rotel...University of Bari (Italy)
 
Design and Implementation of Meetings Document Management and Retrieval System
Design and Implementation of Meetings Document Management and Retrieval SystemDesign and Implementation of Meetings Document Management and Retrieval System
Design and Implementation of Meetings Document Management and Retrieval SystemCSCJournals
 
Blended intelligence of FCA with FLC for knowledge representation from cluste...
Blended intelligence of FCA with FLC for knowledge representation from cluste...Blended intelligence of FCA with FLC for knowledge representation from cluste...
Blended intelligence of FCA with FLC for knowledge representation from cluste...IJECEIAES
 
A Survey on Text Mining-techniques and application
A Survey on Text Mining-techniques and applicationA Survey on Text Mining-techniques and application
A Survey on Text Mining-techniques and applicationRyota Eisaki
 

What's hot (17)

Data and Information Integration: Information Extraction
Data and Information Integration: Information ExtractionData and Information Integration: Information Extraction
Data and Information Integration: Information Extraction
 
Structured and Unstructured Information Extraction Using Text Mining and Natu...
Structured and Unstructured Information Extraction Using Text Mining and Natu...Structured and Unstructured Information Extraction Using Text Mining and Natu...
Structured and Unstructured Information Extraction Using Text Mining and Natu...
 
Research on ontology based information retrieval techniques
Research on ontology based information retrieval techniquesResearch on ontology based information retrieval techniques
Research on ontology based information retrieval techniques
 
AUTOMATED DISCOVERY OF LOGICAL FALLACIES IN LEGAL ARGUMENTATION
AUTOMATED DISCOVERY OF LOGICAL FALLACIES IN LEGAL ARGUMENTATIONAUTOMATED DISCOVERY OF LOGICAL FALLACIES IN LEGAL ARGUMENTATION
AUTOMATED DISCOVERY OF LOGICAL FALLACIES IN LEGAL ARGUMENTATION
 
Ijetcas14 409
Ijetcas14 409Ijetcas14 409
Ijetcas14 409
 
AN ELABORATION OF TEXT CATEGORIZATION AND AUTOMATIC TEXT CLASSIFICATION THROU...
AN ELABORATION OF TEXT CATEGORIZATION AND AUTOMATIC TEXT CLASSIFICATION THROU...AN ELABORATION OF TEXT CATEGORIZATION AND AUTOMATIC TEXT CLASSIFICATION THROU...
AN ELABORATION OF TEXT CATEGORIZATION AND AUTOMATIC TEXT CLASSIFICATION THROU...
 
Clustering of Deep WebPages: A Comparative Study
Clustering of Deep WebPages: A Comparative StudyClustering of Deep WebPages: A Comparative Study
Clustering of Deep WebPages: A Comparative Study
 
IRJET- Concept Extraction from Ambiguous Text Document using K-Means
IRJET- Concept Extraction from Ambiguous Text Document using K-MeansIRJET- Concept Extraction from Ambiguous Text Document using K-Means
IRJET- Concept Extraction from Ambiguous Text Document using K-Means
 
A Domain Based Approach to Information Retrieval in Digital Libraries
A Domain Based Approach to Information Retrieval in Digital LibrariesA Domain Based Approach to Information Retrieval in Digital Libraries
A Domain Based Approach to Information Retrieval in Digital Libraries
 
Survey of Machine Learning Techniques in Textual Document Classification
Survey of Machine Learning Techniques in Textual Document ClassificationSurvey of Machine Learning Techniques in Textual Document Classification
Survey of Machine Learning Techniques in Textual Document Classification
 
Introducing CIDOC-CRM (Cch KR workshop #2.1)
Introducing CIDOC-CRM (Cch KR workshop #2.1)Introducing CIDOC-CRM (Cch KR workshop #2.1)
Introducing CIDOC-CRM (Cch KR workshop #2.1)
 
INFORMATION RETRIEVAL TECHNIQUE FOR WEB USING NLP
INFORMATION RETRIEVAL TECHNIQUE FOR WEB USING NLP INFORMATION RETRIEVAL TECHNIQUE FOR WEB USING NLP
INFORMATION RETRIEVAL TECHNIQUE FOR WEB USING NLP
 
ESTIMATION OF REGRESSION COEFFICIENTS USING GEOMETRIC MEAN OF SQUARED ERROR F...
ESTIMATION OF REGRESSION COEFFICIENTS USING GEOMETRIC MEAN OF SQUARED ERROR F...ESTIMATION OF REGRESSION COEFFICIENTS USING GEOMETRIC MEAN OF SQUARED ERROR F...
ESTIMATION OF REGRESSION COEFFICIENTS USING GEOMETRIC MEAN OF SQUARED ERROR F...
 
A Domain Based Approach to Information Retrieval in Digital Libraries - Rotel...
A Domain Based Approach to Information Retrieval in Digital Libraries - Rotel...A Domain Based Approach to Information Retrieval in Digital Libraries - Rotel...
A Domain Based Approach to Information Retrieval in Digital Libraries - Rotel...
 
Design and Implementation of Meetings Document Management and Retrieval System
Design and Implementation of Meetings Document Management and Retrieval SystemDesign and Implementation of Meetings Document Management and Retrieval System
Design and Implementation of Meetings Document Management and Retrieval System
 
Blended intelligence of FCA with FLC for knowledge representation from cluste...
Blended intelligence of FCA with FLC for knowledge representation from cluste...Blended intelligence of FCA with FLC for knowledge representation from cluste...
Blended intelligence of FCA with FLC for knowledge representation from cluste...
 
A Survey on Text Mining-techniques and application
A Survey on Text Mining-techniques and applicationA Survey on Text Mining-techniques and application
A Survey on Text Mining-techniques and application
 

Similar to MalayIK: An Ontological Approach to Knowledge Transformation in Malay Unstructured Documents

"Analysis of Different Text Classification Algorithms: An Assessment "
"Analysis of Different Text Classification Algorithms: An Assessment ""Analysis of Different Text Classification Algorithms: An Assessment "
"Analysis of Different Text Classification Algorithms: An Assessment "ijtsrd
 
Collaborative Learning of Organisational Knolwedge
Collaborative Learning of Organisational KnolwedgeCollaborative Learning of Organisational Knolwedge
Collaborative Learning of Organisational KnolwedgeWaqas Tariq
 
Mapping the Intellectual Structure of Contemporary Technology Management Rese...
Mapping the Intellectual Structure of Contemporary Technology Management Rese...Mapping the Intellectual Structure of Contemporary Technology Management Rese...
Mapping the Intellectual Structure of Contemporary Technology Management Rese...Che-Wei Lee
 
Research Inventy : International Journal of Engineering and Science
Research Inventy : International Journal of Engineering and ScienceResearch Inventy : International Journal of Engineering and Science
Research Inventy : International Journal of Engineering and Scienceresearchinventy
 
Integrated Social Media Knowledge Capture in Medical Domain of Indonesia
Integrated Social Media Knowledge Capture in Medical Domain of IndonesiaIntegrated Social Media Knowledge Capture in Medical Domain of Indonesia
Integrated Social Media Knowledge Capture in Medical Domain of IndonesiaTELKOMNIKA JOURNAL
 
An efficient educational data mining approach to support e-learning
An efficient educational data mining approach to support e-learningAn efficient educational data mining approach to support e-learning
An efficient educational data mining approach to support e-learningVenu Madhav
 
Building_a_Personal_Knowledge_Recommenda
Building_a_Personal_Knowledge_RecommendaBuilding_a_Personal_Knowledge_Recommenda
Building_a_Personal_Knowledge_RecommendaVinícios Pereira
 
Bsim0004 Assignment1 Copy Part1
Bsim0004 Assignment1 Copy Part1Bsim0004 Assignment1 Copy Part1
Bsim0004 Assignment1 Copy Part1Svensson Leung
 
A Survey on Knowledge Transfer between Knowledge-based Systems
A Survey on Knowledge Transfer between Knowledge-based SystemsA Survey on Knowledge Transfer between Knowledge-based Systems
A Survey on Knowledge Transfer between Knowledge-based SystemsTELKOMNIKA JOURNAL
 
IJCER (www.ijceronline.com) International Journal of computational Engineerin...
IJCER (www.ijceronline.com) International Journal of computational Engineerin...IJCER (www.ijceronline.com) International Journal of computational Engineerin...
IJCER (www.ijceronline.com) International Journal of computational Engineerin...ijceronline
 
Knowledge Management in Software Enterprise
Knowledge Management in Software EnterpriseKnowledge Management in Software Enterprise
Knowledge Management in Software EnterpriseIOSR Journals
 
Assessing The Tangible And Intangible Impacts Of The Convergence Of E-Learnin...
Assessing The Tangible And Intangible Impacts Of The Convergence Of E-Learnin...Assessing The Tangible And Intangible Impacts Of The Convergence Of E-Learnin...
Assessing The Tangible And Intangible Impacts Of The Convergence Of E-Learnin...ijistjournal
 
Assessing The Tangible And Intangible Impacts Of The Convergence Of E-Learnin...
Assessing The Tangible And Intangible Impacts Of The Convergence Of E-Learnin...Assessing The Tangible And Intangible Impacts Of The Convergence Of E-Learnin...
Assessing The Tangible And Intangible Impacts Of The Convergence Of E-Learnin...ijistjournal
 
Information Retrieval based on Cluster Analysis Approach
Information Retrieval based on Cluster Analysis ApproachInformation Retrieval based on Cluster Analysis Approach
Information Retrieval based on Cluster Analysis ApproachAIRCC Publishing Corporation
 

Similar to MalayIK: An Ontological Approach to Knowledge Transformation in Malay Unstructured Documents (20)

"Analysis of Different Text Classification Algorithms: An Assessment "
"Analysis of Different Text Classification Algorithms: An Assessment ""Analysis of Different Text Classification Algorithms: An Assessment "
"Analysis of Different Text Classification Algorithms: An Assessment "
 
Collaborative Learning of Organisational Knolwedge
Collaborative Learning of Organisational KnolwedgeCollaborative Learning of Organisational Knolwedge
Collaborative Learning of Organisational Knolwedge
 
Mapping the Intellectual Structure of Contemporary Technology Management Rese...
Mapping the Intellectual Structure of Contemporary Technology Management Rese...Mapping the Intellectual Structure of Contemporary Technology Management Rese...
Mapping the Intellectual Structure of Contemporary Technology Management Rese...
 
Dest 2010
Dest 2010Dest 2010
Dest 2010
 
Research Inventy : International Journal of Engineering and Science
Research Inventy : International Journal of Engineering and ScienceResearch Inventy : International Journal of Engineering and Science
Research Inventy : International Journal of Engineering and Science
 
Integrated Social Media Knowledge Capture in Medical Domain of Indonesia
Integrated Social Media Knowledge Capture in Medical Domain of IndonesiaIntegrated Social Media Knowledge Capture in Medical Domain of Indonesia
Integrated Social Media Knowledge Capture in Medical Domain of Indonesia
 
An efficient educational data mining approach to support e-learning
An efficient educational data mining approach to support e-learningAn efficient educational data mining approach to support e-learning
An efficient educational data mining approach to support e-learning
 
Building_a_Personal_Knowledge_Recommenda
Building_a_Personal_Knowledge_RecommendaBuilding_a_Personal_Knowledge_Recommenda
Building_a_Personal_Knowledge_Recommenda
 
Bsim0004 Assignment1 Copy Part1
Bsim0004 Assignment1 Copy Part1Bsim0004 Assignment1 Copy Part1
Bsim0004 Assignment1 Copy Part1
 
A Survey on Knowledge Transfer between Knowledge-based Systems
A Survey on Knowledge Transfer between Knowledge-based SystemsA Survey on Knowledge Transfer between Knowledge-based Systems
A Survey on Knowledge Transfer between Knowledge-based Systems
 
H0362049060
H0362049060H0362049060
H0362049060
 
IJCER (www.ijceronline.com) International Journal of computational Engineerin...
IJCER (www.ijceronline.com) International Journal of computational Engineerin...IJCER (www.ijceronline.com) International Journal of computational Engineerin...
IJCER (www.ijceronline.com) International Journal of computational Engineerin...
 
Knowledge Management in Software Enterprise
Knowledge Management in Software EnterpriseKnowledge Management in Software Enterprise
Knowledge Management in Software Enterprise
 
Document
DocumentDocument
Document
 
Shifting from librarian to data manager
Shifting from librarian to data managerShifting from librarian to data manager
Shifting from librarian to data manager
 
20304050607083
2030405060708320304050607083
20304050607083
 
20304050607083 2
20304050607083 220304050607083 2
20304050607083 2
 
Assessing The Tangible And Intangible Impacts Of The Convergence Of E-Learnin...
Assessing The Tangible And Intangible Impacts Of The Convergence Of E-Learnin...Assessing The Tangible And Intangible Impacts Of The Convergence Of E-Learnin...
Assessing The Tangible And Intangible Impacts Of The Convergence Of E-Learnin...
 
Assessing The Tangible And Intangible Impacts Of The Convergence Of E-Learnin...
Assessing The Tangible And Intangible Impacts Of The Convergence Of E-Learnin...Assessing The Tangible And Intangible Impacts Of The Convergence Of E-Learnin...
Assessing The Tangible And Intangible Impacts Of The Convergence Of E-Learnin...
 
Information Retrieval based on Cluster Analysis Approach
Information Retrieval based on Cluster Analysis ApproachInformation Retrieval based on Cluster Analysis Approach
Information Retrieval based on Cluster Analysis Approach
 

More from IJECEIAES

Cloud service ranking with an integration of k-means algorithm and decision-m...
Cloud service ranking with an integration of k-means algorithm and decision-m...Cloud service ranking with an integration of k-means algorithm and decision-m...
Cloud service ranking with an integration of k-means algorithm and decision-m...IJECEIAES
 
Prediction of the risk of developing heart disease using logistic regression
Prediction of the risk of developing heart disease using logistic regressionPrediction of the risk of developing heart disease using logistic regression
Prediction of the risk of developing heart disease using logistic regressionIJECEIAES
 
Predictive analysis of terrorist activities in Thailand's Southern provinces:...
Predictive analysis of terrorist activities in Thailand's Southern provinces:...Predictive analysis of terrorist activities in Thailand's Southern provinces:...
Predictive analysis of terrorist activities in Thailand's Southern provinces:...IJECEIAES
 
Optimal model of vehicular ad-hoc network assisted by unmanned aerial vehicl...
Optimal model of vehicular ad-hoc network assisted by  unmanned aerial vehicl...Optimal model of vehicular ad-hoc network assisted by  unmanned aerial vehicl...
Optimal model of vehicular ad-hoc network assisted by unmanned aerial vehicl...IJECEIAES
 
Improving cyberbullying detection through multi-level machine learning
Improving cyberbullying detection through multi-level machine learningImproving cyberbullying detection through multi-level machine learning
Improving cyberbullying detection through multi-level machine learningIJECEIAES
 
Comparison of time series temperature prediction with autoregressive integrat...
Comparison of time series temperature prediction with autoregressive integrat...Comparison of time series temperature prediction with autoregressive integrat...
Comparison of time series temperature prediction with autoregressive integrat...IJECEIAES
 
Strengthening data integrity in academic document recording with blockchain a...
Strengthening data integrity in academic document recording with blockchain a...Strengthening data integrity in academic document recording with blockchain a...
Strengthening data integrity in academic document recording with blockchain a...IJECEIAES
 
Design of storage benchmark kit framework for supporting the file storage ret...
Design of storage benchmark kit framework for supporting the file storage ret...Design of storage benchmark kit framework for supporting the file storage ret...
Design of storage benchmark kit framework for supporting the file storage ret...IJECEIAES
 
Detection of diseases in rice leaf using convolutional neural network with tr...
Detection of diseases in rice leaf using convolutional neural network with tr...Detection of diseases in rice leaf using convolutional neural network with tr...
Detection of diseases in rice leaf using convolutional neural network with tr...IJECEIAES
 
A systematic review of in-memory database over multi-tenancy
A systematic review of in-memory database over multi-tenancyA systematic review of in-memory database over multi-tenancy
A systematic review of in-memory database over multi-tenancyIJECEIAES
 
Agriculture crop yield prediction using inertia based cat swarm optimization
Agriculture crop yield prediction using inertia based cat swarm optimizationAgriculture crop yield prediction using inertia based cat swarm optimization
Agriculture crop yield prediction using inertia based cat swarm optimizationIJECEIAES
 
Three layer hybrid learning to improve intrusion detection system performance
Three layer hybrid learning to improve intrusion detection system performanceThree layer hybrid learning to improve intrusion detection system performance
Three layer hybrid learning to improve intrusion detection system performanceIJECEIAES
 
Non-binary codes approach on the performance of short-packet full-duplex tran...
Non-binary codes approach on the performance of short-packet full-duplex tran...Non-binary codes approach on the performance of short-packet full-duplex tran...
Non-binary codes approach on the performance of short-packet full-duplex tran...IJECEIAES
 
Improved design and performance of the global rectenna system for wireless po...
Improved design and performance of the global rectenna system for wireless po...Improved design and performance of the global rectenna system for wireless po...
Improved design and performance of the global rectenna system for wireless po...IJECEIAES
 
Advanced hybrid algorithms for precise multipath channel estimation in next-g...
Advanced hybrid algorithms for precise multipath channel estimation in next-g...Advanced hybrid algorithms for precise multipath channel estimation in next-g...
Advanced hybrid algorithms for precise multipath channel estimation in next-g...IJECEIAES
 
Performance analysis of 2D optical code division multiple access through unde...
Performance analysis of 2D optical code division multiple access through unde...Performance analysis of 2D optical code division multiple access through unde...
Performance analysis of 2D optical code division multiple access through unde...IJECEIAES
 
On performance analysis of non-orthogonal multiple access downlink for cellul...
On performance analysis of non-orthogonal multiple access downlink for cellul...On performance analysis of non-orthogonal multiple access downlink for cellul...
On performance analysis of non-orthogonal multiple access downlink for cellul...IJECEIAES
 
Phase delay through slot-line beam switching microstrip patch array antenna d...
Phase delay through slot-line beam switching microstrip patch array antenna d...Phase delay through slot-line beam switching microstrip patch array antenna d...
Phase delay through slot-line beam switching microstrip patch array antenna d...IJECEIAES
 
A simple feed orthogonal excitation X-band dual circular polarized microstrip...
A simple feed orthogonal excitation X-band dual circular polarized microstrip...A simple feed orthogonal excitation X-band dual circular polarized microstrip...
A simple feed orthogonal excitation X-band dual circular polarized microstrip...IJECEIAES
 
A taxonomy on power optimization techniques for fifthgeneration heterogenous ...
A taxonomy on power optimization techniques for fifthgeneration heterogenous ...A taxonomy on power optimization techniques for fifthgeneration heterogenous ...
A taxonomy on power optimization techniques for fifthgeneration heterogenous ...IJECEIAES
 

More from IJECEIAES (20)

Cloud service ranking with an integration of k-means algorithm and decision-m...
Cloud service ranking with an integration of k-means algorithm and decision-m...Cloud service ranking with an integration of k-means algorithm and decision-m...
Cloud service ranking with an integration of k-means algorithm and decision-m...
 
Prediction of the risk of developing heart disease using logistic regression
Prediction of the risk of developing heart disease using logistic regressionPrediction of the risk of developing heart disease using logistic regression
Prediction of the risk of developing heart disease using logistic regression
 
Predictive analysis of terrorist activities in Thailand's Southern provinces:...
Predictive analysis of terrorist activities in Thailand's Southern provinces:...Predictive analysis of terrorist activities in Thailand's Southern provinces:...
Predictive analysis of terrorist activities in Thailand's Southern provinces:...
 
Optimal model of vehicular ad-hoc network assisted by unmanned aerial vehicl...
Optimal model of vehicular ad-hoc network assisted by  unmanned aerial vehicl...Optimal model of vehicular ad-hoc network assisted by  unmanned aerial vehicl...
Optimal model of vehicular ad-hoc network assisted by unmanned aerial vehicl...
 
Improving cyberbullying detection through multi-level machine learning
Improving cyberbullying detection through multi-level machine learningImproving cyberbullying detection through multi-level machine learning
Improving cyberbullying detection through multi-level machine learning
 
Comparison of time series temperature prediction with autoregressive integrat...
Comparison of time series temperature prediction with autoregressive integrat...Comparison of time series temperature prediction with autoregressive integrat...
Comparison of time series temperature prediction with autoregressive integrat...
 
Strengthening data integrity in academic document recording with blockchain a...
Strengthening data integrity in academic document recording with blockchain a...Strengthening data integrity in academic document recording with blockchain a...
Strengthening data integrity in academic document recording with blockchain a...
 
Design of storage benchmark kit framework for supporting the file storage ret...
Design of storage benchmark kit framework for supporting the file storage ret...Design of storage benchmark kit framework for supporting the file storage ret...
Design of storage benchmark kit framework for supporting the file storage ret...
 
Detection of diseases in rice leaf using convolutional neural network with tr...
Detection of diseases in rice leaf using convolutional neural network with tr...Detection of diseases in rice leaf using convolutional neural network with tr...
Detection of diseases in rice leaf using convolutional neural network with tr...
 
A systematic review of in-memory database over multi-tenancy
A systematic review of in-memory database over multi-tenancyA systematic review of in-memory database over multi-tenancy
A systematic review of in-memory database over multi-tenancy
 
Agriculture crop yield prediction using inertia based cat swarm optimization
Agriculture crop yield prediction using inertia based cat swarm optimizationAgriculture crop yield prediction using inertia based cat swarm optimization
Agriculture crop yield prediction using inertia based cat swarm optimization
 
Three layer hybrid learning to improve intrusion detection system performance
Three layer hybrid learning to improve intrusion detection system performanceThree layer hybrid learning to improve intrusion detection system performance
Three layer hybrid learning to improve intrusion detection system performance
 
Non-binary codes approach on the performance of short-packet full-duplex tran...
Non-binary codes approach on the performance of short-packet full-duplex tran...Non-binary codes approach on the performance of short-packet full-duplex tran...
Non-binary codes approach on the performance of short-packet full-duplex tran...
 
Improved design and performance of the global rectenna system for wireless po...
Improved design and performance of the global rectenna system for wireless po...Improved design and performance of the global rectenna system for wireless po...
Improved design and performance of the global rectenna system for wireless po...
 
Advanced hybrid algorithms for precise multipath channel estimation in next-g...
Advanced hybrid algorithms for precise multipath channel estimation in next-g...Advanced hybrid algorithms for precise multipath channel estimation in next-g...
Advanced hybrid algorithms for precise multipath channel estimation in next-g...
 
Performance analysis of 2D optical code division multiple access through unde...
Performance analysis of 2D optical code division multiple access through unde...Performance analysis of 2D optical code division multiple access through unde...
Performance analysis of 2D optical code division multiple access through unde...
 
On performance analysis of non-orthogonal multiple access downlink for cellul...
On performance analysis of non-orthogonal multiple access downlink for cellul...On performance analysis of non-orthogonal multiple access downlink for cellul...
On performance analysis of non-orthogonal multiple access downlink for cellul...
 
Phase delay through slot-line beam switching microstrip patch array antenna d...
Phase delay through slot-line beam switching microstrip patch array antenna d...Phase delay through slot-line beam switching microstrip patch array antenna d...
Phase delay through slot-line beam switching microstrip patch array antenna d...
 
A simple feed orthogonal excitation X-band dual circular polarized microstrip...
A simple feed orthogonal excitation X-band dual circular polarized microstrip...A simple feed orthogonal excitation X-band dual circular polarized microstrip...
A simple feed orthogonal excitation X-band dual circular polarized microstrip...
 
A taxonomy on power optimization techniques for fifthgeneration heterogenous ...
A taxonomy on power optimization techniques for fifthgeneration heterogenous ...A taxonomy on power optimization techniques for fifthgeneration heterogenous ...
A taxonomy on power optimization techniques for fifthgeneration heterogenous ...
 

Recently uploaded

MANUFACTURING PROCESS-II UNIT-5 NC MACHINE TOOLS
MANUFACTURING PROCESS-II UNIT-5 NC MACHINE TOOLSMANUFACTURING PROCESS-II UNIT-5 NC MACHINE TOOLS
MANUFACTURING PROCESS-II UNIT-5 NC MACHINE TOOLSSIVASHANKAR N
 
GDSC ASEB Gen AI study jams presentation
GDSC ASEB Gen AI study jams presentationGDSC ASEB Gen AI study jams presentation
GDSC ASEB Gen AI study jams presentationGDSCAESB
 
(ANVI) Koregaon Park Call Girls Just Call 7001035870 [ Cash on Delivery ] Pun...
(ANVI) Koregaon Park Call Girls Just Call 7001035870 [ Cash on Delivery ] Pun...(ANVI) Koregaon Park Call Girls Just Call 7001035870 [ Cash on Delivery ] Pun...
(ANVI) Koregaon Park Call Girls Just Call 7001035870 [ Cash on Delivery ] Pun...ranjana rawat
 
247267395-1-Symmetric-and-distributed-shared-memory-architectures-ppt (1).ppt
247267395-1-Symmetric-and-distributed-shared-memory-architectures-ppt (1).ppt247267395-1-Symmetric-and-distributed-shared-memory-architectures-ppt (1).ppt
247267395-1-Symmetric-and-distributed-shared-memory-architectures-ppt (1).pptssuser5c9d4b1
 
VIP Call Girls Service Kondapur Hyderabad Call +91-8250192130
VIP Call Girls Service Kondapur Hyderabad Call +91-8250192130VIP Call Girls Service Kondapur Hyderabad Call +91-8250192130
VIP Call Girls Service Kondapur Hyderabad Call +91-8250192130Suhani Kapoor
 
MANUFACTURING PROCESS-II UNIT-2 LATHE MACHINE
MANUFACTURING PROCESS-II UNIT-2 LATHE MACHINEMANUFACTURING PROCESS-II UNIT-2 LATHE MACHINE
MANUFACTURING PROCESS-II UNIT-2 LATHE MACHINESIVASHANKAR N
 
(MEERA) Dapodi Call Girls Just Call 7001035870 [ Cash on Delivery ] Pune Escorts
(MEERA) Dapodi Call Girls Just Call 7001035870 [ Cash on Delivery ] Pune Escorts(MEERA) Dapodi Call Girls Just Call 7001035870 [ Cash on Delivery ] Pune Escorts
(MEERA) Dapodi Call Girls Just Call 7001035870 [ Cash on Delivery ] Pune Escortsranjana rawat
 
Sheet Pile Wall Design and Construction: A Practical Guide for Civil Engineer...
Sheet Pile Wall Design and Construction: A Practical Guide for Civil Engineer...Sheet Pile Wall Design and Construction: A Practical Guide for Civil Engineer...
Sheet Pile Wall Design and Construction: A Practical Guide for Civil Engineer...Dr.Costas Sachpazis
 
Processing & Properties of Floor and Wall Tiles.pptx
Processing & Properties of Floor and Wall Tiles.pptxProcessing & Properties of Floor and Wall Tiles.pptx
Processing & Properties of Floor and Wall Tiles.pptxpranjaldaimarysona
 
IMPLICATIONS OF THE ABOVE HOLISTIC UNDERSTANDING OF HARMONY ON PROFESSIONAL E...
IMPLICATIONS OF THE ABOVE HOLISTIC UNDERSTANDING OF HARMONY ON PROFESSIONAL E...IMPLICATIONS OF THE ABOVE HOLISTIC UNDERSTANDING OF HARMONY ON PROFESSIONAL E...
IMPLICATIONS OF THE ABOVE HOLISTIC UNDERSTANDING OF HARMONY ON PROFESSIONAL E...RajaP95
 
APPLICATIONS-AC/DC DRIVES-OPERATING CHARACTERISTICS
APPLICATIONS-AC/DC DRIVES-OPERATING CHARACTERISTICSAPPLICATIONS-AC/DC DRIVES-OPERATING CHARACTERISTICS
APPLICATIONS-AC/DC DRIVES-OPERATING CHARACTERISTICSKurinjimalarL3
 
Introduction and different types of Ethernet.pptx
Introduction and different types of Ethernet.pptxIntroduction and different types of Ethernet.pptx
Introduction and different types of Ethernet.pptxupamatechverse
 
Call Girls Service Nagpur Tanvi Call 7001035870 Meet With Nagpur Escorts
Call Girls Service Nagpur Tanvi Call 7001035870 Meet With Nagpur EscortsCall Girls Service Nagpur Tanvi Call 7001035870 Meet With Nagpur Escorts
Call Girls Service Nagpur Tanvi Call 7001035870 Meet With Nagpur EscortsCall Girls in Nagpur High Profile
 
IVE Industry Focused Event - Defence Sector 2024
IVE Industry Focused Event - Defence Sector 2024IVE Industry Focused Event - Defence Sector 2024
IVE Industry Focused Event - Defence Sector 2024Mark Billinghurst
 
Microscopic Analysis of Ceramic Materials.pptx
Microscopic Analysis of Ceramic Materials.pptxMicroscopic Analysis of Ceramic Materials.pptx
Microscopic Analysis of Ceramic Materials.pptxpurnimasatapathy1234
 
Call Girls in Nagpur Suman Call 7001035870 Meet With Nagpur Escorts
Call Girls in Nagpur Suman Call 7001035870 Meet With Nagpur EscortsCall Girls in Nagpur Suman Call 7001035870 Meet With Nagpur Escorts
Call Girls in Nagpur Suman Call 7001035870 Meet With Nagpur EscortsCall Girls in Nagpur High Profile
 
HARMONY IN THE NATURE AND EXISTENCE - Unit-IV
HARMONY IN THE NATURE AND EXISTENCE - Unit-IVHARMONY IN THE NATURE AND EXISTENCE - Unit-IV
HARMONY IN THE NATURE AND EXISTENCE - Unit-IVRajaP95
 
Model Call Girl in Narela Delhi reach out to us at 🔝8264348440🔝
Model Call Girl in Narela Delhi reach out to us at 🔝8264348440🔝Model Call Girl in Narela Delhi reach out to us at 🔝8264348440🔝
Model Call Girl in Narela Delhi reach out to us at 🔝8264348440🔝soniya singh
 

Recently uploaded (20)

MANUFACTURING PROCESS-II UNIT-5 NC MACHINE TOOLS
MANUFACTURING PROCESS-II UNIT-5 NC MACHINE TOOLSMANUFACTURING PROCESS-II UNIT-5 NC MACHINE TOOLS
MANUFACTURING PROCESS-II UNIT-5 NC MACHINE TOOLS
 
Exploring_Network_Security_with_JA3_by_Rakesh Seal.pptx
Exploring_Network_Security_with_JA3_by_Rakesh Seal.pptxExploring_Network_Security_with_JA3_by_Rakesh Seal.pptx
Exploring_Network_Security_with_JA3_by_Rakesh Seal.pptx
 
GDSC ASEB Gen AI study jams presentation
GDSC ASEB Gen AI study jams presentationGDSC ASEB Gen AI study jams presentation
GDSC ASEB Gen AI study jams presentation
 
(ANVI) Koregaon Park Call Girls Just Call 7001035870 [ Cash on Delivery ] Pun...
(ANVI) Koregaon Park Call Girls Just Call 7001035870 [ Cash on Delivery ] Pun...(ANVI) Koregaon Park Call Girls Just Call 7001035870 [ Cash on Delivery ] Pun...
(ANVI) Koregaon Park Call Girls Just Call 7001035870 [ Cash on Delivery ] Pun...
 
Call Us -/9953056974- Call Girls In Vikaspuri-/- Delhi NCR
Call Us -/9953056974- Call Girls In Vikaspuri-/- Delhi NCRCall Us -/9953056974- Call Girls In Vikaspuri-/- Delhi NCR
Call Us -/9953056974- Call Girls In Vikaspuri-/- Delhi NCR
 
247267395-1-Symmetric-and-distributed-shared-memory-architectures-ppt (1).ppt
247267395-1-Symmetric-and-distributed-shared-memory-architectures-ppt (1).ppt247267395-1-Symmetric-and-distributed-shared-memory-architectures-ppt (1).ppt
247267395-1-Symmetric-and-distributed-shared-memory-architectures-ppt (1).ppt
 
VIP Call Girls Service Kondapur Hyderabad Call +91-8250192130
VIP Call Girls Service Kondapur Hyderabad Call +91-8250192130VIP Call Girls Service Kondapur Hyderabad Call +91-8250192130
VIP Call Girls Service Kondapur Hyderabad Call +91-8250192130
 
MANUFACTURING PROCESS-II UNIT-2 LATHE MACHINE
MANUFACTURING PROCESS-II UNIT-2 LATHE MACHINEMANUFACTURING PROCESS-II UNIT-2 LATHE MACHINE
MANUFACTURING PROCESS-II UNIT-2 LATHE MACHINE
 
(MEERA) Dapodi Call Girls Just Call 7001035870 [ Cash on Delivery ] Pune Escorts
(MEERA) Dapodi Call Girls Just Call 7001035870 [ Cash on Delivery ] Pune Escorts(MEERA) Dapodi Call Girls Just Call 7001035870 [ Cash on Delivery ] Pune Escorts
(MEERA) Dapodi Call Girls Just Call 7001035870 [ Cash on Delivery ] Pune Escorts
 
Sheet Pile Wall Design and Construction: A Practical Guide for Civil Engineer...
Sheet Pile Wall Design and Construction: A Practical Guide for Civil Engineer...Sheet Pile Wall Design and Construction: A Practical Guide for Civil Engineer...
Sheet Pile Wall Design and Construction: A Practical Guide for Civil Engineer...
 
Processing & Properties of Floor and Wall Tiles.pptx
Processing & Properties of Floor and Wall Tiles.pptxProcessing & Properties of Floor and Wall Tiles.pptx
Processing & Properties of Floor and Wall Tiles.pptx
 
IMPLICATIONS OF THE ABOVE HOLISTIC UNDERSTANDING OF HARMONY ON PROFESSIONAL E...
IMPLICATIONS OF THE ABOVE HOLISTIC UNDERSTANDING OF HARMONY ON PROFESSIONAL E...IMPLICATIONS OF THE ABOVE HOLISTIC UNDERSTANDING OF HARMONY ON PROFESSIONAL E...
IMPLICATIONS OF THE ABOVE HOLISTIC UNDERSTANDING OF HARMONY ON PROFESSIONAL E...
 
APPLICATIONS-AC/DC DRIVES-OPERATING CHARACTERISTICS
APPLICATIONS-AC/DC DRIVES-OPERATING CHARACTERISTICSAPPLICATIONS-AC/DC DRIVES-OPERATING CHARACTERISTICS
APPLICATIONS-AC/DC DRIVES-OPERATING CHARACTERISTICS
 
Introduction and different types of Ethernet.pptx
Introduction and different types of Ethernet.pptxIntroduction and different types of Ethernet.pptx
Introduction and different types of Ethernet.pptx
 
Call Girls Service Nagpur Tanvi Call 7001035870 Meet With Nagpur Escorts
Call Girls Service Nagpur Tanvi Call 7001035870 Meet With Nagpur EscortsCall Girls Service Nagpur Tanvi Call 7001035870 Meet With Nagpur Escorts
Call Girls Service Nagpur Tanvi Call 7001035870 Meet With Nagpur Escorts
 
IVE Industry Focused Event - Defence Sector 2024
IVE Industry Focused Event - Defence Sector 2024IVE Industry Focused Event - Defence Sector 2024
IVE Industry Focused Event - Defence Sector 2024
 
Microscopic Analysis of Ceramic Materials.pptx
Microscopic Analysis of Ceramic Materials.pptxMicroscopic Analysis of Ceramic Materials.pptx
Microscopic Analysis of Ceramic Materials.pptx
 
Call Girls in Nagpur Suman Call 7001035870 Meet With Nagpur Escorts
Call Girls in Nagpur Suman Call 7001035870 Meet With Nagpur EscortsCall Girls in Nagpur Suman Call 7001035870 Meet With Nagpur Escorts
Call Girls in Nagpur Suman Call 7001035870 Meet With Nagpur Escorts
 
HARMONY IN THE NATURE AND EXISTENCE - Unit-IV
HARMONY IN THE NATURE AND EXISTENCE - Unit-IVHARMONY IN THE NATURE AND EXISTENCE - Unit-IV
HARMONY IN THE NATURE AND EXISTENCE - Unit-IV
 
Model Call Girl in Narela Delhi reach out to us at 🔝8264348440🔝
Model Call Girl in Narela Delhi reach out to us at 🔝8264348440🔝Model Call Girl in Narela Delhi reach out to us at 🔝8264348440🔝
Model Call Girl in Narela Delhi reach out to us at 🔝8264348440🔝
 

MalayIK: An Ontological Approach to Knowledge Transformation in Malay Unstructured Documents

  • 1. International Journal of Electrical and Computer Engineering (IJECE) Vol. 8, No. 1, February 2018, pp. 1~10 ISSN: 2088-8708, DOI: 10.11591/ijece.v8i1.pp1-10  1 Journal homepage: http://iaescore.com/journals/index.php/IJECE MalayIK: An Ontological Approach to Knowledge Transformation in Malay Unstructured Documents Fatimah Sidi1 , Iskandar Ishak2 , Marzanah A. Jabar3 1,2 Department of Computer Science, Faculty of Computer Science and Information Technology, Universiti Putra Malaysia, Malaysia 3 Department of Software Engineering and Information System, Faculty of Computer Science and Information Technology, Universiti Putra Malaysia, Malaysia Article Info ABSTRACT Article history: Received Feb 17, 2017 Revised Jul 13, 2017 Accepted Nov 29, 2017 The number of unstructured documents written in Malay language is enormously available on the web and intranets. However, unstructured documents cannot be queried in simple ways, hence the knowledge contained in such documents can neither be used by automatic systems nor could be understood easily and clearly by humans. This paper proposes a new approach to transform extracted knowledge in Malay unstructured document using ontology by identifying, organizing, and structuring the documents into an interrogative structured form. A Malay knowledge base, the MalayIK corpus is developed and used to test the MalayIK-Ontology against Ontos, an existing data extraction engine. The experimental results from MalayIK- Ontology have shown a significant improvement of knowledge extraction over Ontos implementation. This shows that clear knowledge organization and structuring concept is able to increase understanding, which leads to potential increase in sharable and reusable of concepts among the community. Keyword: Interrogative knowledge Knowledge transformation Ontology Unstructured documents Copyright © 2018 Institute of Advanced Engineering and Science. All rights reserved. Corresponding Author: Fatimah Sidi, Department of Computer Science, Faculty of Computer Science and Information Technology, Universiti Putra Malaysia, 43400, UPM, Selangor, Malaysia. Email: fatimah@upm.edu.my 1. INTRODUCTION The difficulty of defining knowledge in unstructured documents is due to the paradox that knowledge resides in a person‟s mind and at the same time, it has to be captured, stored, and reported. For that, philosophers classify knowledge into knowing-that and knowing-how. Knowing-that is factual where data are stored in databases and facts can be recalled, processed, and disseminated. While knowing-how is actionable to do something, turning data into information and in turn into knowledge [1]. However, structured data represent only a little part of the overall organization of knowledge; in fact, the major part of this knowledge is incorporated in textual documents. For example, available business data are captured in text files that are not structured, e.g. memoranda and journal articles that are available electronically [2-4]. A large portion of the available information does not appear in structured databases but rather in collections of text articles drawn from various sources [5]. Thus, the main concern here is to dig knowledge from the available vast amount of textual documents. The proposed ontological approach to knowledge transformation is based on interrogative structure [6] and conceptual modeling [7-11] approach. In transforming the extracted knowledge in unstructured document, “deep-level understanding” of complete sentences is extracted by identifying, organizing, and structuring the information into interrogative structured form. The “deep-level understanding” of complete
  • 2.  ISSN: 2088-8708 Int J Elec & Comp Eng, Vol. 8, No. 1, February 2018 : 1 – 10 2 sentence refers to the understanding of a group of words in a complete sentence which, when they are written down, begin with a capital letter and end with a full stop, question mark, or exclamation mark. The interrogative approach to knowledge extraction relies on data and conceptual modeling, as well as context and knowledge representation. Knowledge extraction supports the creation of: (1) knowledge; (2) relationship; (3) contextual information; and (4) representation of common languages. It gives aid in the transformation of extracted knowledge in an unstructured document into an interrogative structured form. The first issue to address corresponds to the need for a mechanism to identify knowledge from the sourced unstructured document in order to extract the knowledge. This is essentially in the interrogative knowledge identification, which identifies the type of document by separating text into knowledge, information or data and unifying it with personal components of values and beliefs. To identify knowledge, the approach of answering interrogatively is proposed to answer the question within the text in unstructured document. The interrogative contextual information is derived from the incorporation of context and additional information annotation with context key facility. Context is an abstraction of the context factors, which are represented as concepts [12]. It is further exploited by [13] as contextual information, where information entered into the computer is tagged with context keys facilitating future retrieval using those keys. It may be any information that could be used to characterize the situation of an entity i.e. person, place, object [14]. For that, the interrogative contextual information is utilized to understand the process of making sense of information into knowledge and maintain the meaning of the information. This is to gain the interpretation of the identical knowledge by classifying the main point of the unstructured document interrogatively. The rational to incorporate personal components towards the interrogative knowledge identification is as follows. According to [15], personal components have a powerful impact on organizational knowledge [16]. Assert that knowledge is a fluid mix of frame experience, values, contextual information, and expert insight. It originates in the mind of the knower to determine a large part of what the knower sees, absorbs, and concludes from his observations [17]. Stated that knowledge is a private and personal thing, which is intuitive and strongly linked to the user‟s values and beliefs. By manually transforming documents, values are embedded because humans read documents, extract the values of existing fields, and then enter the values into a user interface [18]. 2. RESEARCH METHOD This research proposes the MalayIK-Ontology model that is designed to transform extracted knowledge in Malay unstructured documents into an interrogative structured form based on interrogative knowledge identification as well as interrogative knowledge organization and structuring. The first step is to prepare the unstructured documents into an extension of plain text. The second step is to invoke the lexicon identifier that uses lexicon interrogative analysis matching rules of a specific corpus, which in this research, the MalayIK-Corpus. The lexicon identifier is used to identify and to extract knowledge in each of the complete sentences written in the Malay unstructured document. It is also used to extract interrogative lexical constructs from the individual unstructured document. Next, the third step is to invoke the object recognizer that uses matching rules of object interrogative analysis in order to extract the ontological constructs from the interrogative lexical constructs. The object recognizer populates and maps the objects using ontology engineering, which is a mechanism of a knowledge structure to represent the concept and relationship of the abstract model on how people think about things in the world. Finally, the fourth step is to populate the database scheme by transforming the ontological constructs through connection between the ontology model and the object-relationship model. The MalayIK-Ontology architecture (Figure 1) consists of three main components, which are IKL- Identifier, IKO-Recognizer, and IKS-OntologyDB. IKL-Identifier attempts to answer the question within the text interrogatively, and IKS-OntologyDB connects the ontology and object-relationship model to be populated into database. The Protégé-Frames editor [19] is adopted in this research to structure and capture knowledge. It provides a full-fledged user interface and knowledge server to support users in constructing and storing frame-based domain ontologies, customizing data entry forms, and entering instance data. An object-based recognizer using interrogative knowledge approach or IKO-Recognizer [20], [21] is also adopted. In this research, the IKO-Recognizer maps the object interrogative analysis rule with ontology.
  • 3. Int J Elec & Comp Eng ISSN: 2088-8708  MalayIK: An Ontological Approach to Knowledge Transformation in Malay Unstructured …. (Fatimah Sidi) 3 Construct Ontology Text Preprocessing Unstructured Document Tokenization Part-of-Speech Construction Interrogative Analysis Convert Ontological Constructs into Structured Form Domain knowledge Instance knowledge Structured Form Object-Relationship Model Java classes ……{ …( ..)…{ ……..} } Objects Protégé Knowledge-base System Concepts, Properties, Attributes and Facets of ontological constructs as in Protégé Knowledge-base System Database System Interrogative lexical constructs Tokenization Convert into extension of plain text file Mapping Objects ó Protégé (Classes, Slots) Object Recognizer Unstructured Document List of Concept/Class Ontological constructs Basic Design Structured Document (Populated SQL Database) Database Description Mapping Ontological constructs ó Object-Relationship Model Lexicon interrogative analysis matching rules List of Interrogative Question: ‘apa’ (what), ‘siapa’ (who), ‘bila’ (when), ‘di mana’ (where), ‘mengapa’ (why) and ‘bagaimana’ (how) List of Keyword, Constant, Term and Phrase Definition / Learning of Mapping Interrogative lexical constructs IKL-Identifier: Interrogative Knowledge Lexicon Identifier answering the question within the text interrogatively. IKO-Recognizer: Interrogative Knowledge Object Recognizer mapping object interrogative analysis rule with ontology. MalayIK- Corpus Classes/Concepts, Properties, Attributes and Facets (role restrictions) IKS-OntologyDB: Interrogative Knowledge Structure connecting ontology and object-relationship model to populate into database. Record-Level Objects, Relationships and Constraints Preparation: Conversion of file. List of Object Figure 1. The MalayIK-Ontology Model(left) and System Architecture of MalayIK-Ontology(right) 2.1. IKL-Identifier The Interrogative Knowledge Lexicon- (IKL-) Identifier is a lexicon identifier that uses lexicon interrogative analysis of ‘apa’ (what), ‘siapa’ (who), ‘bila’ (when), ‘di mana’ (where), ‘mengapa’ (why), and ‘bagaimana’ (how) in answering interrogative questions within the text in an unstructured document. The mechanism for the IKL-Identifier is to convert sentences into interrogative lexical constructs in the form of interrogative annotation. Basically, the IKL-Identifier identifies the type of interrogative lexical constructs in each complete sentence within the Malay unstructured document by separating the text into knowledge, information or data. It is also responsible to tag the interrogative lexical constructs with interrogative contextual information, which is important to interpret the information into knowledge and maintain the meaning of the information in the Malay unstructured document. The processes of the IKL-Identifier are illustrated in Figure 2, which are tokenization, lexicon interrogative analysis, interrogative contextual information tagging, and phrases constructor tagging. Lexicon exists No Yes Insert into MalayIK-Corpus as extended lexicon Insert into interrogative annotation Construct and capture phrases and proper nouns MalayIK- Corpus Lexicons Interrogative lexical constructs Tokenization Lexicon Interrogative Analysis Interrogative Contextual Information Tagging Phrases Constructor Tagging Figure 2. IKL-Identifier Processes
  • 4.  ISSN: 2088-8708 Int J Elec & Comp Eng, Vol. 8, No. 1, February 2018 : 1 – 10 4 During tokenization, the text of unstructured document is segmented into sentences and tokenized into lexicons. Subsequently, the case format is defined, either the lexicon will hold digits, lower, upper, title or toggle cases. Each lexicon will then be assigned with automated serial number by lines, sentences, and token numbers. Next, during the Lexicon Interrogative Analysis, each lexicon is analyzed with lexicon interrogative analysis matching rules of the MalayIK-Corpus using the standard Data Manipulation Language (DML). DML is used to analyze, to check, and to insert the lexicon into interrogative annotation as interrogative lexical construct, should it exists. Any new lexicon that is analyzed will be inserted and defined in the MalayIK-Corpus. Finally, the interrogative lexical constructs are used during Phrases Constructor Tagging. In this step, a phrase is constructed by putting together words based on interrogative annotation of the word. A phrase is a group of words, which contains an idea that forms a unit in which writing is part rather than a whole of a sentence. The words are divided depending on their use in a part of speech. Malay sentence: Brian Fielding Frost yang tercinta, umur 41, meninggal dunia pagi Selasa, September 30, 1998, disebabkan oleh kecederaan dialami dalam satu kemalangan kereta. English sentence: Our beloved Brian Fielding Frost, age 41, passed away Tuesday morning, September 30, 1998, due to injuries sustained in an automobile accident. 2.2. IKO-Recognizer The IKO-Recognizer (Interrogative Knowledge Object Recognizer) performs matching and mapping object interrogative analysis rule of what/who/when/where/ why/how to extract ontological constructs [20], [21]. There are two major processes, object recognizer and mapping process. First, the object recognizer uses object interrogative analysis rules by utilizing Object-Oriented Programming (OOP) in order to conceptually organize the program around its data (objects/concepts). In this process, a number of object interrogative analysis rules and precondition language is pre-defined but users may manually define additional rules. Second, the following mapping process uses an ontology engineering approach, whereby objects that have been created by the object recognizer are accessible as plug-ins in the ontology system. The main process in the IKO-Recognizer is the Object Interrogative Analysis Rules and the Precondition Language. Object interrogative analysis rules capitalize on Java OOP class encapsulation approach. For this, the object interrogative analysis rules use interrogative elements as the most upper class of the object. The structure and behaviors of the objects are implemented through (a) Struktur Kata Nama Am (Noun Structure) and (b) Struktur Leksikon Semantik (Semantic Lexicon Structure) in order to construct objects. The first structure, which is the Struktur Kata Nama Am (Noun Structure), the object interrogative analysis rule is defined by combining the structure and behavior of an object with its inheritance and its conceptual modifiers of one or more subclasses in a hierarchical structure. The structure and behaviour of the object are defined by ‘kata_masuk’ as tagged during the interrogative lexical construct earlier. The „kata_masuk’ for ‘penyelidik’ (investigator) is the grammatical information of „kata nama am‟. It is a noun of ‘kata nama am orang’, which refers to as a conceptual of ‘Orang’ (People), and has the interrogative element of ‘siapa’ (who). Hence, it inherits the general behaviour or properties of its parent ‘siapa’ (who). In the second structure, the Struktur Leksikon Semantik (Lexicon Semantic Structure), the object interrogative analysis rule uses the corresponding structure and behaviour of semantic lexicon that defines interrogative elements of ‘bila’ (when), ‘di mana’ (where), ‘mengapa’ (why), and ‘bagaimana’ (how). The semantic lexicons of ‘bila’ (when) and ‘di mana’ (where) correspond to the phrase or proper noun constructed after the semantic lexicon of the interrogative elements. The structure and behaviour of semantic lexicon ‘bila’ (when) shows about the time at which an event take place. Whereas, the semantic lexicon of ‘di mana’ (where) shows about the place something is in, or is coming from or going to. However, the semantic lexicons of ‘mengapa’ (why) and ‘bagaimana’ (how) correspond to the predicate after the semantic lexicons of ‘mengapa’ (why) and ‘bagaimana’ (how). Reason being is to describe the meaning of the semantic lexicons and to give information about the sentence. The semantic lexicon of ‘mengapa’ (why) talks about the reasons for something which introduces a relative. Whereas, the semantic lexicon ‘bagaimana’ (how) explains the way in which something happens or is done and introduces a statement or fact. The objects of ‘mengapa’ (why) and ‘bagaimana’ (how) correspond accordingly to their definitions of interrogative element.
  • 5. Int J Elec & Comp Eng ISSN: 2088-8708  MalayIK: An Ontological Approach to Knowledge Transformation in Malay Unstructured …. (Fatimah Sidi) 5 2.3. IKS-OntologyDB The IKS-OntologyDB is a process of exporting the ontology structure into a database. The metadata of the information regarding the relationships, properties, attributes, and facets of the class structure are created in the ontology system and are exported into Microsoft Access. The exportation is done by using the facility provided by Protégé by selecting the option of Export to HTML format. The transformation of the knowledge-based system created via the Protégé to the database management system by using HTML format. The HTML information is used to create attributes and constraints in the Microsoft Access. The table is created according to the definition and declarations of the SQL schema and ontology declaration of the Protégé knowledge-based system. This is shown in Table 1. Table 1. Ontology versus SQL Declaration Properties Ontology Declaration SQL Declaration Class Table Slot Field Name Property Data Type Facet Constraint Cardinality Cardinality The components of ontology and conceptual model are basically equivalent in terms of concepts and entities, relationships, and attributes. The ontological constructs generated are mapped with the Object- oriented System Model (OSM) established by [7-11]. It is used by the object-relationship model to describe the data interest which includes relationships, lexical appearances and context keywords. Besides, it is used to structure the data identified and extracted and populate them into database scheme. In general, the relevant knowledge about an object set is represented by a colon (:) after an object- set name which denotes that the object set is a specialization. For example, the lexical object set of Death Date: Date, where date describes the string patterns of interrogative element of ‘bila’ (when). For the lexical object set of Deceased Name: Name and Relative Name: Name, name is matched by recognizing the string patterns of proper nouns interrogative element of ‘siapa’ (who). The context keywords indicate the presence of an object in an object set. For example, ‘kematian’ (died) and ‘meninggal dunia’ (passed away) are the context keywords for Death Date; ‘pengebumian’ (buried) is a context keyword for Interment. 2.4. MalayIK Corpus This research uses the MalayIK-Ontology based on interrogative approach. While most approaches of text processing as discussed in [22] use NLP or information extraction to select the set of keywords or phrases to be analyzed, ontology approach is able to avoid mislead in the “vocabulary problem” which leads to spurious results. By establishing a fixed set of general concepts (“People”, “Location”, “Things”) with the entry of word answering the question interrogatively (“People”, “Location”, “Things” refer to „who‟, „where‟, ‟what‟ respectively), the vocabulary used in the rule mapping phase may be controlled. The most important attribute is the grammatical information of lexicon entry to answer the question of the lexicon grammatical information interrogatively besides the root word (lexicon). The MalayIK-Corpus is a Malay language corpus where the Malay dictionary of Kamus Dewan [23, 24] and the dictionary of root words act as important secondary controls of the lexicon entries. It also refers to the dictionary of Kamus Imbuhan Bahasa Melayu [25], Kamus Dwibahasa Oxford Fajar [26], and Kamus Komprehensif Bahasa Melayu [27]. The lexicons entries are manually inserted in the database using standard DML of the related database. In order to create a general purpose corpus for Malay language, the Ahmad‟s and Abdullah‟s stop words [23], [25] are included which indicate pronoun, auxiliary verb, adverb, predicate, preposition, negative, conjunction, relative and determinant. Table 2 presents examples of words entry extracted from MalayIK-Corpus in a table format (by columns and rows). The header row of Table 2 refers the attributes of corpus by columns. The rest of the rows are examples of words entries for ‘rumah’ (house), ‘sejak’ (since), ‘penyelidik’ (researcher), ‘di’ (at), ‘kerana’ (because) and ‘dengan’ (with). It answers the question of interrogative of ‘apa’ (what), ‘bila’ (when), ‘siapa’ (who), ‘di mana’ (where), ‘mengapa’ (why), and ‘bagaimana’ (how) respectively. Basically, the grammatical information of ‘rumah’ and ‘penyelidik’ is noun (‘kata nama am’) but are classified as different category. The word ‘rumah’ (house) falls under categorization of „Things‟ which answers the interrogative question of „what‟. While ‘penyelidik’ (researcher) falls under categorization of „People‟ which answers the interrogative question of „who‟.
  • 6.  ISSN: 2088-8708 Int J Elec & Comp Eng, Vol. 8, No. 1, February 2018 : 1 – 10 6 Table 2. Examples of MalayIK-Corpus kata dasar Perkataan kata masuk elemen interogatif Status rumah rumah kata nama am benda apa 1 sejak sejak kata sendi nama masa bila 2 selidik penyelidik kata nama am orang siapa 1 di di kata sendi nama tempat dan arah di mana 2 kerana kerana kata hubung pancangan mengapa 2 dengan dengan kata sendi nama bersama-sama bagaimana 2 3. RESULTS AND ANALYSIS The first question that arises in designing the MalayIK-Ontology is to check whether the constant/keyword recognizer to extract and structure data of Ontos can be applied to Malay unstructured documents. The next question is to check whether the MalayIK-Ontology can identify knowledge as well as data to be extracted and structured are equivalent or better than Ontos. Furthermore, the knowledge or data obtained needs to be checked for its validity. This is to prove that the MalayIK-Ontology works effectively in extracting and structuring data as compared with Ontos. Therefore, following are the steps taken to perform the experiment. The accuracy of Ontos is measured by the numbers of data extracted between the English and Malay obituaries. The accuracy of MalayIK-Ontology is measured by numbers of knowledge or data extracted. When applying Ontos on English and Malay obituaries, three tables are created based on the obituaries ontology, which are DeceasedPerson, Viewing, and DeceasedPersonRelationshipRelativeName. For MalayIK-Ontology, the table DeceasedPersonRelationshipRelativeName is used to compare the translation of Malay language for Relationship. An example of data extracted for both Ontos and MalayIK-Ontology are listed in Table 3 for DeceasedPerson. Table 3. List of Data Extracted for Table DeceasedPerson Extraction Manual Extraction Ontos MalayIK- Ontology Language of Obituary English English Malay Malay Facts/Attributes DeceasedPerson 1002 1002 1002 1002 DeceasedName Lemar K. Adamson Lemar K. Adamson Lemar K. Adamson Lemar K. Adamson Age 84 84 199 84 DeathDate 9/30/1998 9/30/1998 9/30/1998 9/30/1998 BirthDate 6/12/1914 6/12/1914 6/12/1914 6/12/1914 Funeral 5002 5002 5002 5002 FuneralDate 10/5/1998 10/5/1998 - 10/5/1998 FuneralAddress Silverbell Ward, 1540 E. Linden 1540 E. Linden 1540 E. Linden Silverbell Ward, 1540 E. Linden FuneralTime 10:00 AM 10:00 10:00 10:00 AM Interment 7002 7002 7002 7002 IntermentAddress City Cemetery 236 S. Scott 236 S. Scott City Cemetery IntermentDate - - - - (-) No data extracted 3.1. Analysis of Ontos on English and Malay Obituaries The results of Ontos being applied on English and Malay obituaries are shown in Table 4 and Table 5. This table shows the counted number of facts (attributes values) in the test-set documents of English and Malay obituaries. They [7-11] are consistent with their implementation, which only extracts explicit constants. A string is counted as correct, if the constant extracted occurs in the text. With this understanding, counting is basically straightforward. Due to their name lexicon is incomplete and because of their name- extraction expressions are not rich; sometimes parts of a name are missed or a single name were split into two. For these cases, they list the count after + in the Declared Correctly column.
  • 7. Int J Elec & Comp Eng ISSN: 2088-8708  MalayIK: An Ontological Approach to Knowledge Transformation in Malay Unstructured …. (Fatimah Sidi) 7 Partial names also caused most of the problems for the large number of incorrectly identified relatives. With a more accurate and complete lexicon coupled with richer name-extraction expression, they believe they can achieve much higher precision. As anticipated, the experiment to check the constant/keyword recognizer of Ontos on Malay obituaries does not produce the same results as English obituaries for numbers of facts generated. However, the results show that the DeceasedPerson, DeceasedName, BirthDate and DeathDate generate 100% recall and precision. Facts generated are classified as nonlexical and lexical objects set. The nonlexical and lexical objects set are described in what they defined as data frames. A data frame describes the string patterns for its constants. For that, results of lexical objects sets such as counted number of facts for the IntermentDate, IntermentAddress, ViewingDate, ViewingAddress, Relationship and RelativeName generate 0% recall and precision listed in Table 4. This is due to the context keywords in Malay obituaries being translated. However, non-lexical object sets such as DeceasedPerson are always generated for an obituary record and consequently the results for that sets are 100% recall and precision. They represent non-lexical object sets by surrogate identifiers which are generally easier to identify correctly. This shows Ontos can be applied on Malay obituaries for data frame of non-lexical object sets by surrogate identifiers. Table 4. Results of Ontos on English and Malay Obituaries Facts Number of Facts in Source (N) Number of Facts Declared Correctly + Partially Correct (C) Number of Facts Declared Incorrectly (I) Recall Ratio (%) (C/N) Precision Ratio (%) (C/C+I) E M E M E M E M DeceasedPerson 3 3 3 0 0 100 100 100 100 DeceasedName 3 3 3 0 0 100 100 100 100 Age 3 3 3 0 3 100 0 100 50 BirthDate 3 3 3 0 0 100 100 100 100 DeathDate 3 3 3 0 0 100 100 100 100 Funeral FuneralDate 3 3 0 0 0 100 0 100 0 FuneralAddress 3 0 0 3 2 0 0 0 0 FuneralTime 3 2 1 1 1 67 33 67 50 Interment IntermentDate 0 0 0 2 0 0 0 0 0 IntermentAddress 3 0 0 3 3 0 0 0 0 Viewing ViewingDate 1 0 0 0 0 0 0 0 0 ViewingAddress 3 0 0 0 1 0 0 0 0 BeginingTime 3 2 1 0 1 67 33 100 50 EndingTime 3 3 1 0 0 100 33 100 100 RelationshipRelativeName Relationship 8 8 0 0 0 100 0 100 0 RelativeName 28 6 0 9 0 22 0 40 0 (E) English, (M) Malay 3.2. Analysis of MalayIK-Ontology on Malay Obituaries The results of MalayIK-Ontology being applied with Malay obituaries to determine its ability to identify and extract data as compared with Ontos are listed in Table 5. Table 5 shows the results obtained are compared with the results of the first experiment on Ontos. The purpose of comparison is to check the validity of data identified and generated by the MalayIK-Ontology. In this second experiment, results of both nonlexical (DeceasedPerson) and lexical objects (Funeral, Interment, Viewing and RelationshipRelativeName) of MalayIK-Ontology generate 100% recall and precision except for ViewingAddress which generates 67% recall and precision. This is due to the location keyword indicating
  • 8.  ISSN: 2088-8708 Int J Elec & Comp Eng, Vol. 8, No. 1, February 2018 : 1 – 10 8 address for the lexical object of ViewingAddress in Malay obituaries being translated as something done by a person. Whereas, the lexical object for IntermentDate definitely generates 0%, this is because there is no interment date specified in the obituaries. Hence, this shows that there is an improvement in extracting and structuring data of Ontos lexical objects by the MalayIK-Ontology. Table 5. Results of Ontos and MalayIK-Ontology Facts N Number of Facts Declared Correctly + Partially Correct (C) Number of Facts Declared Incorrectly (I) Recall Ratio (%) (C/N) Precision Ratio (%) (C/C+I) Ontos M O Ontos M O Ontos MO Ontos MO E M M E M M E M M E M M DeceasedPerson 3 3 3 3 0 0 0 100 100 100 100 100 100 DeceasedName 3 3 3 3 0 0 0 100 100 100 100 100 100 Age 3 3 3 3 0 3 0 100 0 100 100 50 100 BirthDate 3 3 3 3 0 0 0 100 100 100 100 100 100 DeathDate 3 3 3 3 0 0 0 100 100 100 100 100 100 Funeral FuneralDate 3 3 0 3 0 0 0 100 0 100 100 0 100 FuneralAddress 3 0 0 3 3 2 0 0 0 100 0 0 100 FuneralTime 3 2 1 3 1 1 0 67 33 100 67 50 100 Interment IntermentDate 0 0 0 0 2 0 0 0 0 0 0 0 0 IntermentAddress 3 0 0 3 3 3 0 0 0 100 0 0 100 Viewing ViewingDate 1 0 0 1 0 0 0 0 0 100 0 0 100 ViewingAddress 3 0 0 2 0 1 1 0 0 67 0 0 67 BeginingTime 3 2 1 3 0 1 0 67 33 100 100 50 100 EndingTime 3 3 1 3 0 0 0 100 33 100 100 100 100 RelationshipRelativeName Relationship 8 8 0 8 0 0 0 100 0 100 100 0 100 RelativeName 2 8 6 0 28 9 0 0 22 0 100 40 0 100 (N) Number of Facts in Source, (E) English, (M) Malay, (MO) MalayIK-Ontology 4. CONCLUSION The main objective of this research is to propose a new approach to transform extracted knowledge in Malay unstructured document by identifying, organizing, and structuring them into interrogative structured form. In order to achieve this objective, an approach is established through the MalayIK-Ontology approach. Based on the results, the annotation of interrogative contextual information tagged in interrogative lexical constructs improves the data extraction. The annotation of interrogative contextual information is annotated with interrogative and grammatical information of the lexical constructs. For example, the lexical object set of BirthDate, DeathDate, FuneralDate, and ViewingDate generate precision of 100% which also generate 100% precision on Ontos.
  • 9. Int J Elec & Comp Eng ISSN: 2088-8708  MalayIK: An Ontological Approach to Knowledge Transformation in Malay Unstructured …. (Fatimah Sidi) 9 This improvement is due to the implementation of annotation interrogative contextual information which is tagged with interrogative element of ‘bila’ (when) which describes about the string patterns of time at which things happened. For lexical object set of DeceasedName and RelativeName which also generate precision of 100%, as the name is matched by recognizing the string patterns of proper nouns which is tagged with interrogative element of ‘siapa’ (who). Besides, phrases or proper nouns based on interrogative annotation of the lexicon are also annotated in the lexical constructs. 5. ACKNOWLEDGEMENT The work was supported by the Ministry of Higher Education Malaysia under the Fundamental Research Grant Scheme (FRGS 03-12-10-999FR). REFERENCES [1] Spiegler, I.: Technology and Knowledge: Bridging a “Generating” Gap, Information & Management, vol. 40(6), pp. 533-539, 2003. [2] T. K. Das and P. M. Kumar, "Big Data Analytics: A Framework for Unstructured Data Analysis", International Journal of Engineering and Technology, Vol. 5, pp. 153-156, 2013 [3] K. Cukier, “The Economist, Data, data everywhere: A special report on managing information”, 2010, February 25, retrieved from http://www.economist. com/node/15557443 [4] N. Khan, et al., "Big Data: Survey, Technologies, Opportunities, and Challenges", The Scientific World Journal, vol. 2014, pp. 1-18, 2014. [5] R. Feldman, "Mining Unstructured Data", in International Conference on Knowledge Discovery and Data Mining 5th ACM SIGKDD, August, 1999. [6] E. J. Quigley and A. Debons, "Interrogative theory of information and knowledge", in ACM SIGCPR conference on Computer personnel research, pp. 4-10, 1999, pp. 4-10. [7] D. W. Embley, et al., "A Conceptual-Modeling Approach to Extracting Data from the Web", in 17th International Conference on Conceptual Modeling, 1998, pp. 78-91. [8] D. W. Embley, "Ontology-based Extraction and Structuring of Information from Data-rich Unstructured Documents", in 7th International Conference on Information and Knowledge Management, 1998, pp. 52-59. [9] D. W. Embley, et al., "Conceptual-Model-based Data Extraction from Multiple-Record Web Pages, Data and Knowledge Engineering", Data & Knowledge Engineering Journal, vol. 31(3), pp. 227-251, 1999. [10] D. W. Embley, et al., "Record-Boundary Discovery in Web Documents", in Proceeding of the 1999 ACM SIGMOD International Conference on Management of Data, 1999, pp. 467-478. [11] D. W. Embley, "Toward Semantic Understanding: An Approach Based on Information Extraction Ontologies" in 15th Australasian Database Conference, 2004, pp. 3-12. [12] B. N. Schilit and M. M. Theimer, "Disseminating Active Map Information to Mobile Hosts", IEEE Network, vol. 8(5), pp. 22-32, 1994. [13] Lamming, M.G. and Newman, W.M., "Activity-based Information Retrieval: Technology in Support of Personal Memory", in 12th World Computer Congress on Personal Computers and Intelligent Systems - Information Processing, 1992, pp. 68-81. [14] G. D. Abowd, et al., "Towards a Better Understanding of Context and Context-Awareness", in 1st International Symposium on Handheld and Ubiquitous Computing 1999, pp. 304-307. [15] I. Nonaka, "A Dynamic Theory of Organizational Knowledge Creation". Organization Science, vol. 5(1), pp. 14-37, 1994. [16] T. H. Davenport, et al., "Working Knowledge: How Organizations Manage What They Know", Harvard Business School Press, 2000. [17] F. J. Varela, et al., "The Embodied Mind", MIT Press, 1992. [18] R. Virk, "Transforming Unstructured Content into “Meaningful” XML", 2004, http://www.dmreview.com/whitepaper/WID413.pdf. [19] N. F. Noy, and D. L. McGuinness, "Ontology Development 101: A Guide to Creating Your First Ontology", Knowledge Systems Laboratory, Technical Report KSL-01-05, 2001. [20] I. Ishak, et al., "Object Recognizer For Organizing And Structuring Unstructured Documents Using Interrogative Knowledge", Journal of Theoretical & Applied Information Technology, vol. 83, pp. 442-450, 2016. [21] F. Sidi, F. and M. A. Jabar, "Interrogative Knowledge Organization and Structuring from Unstructured Documents". Knowledge Management and Innovation: A Business Competitive Edge Perspective, pp. 1028-1032, 2010. [22] M. Shamsfard and A. A. Barforoush, "The State of the Art in Ontology Learning: A Framework for Comparison", Knowledge Engineering Review, vol. 18(4), pp. 293-316, 2003. [23] Dewan Bahasa Perpustakaan, "Kamus Dewan Edisi Ketiga", Dewan Bahasa dan Pustaka, 2002. [24] Dewan Bahasa Perpustakaan, "Kamus Dewan Edisi Keempat", Dewan Bahasa dan Pustaka, 2005. [25] H. M. Ali, "Kamus Imbuhan Bahasa Melayu Edisi Kedua", Penerbit Fajar Bakti Sdn. Bhd., 1993. [26] J. M. Hawkins, "Kamus Dwibahasa Oxford Fajar Edisi Ketiga", Penerbit Fajar Bakti Sdn. Bhd., 2001. [27] A. Othman, "Kamus Komprehensif Bahasa Melayu", Penerbit Fajar Bakti Sdn. Bhd., a subsidiary of Oxford University Press, 2005.
  • 10.  ISSN: 2088-8708 Int J Elec & Comp Eng, Vol. 8, No. 1, February 2018 : 1 – 10 10 BIOGRAPHIES OF AUTHORS Fatimah Sidi holds a PhD in Management Information Systems from Universiti Putra Malaysia. She received her Bachelor and Masters degree in Computer Science from the same university. She has twenty years experience as System Analyst in System Development and Enterprise Database Management. Her research interest spans around Knowledge and Information Management Systems, Data and Knowledge Engineering, as well as Database and Data Warehousing Iskandar Ishak received his BIT (Hons) from Universiti Tenaga Nasional in 2002. He received Master of Technology in Information Technology in the year 2003 from the Royal Melbourne Institute of Technology University and completed his PhD in Computer Science in 2011 at the Universiti Teknologi Malaysia. He is now a Senior Lecturer at the Department of Computer Science, Faculty of Computer Science and Information Technology, Universiti Putra Malaysia since 2011. His research interest lies in the area of Database System, Information Systems, Big Data, Peer-to-Peer and Mobile Computing and Information Retrieval Marzanah A. Jabar graduated from Universiti Kebangsaan Malaysia with BSc. in Quantitative Studies in 1983. She received her MSc. in Computer Science in the year 2001 and her PhD in Management Information System in the year 2007, both from Universiti Putra Malaysia. She has over twenty years experience in leading information systems development. Her research interest lies in the area of Software and Knowledge Engineering, Knowledge Management Systems, and strategic use of Information Systems and Technologies