SlideShare a Scribd company logo
NAME:- DEBOLINA BISWAS
R O L L N O : - 0 0 2 0 0 0 8 0 2 0 1 0
S E S S I O N : 2 0 2 0 - 2 0 2 1
PAPER: I N F O R M E T R I C S . ( M L - 1 1 )
CO-WORD ANALYSIS
Introduction
Since World War II, the scope and volume of scientific research have increased dramatically. This is well
reflected in the growth of the literature. Many decades later, along with developments in information
technology, especially in the area of data storage, the amount of information in the world is overwhelming.
In formulating and executing broad spectrum research policy, it is important to understand how
research thrusts have interrelated and evolved over time, how they are projected to evolve, and
how different types of interventions from sponsors and policymakers can affect the evolution and
impact of research.
The traditional way to map the relationships among concepts, ideas, and problems in science is to
seek the views of a relatively small number of experts. But identification of the connectivity of a
broad range of areas is well beyond the expertise of any panel of experts.
Much recent effort has been focused on development of more objective quantitative approaches for
analyzing and integrating written and survey information to supplement analysts or groups of
peers in understanding research trends ; they include co-citation analysis, co-nomination analysis,
and co-word analysis.
So, we can see that modern quantitative techniques utilise computer technologies extensively. One
of techniques exploits the use of co-occurrence phenomena, and it is the method of co-word
analysis.
What is Co-Word Analysis?
 Co-word analyses study the co-occurrence of pairs of items (for example, keywords) that
are representative in a document, to identify relations between the ideas presented in the
texts (He, 1999).
 Co-word analysis is related to co-citation analysis (Small, 1973; Small & Griffith, 1974).
Co-citation analysis provides a method of mapping the structure of a research field
through pairs of documents jointly cited. But co-word analysis deals directly with sets of
terms shared by documents instead of shared citations. Therefore, it maps the pertinent
literature directly from the interaction of key terms instead of the interaction of citations.
 So, in co-word analysis, it is assumed that keywords extracted from papers could
represent a specific research direction, research topic or subject of a field. If two
keywords co-occur within one paper, the two research topics they represent are related.
Higher co-word frequency means stronger correlation in keywords pairs, which can
further suggest that two keywords are related to a specific research topic (Cambrosio et
al. 1993).
 Co-word analysis has the potential of effectively revealing patterns and trends in a
specific discipline (Ding et al. 2001). Co-word analysis has been utilized to reveal
research advances in many fields.
Origins of Co-word Analysis
The origins of co-word phenomena can be traced back at least five decades to the pioneering
work in: 1) lexicography of Homby (1942) to account for co-occurrence knowledge, and 2)
linguistics of De Saussure (1949) to describe how affinity of two language units correlates with
their appearance in the language. (Kostoff, n.d)
But the technique of co-word analysis was first developed in collaboration between the Centre
de Sociologie de I’Innovation of the Ecole Nationale Superieure des Mines of Paris and the
CNRS (Centre National de la Re-cherche Scientifique) of France during the 198Os, and their
system was called “LEXIMAPPE.”
This method, originally dedicated to scientific fields, measures the association strength
between terms in documents to reveal and visualize the evolution of scientific fields through
the construction of clusters and strategic diagram.
For about twenty years, this technique has been employed to map the dynamic development of
several research fields. One of the early studies was carried out by Serge Bauin (1986) to map
the dynamics of aquaculture from 1979 to 1981.
Theoretical Foundation of Co-word
Analysis-Actor Network
The co-word analysis technique was first proposed to map the dynamics of science. The most
feasible way to understand the dynamics of science is to take the force of science in present-
day societies into account.
“Actor network is the theoretical foundation for co-word analysis to map the dynamics of
science”. (Callon, 1986)
Even though science cannot be reduced to texts only, texts are still a prime source for studies
on how worlds are created and transformed in the laboratory. Therefore, instead of
following the actors to see how they change the world, following the texts is another way to
map the dynamics of science.
Based on the co-occurrence of pairs of words, co-word analysis seeks to extract the themes of
science and detect the linkages among these themes directly from the subject content of
texts. (Callon et al., 1986b).
Overall, co-word analysis considers the dynamics of science as a result of actor strategies.
(Callon et al., 1991).
Co-word analysis method
Co-word analysis is a content analysis technique that uses patterns of co-occurrence of pairs
of items (i.e., words or noun phrases) in a corpus of texts to identify the relationships
between ideas within the subject areas as presented in these texts.
Indexes based on the co-occurrence frequency of items, such as an inclusion index and a
proximity index, are used to measure the strength of relationships between items. Based on
these indexes, items are clustered into groups and displayed in network maps.
For example, an inclusion map is used to highlight the central themes in a domain, and a
proximity map is used to reveal the connections between minor areas hidden behind the
central ones. Some other indexes, such as those based on density and centrality, are
employed to evaluate the shape of each map, showing the degree to which each area is
centrally structured and the extent to which each area is central to the others. By comparing
the network maps for different time periods, the scientific dynamic can be detected.
Metrics for co-word analysis
 Metrics for co-word analysis have been studied extensively (Grivel & François, 1995). Two
terms, i and j, co-occur if they are used together in a single document. Take a corpus consisting
of N documents. Each document is indexed by a set of unique terms that can occur in multiple
documents. Let Ck be the number of occurrences of term k; i.e., the number of times k is used
for indexing documents in the corpus. Let Cij be the number of co-occurrences of terms i and j
(the number of documents indexed by both terms).
 In later co-word studies a index is employed to calculate the association values between word
pairs. This coefficient is called the equivalence index (e-coefficient) (Callon et al., 1991) or
strength (Coulter et al., 1998). The basic metric used for this study is the Association Indice
Eij. The strength of association between terms i and j is given by
the expression:
 Two terms that appear many times in isolation but only a few times together will yield a lower E
value than two terms that appear relatively less often alone but have a higher ratio of co-
occurrences. Terms with relatively high E values form the networks’ links. A term network
consists of nodes (terms) connected by links. Each node must be linked to at least one other
node in a network.
Continued
 This metric provides an intuitive measure of the strength of association between terms,
and only indicates that there is some semantic relationship. This metric is easier to
understand and utilize in the production and interpretation of term association maps. It
allows associations of both major and minor terms and is symmetrical in their
relationships (Callon, Courtial, & Turner, 1991). E can be used as the basis for devising
several complementary measures of term interactions and term networks in a unified
manner.
 The co-word algorithm then proceeds in two steps to produce the paired connection of
terms. The first step builds networks that can identify areas of strong focus. The second
step can identify terms that associate in more than one network and thereby indicate
overlapping issues.
 Without some minimum constraints, terms that appear infrequently but almost always
together could dominate clusters; hence a minimum co-occurrence Cij value is required to
generate a link. At the same time, some maps can become cluttered due to an excessive
number of legitimate links (generally of decreasing E values); hence, restrictions on
numbers of nodes and sometimes links are required to help discover the major partitions
of concepts.
Inclusion Index
 In the early studies of co- word analysis, two indexes were introduced: 1. Inclusion
Index and 2. Proximity Index.
 The hierarchies of subject areas in a research problem can be detected by calculating an
index, called the inclusion index. The Inclusion Index offers access to a tangible metric
that can help identify gaps in our works and understand what needs to be addressed.
 Inclusion Index (Iij):
where,
 Cij the number of documents in which the keyword pair (M, and M,) appears;
 Ci is the occurrence frequency of keyword Mi in the set of articles;
 Cj is the occurrence frequency of keyword M. in the set of articles;
 min (Ci,Cj) is the minimum of the two freqdencies Ci,and Cj.
 Iij has a value between 0 and 1, and it can be interpreted as a conditional probability.
Proximity Index
 To detect the minor but potentially growing areas we need the proximity index.
 If a situation occurs that implies that there are some mediator keywords, which
have a relatively low occurrence frequency but still have significant
relationships with some of the peripheral key-words. To bring out such
patterns,
a proximity index Pij is defined:
where,
 Ci, Cj, and Cij have the same meaning as in formula of inclusion index.
 N is the number of articles in the collection.
 The mediator and peripheral keywords pulled out by Pij represent minor but
potentially growing areas.
Inclusion Map
 After the inclusion and proximity indexes are calculated, inclusion and
proximity maps are created.
 The inclusion maps are designed to discover the central themes in a domain and
depict their relationship to keywords that occur less frequently. The inclusion map
involves getting more information about a certain theme.
 To create inclusion maps, the link that has the highest inclusion index value is
selected first. These linked nodes become the starting points for the first inclusion
map (subnetwork).
 Other links and their corresponding nodes are then added into the map in the
decreasing order of their inclusion index until the threshold I0 is reached.
 Other links and their corresponding nodes are then added into the map in the
decreasing order of their inclusion index until the threshold I, is reached.
 Keywords that appear on the top level of inclusion maps are called “central poles” of
the domain of research. Keywords that are included in the central poles, and
themselves include some other words at lower levels, are called “mediator words”
(Callon et al., 1986).
Proximity Map
 The proximity maps are designed to discover connections between minor
ideas hidden behind the central themes. The proximity maps concerns the
analysis of the links between themes.
 The process to create proximity maps is similar to that for inclusion maps. The
difference is that the proximity index is used instead of the inclusion index.
 If the threshold P, is lowered enough, more proximity connections between
keywords will appear in the map and, eventually, the mediators and central
poles found in inclusion maps will reappear. In this way, the relationship
between minor issues and central poles can be studied (Callon et al., 1986).
Density and Centrality
 Density: Density is used to measure the strength of the links that tie together the
words making up the cluster; that is the internal strength of a cluster. It provides a good
representation of the cluster’s capacity to maintain itself and to develop over the course
of time in the field under consideration (Callon et al., 1991).
 The value of the density of a given cluster can be measured in several ways. Generally, the
index value for links between each word pair is calculated first. Then, the density value
can be the average value (mean) of internal links. the median value of internal links or
the sum of the squares of the value of internal links.
 Centrality: Centrality is used to measure the strength of a subject area’s interaction
with other subject areas. Ranking subject areas (clusters) with respect to their centrality
shows the extent to which each area is central within a global research network. The
greater the number and strength of a subject area’s connections with other subject areas,
the more central this subject area will be in the research network (Bauin et al., 1991).
 For a given cluster (area), its centrality can be the sum of all external link values or the
square root of the sum of the squares of all external link values (e.g., Coulter et al., 1998).
An external link is a link that goes from a word belonging to a cluster to a word external
to the cluster.
Strategic diagram
 A strategic diagram that offers a global representation of the structure of
any field or subfield can be created by plotting centrality and density into a
two-dimensional diagram (Law et al., 1988).
 Typically, the horizontal axis represents centrality, the vertical axis
represents density, and the origin of the graph is at the median of the
respective axis values. This map situates each subject area within a two-
dimensional space divided into four quadrants.
 The strategic diagram is used in many co-word analysis studies and the
analysis based on it is similar among these studies.
Strategic diagram
Quadrant 3 : Peripheral
and developed Words
Quadrant 1: Central and
developed Words
Quadrant 4: Peripheral
and undeveloped Words
Quadrant 2: Central and
undeveloped Words
Density
Centrality
Thematic Network
 In a theme, the keywords and their interconnections draw a network graph, called a
thematic network. Each thematic network is labeled using the name of the most
significant keyword in the associated theme (usually identified by the most central
keyword of the theme).
 In this figure, several keywords are
interconnected, where the volume of the
spheres is proportional to the number of
Documents corresponding to each keyword,
the thickness of the link between two
spheres i and j is proportional to the
equivalence index Iij .
 Network is built, based on the documents linked to each thematic network. A
document is linked to a theme if it has at least two keywords presented in the
thematic network.
Dynamics of Networks
 A striking feature of some strategic diagrams is the radical change in
the configuration of the network at two periods. This reflects the
dynamics of science. Based on the strategic diagram, we can analyze the
stability of the networks and foresee their changes in the future. This
issue is addressed in many studies, and the methods used in these
studies fall into two categories: The study of strategic diagrams, and the
ratio of centrality to density.
 The comparison of two networks, N1 and N2, which might be two net-
works at different times or two distinct networks at the same time, can
be done by the method network comparison.
Database Tomography
 Database Tomography is a patented system for analyzing large amounts of textual
computerized material (Kostoff et al., 1995).
 It can be considered as another generation of co-word analysis. Algorithms for extracting
multi-word phrase frequencies and performing phrase proximity analysis are included in
this system. Phrase frequency analysis can be used to discover the pervasive themes of a
database while the phrase proximity analysis can be used to detect the relationships
among these themes and between these themes and subthemes (Kostoff et al., 1997a).
 Similar to co-word analysis, Database Tomography can identify the main intellectual
thrust areas and the relationship among these thrust areas. It provides a comprehensive
overview of a research network and allows specific starting points to be chosen rationally
for more detailed investigations into a topic of interest.
 Based on the term co-occurrence information, Database Tomography can also be used to
expand the initial query in information retrieval (IR) systems and, in turn, allow the
retrieval of relevant documents that would not have been retrieved with the initial query
(Kostoff et al., 1997b).
Advantages of Co-Word Analysis
 Quantitative Over Qualitative: The advantages of co-word analysis over
qualitative analysis were recognized by researchers. Using the quantitative in pursuit of
the qualitative, researchers are also able to highlight features of scientific fields that have
not always been recognized. The heterogeneity of scientific world-building is preserved in
co-word analysis, where experimental findings, research methods, concepts, social
problems, material artifacts and locations may appear together on the maps.
 Flexibility: Compared with other methods of analysis that focus on texts, co-word
analysis is much more flexible in that it shows the research net- work with graphs. On the
one hand, these graphs can be simplified to show the overall structure of the network. On
the other hand, one can zoom in on certain areas and trace the co-word patterns in as
much detail as one wishes.
Limitations of Co-word Analysis
 Indexer effect: Solely making use of keywords and index words was the biggest
problem of early co- word analysis. It was called “indexer effect” and was addressed by
many researchers.
 The representations of the results given by co-word analysis are too difficult to read.
(Whittaker, 1989)
 The coverage of database is incomplete. Certain types of literature, such as patents arid
the “grey literature,” lie outside of the publication circuit and are not indexed in the
database. This means that the results from co-word analysis cannot reflect the whole
picture of the research field in question.
 The delay between the writing of a document and the moment when it is indexed and
entered into a database causes the co-word analysis to fail to detect emerging research
themes at an early stage. (Callon et al., 1986)
Some key Issues in Co-word Analysis
 The maps obtained by co-word analysis are generally considered very
difficult to understand in isolation. They have to be interpreted with
caution. It is suggested that the interpretation must be active and based
on the comparison of maps (Callon and al. 1986).
 The choice of the words (keywords, descriptors) is another issue to be
carefully considered and has led to many discussions (Leydesdorff,
1997, Whittaker, 1989). The "normalization" of words must be
considered as well .
Assumptions of Co-word Analysis
 The assumptions of co-word analysis are presented by Whittaker (1989) as follows:
“It [co-word analysis] relies upon the argument that (i) authors of scientific articles choose
their technical terms carefully; (ii) when different terms are used in the same articles it is
therefore because the author is either recognizing or postulating some non-trivial
relationship between their referents; and (iii) if enough different authors appear to recognize
the same relationship, then that relation- ship may be assumed to have some significance
within the area of science concerned.”
 Later, Law and Whittaker (1992) have restated two of the assumptions. First. co-word
analysis assumes that the keywords used by indexers to index a paper reflect the present
stages of the scientific re- search in question. Then, co-word analysis assumes that
arguments received by other scientists will lead to the publication of further scientific
papers that are indexed by similar sets of keywords.
Applications of Co-word analysis
 Co-word analysis has been used to study conceptual work in different domains
by many researchers. full text articles included in the journal Scientrometrics to
mapping the intellectual structure of scientrometrics using text-mining and co-
word analysis.
 Co-word analysis is the bibliometric technique used to identify the main
themes. All this allows us to quantify and visualize the thematic evolution of
many research fields. It also helps to both experts and novices to understand
the current state of the field and to predict where future research could lead.
Conclusion
The appearance of co-word analysis added another choice in the area of
bibliometrics and provided another way to discover knowledge in databases.
Similar to co-citation analysis, co-word analysis is also a kind of relational study
based on the idea that publications should not be considered as discrete units.
Instead, each is built upon others.
Previous studies have shown co-word analysis to be a powerful tool to discover
knowledge in databases. It has been used to detect the themes in a given research
area, the relationship between these themes, the extent to which these themes are
central to the whole area, and the degree to which these themes are internally
structured. In the last twenty years, co-word analysis has been improved in many
aspects. The main progress ran be found in two fields: 1. Source of words and 2.
Measurements.
So lastly we can say that co-word analysis is the one of the most common methods
used for constructing the thematic and strategic map of a field.
References
 Bauin, S. (1986). Aquaculture: A field by bureaucratic fiat. In M. Callon, J. Law, & A. Rip
(Eds.),Mapping the dynamics of sriencr and technology: Sociology ofscience in the real
world (pp. 124-141). London: The Macmillan Press Ltd.
 Chllon, M. (1986). Pinpointing industrial invention: An exploration of quantitative meth-
ods for the analysis of patents. In M. Callon, J. Law, & A. Rip (Eds.), Mapping thr
dynamics of science and techno1 ociology of science in the real world (pp. 163-188). Lon-
don: The Macmillan Press Ltd.
 Callon, M.; Courtid1,J-P.; & Ladle, F. (1991). Co-word analysis as a tool for describing the
nrtwork of interactions between basic and technological research: The case of poly- mer
chemistry. Scienlomrtrirs, 22(1).
 He, Q., 1999, Knowledge Discovery Through Co-Word Analysis, Library Trends, 48(1).
 Whittaker, J. (1989). Creativity and conformity in science: Titles, keywords and co-word
analysis. Social Studies of Science, 19, 473-496.
Thank You

More Related Content

What's hot

Desidoc
DesidocDesidoc
Desidoc
Manish Aarya
 
Sendoc
SendocSendoc
Sendoc
Akshay Kumar
 
Indexing popsi....
Indexing popsi....Indexing popsi....
Indexing popsi....
PAQUIAAIZEL
 
Informetrics final
Informetrics finalInformetrics final
Informetrics finalAamir Abbas
 
Chain indexing
Chain indexingChain indexing
Chain indexingsilambu111
 
DELNET by Gaurav Boudh
DELNET by Gaurav BoudhDELNET by Gaurav Boudh
DELNET by Gaurav Boudh
Library and Information Science Blog
 
Management of Library and information Centres
Management of Library and information CentresManagement of Library and information Centres
Management of Library and information Centres
Sundar B N
 
Webometrics
WebometricsWebometrics
Webometrics
SrikantaSahu10
 
Library networks and consortium
Library networks and consortiumLibrary networks and consortium
Library networks and consortium
SunilKumar5028
 
National social science documentation centre (nassdoc )
National social science documentation centre (nassdoc )National social science documentation centre (nassdoc )
National social science documentation centre (nassdoc )
GordonAmidu
 
NISCAIR by Jaya Singh
NISCAIR by Jaya SinghNISCAIR by Jaya Singh
NISCAIR by Jaya Singh
AMAN KUMAR KUSHWAHA
 
E-granthalaya ILMS
E-granthalaya ILMSE-granthalaya ILMS
E-granthalaya ILMS
Dr. Rupak Chakravarty
 
DDS of INSDOC
DDS of INSDOC DDS of INSDOC
DDS of INSDOC
samhati27
 
Introduction to WebDewey
Introduction to WebDeweyIntroduction to WebDewey
Introduction to WebDewey
awilso02
 
Lotka’s law a study with reference to the literature by
Lotka’s law a study with reference to the literature byLotka’s law a study with reference to the literature by
Lotka’s law a study with reference to the literature byIAEME Publication
 

What's hot (20)

Desidoc
DesidocDesidoc
Desidoc
 
Uniterm indexing
Uniterm indexing Uniterm indexing
Uniterm indexing
 
Sendoc
SendocSendoc
Sendoc
 
Indexing popsi....
Indexing popsi....Indexing popsi....
Indexing popsi....
 
Informetrics final
Informetrics finalInformetrics final
Informetrics final
 
Bibliometrics
BibliometricsBibliometrics
Bibliometrics
 
Chain indexing
Chain indexingChain indexing
Chain indexing
 
DELNET by Gaurav Boudh
DELNET by Gaurav BoudhDELNET by Gaurav Boudh
DELNET by Gaurav Boudh
 
Management of Library and information Centres
Management of Library and information CentresManagement of Library and information Centres
Management of Library and information Centres
 
Webometrics
WebometricsWebometrics
Webometrics
 
UNISIST
UNISISTUNISIST
UNISIST
 
Precis
PrecisPrecis
Precis
 
Library networks and consortium
Library networks and consortiumLibrary networks and consortium
Library networks and consortium
 
National social science documentation centre (nassdoc )
National social science documentation centre (nassdoc )National social science documentation centre (nassdoc )
National social science documentation centre (nassdoc )
 
NISCAIR by Jaya Singh
NISCAIR by Jaya SinghNISCAIR by Jaya Singh
NISCAIR by Jaya Singh
 
E-granthalaya ILMS
E-granthalaya ILMSE-granthalaya ILMS
E-granthalaya ILMS
 
DDS of INSDOC
DDS of INSDOC DDS of INSDOC
DDS of INSDOC
 
Introduction to WebDewey
Introduction to WebDeweyIntroduction to WebDewey
Introduction to WebDewey
 
Lotka’s law a study with reference to the literature by
Lotka’s law a study with reference to the literature byLotka’s law a study with reference to the literature by
Lotka’s law a study with reference to the literature by
 
Soul
Soul Soul
Soul
 

Similar to Co word analysis

ASSESSING SIMILARITY BETWEEN ONTOLOGIES: THE CASE OF THE CONCEPTUAL SIMILARITY
ASSESSING SIMILARITY BETWEEN ONTOLOGIES: THE CASE OF THE CONCEPTUAL SIMILARITYASSESSING SIMILARITY BETWEEN ONTOLOGIES: THE CASE OF THE CONCEPTUAL SIMILARITY
ASSESSING SIMILARITY BETWEEN ONTOLOGIES: THE CASE OF THE CONCEPTUAL SIMILARITY
IJwest
 
ASSESSING SIMILARITY BETWEEN ONTOLOGIES: THE CASE OF THE CONCEPTUAL SIMILARITY
ASSESSING SIMILARITY BETWEEN ONTOLOGIES: THE CASE OF THE CONCEPTUAL SIMILARITYASSESSING SIMILARITY BETWEEN ONTOLOGIES: THE CASE OF THE CONCEPTUAL SIMILARITY
ASSESSING SIMILARITY BETWEEN ONTOLOGIES: THE CASE OF THE CONCEPTUAL SIMILARITY
dannyijwest
 
Adapting Measures of Clumping Strength to Assess Term-Term Similarity
Adapting Measures of Clumping Strength to Assess Term-Term SimilarityAdapting Measures of Clumping Strength to Assess Term-Term Similarity
Adapting Measures of Clumping Strength to Assess Term-Term Similarity
Vladimir Kulyukin
 
A Science Mapping Analysis Of Blood Donation Behaviour
A Science Mapping Analysis Of Blood Donation BehaviourA Science Mapping Analysis Of Blood Donation Behaviour
A Science Mapping Analysis Of Blood Donation Behaviour
Bria Davis
 
Visual mapping sentence_a_methodological (1)
Visual mapping sentence_a_methodological (1)Visual mapping sentence_a_methodological (1)
Visual mapping sentence_a_methodological (1)
Natalia Djahi
 
Knowledge codification and abstraction
Knowledge codification and abstractionKnowledge codification and abstraction
Knowledge codification and abstractionEva Ortoll
 
Cooperating Techniques for Extracting Conceptual Taxonomies from Text
Cooperating Techniques for Extracting Conceptual Taxonomies from TextCooperating Techniques for Extracting Conceptual Taxonomies from Text
Cooperating Techniques for Extracting Conceptual Taxonomies from TextFulvio Rotella
 
Cooperating Techniques for Extracting Conceptual Taxonomies from Text
Cooperating Techniques for Extracting Conceptual Taxonomies from TextCooperating Techniques for Extracting Conceptual Taxonomies from Text
Cooperating Techniques for Extracting Conceptual Taxonomies from TextUniversity of Bari (Italy)
 
Afl 521 interpretive
Afl 521 interpretiveAfl 521 interpretive
Afl 521 interpretive
Randy Nobleza
 
A Review And Evaluation Of Prominent Theories Of Writing
A Review And Evaluation Of Prominent Theories Of WritingA Review And Evaluation Of Prominent Theories Of Writing
A Review And Evaluation Of Prominent Theories Of Writing
Lisa Riley
 
ONTOLOGICAL MODEL FOR CHARACTER RECOGNITION BASED ON SPATIAL RELATIONS
ONTOLOGICAL MODEL FOR CHARACTER RECOGNITION BASED ON SPATIAL RELATIONSONTOLOGICAL MODEL FOR CHARACTER RECOGNITION BASED ON SPATIAL RELATIONS
ONTOLOGICAL MODEL FOR CHARACTER RECOGNITION BASED ON SPATIAL RELATIONS
sipij
 
Automatize Document Topic And Subtopic Detection With Support Of A Corpus
Automatize Document Topic And Subtopic Detection With Support Of A CorpusAutomatize Document Topic And Subtopic Detection With Support Of A Corpus
Automatize Document Topic And Subtopic Detection With Support Of A Corpus
Richard Hogue
 
Interpreting the Semantics of Anomalies Based on Mutual Information in Link M...
Interpreting the Semantics of Anomalies Based on Mutual Information in Link M...Interpreting the Semantics of Anomalies Based on Mutual Information in Link M...
Interpreting the Semantics of Anomalies Based on Mutual Information in Link M...
ijdms
 
Supporting scientific discovery through linkages of literature and data
Supporting scientific discovery through linkages of literature and dataSupporting scientific discovery through linkages of literature and data
Supporting scientific discovery through linkages of literature and data
Don Pellegrino
 
IJERD (www.ijerd.com) International Journal of Engineering Research and Devel...
IJERD (www.ijerd.com) International Journal of Engineering Research and Devel...IJERD (www.ijerd.com) International Journal of Engineering Research and Devel...
IJERD (www.ijerd.com) International Journal of Engineering Research and Devel...IJERD Editor
 
International Journal of Engineering Research and Development (IJERD)
International Journal of Engineering Research and Development (IJERD)International Journal of Engineering Research and Development (IJERD)
International Journal of Engineering Research and Development (IJERD)
IJERD Editor
 
Towards a Relational Paradigm in Sustainability Research, Practice, and Educa...
Towards a Relational Paradigm in Sustainability Research, Practice, and Educa...Towards a Relational Paradigm in Sustainability Research, Practice, and Educa...
Towards a Relational Paradigm in Sustainability Research, Practice, and Educa...
Zack Walsh
 
SEMANTIC INTEGRATION FOR AUTOMATIC ONTOLOGY MAPPING
SEMANTIC INTEGRATION FOR AUTOMATIC ONTOLOGY MAPPING SEMANTIC INTEGRATION FOR AUTOMATIC ONTOLOGY MAPPING
SEMANTIC INTEGRATION FOR AUTOMATIC ONTOLOGY MAPPING
cscpconf
 

Similar to Co word analysis (20)

ASSESSING SIMILARITY BETWEEN ONTOLOGIES: THE CASE OF THE CONCEPTUAL SIMILARITY
ASSESSING SIMILARITY BETWEEN ONTOLOGIES: THE CASE OF THE CONCEPTUAL SIMILARITYASSESSING SIMILARITY BETWEEN ONTOLOGIES: THE CASE OF THE CONCEPTUAL SIMILARITY
ASSESSING SIMILARITY BETWEEN ONTOLOGIES: THE CASE OF THE CONCEPTUAL SIMILARITY
 
ASSESSING SIMILARITY BETWEEN ONTOLOGIES: THE CASE OF THE CONCEPTUAL SIMILARITY
ASSESSING SIMILARITY BETWEEN ONTOLOGIES: THE CASE OF THE CONCEPTUAL SIMILARITYASSESSING SIMILARITY BETWEEN ONTOLOGIES: THE CASE OF THE CONCEPTUAL SIMILARITY
ASSESSING SIMILARITY BETWEEN ONTOLOGIES: THE CASE OF THE CONCEPTUAL SIMILARITY
 
Adapting Measures of Clumping Strength to Assess Term-Term Similarity
Adapting Measures of Clumping Strength to Assess Term-Term SimilarityAdapting Measures of Clumping Strength to Assess Term-Term Similarity
Adapting Measures of Clumping Strength to Assess Term-Term Similarity
 
A Science Mapping Analysis Of Blood Donation Behaviour
A Science Mapping Analysis Of Blood Donation BehaviourA Science Mapping Analysis Of Blood Donation Behaviour
A Science Mapping Analysis Of Blood Donation Behaviour
 
Visual mapping sentence_a_methodological (1)
Visual mapping sentence_a_methodological (1)Visual mapping sentence_a_methodological (1)
Visual mapping sentence_a_methodological (1)
 
Knowledge codification and abstraction
Knowledge codification and abstractionKnowledge codification and abstraction
Knowledge codification and abstraction
 
Cooperating Techniques for Extracting Conceptual Taxonomies from Text
Cooperating Techniques for Extracting Conceptual Taxonomies from TextCooperating Techniques for Extracting Conceptual Taxonomies from Text
Cooperating Techniques for Extracting Conceptual Taxonomies from Text
 
Cooperating Techniques for Extracting Conceptual Taxonomies from Text
Cooperating Techniques for Extracting Conceptual Taxonomies from TextCooperating Techniques for Extracting Conceptual Taxonomies from Text
Cooperating Techniques for Extracting Conceptual Taxonomies from Text
 
Afl 521 interpretive
Afl 521 interpretiveAfl 521 interpretive
Afl 521 interpretive
 
A Review And Evaluation Of Prominent Theories Of Writing
A Review And Evaluation Of Prominent Theories Of WritingA Review And Evaluation Of Prominent Theories Of Writing
A Review And Evaluation Of Prominent Theories Of Writing
 
Drijvers et al_two_lenses
Drijvers et al_two_lensesDrijvers et al_two_lenses
Drijvers et al_two_lenses
 
ONTOLOGICAL MODEL FOR CHARACTER RECOGNITION BASED ON SPATIAL RELATIONS
ONTOLOGICAL MODEL FOR CHARACTER RECOGNITION BASED ON SPATIAL RELATIONSONTOLOGICAL MODEL FOR CHARACTER RECOGNITION BASED ON SPATIAL RELATIONS
ONTOLOGICAL MODEL FOR CHARACTER RECOGNITION BASED ON SPATIAL RELATIONS
 
Automatize Document Topic And Subtopic Detection With Support Of A Corpus
Automatize Document Topic And Subtopic Detection With Support Of A CorpusAutomatize Document Topic And Subtopic Detection With Support Of A Corpus
Automatize Document Topic And Subtopic Detection With Support Of A Corpus
 
Interpreting the Semantics of Anomalies Based on Mutual Information in Link M...
Interpreting the Semantics of Anomalies Based on Mutual Information in Link M...Interpreting the Semantics of Anomalies Based on Mutual Information in Link M...
Interpreting the Semantics of Anomalies Based on Mutual Information in Link M...
 
Supporting scientific discovery through linkages of literature and data
Supporting scientific discovery through linkages of literature and dataSupporting scientific discovery through linkages of literature and data
Supporting scientific discovery through linkages of literature and data
 
IJERD (www.ijerd.com) International Journal of Engineering Research and Devel...
IJERD (www.ijerd.com) International Journal of Engineering Research and Devel...IJERD (www.ijerd.com) International Journal of Engineering Research and Devel...
IJERD (www.ijerd.com) International Journal of Engineering Research and Devel...
 
International Journal of Engineering Research and Development (IJERD)
International Journal of Engineering Research and Development (IJERD)International Journal of Engineering Research and Development (IJERD)
International Journal of Engineering Research and Development (IJERD)
 
Towards a Relational Paradigm in Sustainability Research, Practice, and Educa...
Towards a Relational Paradigm in Sustainability Research, Practice, and Educa...Towards a Relational Paradigm in Sustainability Research, Practice, and Educa...
Towards a Relational Paradigm in Sustainability Research, Practice, and Educa...
 
SEMANTIC INTEGRATION FOR AUTOMATIC ONTOLOGY MAPPING
SEMANTIC INTEGRATION FOR AUTOMATIC ONTOLOGY MAPPING SEMANTIC INTEGRATION FOR AUTOMATIC ONTOLOGY MAPPING
SEMANTIC INTEGRATION FOR AUTOMATIC ONTOLOGY MAPPING
 
Final libre(1)
Final libre(1)Final libre(1)
Final libre(1)
 

Recently uploaded

Criminal IP - Threat Hunting Webinar.pdf
Criminal IP - Threat Hunting Webinar.pdfCriminal IP - Threat Hunting Webinar.pdf
Criminal IP - Threat Hunting Webinar.pdf
Criminal IP
 
Best best suvichar in gujarati english meaning of this sentence as Silk road ...
Best best suvichar in gujarati english meaning of this sentence as Silk road ...Best best suvichar in gujarati english meaning of this sentence as Silk road ...
Best best suvichar in gujarati english meaning of this sentence as Silk road ...
AbhimanyuSinha9
 
Malana- Gimlet Market Analysis (Portfolio 2)
Malana- Gimlet Market Analysis (Portfolio 2)Malana- Gimlet Market Analysis (Portfolio 2)
Malana- Gimlet Market Analysis (Portfolio 2)
TravisMalana
 
一比一原版(UofS毕业证书)萨省大学毕业证如何办理
一比一原版(UofS毕业证书)萨省大学毕业证如何办理一比一原版(UofS毕业证书)萨省大学毕业证如何办理
一比一原版(UofS毕业证书)萨省大学毕业证如何办理
v3tuleee
 
哪里卖(usq毕业证书)南昆士兰大学毕业证研究生文凭证书托福证书原版一模一样
哪里卖(usq毕业证书)南昆士兰大学毕业证研究生文凭证书托福证书原版一模一样哪里卖(usq毕业证书)南昆士兰大学毕业证研究生文凭证书托福证书原版一模一样
哪里卖(usq毕业证书)南昆士兰大学毕业证研究生文凭证书托福证书原版一模一样
axoqas
 
一比一原版(Adelaide毕业证书)阿德莱德大学毕业证如何办理
一比一原版(Adelaide毕业证书)阿德莱德大学毕业证如何办理一比一原版(Adelaide毕业证书)阿德莱德大学毕业证如何办理
一比一原版(Adelaide毕业证书)阿德莱德大学毕业证如何办理
slg6lamcq
 
一比一原版(UIUC毕业证)伊利诺伊大学|厄巴纳-香槟分校毕业证如何办理
一比一原版(UIUC毕业证)伊利诺伊大学|厄巴纳-香槟分校毕业证如何办理一比一原版(UIUC毕业证)伊利诺伊大学|厄巴纳-香槟分校毕业证如何办理
一比一原版(UIUC毕业证)伊利诺伊大学|厄巴纳-香槟分校毕业证如何办理
ahzuo
 
Machine learning and optimization techniques for electrical drives.pptx
Machine learning and optimization techniques for electrical drives.pptxMachine learning and optimization techniques for electrical drives.pptx
Machine learning and optimization techniques for electrical drives.pptx
balafet
 
Opendatabay - Open Data Marketplace.pptx
Opendatabay - Open Data Marketplace.pptxOpendatabay - Open Data Marketplace.pptx
Opendatabay - Open Data Marketplace.pptx
Opendatabay
 
standardisation of garbhpala offhgfffghh
standardisation of garbhpala offhgfffghhstandardisation of garbhpala offhgfffghh
standardisation of garbhpala offhgfffghh
ArpitMalhotra16
 
The affect of service quality and online reviews on customer loyalty in the E...
The affect of service quality and online reviews on customer loyalty in the E...The affect of service quality and online reviews on customer loyalty in the E...
The affect of service quality and online reviews on customer loyalty in the E...
jerlynmaetalle
 
Adjusting primitives for graph : SHORT REPORT / NOTES
Adjusting primitives for graph : SHORT REPORT / NOTESAdjusting primitives for graph : SHORT REPORT / NOTES
Adjusting primitives for graph : SHORT REPORT / NOTES
Subhajit Sahu
 
一比一原版(UMich毕业证)密歇根大学|安娜堡分校毕业证成绩单
一比一原版(UMich毕业证)密歇根大学|安娜堡分校毕业证成绩单一比一原版(UMich毕业证)密歇根大学|安娜堡分校毕业证成绩单
一比一原版(UMich毕业证)密歇根大学|安娜堡分校毕业证成绩单
ewymefz
 
一比一原版(UofM毕业证)明尼苏达大学毕业证成绩单
一比一原版(UofM毕业证)明尼苏达大学毕业证成绩单一比一原版(UofM毕业证)明尼苏达大学毕业证成绩单
一比一原版(UofM毕业证)明尼苏达大学毕业证成绩单
ewymefz
 
一比一原版(QU毕业证)皇后大学毕业证成绩单
一比一原版(QU毕业证)皇后大学毕业证成绩单一比一原版(QU毕业证)皇后大学毕业证成绩单
一比一原版(QU毕业证)皇后大学毕业证成绩单
enxupq
 
一比一原版(CU毕业证)卡尔顿大学毕业证成绩单
一比一原版(CU毕业证)卡尔顿大学毕业证成绩单一比一原版(CU毕业证)卡尔顿大学毕业证成绩单
一比一原版(CU毕业证)卡尔顿大学毕业证成绩单
yhkoc
 
【社内勉強会資料_Octo: An Open-Source Generalist Robot Policy】
【社内勉強会資料_Octo: An Open-Source Generalist Robot Policy】【社内勉強会資料_Octo: An Open-Source Generalist Robot Policy】
【社内勉強会資料_Octo: An Open-Source Generalist Robot Policy】
NABLAS株式会社
 
一比一原版(YU毕业证)约克大学毕业证成绩单
一比一原版(YU毕业证)约克大学毕业证成绩单一比一原版(YU毕业证)约克大学毕业证成绩单
一比一原版(YU毕业证)约克大学毕业证成绩单
enxupq
 
FP Growth Algorithm and its Applications
FP Growth Algorithm and its ApplicationsFP Growth Algorithm and its Applications
FP Growth Algorithm and its Applications
MaleehaSheikh2
 
一比一原版(CBU毕业证)卡普顿大学毕业证成绩单
一比一原版(CBU毕业证)卡普顿大学毕业证成绩单一比一原版(CBU毕业证)卡普顿大学毕业证成绩单
一比一原版(CBU毕业证)卡普顿大学毕业证成绩单
nscud
 

Recently uploaded (20)

Criminal IP - Threat Hunting Webinar.pdf
Criminal IP - Threat Hunting Webinar.pdfCriminal IP - Threat Hunting Webinar.pdf
Criminal IP - Threat Hunting Webinar.pdf
 
Best best suvichar in gujarati english meaning of this sentence as Silk road ...
Best best suvichar in gujarati english meaning of this sentence as Silk road ...Best best suvichar in gujarati english meaning of this sentence as Silk road ...
Best best suvichar in gujarati english meaning of this sentence as Silk road ...
 
Malana- Gimlet Market Analysis (Portfolio 2)
Malana- Gimlet Market Analysis (Portfolio 2)Malana- Gimlet Market Analysis (Portfolio 2)
Malana- Gimlet Market Analysis (Portfolio 2)
 
一比一原版(UofS毕业证书)萨省大学毕业证如何办理
一比一原版(UofS毕业证书)萨省大学毕业证如何办理一比一原版(UofS毕业证书)萨省大学毕业证如何办理
一比一原版(UofS毕业证书)萨省大学毕业证如何办理
 
哪里卖(usq毕业证书)南昆士兰大学毕业证研究生文凭证书托福证书原版一模一样
哪里卖(usq毕业证书)南昆士兰大学毕业证研究生文凭证书托福证书原版一模一样哪里卖(usq毕业证书)南昆士兰大学毕业证研究生文凭证书托福证书原版一模一样
哪里卖(usq毕业证书)南昆士兰大学毕业证研究生文凭证书托福证书原版一模一样
 
一比一原版(Adelaide毕业证书)阿德莱德大学毕业证如何办理
一比一原版(Adelaide毕业证书)阿德莱德大学毕业证如何办理一比一原版(Adelaide毕业证书)阿德莱德大学毕业证如何办理
一比一原版(Adelaide毕业证书)阿德莱德大学毕业证如何办理
 
一比一原版(UIUC毕业证)伊利诺伊大学|厄巴纳-香槟分校毕业证如何办理
一比一原版(UIUC毕业证)伊利诺伊大学|厄巴纳-香槟分校毕业证如何办理一比一原版(UIUC毕业证)伊利诺伊大学|厄巴纳-香槟分校毕业证如何办理
一比一原版(UIUC毕业证)伊利诺伊大学|厄巴纳-香槟分校毕业证如何办理
 
Machine learning and optimization techniques for electrical drives.pptx
Machine learning and optimization techniques for electrical drives.pptxMachine learning and optimization techniques for electrical drives.pptx
Machine learning and optimization techniques for electrical drives.pptx
 
Opendatabay - Open Data Marketplace.pptx
Opendatabay - Open Data Marketplace.pptxOpendatabay - Open Data Marketplace.pptx
Opendatabay - Open Data Marketplace.pptx
 
standardisation of garbhpala offhgfffghh
standardisation of garbhpala offhgfffghhstandardisation of garbhpala offhgfffghh
standardisation of garbhpala offhgfffghh
 
The affect of service quality and online reviews on customer loyalty in the E...
The affect of service quality and online reviews on customer loyalty in the E...The affect of service quality and online reviews on customer loyalty in the E...
The affect of service quality and online reviews on customer loyalty in the E...
 
Adjusting primitives for graph : SHORT REPORT / NOTES
Adjusting primitives for graph : SHORT REPORT / NOTESAdjusting primitives for graph : SHORT REPORT / NOTES
Adjusting primitives for graph : SHORT REPORT / NOTES
 
一比一原版(UMich毕业证)密歇根大学|安娜堡分校毕业证成绩单
一比一原版(UMich毕业证)密歇根大学|安娜堡分校毕业证成绩单一比一原版(UMich毕业证)密歇根大学|安娜堡分校毕业证成绩单
一比一原版(UMich毕业证)密歇根大学|安娜堡分校毕业证成绩单
 
一比一原版(UofM毕业证)明尼苏达大学毕业证成绩单
一比一原版(UofM毕业证)明尼苏达大学毕业证成绩单一比一原版(UofM毕业证)明尼苏达大学毕业证成绩单
一比一原版(UofM毕业证)明尼苏达大学毕业证成绩单
 
一比一原版(QU毕业证)皇后大学毕业证成绩单
一比一原版(QU毕业证)皇后大学毕业证成绩单一比一原版(QU毕业证)皇后大学毕业证成绩单
一比一原版(QU毕业证)皇后大学毕业证成绩单
 
一比一原版(CU毕业证)卡尔顿大学毕业证成绩单
一比一原版(CU毕业证)卡尔顿大学毕业证成绩单一比一原版(CU毕业证)卡尔顿大学毕业证成绩单
一比一原版(CU毕业证)卡尔顿大学毕业证成绩单
 
【社内勉強会資料_Octo: An Open-Source Generalist Robot Policy】
【社内勉強会資料_Octo: An Open-Source Generalist Robot Policy】【社内勉強会資料_Octo: An Open-Source Generalist Robot Policy】
【社内勉強会資料_Octo: An Open-Source Generalist Robot Policy】
 
一比一原版(YU毕业证)约克大学毕业证成绩单
一比一原版(YU毕业证)约克大学毕业证成绩单一比一原版(YU毕业证)约克大学毕业证成绩单
一比一原版(YU毕业证)约克大学毕业证成绩单
 
FP Growth Algorithm and its Applications
FP Growth Algorithm and its ApplicationsFP Growth Algorithm and its Applications
FP Growth Algorithm and its Applications
 
一比一原版(CBU毕业证)卡普顿大学毕业证成绩单
一比一原版(CBU毕业证)卡普顿大学毕业证成绩单一比一原版(CBU毕业证)卡普顿大学毕业证成绩单
一比一原版(CBU毕业证)卡普顿大学毕业证成绩单
 

Co word analysis

  • 1. NAME:- DEBOLINA BISWAS R O L L N O : - 0 0 2 0 0 0 8 0 2 0 1 0 S E S S I O N : 2 0 2 0 - 2 0 2 1 PAPER: I N F O R M E T R I C S . ( M L - 1 1 ) CO-WORD ANALYSIS
  • 2. Introduction Since World War II, the scope and volume of scientific research have increased dramatically. This is well reflected in the growth of the literature. Many decades later, along with developments in information technology, especially in the area of data storage, the amount of information in the world is overwhelming. In formulating and executing broad spectrum research policy, it is important to understand how research thrusts have interrelated and evolved over time, how they are projected to evolve, and how different types of interventions from sponsors and policymakers can affect the evolution and impact of research. The traditional way to map the relationships among concepts, ideas, and problems in science is to seek the views of a relatively small number of experts. But identification of the connectivity of a broad range of areas is well beyond the expertise of any panel of experts. Much recent effort has been focused on development of more objective quantitative approaches for analyzing and integrating written and survey information to supplement analysts or groups of peers in understanding research trends ; they include co-citation analysis, co-nomination analysis, and co-word analysis. So, we can see that modern quantitative techniques utilise computer technologies extensively. One of techniques exploits the use of co-occurrence phenomena, and it is the method of co-word analysis.
  • 3. What is Co-Word Analysis?  Co-word analyses study the co-occurrence of pairs of items (for example, keywords) that are representative in a document, to identify relations between the ideas presented in the texts (He, 1999).  Co-word analysis is related to co-citation analysis (Small, 1973; Small & Griffith, 1974). Co-citation analysis provides a method of mapping the structure of a research field through pairs of documents jointly cited. But co-word analysis deals directly with sets of terms shared by documents instead of shared citations. Therefore, it maps the pertinent literature directly from the interaction of key terms instead of the interaction of citations.  So, in co-word analysis, it is assumed that keywords extracted from papers could represent a specific research direction, research topic or subject of a field. If two keywords co-occur within one paper, the two research topics they represent are related. Higher co-word frequency means stronger correlation in keywords pairs, which can further suggest that two keywords are related to a specific research topic (Cambrosio et al. 1993).  Co-word analysis has the potential of effectively revealing patterns and trends in a specific discipline (Ding et al. 2001). Co-word analysis has been utilized to reveal research advances in many fields.
  • 4. Origins of Co-word Analysis The origins of co-word phenomena can be traced back at least five decades to the pioneering work in: 1) lexicography of Homby (1942) to account for co-occurrence knowledge, and 2) linguistics of De Saussure (1949) to describe how affinity of two language units correlates with their appearance in the language. (Kostoff, n.d) But the technique of co-word analysis was first developed in collaboration between the Centre de Sociologie de I’Innovation of the Ecole Nationale Superieure des Mines of Paris and the CNRS (Centre National de la Re-cherche Scientifique) of France during the 198Os, and their system was called “LEXIMAPPE.” This method, originally dedicated to scientific fields, measures the association strength between terms in documents to reveal and visualize the evolution of scientific fields through the construction of clusters and strategic diagram. For about twenty years, this technique has been employed to map the dynamic development of several research fields. One of the early studies was carried out by Serge Bauin (1986) to map the dynamics of aquaculture from 1979 to 1981.
  • 5. Theoretical Foundation of Co-word Analysis-Actor Network The co-word analysis technique was first proposed to map the dynamics of science. The most feasible way to understand the dynamics of science is to take the force of science in present- day societies into account. “Actor network is the theoretical foundation for co-word analysis to map the dynamics of science”. (Callon, 1986) Even though science cannot be reduced to texts only, texts are still a prime source for studies on how worlds are created and transformed in the laboratory. Therefore, instead of following the actors to see how they change the world, following the texts is another way to map the dynamics of science. Based on the co-occurrence of pairs of words, co-word analysis seeks to extract the themes of science and detect the linkages among these themes directly from the subject content of texts. (Callon et al., 1986b). Overall, co-word analysis considers the dynamics of science as a result of actor strategies. (Callon et al., 1991).
  • 6. Co-word analysis method Co-word analysis is a content analysis technique that uses patterns of co-occurrence of pairs of items (i.e., words or noun phrases) in a corpus of texts to identify the relationships between ideas within the subject areas as presented in these texts. Indexes based on the co-occurrence frequency of items, such as an inclusion index and a proximity index, are used to measure the strength of relationships between items. Based on these indexes, items are clustered into groups and displayed in network maps. For example, an inclusion map is used to highlight the central themes in a domain, and a proximity map is used to reveal the connections between minor areas hidden behind the central ones. Some other indexes, such as those based on density and centrality, are employed to evaluate the shape of each map, showing the degree to which each area is centrally structured and the extent to which each area is central to the others. By comparing the network maps for different time periods, the scientific dynamic can be detected.
  • 7. Metrics for co-word analysis  Metrics for co-word analysis have been studied extensively (Grivel & François, 1995). Two terms, i and j, co-occur if they are used together in a single document. Take a corpus consisting of N documents. Each document is indexed by a set of unique terms that can occur in multiple documents. Let Ck be the number of occurrences of term k; i.e., the number of times k is used for indexing documents in the corpus. Let Cij be the number of co-occurrences of terms i and j (the number of documents indexed by both terms).  In later co-word studies a index is employed to calculate the association values between word pairs. This coefficient is called the equivalence index (e-coefficient) (Callon et al., 1991) or strength (Coulter et al., 1998). The basic metric used for this study is the Association Indice Eij. The strength of association between terms i and j is given by the expression:  Two terms that appear many times in isolation but only a few times together will yield a lower E value than two terms that appear relatively less often alone but have a higher ratio of co- occurrences. Terms with relatively high E values form the networks’ links. A term network consists of nodes (terms) connected by links. Each node must be linked to at least one other node in a network.
  • 8. Continued  This metric provides an intuitive measure of the strength of association between terms, and only indicates that there is some semantic relationship. This metric is easier to understand and utilize in the production and interpretation of term association maps. It allows associations of both major and minor terms and is symmetrical in their relationships (Callon, Courtial, & Turner, 1991). E can be used as the basis for devising several complementary measures of term interactions and term networks in a unified manner.  The co-word algorithm then proceeds in two steps to produce the paired connection of terms. The first step builds networks that can identify areas of strong focus. The second step can identify terms that associate in more than one network and thereby indicate overlapping issues.  Without some minimum constraints, terms that appear infrequently but almost always together could dominate clusters; hence a minimum co-occurrence Cij value is required to generate a link. At the same time, some maps can become cluttered due to an excessive number of legitimate links (generally of decreasing E values); hence, restrictions on numbers of nodes and sometimes links are required to help discover the major partitions of concepts.
  • 9. Inclusion Index  In the early studies of co- word analysis, two indexes were introduced: 1. Inclusion Index and 2. Proximity Index.  The hierarchies of subject areas in a research problem can be detected by calculating an index, called the inclusion index. The Inclusion Index offers access to a tangible metric that can help identify gaps in our works and understand what needs to be addressed.  Inclusion Index (Iij): where,  Cij the number of documents in which the keyword pair (M, and M,) appears;  Ci is the occurrence frequency of keyword Mi in the set of articles;  Cj is the occurrence frequency of keyword M. in the set of articles;  min (Ci,Cj) is the minimum of the two freqdencies Ci,and Cj.  Iij has a value between 0 and 1, and it can be interpreted as a conditional probability.
  • 10. Proximity Index  To detect the minor but potentially growing areas we need the proximity index.  If a situation occurs that implies that there are some mediator keywords, which have a relatively low occurrence frequency but still have significant relationships with some of the peripheral key-words. To bring out such patterns, a proximity index Pij is defined: where,  Ci, Cj, and Cij have the same meaning as in formula of inclusion index.  N is the number of articles in the collection.  The mediator and peripheral keywords pulled out by Pij represent minor but potentially growing areas.
  • 11. Inclusion Map  After the inclusion and proximity indexes are calculated, inclusion and proximity maps are created.  The inclusion maps are designed to discover the central themes in a domain and depict their relationship to keywords that occur less frequently. The inclusion map involves getting more information about a certain theme.  To create inclusion maps, the link that has the highest inclusion index value is selected first. These linked nodes become the starting points for the first inclusion map (subnetwork).  Other links and their corresponding nodes are then added into the map in the decreasing order of their inclusion index until the threshold I0 is reached.  Other links and their corresponding nodes are then added into the map in the decreasing order of their inclusion index until the threshold I, is reached.  Keywords that appear on the top level of inclusion maps are called “central poles” of the domain of research. Keywords that are included in the central poles, and themselves include some other words at lower levels, are called “mediator words” (Callon et al., 1986).
  • 12. Proximity Map  The proximity maps are designed to discover connections between minor ideas hidden behind the central themes. The proximity maps concerns the analysis of the links between themes.  The process to create proximity maps is similar to that for inclusion maps. The difference is that the proximity index is used instead of the inclusion index.  If the threshold P, is lowered enough, more proximity connections between keywords will appear in the map and, eventually, the mediators and central poles found in inclusion maps will reappear. In this way, the relationship between minor issues and central poles can be studied (Callon et al., 1986).
  • 13. Density and Centrality  Density: Density is used to measure the strength of the links that tie together the words making up the cluster; that is the internal strength of a cluster. It provides a good representation of the cluster’s capacity to maintain itself and to develop over the course of time in the field under consideration (Callon et al., 1991).  The value of the density of a given cluster can be measured in several ways. Generally, the index value for links between each word pair is calculated first. Then, the density value can be the average value (mean) of internal links. the median value of internal links or the sum of the squares of the value of internal links.  Centrality: Centrality is used to measure the strength of a subject area’s interaction with other subject areas. Ranking subject areas (clusters) with respect to their centrality shows the extent to which each area is central within a global research network. The greater the number and strength of a subject area’s connections with other subject areas, the more central this subject area will be in the research network (Bauin et al., 1991).  For a given cluster (area), its centrality can be the sum of all external link values or the square root of the sum of the squares of all external link values (e.g., Coulter et al., 1998). An external link is a link that goes from a word belonging to a cluster to a word external to the cluster.
  • 14. Strategic diagram  A strategic diagram that offers a global representation of the structure of any field or subfield can be created by plotting centrality and density into a two-dimensional diagram (Law et al., 1988).  Typically, the horizontal axis represents centrality, the vertical axis represents density, and the origin of the graph is at the median of the respective axis values. This map situates each subject area within a two- dimensional space divided into four quadrants.  The strategic diagram is used in many co-word analysis studies and the analysis based on it is similar among these studies.
  • 15. Strategic diagram Quadrant 3 : Peripheral and developed Words Quadrant 1: Central and developed Words Quadrant 4: Peripheral and undeveloped Words Quadrant 2: Central and undeveloped Words Density Centrality
  • 16. Thematic Network  In a theme, the keywords and their interconnections draw a network graph, called a thematic network. Each thematic network is labeled using the name of the most significant keyword in the associated theme (usually identified by the most central keyword of the theme).  In this figure, several keywords are interconnected, where the volume of the spheres is proportional to the number of Documents corresponding to each keyword, the thickness of the link between two spheres i and j is proportional to the equivalence index Iij .  Network is built, based on the documents linked to each thematic network. A document is linked to a theme if it has at least two keywords presented in the thematic network.
  • 17. Dynamics of Networks  A striking feature of some strategic diagrams is the radical change in the configuration of the network at two periods. This reflects the dynamics of science. Based on the strategic diagram, we can analyze the stability of the networks and foresee their changes in the future. This issue is addressed in many studies, and the methods used in these studies fall into two categories: The study of strategic diagrams, and the ratio of centrality to density.  The comparison of two networks, N1 and N2, which might be two net- works at different times or two distinct networks at the same time, can be done by the method network comparison.
  • 18. Database Tomography  Database Tomography is a patented system for analyzing large amounts of textual computerized material (Kostoff et al., 1995).  It can be considered as another generation of co-word analysis. Algorithms for extracting multi-word phrase frequencies and performing phrase proximity analysis are included in this system. Phrase frequency analysis can be used to discover the pervasive themes of a database while the phrase proximity analysis can be used to detect the relationships among these themes and between these themes and subthemes (Kostoff et al., 1997a).  Similar to co-word analysis, Database Tomography can identify the main intellectual thrust areas and the relationship among these thrust areas. It provides a comprehensive overview of a research network and allows specific starting points to be chosen rationally for more detailed investigations into a topic of interest.  Based on the term co-occurrence information, Database Tomography can also be used to expand the initial query in information retrieval (IR) systems and, in turn, allow the retrieval of relevant documents that would not have been retrieved with the initial query (Kostoff et al., 1997b).
  • 19. Advantages of Co-Word Analysis  Quantitative Over Qualitative: The advantages of co-word analysis over qualitative analysis were recognized by researchers. Using the quantitative in pursuit of the qualitative, researchers are also able to highlight features of scientific fields that have not always been recognized. The heterogeneity of scientific world-building is preserved in co-word analysis, where experimental findings, research methods, concepts, social problems, material artifacts and locations may appear together on the maps.  Flexibility: Compared with other methods of analysis that focus on texts, co-word analysis is much more flexible in that it shows the research net- work with graphs. On the one hand, these graphs can be simplified to show the overall structure of the network. On the other hand, one can zoom in on certain areas and trace the co-word patterns in as much detail as one wishes.
  • 20. Limitations of Co-word Analysis  Indexer effect: Solely making use of keywords and index words was the biggest problem of early co- word analysis. It was called “indexer effect” and was addressed by many researchers.  The representations of the results given by co-word analysis are too difficult to read. (Whittaker, 1989)  The coverage of database is incomplete. Certain types of literature, such as patents arid the “grey literature,” lie outside of the publication circuit and are not indexed in the database. This means that the results from co-word analysis cannot reflect the whole picture of the research field in question.  The delay between the writing of a document and the moment when it is indexed and entered into a database causes the co-word analysis to fail to detect emerging research themes at an early stage. (Callon et al., 1986)
  • 21. Some key Issues in Co-word Analysis  The maps obtained by co-word analysis are generally considered very difficult to understand in isolation. They have to be interpreted with caution. It is suggested that the interpretation must be active and based on the comparison of maps (Callon and al. 1986).  The choice of the words (keywords, descriptors) is another issue to be carefully considered and has led to many discussions (Leydesdorff, 1997, Whittaker, 1989). The "normalization" of words must be considered as well .
  • 22. Assumptions of Co-word Analysis  The assumptions of co-word analysis are presented by Whittaker (1989) as follows: “It [co-word analysis] relies upon the argument that (i) authors of scientific articles choose their technical terms carefully; (ii) when different terms are used in the same articles it is therefore because the author is either recognizing or postulating some non-trivial relationship between their referents; and (iii) if enough different authors appear to recognize the same relationship, then that relation- ship may be assumed to have some significance within the area of science concerned.”  Later, Law and Whittaker (1992) have restated two of the assumptions. First. co-word analysis assumes that the keywords used by indexers to index a paper reflect the present stages of the scientific re- search in question. Then, co-word analysis assumes that arguments received by other scientists will lead to the publication of further scientific papers that are indexed by similar sets of keywords.
  • 23. Applications of Co-word analysis  Co-word analysis has been used to study conceptual work in different domains by many researchers. full text articles included in the journal Scientrometrics to mapping the intellectual structure of scientrometrics using text-mining and co- word analysis.  Co-word analysis is the bibliometric technique used to identify the main themes. All this allows us to quantify and visualize the thematic evolution of many research fields. It also helps to both experts and novices to understand the current state of the field and to predict where future research could lead.
  • 24. Conclusion The appearance of co-word analysis added another choice in the area of bibliometrics and provided another way to discover knowledge in databases. Similar to co-citation analysis, co-word analysis is also a kind of relational study based on the idea that publications should not be considered as discrete units. Instead, each is built upon others. Previous studies have shown co-word analysis to be a powerful tool to discover knowledge in databases. It has been used to detect the themes in a given research area, the relationship between these themes, the extent to which these themes are central to the whole area, and the degree to which these themes are internally structured. In the last twenty years, co-word analysis has been improved in many aspects. The main progress ran be found in two fields: 1. Source of words and 2. Measurements. So lastly we can say that co-word analysis is the one of the most common methods used for constructing the thematic and strategic map of a field.
  • 25. References  Bauin, S. (1986). Aquaculture: A field by bureaucratic fiat. In M. Callon, J. Law, & A. Rip (Eds.),Mapping the dynamics of sriencr and technology: Sociology ofscience in the real world (pp. 124-141). London: The Macmillan Press Ltd.  Chllon, M. (1986). Pinpointing industrial invention: An exploration of quantitative meth- ods for the analysis of patents. In M. Callon, J. Law, & A. Rip (Eds.), Mapping thr dynamics of science and techno1 ociology of science in the real world (pp. 163-188). Lon- don: The Macmillan Press Ltd.  Callon, M.; Courtid1,J-P.; & Ladle, F. (1991). Co-word analysis as a tool for describing the nrtwork of interactions between basic and technological research: The case of poly- mer chemistry. Scienlomrtrirs, 22(1).  He, Q., 1999, Knowledge Discovery Through Co-Word Analysis, Library Trends, 48(1).  Whittaker, J. (1989). Creativity and conformity in science: Titles, keywords and co-word analysis. Social Studies of Science, 19, 473-496.