SlideShare a Scribd company logo
1 of 1
Theoretical foundations and applications: a study of normalized 
indicators Salton's Cosine and Jaccard Index in Author Co-citation 
Analysis 
Bruno Henrique Alves1 & Ely Francina Tannuri de Oliveira2 
1 bruninkmkt@hotmail.com 
2 etannuri@gmail.com 
UNESP – Univ Estadual Paulista, 737 Hygino Muzzi Filho Avenue, 17525-900 Marília (Brazil) 
1 Introduction 
Studies on Author Co-citation Analysis (ACA) aim to identify influential authors and show 
their interrelations from citations (White; Griffith, 1981; White; McCain, 1998). 
When comparative studies are intended, given the specificities of each area, the 
importance of normalized indicators, which standardize the units of measure and reveal 
aspects not explained in absolutes, are emphasized. 
According to the studies of Luukkonen et al. (1993), absolute and normalized measures 
carry different types of information: the first shows the central "actors" of the networks, 
while the latter shows the intensity of relations and reveal aspects that are not 
identifiable in the absolute frequencies. Among relative indices, Pearson's Correlation 
Coefficient Salton's Cosine, and Jaccard Index are cited (Leydersdoff; Vaughan, 2006). 
Pearson's r was the standard measure before the studies of Ahlgren, Jarneving & 
Rousseau (2003), who criticized its use, showing that it does not satisfy as similarity and 
proximity measures. 
This research aims to deepen the study on normalized indicators of ACA. Specifically, it 
presents and analyzes normalized indicators such as Salton's Cosine (Ss) and Jaccard Index 
(JI), and compares the similarities between them via identification of normalized relations 
applied to Information Science. 
Salton's Cosine (Ss) and Jaccard Index (JI) are stressed. These two normalized indices are 
calculated from the co-occurrence matrix of absolute data, according to Luukkonen et al. 
(1993). 
In the studies by Hamers et al. (1989), co-occurrences represent co-citations, Ss is then 
expressed (Equation 1): 
Where: 
coc(a,b)= total of co-occurences of authors a and b 
cit(a) = total of citations received by author a 
cit(b)= total of citations received by author b 
Luukkonen et al (1993), express JI by (Equation 2): 
Both Ss as IJ vary between zero and one: the closer to one, more similar are the two 
authors (with theoretical-methodological proximity, similarity, complementarity, overlap, 
or opposed ideas or even co-authorship); the closer to zero, the farther is the association 
between the two authors. 
2 Methodological procedures 
Data was extracted from 110 articles published in the 2007-2011 period, from ENANCIBs 
proceedings, in Brazil. We identified 1242 cited researchers, 2003 references, composing a 
target group of 20 researchers cited at least 12 times. 
A 20x20 square matrix was built, from the most cited authors, with absolute co-citation 
frequency. Ss and JI was applied. We used Microsoft Excel macros built in "Visual Basic for 
Applications“ (VBA). We comparatively analyzed the results of the two normalized matrices 
using Ss and JI, evidencing the proximities, similarities and differences between the present 
values and the intensities of connections in the networks. 
3 Presentation and analysis of data 
The two normalized matrices using Ss (Equation 1) and JI (Equation 2) are presented in Tables 
1 and 2. In the analysis of Tables 1 and 2, we initially highlighted Meneghini and Packer with 
the highest value for Ss equal to 0.84 and JI equal to 0.71, observing that in the absolute 
matrix (not presented here) the co-citation between these two authors is 10, with 13 
citations made to Meneghini and 11 to Packer. The number of co-citations is relativized by 
citations made to the two authors. In Figures 1 and 2, the links between these authors are 
strongly highlighted. 
Research Group for 
Metric Studies of 
Table 1. Ss Normalized Matrix 
Information 
Meadows and Mueller present Ss equal to 0.54, JI equal to 0.37 and the absolute 
number of co-citations equal to 19 with 31 citations made to Meadows and 40 
citations to Mueller, which justifies the relativized median value for Ss, and lower 
to JI. In Figure 1, the link between these two researchers is much more 
highlighted than in Figure 2. 
Table 2. JI Normalized Matrix 
Researchers Leta and Spinak present 0.08 for Ss and 0.04 for JI with absolute co-citation 
value equal to 1, with 12 citations to Leta and 12 citations to Spinak, 
which explains the low relativized values. In Figures 1 and 2 the connections for 
both Ss and JI present their links slightly differentiated. 
The highlights r atify Hamers et.al. (1989), when they claim that Ss formula often 
produces a relative similarity measure which is twice the number obtained by JI. 
Extending this analysis to other values, it is observed that the higher the Ss, the 
closer JI will be to it, and above half of Ss (the example of Meneghini and Packer; 
Meadows and Mueller), and the lower the Ss, the JI will be closer or will be the very 
half (the example of Leta and Spinak). 
4 Final considerations 
This study has validated the analyzes already made b y other scholars and advanced 
on existing analyzes between Ss and JI, showing when there is a tendency of 
proximity. They exhibit similar behavior and the choice of using either index does 
not present a conclusive position on the pointed question, and consequently, the 
appropriate methodology to establish ACA is not fully consolidated. 
References 
Algren, P., Jarneving, B. & Rousseau, R. (2003). Requirements for a Cocitation Similarity Measure, with Special Reference to Pearson’s Correlation Coefficient. Journal of the American 
Society for Information Science and Technology, 54, 6, 550-560. 
Hamers, L. et al. (1989). Similarity measures in scientometric research: the Jaccard index versus Salton’s cosine formula. Information Processing & Management, 25, 3, 315-318. 
Leydesdorff, L. & Vaughan, L. (2006). Co-occurrence Matrices and their applications in Information Science: Extending ACA to the Web environment. Journal of the American Society for 
Information Science and Technology, 57, 12, 1616-1628. 
Luukkonen, T. et al. (1993). The measurement of international scientific collaboration. Scientometrics, Amsterdam, 28, 1, 15-36. 
White, H.D., & Griffith, B. (1981). Author cocitation: A literature measure of intellectual structure. Journal of the American Society for Information Science, 32, 163–171. 
White, H.D. & Mccain, K.W. (1998). Visualizing a discipline: an author co-citation analysis of Information Science, 1972-1995. Journal of the American Society for Information Science, 49, 4, 
327-355.

More Related Content

What's hot

A Two-Step Self-Evaluation Algorithm On Imputation Approaches For Missing Cat...
A Two-Step Self-Evaluation Algorithm On Imputation Approaches For Missing Cat...A Two-Step Self-Evaluation Algorithm On Imputation Approaches For Missing Cat...
A Two-Step Self-Evaluation Algorithm On Imputation Approaches For Missing Cat...CSCJournals
 
COMPARISON OF THE RATIO ESTIMATE TO THE LOCAL LINEAR POLYNOMIAL ESTIMATE OF F...
COMPARISON OF THE RATIO ESTIMATE TO THE LOCAL LINEAR POLYNOMIAL ESTIMATE OF F...COMPARISON OF THE RATIO ESTIMATE TO THE LOCAL LINEAR POLYNOMIAL ESTIMATE OF F...
COMPARISON OF THE RATIO ESTIMATE TO THE LOCAL LINEAR POLYNOMIAL ESTIMATE OF F...IJESM JOURNAL
 
Class24 chi squaretestofindependenceposthoc
Class24 chi squaretestofindependenceposthocClass24 chi squaretestofindependenceposthoc
Class24 chi squaretestofindependenceposthocBetynatha Kb
 
Chi squared test
Chi squared testChi squared test
Chi squared testDhruv Patel
 
Unit 1 bp801 t e range24022022
Unit 1 bp801 t e range24022022Unit 1 bp801 t e range24022022
Unit 1 bp801 t e range24022022ashish7sattee
 
STATISTICAL TOOLS USED IN ANALYTICAL CHEMISTRY
STATISTICAL TOOLS USED IN ANALYTICAL CHEMISTRYSTATISTICAL TOOLS USED IN ANALYTICAL CHEMISTRY
STATISTICAL TOOLS USED IN ANALYTICAL CHEMISTRYkeerthana151
 
What is a parametric method?
What is a parametric method?What is a parametric method?
What is a parametric method?Ken Plummer
 
Correlation and Path analysis in breeding experiments
Correlation and Path analysis in breeding experimentsCorrelation and Path analysis in breeding experiments
Correlation and Path analysis in breeding experimentsankit dhillon
 
Statistics And Correlation
Statistics And CorrelationStatistics And Correlation
Statistics And Correlationpankaj prabhakar
 
What is a Wilcoxon Sign-Ranked Test (pair t non para)?
What is a Wilcoxon Sign-Ranked Test (pair t non para)?What is a Wilcoxon Sign-Ranked Test (pair t non para)?
What is a Wilcoxon Sign-Ranked Test (pair t non para)?Ken Plummer
 
what is Correlations
what is Correlationswhat is Correlations
what is Correlationsderiliumboy
 
Graduate Paper--Hierarchical clustring and topology for psychometrics paper
Graduate Paper--Hierarchical clustring and topology for psychometrics paperGraduate Paper--Hierarchical clustring and topology for psychometrics paper
Graduate Paper--Hierarchical clustring and topology for psychometrics paperColleen Farrelly
 
s.analysis
s.analysiss.analysis
s.analysiskavi ...
 
Antonio Gasparrini: Open access: a researcher's perspective
Antonio Gasparrini: Open access: a researcher's perspectiveAntonio Gasparrini: Open access: a researcher's perspective
Antonio Gasparrini: Open access: a researcher's perspectiveNeilStewartCity
 
2013jsm,Proceedings,DSweitzer,26sep
2013jsm,Proceedings,DSweitzer,26sep2013jsm,Proceedings,DSweitzer,26sep
2013jsm,Proceedings,DSweitzer,26sepDennis Sweitzer
 
Unit 1 bp801 t f standard deviation 24022022
Unit 1 bp801 t f standard deviation 24022022Unit 1 bp801 t f standard deviation 24022022
Unit 1 bp801 t f standard deviation 24022022ashish7sattee
 

What's hot (19)

Chi square test
Chi square testChi square test
Chi square test
 
A Two-Step Self-Evaluation Algorithm On Imputation Approaches For Missing Cat...
A Two-Step Self-Evaluation Algorithm On Imputation Approaches For Missing Cat...A Two-Step Self-Evaluation Algorithm On Imputation Approaches For Missing Cat...
A Two-Step Self-Evaluation Algorithm On Imputation Approaches For Missing Cat...
 
COMPARISON OF THE RATIO ESTIMATE TO THE LOCAL LINEAR POLYNOMIAL ESTIMATE OF F...
COMPARISON OF THE RATIO ESTIMATE TO THE LOCAL LINEAR POLYNOMIAL ESTIMATE OF F...COMPARISON OF THE RATIO ESTIMATE TO THE LOCAL LINEAR POLYNOMIAL ESTIMATE OF F...
COMPARISON OF THE RATIO ESTIMATE TO THE LOCAL LINEAR POLYNOMIAL ESTIMATE OF F...
 
Class24 chi squaretestofindependenceposthoc
Class24 chi squaretestofindependenceposthocClass24 chi squaretestofindependenceposthoc
Class24 chi squaretestofindependenceposthoc
 
Chi squared test
Chi squared testChi squared test
Chi squared test
 
Unit 1 bp801 t e range24022022
Unit 1 bp801 t e range24022022Unit 1 bp801 t e range24022022
Unit 1 bp801 t e range24022022
 
Biva riate analysis pdf
Biva riate analysis pdfBiva riate analysis pdf
Biva riate analysis pdf
 
STATISTICAL TOOLS USED IN ANALYTICAL CHEMISTRY
STATISTICAL TOOLS USED IN ANALYTICAL CHEMISTRYSTATISTICAL TOOLS USED IN ANALYTICAL CHEMISTRY
STATISTICAL TOOLS USED IN ANALYTICAL CHEMISTRY
 
What is a parametric method?
What is a parametric method?What is a parametric method?
What is a parametric method?
 
Correlation and Path analysis in breeding experiments
Correlation and Path analysis in breeding experimentsCorrelation and Path analysis in breeding experiments
Correlation and Path analysis in breeding experiments
 
Statistics And Correlation
Statistics And CorrelationStatistics And Correlation
Statistics And Correlation
 
What is a Wilcoxon Sign-Ranked Test (pair t non para)?
What is a Wilcoxon Sign-Ranked Test (pair t non para)?What is a Wilcoxon Sign-Ranked Test (pair t non para)?
What is a Wilcoxon Sign-Ranked Test (pair t non para)?
 
what is Correlations
what is Correlationswhat is Correlations
what is Correlations
 
powerpoint making
powerpoint makingpowerpoint making
powerpoint making
 
Graduate Paper--Hierarchical clustring and topology for psychometrics paper
Graduate Paper--Hierarchical clustring and topology for psychometrics paperGraduate Paper--Hierarchical clustring and topology for psychometrics paper
Graduate Paper--Hierarchical clustring and topology for psychometrics paper
 
s.analysis
s.analysiss.analysis
s.analysis
 
Antonio Gasparrini: Open access: a researcher's perspective
Antonio Gasparrini: Open access: a researcher's perspectiveAntonio Gasparrini: Open access: a researcher's perspective
Antonio Gasparrini: Open access: a researcher's perspective
 
2013jsm,Proceedings,DSweitzer,26sep
2013jsm,Proceedings,DSweitzer,26sep2013jsm,Proceedings,DSweitzer,26sep
2013jsm,Proceedings,DSweitzer,26sep
 
Unit 1 bp801 t f standard deviation 24022022
Unit 1 bp801 t f standard deviation 24022022Unit 1 bp801 t f standard deviation 24022022
Unit 1 bp801 t f standard deviation 24022022
 

Similar to Theoretical foundations and applications: a study of normalized indicators Salton's Cosine and Jaccard Index in Author Co-citation Analysis

Co word analysis
Co word analysisCo word analysis
Co word analysisdebolina73
 
Meta-Analysis of Interaction in Distance Education
Meta-Analysis of Interaction in Distance EducationMeta-Analysis of Interaction in Distance Education
Meta-Analysis of Interaction in Distance EducationSu-Tuan Lulee
 
Evaluation Of A Correlation Analysis Essay
Evaluation Of A Correlation Analysis EssayEvaluation Of A Correlation Analysis Essay
Evaluation Of A Correlation Analysis EssayCrystal Alvarez
 
Katagorisel veri analizi
Katagorisel veri analiziKatagorisel veri analizi
Katagorisel veri analiziBurak Kocak
 
Reflexiones de baron y kenny
Reflexiones de baron y kennyReflexiones de baron y kenny
Reflexiones de baron y kennyLuis Manosalvas
 
Reading Material: Qualitative Interview
Reading Material: Qualitative InterviewReading Material: Qualitative Interview
Reading Material: Qualitative Interviewfirdausabdmunir85
 
Hi Dr. Kyzar and ClassWhen looking at the evidence, one notes .docx
Hi Dr. Kyzar and ClassWhen looking at the evidence, one notes .docxHi Dr. Kyzar and ClassWhen looking at the evidence, one notes .docx
Hi Dr. Kyzar and ClassWhen looking at the evidence, one notes .docxsimonithomas47935
 
Running head RESEARCH DISCUSSION .docx
Running head RESEARCH DISCUSSION                             .docxRunning head RESEARCH DISCUSSION                             .docx
Running head RESEARCH DISCUSSION .docxjeanettehully
 
Correlational research
Correlational researchCorrelational research
Correlational researchJijo G John
 
DESIGN METHODOLOGY FOR RELATIONAL DATABASES: ISSUES RELATED TO TERNARY RELATI...
DESIGN METHODOLOGY FOR RELATIONAL DATABASES: ISSUES RELATED TO TERNARY RELATI...DESIGN METHODOLOGY FOR RELATIONAL DATABASES: ISSUES RELATED TO TERNARY RELATI...
DESIGN METHODOLOGY FOR RELATIONAL DATABASES: ISSUES RELATED TO TERNARY RELATI...ijdms
 
ANYTIME YOU COMPLETE A PAPERESSAY in this course you must follow .docx
ANYTIME YOU COMPLETE A PAPERESSAY in this course you must follow .docxANYTIME YOU COMPLETE A PAPERESSAY in this course you must follow .docx
ANYTIME YOU COMPLETE A PAPERESSAY in this course you must follow .docxfestockton
 
NAMED ENTITY RECOGNITION IN TURKISH USING ASSOCIATION MEASURES
NAMED ENTITY RECOGNITION IN TURKISH USING ASSOCIATION MEASURESNAMED ENTITY RECOGNITION IN TURKISH USING ASSOCIATION MEASURES
NAMED ENTITY RECOGNITION IN TURKISH USING ASSOCIATION MEASURESacijjournal
 
Lanning ICPS: Community structure of JPSP
Lanning ICPS: Community structure of JPSPLanning ICPS: Community structure of JPSP
Lanning ICPS: Community structure of JPSPKevin Lanning
 
1609182308Mediation.pdf
1609182308Mediation.pdf1609182308Mediation.pdf
1609182308Mediation.pdfJoshuaLau29
 
Ecological study design multiple group study and statistical analysis
Ecological study design multiple group study and statistical analysisEcological study design multiple group study and statistical analysis
Ecological study design multiple group study and statistical analysissirjana Tiwari
 
Adapting Measures of Clumping Strength to Assess Term-Term Similarity
Adapting Measures of Clumping Strength to Assess Term-Term SimilarityAdapting Measures of Clumping Strength to Assess Term-Term Similarity
Adapting Measures of Clumping Strength to Assess Term-Term SimilarityVladimir Kulyukin
 
Text Matching based on Document Embeddings
Text Matching based on Document EmbeddingsText Matching based on Document Embeddings
Text Matching based on Document EmbeddingsKojin Oshiba
 
A Computer-Based Approach For Deriving And Measuring Individual And Team Know...
A Computer-Based Approach For Deriving And Measuring Individual And Team Know...A Computer-Based Approach For Deriving And Measuring Individual And Team Know...
A Computer-Based Approach For Deriving And Measuring Individual And Team Know...Angie Miller
 

Similar to Theoretical foundations and applications: a study of normalized indicators Salton's Cosine and Jaccard Index in Author Co-citation Analysis (20)

Co word analysis
Co word analysisCo word analysis
Co word analysis
 
Meta-Analysis of Interaction in Distance Education
Meta-Analysis of Interaction in Distance EducationMeta-Analysis of Interaction in Distance Education
Meta-Analysis of Interaction in Distance Education
 
Evaluation Of A Correlation Analysis Essay
Evaluation Of A Correlation Analysis EssayEvaluation Of A Correlation Analysis Essay
Evaluation Of A Correlation Analysis Essay
 
Katagorisel veri analizi
Katagorisel veri analiziKatagorisel veri analizi
Katagorisel veri analizi
 
STDEV . I3.pdf
STDEV . I3.pdfSTDEV . I3.pdf
STDEV . I3.pdf
 
Reflexiones de baron y kenny
Reflexiones de baron y kennyReflexiones de baron y kenny
Reflexiones de baron y kenny
 
Reading Material: Qualitative Interview
Reading Material: Qualitative InterviewReading Material: Qualitative Interview
Reading Material: Qualitative Interview
 
Hi Dr. Kyzar and ClassWhen looking at the evidence, one notes .docx
Hi Dr. Kyzar and ClassWhen looking at the evidence, one notes .docxHi Dr. Kyzar and ClassWhen looking at the evidence, one notes .docx
Hi Dr. Kyzar and ClassWhen looking at the evidence, one notes .docx
 
Running head RESEARCH DISCUSSION .docx
Running head RESEARCH DISCUSSION                             .docxRunning head RESEARCH DISCUSSION                             .docx
Running head RESEARCH DISCUSSION .docx
 
Correlational research
Correlational researchCorrelational research
Correlational research
 
PSY 223 Short Paper
PSY 223 Short PaperPSY 223 Short Paper
PSY 223 Short Paper
 
DESIGN METHODOLOGY FOR RELATIONAL DATABASES: ISSUES RELATED TO TERNARY RELATI...
DESIGN METHODOLOGY FOR RELATIONAL DATABASES: ISSUES RELATED TO TERNARY RELATI...DESIGN METHODOLOGY FOR RELATIONAL DATABASES: ISSUES RELATED TO TERNARY RELATI...
DESIGN METHODOLOGY FOR RELATIONAL DATABASES: ISSUES RELATED TO TERNARY RELATI...
 
ANYTIME YOU COMPLETE A PAPERESSAY in this course you must follow .docx
ANYTIME YOU COMPLETE A PAPERESSAY in this course you must follow .docxANYTIME YOU COMPLETE A PAPERESSAY in this course you must follow .docx
ANYTIME YOU COMPLETE A PAPERESSAY in this course you must follow .docx
 
NAMED ENTITY RECOGNITION IN TURKISH USING ASSOCIATION MEASURES
NAMED ENTITY RECOGNITION IN TURKISH USING ASSOCIATION MEASURESNAMED ENTITY RECOGNITION IN TURKISH USING ASSOCIATION MEASURES
NAMED ENTITY RECOGNITION IN TURKISH USING ASSOCIATION MEASURES
 
Lanning ICPS: Community structure of JPSP
Lanning ICPS: Community structure of JPSPLanning ICPS: Community structure of JPSP
Lanning ICPS: Community structure of JPSP
 
1609182308Mediation.pdf
1609182308Mediation.pdf1609182308Mediation.pdf
1609182308Mediation.pdf
 
Ecological study design multiple group study and statistical analysis
Ecological study design multiple group study and statistical analysisEcological study design multiple group study and statistical analysis
Ecological study design multiple group study and statistical analysis
 
Adapting Measures of Clumping Strength to Assess Term-Term Similarity
Adapting Measures of Clumping Strength to Assess Term-Term SimilarityAdapting Measures of Clumping Strength to Assess Term-Term Similarity
Adapting Measures of Clumping Strength to Assess Term-Term Similarity
 
Text Matching based on Document Embeddings
Text Matching based on Document EmbeddingsText Matching based on Document Embeddings
Text Matching based on Document Embeddings
 
A Computer-Based Approach For Deriving And Measuring Individual And Team Know...
A Computer-Based Approach For Deriving And Measuring Individual And Team Know...A Computer-Based Approach For Deriving And Measuring Individual And Team Know...
A Computer-Based Approach For Deriving And Measuring Individual And Team Know...
 

Recently uploaded

REVISTA DE BIOLOGIA E CIÊNCIAS DA TERRA ISSN 1519-5228 - Artigo_Bioterra_V24_...
REVISTA DE BIOLOGIA E CIÊNCIAS DA TERRA ISSN 1519-5228 - Artigo_Bioterra_V24_...REVISTA DE BIOLOGIA E CIÊNCIAS DA TERRA ISSN 1519-5228 - Artigo_Bioterra_V24_...
REVISTA DE BIOLOGIA E CIÊNCIAS DA TERRA ISSN 1519-5228 - Artigo_Bioterra_V24_...Universidade Federal de Sergipe - UFS
 
Topic 9- General Principles of International Law.pptx
Topic 9- General Principles of International Law.pptxTopic 9- General Principles of International Law.pptx
Topic 9- General Principles of International Law.pptxJorenAcuavera1
 
Neurodevelopmental disorders according to the dsm 5 tr
Neurodevelopmental disorders according to the dsm 5 trNeurodevelopmental disorders according to the dsm 5 tr
Neurodevelopmental disorders according to the dsm 5 trssuser06f238
 
User Guide: Orion™ Weather Station (Columbia Weather Systems)
User Guide: Orion™ Weather Station (Columbia Weather Systems)User Guide: Orion™ Weather Station (Columbia Weather Systems)
User Guide: Orion™ Weather Station (Columbia Weather Systems)Columbia Weather Systems
 
Call Girls In Nihal Vihar Delhi ❤️8860477959 Looking Escorts In 24/7 Delhi NCR
Call Girls In Nihal Vihar Delhi ❤️8860477959 Looking Escorts In 24/7 Delhi NCRCall Girls In Nihal Vihar Delhi ❤️8860477959 Looking Escorts In 24/7 Delhi NCR
Call Girls In Nihal Vihar Delhi ❤️8860477959 Looking Escorts In 24/7 Delhi NCRlizamodels9
 
Radiation physics in Dental Radiology...
Radiation physics in Dental Radiology...Radiation physics in Dental Radiology...
Radiation physics in Dental Radiology...navyadasi1992
 
User Guide: Magellan MX™ Weather Station
User Guide: Magellan MX™ Weather StationUser Guide: Magellan MX™ Weather Station
User Guide: Magellan MX™ Weather StationColumbia Weather Systems
 
Pests of castor_Binomics_Identification_Dr.UPR.pdf
Pests of castor_Binomics_Identification_Dr.UPR.pdfPests of castor_Binomics_Identification_Dr.UPR.pdf
Pests of castor_Binomics_Identification_Dr.UPR.pdfPirithiRaju
 
Speech, hearing, noise, intelligibility.pptx
Speech, hearing, noise, intelligibility.pptxSpeech, hearing, noise, intelligibility.pptx
Speech, hearing, noise, intelligibility.pptxpriyankatabhane
 
Four Spheres of the Earth Presentation.ppt
Four Spheres of the Earth Presentation.pptFour Spheres of the Earth Presentation.ppt
Four Spheres of the Earth Presentation.pptJoemSTuliba
 
User Guide: Capricorn FLX™ Weather Station
User Guide: Capricorn FLX™ Weather StationUser Guide: Capricorn FLX™ Weather Station
User Guide: Capricorn FLX™ Weather StationColumbia Weather Systems
 
Fertilization: Sperm and the egg—collectively called the gametes—fuse togethe...
Fertilization: Sperm and the egg—collectively called the gametes—fuse togethe...Fertilization: Sperm and the egg—collectively called the gametes—fuse togethe...
Fertilization: Sperm and the egg—collectively called the gametes—fuse togethe...D. B. S. College Kanpur
 
Pests of Blackgram, greengram, cowpea_Dr.UPR.pdf
Pests of Blackgram, greengram, cowpea_Dr.UPR.pdfPests of Blackgram, greengram, cowpea_Dr.UPR.pdf
Pests of Blackgram, greengram, cowpea_Dr.UPR.pdfPirithiRaju
 
Call Girls in Munirka Delhi 💯Call Us 🔝8264348440🔝
Call Girls in Munirka Delhi 💯Call Us 🔝8264348440🔝Call Girls in Munirka Delhi 💯Call Us 🔝8264348440🔝
Call Girls in Munirka Delhi 💯Call Us 🔝8264348440🔝soniya singh
 
Behavioral Disorder: Schizophrenia & it's Case Study.pdf
Behavioral Disorder: Schizophrenia & it's Case Study.pdfBehavioral Disorder: Schizophrenia & it's Case Study.pdf
Behavioral Disorder: Schizophrenia & it's Case Study.pdfSELF-EXPLANATORY
 
Forensic limnology of diatoms by Sanjai.pptx
Forensic limnology of diatoms by Sanjai.pptxForensic limnology of diatoms by Sanjai.pptx
Forensic limnology of diatoms by Sanjai.pptxkumarsanjai28051
 
BIOETHICS IN RECOMBINANT DNA TECHNOLOGY.
BIOETHICS IN RECOMBINANT DNA TECHNOLOGY.BIOETHICS IN RECOMBINANT DNA TECHNOLOGY.
BIOETHICS IN RECOMBINANT DNA TECHNOLOGY.PraveenaKalaiselvan1
 
Harmful and Useful Microorganisms Presentation
Harmful and Useful Microorganisms PresentationHarmful and Useful Microorganisms Presentation
Harmful and Useful Microorganisms Presentationtahreemzahra82
 
Pests of safflower_Binomics_Identification_Dr.UPR.pdf
Pests of safflower_Binomics_Identification_Dr.UPR.pdfPests of safflower_Binomics_Identification_Dr.UPR.pdf
Pests of safflower_Binomics_Identification_Dr.UPR.pdfPirithiRaju
 
THE ROLE OF PHARMACOGNOSY IN TRADITIONAL AND MODERN SYSTEM OF MEDICINE.pptx
THE ROLE OF PHARMACOGNOSY IN TRADITIONAL AND MODERN SYSTEM OF MEDICINE.pptxTHE ROLE OF PHARMACOGNOSY IN TRADITIONAL AND MODERN SYSTEM OF MEDICINE.pptx
THE ROLE OF PHARMACOGNOSY IN TRADITIONAL AND MODERN SYSTEM OF MEDICINE.pptxNandakishor Bhaurao Deshmukh
 

Recently uploaded (20)

REVISTA DE BIOLOGIA E CIÊNCIAS DA TERRA ISSN 1519-5228 - Artigo_Bioterra_V24_...
REVISTA DE BIOLOGIA E CIÊNCIAS DA TERRA ISSN 1519-5228 - Artigo_Bioterra_V24_...REVISTA DE BIOLOGIA E CIÊNCIAS DA TERRA ISSN 1519-5228 - Artigo_Bioterra_V24_...
REVISTA DE BIOLOGIA E CIÊNCIAS DA TERRA ISSN 1519-5228 - Artigo_Bioterra_V24_...
 
Topic 9- General Principles of International Law.pptx
Topic 9- General Principles of International Law.pptxTopic 9- General Principles of International Law.pptx
Topic 9- General Principles of International Law.pptx
 
Neurodevelopmental disorders according to the dsm 5 tr
Neurodevelopmental disorders according to the dsm 5 trNeurodevelopmental disorders according to the dsm 5 tr
Neurodevelopmental disorders according to the dsm 5 tr
 
User Guide: Orion™ Weather Station (Columbia Weather Systems)
User Guide: Orion™ Weather Station (Columbia Weather Systems)User Guide: Orion™ Weather Station (Columbia Weather Systems)
User Guide: Orion™ Weather Station (Columbia Weather Systems)
 
Call Girls In Nihal Vihar Delhi ❤️8860477959 Looking Escorts In 24/7 Delhi NCR
Call Girls In Nihal Vihar Delhi ❤️8860477959 Looking Escorts In 24/7 Delhi NCRCall Girls In Nihal Vihar Delhi ❤️8860477959 Looking Escorts In 24/7 Delhi NCR
Call Girls In Nihal Vihar Delhi ❤️8860477959 Looking Escorts In 24/7 Delhi NCR
 
Radiation physics in Dental Radiology...
Radiation physics in Dental Radiology...Radiation physics in Dental Radiology...
Radiation physics in Dental Radiology...
 
User Guide: Magellan MX™ Weather Station
User Guide: Magellan MX™ Weather StationUser Guide: Magellan MX™ Weather Station
User Guide: Magellan MX™ Weather Station
 
Pests of castor_Binomics_Identification_Dr.UPR.pdf
Pests of castor_Binomics_Identification_Dr.UPR.pdfPests of castor_Binomics_Identification_Dr.UPR.pdf
Pests of castor_Binomics_Identification_Dr.UPR.pdf
 
Speech, hearing, noise, intelligibility.pptx
Speech, hearing, noise, intelligibility.pptxSpeech, hearing, noise, intelligibility.pptx
Speech, hearing, noise, intelligibility.pptx
 
Four Spheres of the Earth Presentation.ppt
Four Spheres of the Earth Presentation.pptFour Spheres of the Earth Presentation.ppt
Four Spheres of the Earth Presentation.ppt
 
User Guide: Capricorn FLX™ Weather Station
User Guide: Capricorn FLX™ Weather StationUser Guide: Capricorn FLX™ Weather Station
User Guide: Capricorn FLX™ Weather Station
 
Fertilization: Sperm and the egg—collectively called the gametes—fuse togethe...
Fertilization: Sperm and the egg—collectively called the gametes—fuse togethe...Fertilization: Sperm and the egg—collectively called the gametes—fuse togethe...
Fertilization: Sperm and the egg—collectively called the gametes—fuse togethe...
 
Pests of Blackgram, greengram, cowpea_Dr.UPR.pdf
Pests of Blackgram, greengram, cowpea_Dr.UPR.pdfPests of Blackgram, greengram, cowpea_Dr.UPR.pdf
Pests of Blackgram, greengram, cowpea_Dr.UPR.pdf
 
Call Girls in Munirka Delhi 💯Call Us 🔝8264348440🔝
Call Girls in Munirka Delhi 💯Call Us 🔝8264348440🔝Call Girls in Munirka Delhi 💯Call Us 🔝8264348440🔝
Call Girls in Munirka Delhi 💯Call Us 🔝8264348440🔝
 
Behavioral Disorder: Schizophrenia & it's Case Study.pdf
Behavioral Disorder: Schizophrenia & it's Case Study.pdfBehavioral Disorder: Schizophrenia & it's Case Study.pdf
Behavioral Disorder: Schizophrenia & it's Case Study.pdf
 
Forensic limnology of diatoms by Sanjai.pptx
Forensic limnology of diatoms by Sanjai.pptxForensic limnology of diatoms by Sanjai.pptx
Forensic limnology of diatoms by Sanjai.pptx
 
BIOETHICS IN RECOMBINANT DNA TECHNOLOGY.
BIOETHICS IN RECOMBINANT DNA TECHNOLOGY.BIOETHICS IN RECOMBINANT DNA TECHNOLOGY.
BIOETHICS IN RECOMBINANT DNA TECHNOLOGY.
 
Harmful and Useful Microorganisms Presentation
Harmful and Useful Microorganisms PresentationHarmful and Useful Microorganisms Presentation
Harmful and Useful Microorganisms Presentation
 
Pests of safflower_Binomics_Identification_Dr.UPR.pdf
Pests of safflower_Binomics_Identification_Dr.UPR.pdfPests of safflower_Binomics_Identification_Dr.UPR.pdf
Pests of safflower_Binomics_Identification_Dr.UPR.pdf
 
THE ROLE OF PHARMACOGNOSY IN TRADITIONAL AND MODERN SYSTEM OF MEDICINE.pptx
THE ROLE OF PHARMACOGNOSY IN TRADITIONAL AND MODERN SYSTEM OF MEDICINE.pptxTHE ROLE OF PHARMACOGNOSY IN TRADITIONAL AND MODERN SYSTEM OF MEDICINE.pptx
THE ROLE OF PHARMACOGNOSY IN TRADITIONAL AND MODERN SYSTEM OF MEDICINE.pptx
 

Theoretical foundations and applications: a study of normalized indicators Salton's Cosine and Jaccard Index in Author Co-citation Analysis

  • 1. Theoretical foundations and applications: a study of normalized indicators Salton's Cosine and Jaccard Index in Author Co-citation Analysis Bruno Henrique Alves1 & Ely Francina Tannuri de Oliveira2 1 bruninkmkt@hotmail.com 2 etannuri@gmail.com UNESP – Univ Estadual Paulista, 737 Hygino Muzzi Filho Avenue, 17525-900 Marília (Brazil) 1 Introduction Studies on Author Co-citation Analysis (ACA) aim to identify influential authors and show their interrelations from citations (White; Griffith, 1981; White; McCain, 1998). When comparative studies are intended, given the specificities of each area, the importance of normalized indicators, which standardize the units of measure and reveal aspects not explained in absolutes, are emphasized. According to the studies of Luukkonen et al. (1993), absolute and normalized measures carry different types of information: the first shows the central "actors" of the networks, while the latter shows the intensity of relations and reveal aspects that are not identifiable in the absolute frequencies. Among relative indices, Pearson's Correlation Coefficient Salton's Cosine, and Jaccard Index are cited (Leydersdoff; Vaughan, 2006). Pearson's r was the standard measure before the studies of Ahlgren, Jarneving & Rousseau (2003), who criticized its use, showing that it does not satisfy as similarity and proximity measures. This research aims to deepen the study on normalized indicators of ACA. Specifically, it presents and analyzes normalized indicators such as Salton's Cosine (Ss) and Jaccard Index (JI), and compares the similarities between them via identification of normalized relations applied to Information Science. Salton's Cosine (Ss) and Jaccard Index (JI) are stressed. These two normalized indices are calculated from the co-occurrence matrix of absolute data, according to Luukkonen et al. (1993). In the studies by Hamers et al. (1989), co-occurrences represent co-citations, Ss is then expressed (Equation 1): Where: coc(a,b)= total of co-occurences of authors a and b cit(a) = total of citations received by author a cit(b)= total of citations received by author b Luukkonen et al (1993), express JI by (Equation 2): Both Ss as IJ vary between zero and one: the closer to one, more similar are the two authors (with theoretical-methodological proximity, similarity, complementarity, overlap, or opposed ideas or even co-authorship); the closer to zero, the farther is the association between the two authors. 2 Methodological procedures Data was extracted from 110 articles published in the 2007-2011 period, from ENANCIBs proceedings, in Brazil. We identified 1242 cited researchers, 2003 references, composing a target group of 20 researchers cited at least 12 times. A 20x20 square matrix was built, from the most cited authors, with absolute co-citation frequency. Ss and JI was applied. We used Microsoft Excel macros built in "Visual Basic for Applications“ (VBA). We comparatively analyzed the results of the two normalized matrices using Ss and JI, evidencing the proximities, similarities and differences between the present values and the intensities of connections in the networks. 3 Presentation and analysis of data The two normalized matrices using Ss (Equation 1) and JI (Equation 2) are presented in Tables 1 and 2. In the analysis of Tables 1 and 2, we initially highlighted Meneghini and Packer with the highest value for Ss equal to 0.84 and JI equal to 0.71, observing that in the absolute matrix (not presented here) the co-citation between these two authors is 10, with 13 citations made to Meneghini and 11 to Packer. The number of co-citations is relativized by citations made to the two authors. In Figures 1 and 2, the links between these authors are strongly highlighted. Research Group for Metric Studies of Table 1. Ss Normalized Matrix Information Meadows and Mueller present Ss equal to 0.54, JI equal to 0.37 and the absolute number of co-citations equal to 19 with 31 citations made to Meadows and 40 citations to Mueller, which justifies the relativized median value for Ss, and lower to JI. In Figure 1, the link between these two researchers is much more highlighted than in Figure 2. Table 2. JI Normalized Matrix Researchers Leta and Spinak present 0.08 for Ss and 0.04 for JI with absolute co-citation value equal to 1, with 12 citations to Leta and 12 citations to Spinak, which explains the low relativized values. In Figures 1 and 2 the connections for both Ss and JI present their links slightly differentiated. The highlights r atify Hamers et.al. (1989), when they claim that Ss formula often produces a relative similarity measure which is twice the number obtained by JI. Extending this analysis to other values, it is observed that the higher the Ss, the closer JI will be to it, and above half of Ss (the example of Meneghini and Packer; Meadows and Mueller), and the lower the Ss, the JI will be closer or will be the very half (the example of Leta and Spinak). 4 Final considerations This study has validated the analyzes already made b y other scholars and advanced on existing analyzes between Ss and JI, showing when there is a tendency of proximity. They exhibit similar behavior and the choice of using either index does not present a conclusive position on the pointed question, and consequently, the appropriate methodology to establish ACA is not fully consolidated. References Algren, P., Jarneving, B. & Rousseau, R. (2003). Requirements for a Cocitation Similarity Measure, with Special Reference to Pearson’s Correlation Coefficient. Journal of the American Society for Information Science and Technology, 54, 6, 550-560. Hamers, L. et al. (1989). Similarity measures in scientometric research: the Jaccard index versus Salton’s cosine formula. Information Processing & Management, 25, 3, 315-318. Leydesdorff, L. & Vaughan, L. (2006). Co-occurrence Matrices and their applications in Information Science: Extending ACA to the Web environment. Journal of the American Society for Information Science and Technology, 57, 12, 1616-1628. Luukkonen, T. et al. (1993). The measurement of international scientific collaboration. Scientometrics, Amsterdam, 28, 1, 15-36. White, H.D., & Griffith, B. (1981). Author cocitation: A literature measure of intellectual structure. Journal of the American Society for Information Science, 32, 163–171. White, H.D. & Mccain, K.W. (1998). Visualizing a discipline: an author co-citation analysis of Information Science, 1972-1995. Journal of the American Society for Information Science, 49, 4, 327-355.