Requirements for a cocitation similarity measure, with special reference to Pearson's correlation coefficient
2003 (English)In: Journal of The American Society For Information Science And Technology, ISSN 1532-2882, E-ISSN 1532-2890, Vol. 54, no 6, 550-560 p.Article in journal (Refereed) Published
Author cocitation analysis (ACA), a special type of cocitation analysis, was introduced by White and Griffith in 1981. This technique is used to analyze the intellectual structure of a given scientific field. In 1990, McCain published a technical overview that has been largely adopted as a standard. Here, McCain notes that Pearson's correlation coefficient (Pearson's r) is often used as a similarity measure in ACA and presents some advantages of its use. The present article criticizes the use of Pearson's r in ACA and sets forth two natural requirements that a similarity measure applied in ACA should satisfy. It is shown that Pearson's r does not satisfy these requirements. Real and hypothetical data are used in order to obtain counterexamples to both requirements. It is concluded that Pearson's r is probably not an optimal choice of a similarity measure in ACA. Still, further empirical research is needed to show if, and in that case to what extent, the use of similarity measures in ACA that fulfill these requirements would lead to objectively better results in full-scale studies. Further, problems related to incomplete cocitation matrices are discussed.
Place, publisher, year, edition, pages
2003. Vol. 54, no 6, 550-560 p.
IdentifiersURN: urn:nbn:se:kth:diva-171395DOI: 10.1002/asi.10242ISI: 000181925300007OAI: oai:DiVA.org:kth-171395DiVA: diva2:843562
NR 201508172015-07-292015-07-292015-08-17Bibliographically approved