Структура и параметры невзвешенной сети соавторства на основе данных БД REPEC
Автор: Бредихин Сергей Всеволодович, Ляпунов Виктор Михайлович, Щербакова Наталья Григорьевна
Журнал: Проблемы информатики @problem-info
Рубрика: Прикладные информационные технологии
Статья в выпуске: 3 (52), 2021 года.
Бесплатный доступ
Исследуется структура сотрудничества на основе соавторства в научных публикациях. На основе информации, извлеченной из библиографической базы данных, построена сеть соавторства и измерены ее основные параметры. Показано, что сеть является масштабно-инвариантной и обладает структурой, свойственной малому миру. Вычислены индексы сотрудничества, отражающие вовлеченность ученых в совместную деятельность.
Библиометрия, сеть соавторства, индексы сотрудничества
Короткий адрес: https://sciup.org/143178345
IDR: 143178345 | УДК: 001.12+303.2 | DOI: 10.24412/2073-0667-2021-3-56-67
The structure and parameters of the unweighted co-authorship network based on DB REPEC data
In this paper we investigate the structure of scientific collaboration based on co-authorship in publications retrieved from the RcPEc database. The main attention is paid to the co-authorship network: nodes correspond to authors, and two authors arc considered connected if they arc coauthors of at least one publication. The network is represented by the undirected unweighted collaboration graph. The co-authorship in articles provides a window on patterns of collaboration within the academic community. The study of scientific collaboration networks is carried out in two main directions. Empirical measurements provide detailed characteristics of the network: statistical properties of the distribution of degrees of nodes, global network parameters, nodes centralities (Newman 2001a; 2001b; 200lc). The study of the dynamic properties of real networks and network models makes it possible to identify the structural principles that govern the evolution of networks; dynamic properties, in turn, can explain static ones (Barabasi et al., 2001; Savic et al., 2017). Co-authorship networks arc used to identify research groups and the most important researchers and to predict their scientific success; to classify journals by degree of co-authorship and to make maps of science. Co-authorship networks can be constructed for different components of analysis, such as researchers, institutions, and countries. We use a researcher as the unit of analysis. Let P be the set of publications under consideration and V - the set of the authors of these publications. We assume that each publication in P has at least two authors. For i,j e V let us define the binary relation of co-authorship Rca: i Rcaj = (3p e P) the author i is one of the authors of p and the author j is one of the authors of p. The co-authorship network is the weighted undirected network Nca = (V, E,w), V - the set of the authors, E C V x V - the set of weighted edges, e = (i, j) e E, if iRcaj. In the simplest case, w(e) = 1, regardless of how often these authors are coauthors and how much each contributed to coauthored publications, that is, the network can be considered unweighted. Then the graph corresponding to the unweighted network is represented bv the (0,1) adjacency matrix U = (uij). The articles used in this study were retrieved from the RcPEc database (REPEC). In order to identify authors uniquely and to infer actual author identity we use the author „ profile“ that authors create basing on the Author Service provided by the RcPEc database (similar to Google Scholar). So we have |P| = 91113 publications with more than one author and |V| = 32 434 authors of these publications. In total, these authors have published 364 979 articles. The publications refer to the period from 1954 to 2019. We presented the base statistics such as papers per author, authors per paper, maximal authors per paper and collaboration indices Cl, DC, CC, MCC. We also measured the network parameters such as the size of the largest component, network density, average distance, maximal distance and clustering coefficient. It was found that the distribution of numbers of coauthors for authors follows power law with exponent 7 = 1.3. The study showed that in the collection of publications under consideration the fraction of coauthored publications is small (25%) and the prevailing trend is the presence of two coauthors in a publication (77%). The most authors are indirectly connected to each other - the maximum component includes 90 % of authors. The co-authorship network under consideration is scale-free and shows the „small world“ effect.
Список литературы Структура и параметры невзвешенной сети соавторства на основе данных БД REPEC
- Beaver D. D. Reflections on scientific collaboration (and its study): past, present and future // Scientometrics. 2001. V. 52. R 365-377.
- Laudel G. Who do we measure by co-authorship? // Research Evaluation. 2002. V. 11. P. 3-15.
- Stephan P. How economics shapes science // Cambridge, Mass.: Harvard University Press, 2012. 384 pp.
- Ding Y. Scientific collaboration and endorsement: Network analysis of coauthorship and citation networks // J. of Informetrics. 2011. V. 5. P. 187-203.
- Wallace M.L., Larivire V., Gingras Y. A SmallWorld of citations? The influence of collaboration networks on citation practices // PLoS ONE. 2012. V. 7(3), e33339. [Electron, resource]. URL: https://doi.org/10.1371/journal.pone.0033339.
- Martin Т., Ball В., Karrer В., Newman M.E.J. Coauthorship and citation patterns in the Physical Review // Phvs. Rev. 2013. E 88, 012814. [Electron, resource]. URL: https://doi.org/ 10.1103/PhysRevE.88.012814.
- Newman M. E. J. The structure of scientific collaboration networks // Proc. of the Nat. Acad, of Sci. of the USA. 2001. V.98(2). P. 404-409.
- Newman M.E.J. Scientific collaboration networks. I. Network construction and fundamental results // Phvs. Rev. E, 64(1), 016131.
- Newman M. E. J. Scientific collaboration networks. II. Shortest paths, weighted networks, and centralitv // Phvs. Rev. E, 64(1), 016132.
- Barabasi A.L., Jeong H., Neda, Z. Ravasz E., Schubert A., V.csek T. Evolution of the social network of scientific collaborations // arXiv: cond-mat/0104162 (2001).
- Savic M., Ivanovic M., Radovanovic M., Ognjanovic Z., Pejovic A., Jaksic KrugerT. The structure and evolution of scientific collaboration in Serbian mathematical journals // Scientometrics. 2014. V. 101(3). P. 1805-1830.[Electron, resource], URL: https://doi.org/10.1007/slll92-014-1295-6.
- RePEc. General principles. [Electron, resource]. URL: http://repec.org/.
- Yoshikane F., Nozawa Т., Tsuji K. Comparative analysis of co-authorship networks considering authors' roles in collaboration: Differences between the theoretical and application areas // Scientometrics. 2006. V. 68, iss. 3. P. 643-655.
- Bouros N., Sotiropoulos D.N., Pournarakis D., & Giaglis G.M. Social Network Analysis WTithin the ICMB Community: Co-Authorship Networks // The 13th Intern, conf. on Mobile Business, ICMB 2014, London (UK), June 4-5, 2014. P. 8. [Electron, resource]. URL: http://aisel. aisnet.org/icmb2014/8.
- Lawani S. M. Quality, collaboration and citations in cancer research: A bibliometric study // Ph.D. Dissertation. Florida State Univ., 1980.
- subramanyam К. Bibliometric studies of research collaboration: A review //J. Inform. Sci. 1983. V.6, iss. 1. P. 33-38.
- Ajiferuke I., Burell Q., and Tague J. Collaborative coefficient: A single measure of the degree of collaboration in research // Scientometrics. 1988. V. 14, iss. 5-6. P. 421-433.
- Egghe L. Theory of collaboration and collaborative measures // Inform. Proc. and Manag. 1991. V. 27, iss. 2-3. P. 177-202.
- Sarvanur K. and Srikanth R. Modified collaboration coefficient: A new measure for quantifying the degree of research collaboration // Scientometrics. 2010. V. 84, iss. 2. P. 365-376.
- Дистель P. Теория графов. Новосибирск: Ии-т математики, 2002. 336 с.
- Харари Ф. Теория графов. М.: Мир, 1973. 297 с.
- Watts D.J. Networks, dynamics and the small-world phenomenon // Amer. J. of Sociol. 1999. V. 105, iss. 2. P. 493-527.
- Watts D.J., Strogatz S.H. Collective dynamics of 'small-world' networks // Nature. 1998. V. 393. P. 440-442.
- Milgram S. The small world problem // Psychol. Today. 1967. V.2. P. 60-67.