Ранжирование узлов взвешенной сети соавторства: анализ данных БД Repec
Автор: Бредихин Сергей Всеволодович, Ляпунов Виктор Михайлович, Щербакова Наталия
Журнал: Проблемы информатики @problem-info
Рубрика: Прикладные информационные технологии
Статья в выпуске: 4 (53), 2021 года.
Бесплатный доступ
Изучаются взвешенные сети соавторства, построенные на основе извлеченной из библиографической БД информации. Узлами сетей являются авторы научных публикаций, а связи устанавливаются на основе бинарного отношения соавторства. Рассматриваются два метода установления весов ребер, отражающие вклад каждого соавтора, если эти сведения не указаны явно. Вычислены меры центральности но степени, близости, посредничеству и собственному вектору. Приведены результаты ранжирования авторов, исследована их зависимость от выбора метода назначения весов. Показано, что меры центральности имеют высокий уровень корреляции, наивысший для центральности но посредничеству.
Библиометрия, сеть соавторства, методы определения вкладов авторов, меры центральности
Короткий адрес: https://sciup.org/143178551
IDR: 143178551 | УДК: 519.177
Ranking authors of the weighted co-authorship network: analysis of DB Repec data
In the previous paper [12] we investigated the co-authorship network (Nca) represented bv an unweighted graph: nodes correspond to authors, and two authors are considered connected if they are coauthors of at least one publication. Basic network properties are: existence of the giant component (includes 90% of authors), “small worldness” [24] and a power-law fitting of the distribution of coauthors. In this paper we focus on centrality measures in order to identify key authors on the base of the weighted co-authorship network. Using co-authorship data from the distributed database RcPEc [13] we construct two weighted networks that differ in the way of computing edge weights. Let P (\P| = l) be the set of publications and assume that each publication in P has at least two authors. Let V (\V| = n) be the set of authors of these publications and aij = 1 if i is the author of the publication j. For the network N|? the strength of the collaborative tie (the edge weight) i between the authors i and j is equal to the number of joint papers (T-method): w (i,j) = ^ aik · ajk. k=l For the network Npd the edge weight between the authors i and j depends not only on the number of coauthored papers, but also on the number of other coauthors of these papers (F-method [7]): 1 aik · ajk w (i,j) = nk is the number of authors of the publication k. k=l nk -1 The raw data processing procedure is presented in [12], as a result the number of authors \V\ = 32 434 and the number of coauthored publications \P\ = 91113 For each of the network Nca, Nya, NF four measures of centrality such as degree, closeness, betweenness and eigenvector have been calculated and the tables (tabs. 2 4) containing the names of the authors with the highest ranks are provided. It should be noted that these authors have high h-indcx values (according to Google Scholar search engine or IDEAS ranking system [25] based on all publications of the authors). In order to study the dependence of author ranks on the method of calculating the contributions of authors to publications we calculated Pearson’s correlation coefficients and Spearman’s rank correlation coefficients for the same centrality measures for the networks under consideration. It was shown that regardless of how the edge weights are calculated the same centrality measures have significant correlation with each other. The most significant correlation according to both coefficients is fixed for the betweenness centrality, the least for the eigenvector centrality, which determines the “prestige” of the network actor. To illustrate the studied ways of calculating edge weights and the dependence of node ranks on the method and a node location, we considered the 12-node component of Nca and applied four centrality measures to its weighted representations. We see that the ranks of authors differ depending on the method of edge weights calculating. On the base of node ranks we calculated node weights This work was carried out under state contract with ICMMG SB RAS (0251-2021-0005) and presented new ranks of authors (tab. 10) within any component representation and centrality measure used. It is noted that the high ranked authors are the influential persons with a large number of citations. The purpose of further research is to identify the relationship between key authors and the number of citations of coauthored publications. The question of interest is whether collaborative publications receive more citations than single author publications.
Список литературы Ранжирование узлов взвешенной сети соавторства: анализ данных БД Repec
- Everett М. G., Borgatti S. P. The centralitv of groups and classes // J. of Math. Sociology. 1999. V. 23, iss. 3. P. 181-201.
- borgatti S.P. Identifying sets of key players in a social network // Comput. Math. Organiz. Theory. 2006. V. 12. P. 21-34.
- Bollen J., Rodriguez M. A., Van De Sompel H. Journal status // Scientometrics. 2006. V. 69, iss. 3. P. 669-687.
- Leydesdorpp L. Betweenness centralitv as an indicator of the interdisciplinaritv of scientific journals // J. of the Amer. Soc. for Inform. Sci. and Technol. 2007. V. 58, iss. 9. P. 1303-1319.
- Peng T-Q. Assortative mixing, preferential attachment, and triadic closure: A longitudinal study of tie-generative mechanisms in journal citation networks // J. of Informetrics. 2015. V. 9, iss. 2. P. 250-262.
- Newman M. E. J. Scientific collaboration networks. I. Network construction and fundamental results // Phvs. Rev. E. 2001. V. 64, iss. 1. 016131.
- Newman M. E. J. Scientific collaboration networks. II. Shortest paths, weighted networks, and centralitv // Phvs. Rev. E. 2001. V. 64, iss. 1. 016132.
- Newman M. E. J. Who is the best connected scientist? A study of scientific coauthorship networks // Complex network. Lect. notes in Phvs. 2004. V. 650. P. 337-390.
- Yan E., Ding Y. Applying centralitv measures to impact analysis // J. of Amer. Soc. for Inform. Sci. and Technol. 2009. V. 60, iss. 10. P. 2107-2118.
- Uddin S., Hossain L., Abbasi A., Rasmussen K. Trend and efficiency analysis of co-authorship network // Scientometrics. 2012. V. 90. P. 687-699.
- Youngblood XL. Lahti D. A bibliometric analysis of the interdisciplinary field of cultural evolution // Palgrave Communications. 2018. Art. 120.
- Бредихин С. В., Ляпунов В. XL. Щербакова Н. Г. Структура и параметры невзвешенной сети соавторства на основе данных БД RePEc // Пробл. информ. 2021 (в печати).
- RePEc. General principles. [Electron, res.], http://repec.org/.
- Perianes-Rodriguez A., Waltman L., van Eck N. J. Constructing bibliometric networks: A comparison between full and fractional counting // J. of Informetrics. 2016. V. 10, iss. 4. P. 1178— 1195.
- Бредихин С. В., Ляпунов В. М., Щербакова Н. Г. Библиометрические сети научных статей и журналов. Новосибирск, ИВМиМГ СО РАН, 2021, 334 с. [Электрон, pec.]. https://www. elibrary.ru/item.asp?id=45606936.
- Nieminen J. On centrality in a graph // Scandinav. J. of Psych. 1974. V. 15. P. 322-336.
- Barrat A., Barthelemy M., Pastor-Satorras R., Vespignani A. The architecture of complex weighted networks // Proc. of the Nation. Acad, of Sci. 2004. V. 101, iss. 11. -P. 3747-3752.
- Bavelas A. Communication patterns in task-oriented groups // J. of the Acoustical Soc. of Amer. 1950. V. 22. P. 271-288.
- Beauchamp M. A. An improved index of centrality // Behav. Sci. 1965. V. 10. P. 161-163.
- Anthonisse J. M. The rush in a directed graph // Technic, rep. BN 9/71. Amsterdam: Stiching Matematisch Centrum, 1971.
- Freeman L. C. A set of measures of centrality based upon betweenness // Sociometry. 1977. V. 40. P. 35-41.
- Bonacich P. Factoring and weighting approaches to status scores and clique identification // J. of Math. Sociol. 1972. V. 2. P. 113-120.
- Watts D. J. Networks, dynamics and the small-world phenomenon // Amer. J. of Sociol. 1999. V. 105, iss. 2. P. 493-527.
- Watts D. J., Strogatz S. H. Collective dynamics of 'small-world' networks // Nature. 1998. V. 393. P. 440-442.
- IDEAS : [Electron, res.], https://ideas.repec.org/top/top.person.hindex.html.