首页 | 本学科首页   官方微博 | 高级检索  
相似文献
 共查询到20条相似文献,搜索用时 64 毫秒
1.
We study an agent-based model for generating citation distributions in complex networks of scientific papers, where a fraction of citations is allotted according to the preferential attachment rule (rich get richer) and the remainder is allocated accidentally (purely at random, uniformly). Previously, we derived and analysed such a process in the context of describing individual authors, but now we apply it to scientific journals in computer and information sciences. Based on the large DBLP dataset as well as the CORE (Computing Research and Education Association of Australasia) journal ranking, we find that the impact of journals is correlated with the degree of accidentality of their citation distribution. Citations to impactful journals tend to be more preferential, while citations to lower-ranked journals are distributed in a more accidental manner. Further, applied fields of research such as artificial intelligence seem to be driven by a stronger preferential component – and hence have a higher degree of inequality – than the more theoretical ones, e.g., mathematics and computation theory.  相似文献   

2.
The number of clinical citations received from clinical guidelines or clinical trials has been considered as one of the most appropriate indicators for quantifying the clinical impact of biomedical papers. Therefore, the early prediction of clinical citation count of biomedical papers is critical to scientific activities in biomedicine, such as research evaluation, resource allocation, and clinical translation. In this study, we designed a four-layer multilayer perceptron neural network (MPNN) model to predict the clinical citation count of biomedical papers in the future by using 9,822,620 biomedical papers published from 1985 to 2005. We extracted ninety-one paper features from three dimensions as the input of the model, including twenty-one features in the paper dimension, thirty-five in the reference dimension, and thirty-five in the citing paper dimension. In each dimension, the features can be classified into three categories, i.e., the citation-related features, the clinical translation-related features, and the topic-related features. Besides, in the paper dimension, we also considered the features that have previously been demonstrated to be related to the citation counts of research papers. The results showed that the proposed MPNN model outperformed the other five baseline models, and the features in the reference dimension were the most important. In all the three dimensions, the citation-related and topic-related features were more important than the clinical translation-related features for the prediction. It also turned out that the features helpful in predicting the citation count of papers are not important for predicting the clinical citation count of biomedical papers. Furthermore, we explored the MPNN model based on different categories of biomedical papers. The results showed that the clinical translation-related features were more important for the prediction of clinical citation count of basic papers rather than those papers closer to clinical science. This study provided a novel dimension (i.e., the reference dimension) for the research community and could be applied to other related research tasks, such as the research assessment for translational programs. In addition, the findings in this study could be useful for biomedical authors (especially for those in basic science) to get more attention from clinical research.  相似文献   

3.
The paper presents an in-depth study of uncited papers. For that, we explore the documents indexed in the INSPIRE database from 1970 to 2015. Uncited articles represent a complex bibliometric environment in which references are generated. The reference lists of uncited papers form a dynamic system partially responsible for the redistribution of scientific impact. Our task is to quantify the detailed structure of citations and references directly in terms of quantiles. We also study the entropy and the statistical complexity measure of references and citations of papers’ quantiles. We introduce a theoretical framework in which citation distribution is considered an asymptotic distribution of the largest values in a sequence of independent identically distributed random variables. We show that this asymptotic distribution is the generalized extreme-value distribution. Furthermore, we empirically demonstrate that the asymptotic behavior of citation distribution is close to (but not quite) the generalized extreme-value distribution.  相似文献   

4.
In this study, we investigate the extent to which patent citations to papers can serve as early signs for predicting delayed recognized knowledge in science using a comparative study with a control group, i.e., instant recognition papers. We identify the two opposite groups of papers by the Bcp measure, a parameter-free index for identifying papers which were recognized with delay. We provide a macro (Science/Nature papers dataset) and micro (a case chosen from the dataset) evidence on paper-patent citation linkages as early signs for predicting delayed recognized knowledge in science. It appears that papers with delayed recognition show a stronger and longer technical impact than instant recognition papers. We provide indication that in the more recent years papers with delayed recognition are awakened more often and earlier by a patent rather than by a scientific paper (also called “prince”). We also found that patent citations seem to play an important role to avoid instant recognition papers to level off or to become a so called “flash in the pan”, i.e., instant recognition. It also appears that the sleeping beauties may firstly encounter negative citations and then patent citations and finally get widely recognized. In contrast to the two focused fields (biology and chemistry) for instant recognition papers, delayed recognition papers are rather evenly distributed in biology, chemistry, psychology, geology, materials science, and physics. We discovered several pairs of “science sleeping”-“technology inducing”, such as “biology-biotechnology/pharmaceuticals”, “chemistry-chemical engineering”, as well as some trans-fields science-technology interactions, such as “psychology - computer technology/control technology/audio-visual technology”, “physics - computer technology”, and “mathematics-computer technology”. We propose in further research to discover the potential ahead of time and transformative research by using citation delay analysis, patent & NPL analysis, and citation context analysis.  相似文献   

5.
图书馆图书借阅系统与单标度二元网络模型   总被引:8,自引:0,他引:8  
本文从网络的角度 ,研究了图书馆这样一种有趣的复杂系统。读者和图书之间通过借阅建立联系 ,可以在两个层次上用网络语言来描述 ,即二元 (读者—图书 )和单元 (读者—读者 ,图书—图书 )网络。我们以研究配位数分布为工具 ,研究了北京师范大学图书馆外借处图书在 14个月内的借阅情况所构成的网络 ,发现其体现了很好的单标度性质 ,即配位数分布体现为一指数衰减的形式。随后提出了一个单标度二元网络模型 ,对此进行解释 ,定性地重现了这一实测结果。  相似文献   

6.
References formulate the foundation and endorsements of scientific research. Using articles published in 2005 covered by Microsoft Academic Graph, the current paper defines and calculates five indicators of references, i.e., the number of references, the number of citations of references, the age of references, the number of nodes in a reference cascade (a multi-generation reference network), and the density of bibliographic coupling networks by references, and investigates their relations with the citation impact of the focal publication. A non-linear relationship is shown in all the five indicators; specifically, we observe two types of patterns, namely inverted-L and -U relations, both presenting the existence of critical points. We further explore the discipline-level differences of such a relationship and how it relates to the characteristics of the discipline itself. Among all five indicators, the effect of disciplinary academic “environments” is universally identified. We believe that the current paper provides insightful views to the discussion regarding the significance of a reference list.  相似文献   

7.
Predicting the citation counts of academic papers is of considerable significance to scientific evaluation. This study used a four-layer Back Propagation (BP) neural network model to predict the five-year citations of 49,834 papers in the library, information and documentation field indexed by the CSSCI database and published from 2000 to 2013. We extracted six paper features, two journal features, nine author features, eight reference features, and five early citation features to make the prediction. The empirical experiments showed that the performance of the BP neural network is significantly better than those of the six baseline models. In terms of the prediction effect, the accuracy of the model at predicting infrequently cited papers was higher than that for frequently cited ones. We determined that five essential features have significant effects on the prediction performance of the model, i.e., ‘citations in the first two years’, ‘first-cited age’, ‘paper length’, ‘month of publication’, and ‘self-citations of journals’, and the other features contribute only slightly to the prediction.  相似文献   

8.
9.
Traditionally, citation count has served as the main evaluation measure for a paper's importance and influence. In turn, many evaluations of authors, institutions and journals are based on aggregations upon papers (e.g. h-index). In this work, we explore measures defined on the citation graph that offer a more intuitive insight into the impact of a paper than the superficial count of citations. Our main argument is focused on the identification of influence as an expression of the citation density in the subgraph of citations built for each paper. We propose two measures that capitalize on the notion of density providing researchers alternative evaluations of their work. While the general idea of impact for a paper can be viewed as how many researchers have shown interest to a piece of work, the proposed measures are based on the hypothesis that a piece of work may have influenced some papers even if they do not contain references to that piece of work. The proposed measures are also extended to researchers and journals.  相似文献   

10.
Recently, two new indicators (Equalized Mean-based Normalized Proportion Cited, EMNPC; Mean-based Normalized Proportion Cited, MNPC) were proposed which are intended for sparse scientometrics data, e.g., alternative metrics (altmetrics). The indicators compare the proportion of mentioned papers (e.g. on Facebook) of a unit (e.g., a researcher or institution) with the proportion of mentioned papers in the corresponding fields and publication years (the expected values). In this study, we propose a third indicator (Mantel-Haenszel quotient, MHq) belonging to the same indicator family. The MHq is based on the MH analysis – an established method in statistics for the comparison of proportions. We test (using citations and assessments by peers, i.e. F1000Prime recommendations) if the three indicators can distinguish between different quality levels as defined on the basis of the assessments by peers. Thus, we test their convergent validity. We find that the indicator MHq is able to distinguish between the quality levels in most cases while MNPC and EMNPC are not. Since the MHq is shown in this study to be a valid indicator, we apply it to six types of zero-inflated altmetrics data and test whether different altmetrics sources are related to quality. The results for the various altmetrics demonstrate that the relationship between altmetrics (Wikipedia, Facebook, blogs, and news data) and assessments by peers is not as strong as the relationship between citations and assessments by peers. Actually, the relationship between citations and peer assessments is about two to three times stronger than the association between altmetrics and assessments by peers.  相似文献   

11.
Articles are cited for different purposes and differentiating between reasons when counting citations may therefore give finer-grained citation count information. Although identifying and aggregating the individual reasons for each citation may be impractical, recording the number of citations that originate from different article sections might illuminate the general reasons behind a citation count (e.g., 110 citations = 10 Introduction citations + 100 Methods citations). To help investigate whether this could be a practical and universal solution, this article compares 19 million citations with DOIs from six different standard sections in 799,055 PubMed Central open access articles across 21 out of 22 fields. There are apparently non-systematic differences between fields in the most citing sections and the extent to which citations from one section overlap with citations from another, with some degree of overlap in most cases. Thus, at a science-wide level, section headings are partly unreliable indicators of citation context, even if they are more standard within individual fields. They may still be used within fields to help identify individual highly cited articles that have had one type of impact, especially methodological (Methods) or context setting (Introduction), but expert judgement is needed to validate the results.  相似文献   

12.
Incorporating fresh members into teams is considered a pathway to team creativity. However, whether freshness improves team performance or not remains unclear, as well as the optimal involvement of fresh members for team performance. Focusing on team impact, one important dimension of team performance, this study uses a group of authors on the byline of a publication as a proxy for a scientific team and quantifies team impact by citations of a paper authored by this team, i.e., article team impact. We extend an indicator, i.e., article team freshness, to measure the extent to which a scientific team incorporates new members, by calculating the fraction of new collaboration relations established within the team. Based on more than 43 million scientific publications covering more than a half-century of research from Microsoft Academic Graph, this study provides a holistic picture of the current development of article team freshness by outlining the temporal evolution of freshness, and its disciplinary distribution. Subsequently, using a multivariable regression approach, we examine the association between article team freshness and papers’ short-term and long-term citations. The major findings are as follows: (1) article team freshness in scientific teams has been increasing in the past half-century; (2) there exists an inverted-U-shaped association between article team freshness and papers’ citations in all the disciplines and different periods; (3) article team impact is hampered by article team freshness in small-sized teams, while medium-sized and large-sized teams can benefit more from article team freshness before the fraction of new collaboration reaches its turning point. The findings of this study provide implications for the practice of team formation and team management in science.  相似文献   

13.
The non-citation rate refers to the proportion of papers that do not attract any citation over a period of time following their publication. After reviewing all the related papers in Web of Science, Google Scholar and Scopus database, we find the current literature on citation distribution gives more focus on the distribution of the percentages and citations of papers receiving at least one citation, while there are fewer studies on the time-dependent patterns of the percentage of never-cited papers, on what distribution model can fit their time-dependent patterns, as well as on the factors influencing the non-citation rate. Here, we perform an empirical pilot analysis to the time-dependent distribution of the percentages of never-cited papers in a series of different, consecutive citation time windows following their publication in our selected six sample journals, and study the influence of paper length on the chance of papers’ getting cited. Through the above analysis, the following general conclusions are drawn: (1) a three-parameter negative exponential model can well fit time-dependent distribution curve of the percentages of never-cited papers; (2) in the initial citation time window, the percentage of never-cited papers in each journal is very high. However, as the citation time window becomes wider and wider, the percentage of never-cited papers begins to drop rapidly at first, and then drop more slowly, and the total degree of decline for most of journals is very large; (3) when applying the wider citation time windows, the percentage of never-cited papers for each journal begins to approach a stable value, and after that value, there will be very few changes in these stable percentages, unless we meet a large amount of “Sleeping Beauties” type papers; (4) the length of an paper has a great influence on whether it will be cited or not.  相似文献   

14.
《Journal of Informetrics》2019,13(2):485-499
With the growing number of published scientific papers world-wide, the need to evaluation and quality assessment methods for research papers is increasing. Scientific fields such as scientometrics, informetrics, and bibliometrics establish quantified analysis methods and measurements for evaluating scientific papers. In this area, an important problem is to predict the future influence of a published paper. Particularly, early discrimination between influential papers and insignificant papers may find important applications. In this regard, one of the most important metrics is the number of citations to the paper, since this metric is widely utilized in the evaluation of scientific publications and moreover, it serves as the basis for many other metrics such as h-index. In this paper, we propose a novel method for predicting long-term citations of a paper based on the number of its citations in the first few years after publication. In order to train a citation count prediction model, we employed artificial neural network which is a powerful machine learning tool with recently growing applications in many domains including image and text processing. The empirical experiments show that our proposed method outperforms state-of-the-art methods with respect to the prediction accuracy in both yearly and total prediction of the number of citations.  相似文献   

15.
The ready availability of on-line information resources may have resulted in a change in biomedical researchers' patterns of use of resources which are not available on-line. To investigate the occurrence and extent of these changes, the distribution patterns by date of citations from every major article published in ten high-impact biomedical journals in 1959, 1969, and 1979 were analyzed. The analysis showed that in all three time periods approximately 89% of the references fell within fifteen years of the journal's publication date. In 1979, however, references were more evenly distributed over the fifteen years available on-line. In addition, a marked increase was noted in the number of cited references per article. To verify the citation analysis results, a random selection of senior authors of 1979 papers was surveyed. Major findings showed that: (1) approximately 29% of the respondents had used a computer bibliography to prepare the paper which was analyzed; (2) approximately 73% of those using a computer search used it in combination with a manual search; and (3) less than 22% of all respondents had never used a computer-generated bibliography.  相似文献   

16.
睡美人与王子文献的识别方法研究   总被引:1,自引:0,他引:1  
[目的/意义] 研究睡美人与王子文献的识别方法。分析唤醒机制,为未来在学术交流体系中发现"王子"作者,发掘、唤醒低被引和零被引文献的潜在价值提供理论依据。[方法/过程] 采用被引速率指标和睡美人指数两种客观指标识别1970-2005年临床医学四大名刊上发表的睡美人文献;基于以下4个原则寻找唤醒睡美人的王子文献:①发表于被引突增的附近年份;②本身被引次数较高;③与睡美人文献的同被引次数高;④在年度被引次数曲线上,王子文献对睡美人文献的"牵引或拉动"作用非常显著,即至少在睡美人文献引用突增的附近年份,王子文献的年度被引次数应高于睡美人文献。[结果/结论] 由于考虑了全部引文窗的引文曲线,被引速率指标能够识别出那些被引生命周期长、至今仍持续不断高频被引的论文;睡美人指数能够快速识别出睡美人文献,但却无法反映年度被引次数达到峰值之后的引文曲线;将被引速率+发表最初5年年均被引次数两个指标结合起来能够更好地识别睡美人文献。分析发现,综述、指南、著作等"共识型"的文献对于引发那些提出了新思想但尚未被认可的睡美人文献的被引突增起到了关键作用。建议事后识别睡美人文献可采用客观指标与主观界定相结合的方法,事前预测睡美人文献要注意追踪其是否被"共识性"文献推荐和引用,学术评价要特别关注被引速率低的论文。  相似文献   

17.
Various factors are believed to govern the selection of references in citation networks, but a precise, quantitative determination of their importance has remained elusive. In this paper, we show that three factors can account for the referencing pattern of citation networks for two topics, namely “graphenes” and “complex networks”, thus allowing one to reproduce the topological features of the networks built with papers being the nodes and the edges established by citations. The most relevant factor was content similarity, while the other two – in-degree (i.e. citation counts) and age of publication – had varying importance depending on the topic studied. This dependence indicates that additional factors could play a role. Indeed, by intuition one should expect the reputation (or visibility) of authors and/or institutions to affect the referencing pattern, and this is only indirectly considered via the in-degree that should correlate with such reputation. Because information on reputation is not readily available, we simulated its effect on artificial citation networks considering two communities with distinct fitness (visibility) parameters. One community was assumed to have twice the fitness value of the other, which amounts to a double probability for a paper being cited. While the h-index for authors in the community with larger fitness evolved with time with slightly higher values than for the control network (no fitness considered), a drastic effect was noted for the community with smaller fitness.  相似文献   

18.
Research on the evaluation of the quality of academic papers is attracting more attention from scholars in scientometrics. However, most previous researches have assessed paper quality based on external indicators, such as citations, which failed to account for the content of the research. To that end, this paper proposed a new method for measuring a paper's originality. The method was based on knowledge units in semantic networks, focusing on the relationship and semantic similarity of different knowledge units. Connectivity and path similarity between different content elements were used in particular networks as indicators of originality. This study used papers published between 2014 and 2018 in three categories (i.e. Library & Information Science, Educational Psychology, and Carbon Nanotubes) and divided their content into three parts (i.e. research topics, research methods and research results). It was found that the originality in all categories increase each year. Furthermore, a comparison of our new method with previous models of citation network analysis and knowledge combination analysis showed that our new method is better than those previous methods when used in measuring originality.  相似文献   

19.
In citation network analysis, complex behavior is reduced to a simple edge, namely, node A cites node B. The implicit assumption is that A is giving credit to, or acknowledging, B. It is also the case that the contributions of all citations are treated equally, even though some citations appear multiply in a text and others appear only once. In this study, we apply text-mining algorithms to a relatively large dataset (866 information science articles containing 32,496 bibliographic references) to demonstrate the differential contributions made by references. We (1) look at the placement of citations across the different sections of a journal article, and (2) identify highly cited works using two different counting methods (CountOne and CountX). We find that (1) the most highly cited works appear in the Introduction and Literature Review sections of citing papers, and (2) the citation rankings produced by CountOne and CountX differ. That is to say, counting the number of times a bibliographic reference is cited in a paper rather than treating all references the same no matter how many times they are invoked in the citing article reveals the differential contributions made by the cited works to the citing paper.  相似文献   

20.
p 指数运用于人才评价的有效性实证研究   总被引:2,自引:0,他引:2  
h指数用于高发文、高引用的学者评价是有效的,但对低发文、高引用的学者进行评价存在缺陷,且数值易于雷同,不易区分。p指数在学者研究绩效评价方面具有同h指数相一致的维度,它不仅考虑学者的被引次数(C),而且考虑学者的研究质量指标——平均被引率(C/N)。以图书情报与文献学科领域49位专家为例,对比分析专家的发文量(N)、被引次数(C)、平均被引率、专家h指标、g指数、p指数,并进行相关性分析。结论:p指数优于现有的h指数、g指数,更具有评价的合理性,应在更大范围内进一步使用。  相似文献   

设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号