首页 | 本学科首页   官方微博 | 高级检索  
相似文献
 共查询到20条相似文献,搜索用时 31 毫秒
1.
基于被引次数的引文分析无法直接揭示论文的研究内容,利用关键词或从标题、摘要和全文中抽取的主题词很难客观反映论文的被引原因。本文以碳纳米管纤维研究领域的高被引论文为研究对象进行引文内容抽取和主题识别,经人工判读验证:基于引文内容分析的高被引论文识别的核心主题能够较好地揭示高被引论文的被引原因(引用动机),而且与论文的研究内容相符合;与基于全文、基于标题和摘要的主题识别相比,在引文内容分析基础上识别的主题具有更好的主题代表性,能够有效揭示被引文献的研究内容,是对原文相关信息的重要补充。本文的实验表明基于引文内容分析的高被引论文主题识别是可行而且有效的。图4。表4。参考文献31。  相似文献   

2.
Predicting the citation counts of academic papers is of considerable significance to scientific evaluation. This study used a four-layer Back Propagation (BP) neural network model to predict the five-year citations of 49,834 papers in the library, information and documentation field indexed by the CSSCI database and published from 2000 to 2013. We extracted six paper features, two journal features, nine author features, eight reference features, and five early citation features to make the prediction. The empirical experiments showed that the performance of the BP neural network is significantly better than those of the six baseline models. In terms of the prediction effect, the accuracy of the model at predicting infrequently cited papers was higher than that for frequently cited ones. We determined that five essential features have significant effects on the prediction performance of the model, i.e., ‘citations in the first two years’, ‘first-cited age’, ‘paper length’, ‘month of publication’, and ‘self-citations of journals’, and the other features contribute only slightly to the prediction.  相似文献   

3.
[目的/意义]借助引文分析与其他文献计量学研究方法对特定领域的"研究前沿"进行分析,是目前情报学研究的重要课题。但是中文"研究前沿"实际对应于Research Front与Research Frontier两个具有不同内涵的英文术语,厘清这两个术语关系有助于学者改进并正确应用"研究前沿"分析方法。[方法/过程]在分析Research Frontier与Research Front两个概念交集关系模型的基础上,以Scientometrics和JASIST两种期刊为样本,检验了常见Research Front引文分析指标的可靠性。[结果/结论]实证研究结果显示,尽管高被引论文比零被引在引文的新颖性、热点性、耦合关联性上具有相对显著的优势,但这些特征指标并不能高效地映射潜在的高被引论文。因此,Research Front与Research Frontier是具有显著差异的不同概念,学者们应该对这两组概念进行更为深入的讨论。  相似文献   

4.
学术论文是科研活动产出和知识信息交流的主要载体,而学术论文被引用则是学术成果获得认同的主要方式之一。研究高被引论文所引用论文的分布特征,对于了解高被引论文知识基础的构成具有重要意义。本文以ISI图书情报学(Information Science & Library Science)为例,以实证研究为基础,按比例采集图书情报学领域的高被引论文数据,查询这些高被引论文的参考文献的被引频次数据,以期得到高被引论文的引证特点。研究发现,图书情报学领域的高被引论文更多地引用了高被引参考文献;随着被引频次的降低,其引用的高被引论文所占的比例也在下降。  相似文献   

5.
《编辑学报》高被引论文分析   总被引:5,自引:1,他引:4  
张建合 《编辑学报》2010,22(6):562-564
以中国知网<中国学术文献网络出版总库>为统计源,从文献引证的角度分析<编辑学报>高被引论文的分布规律.研究结果表明:该数据库共收录<编辑学报>1989-2009年原文3 508篇,被引文献2545篇,被引率为73%,总被引频次为1万5 863,单篇最高被引频次71;较少的论文拥有较高的被引频次,基本符合"二八定律";前100篇高被引论文中,栏目高被引论文数最多的是<理论研究>(46篇),个人高被引论文数最多的是游苏宁(6篇);前10篇高被引论文每年都在被引用,具有旺盛的生命力.  相似文献   

6.
刘洋  崔雷 《图书情报工作》2014,58(6):101-104
以引文上下文为研究对象,探讨来自于引文上下文、目标文献摘要以及目标文献自标医学主题词(下称主题词)三者间的符合程度,定量分析引文上下文在表征目标文献内容特征时的作用。以被Circulation杂志高频引证的5篇研究类论文作为目标文献,提取其施引文献的全部引文上下文,并对其进行分词及主题词匹配;将其结果与目标文献摘要提取的主题词以及文献自标的主题词进行两两比较。结果表明,引文上下文与目标文献摘要具有较高的符合度,而且在表征被引文献内容特征的效果上明显具有优势。  相似文献   

7.
睡美人与王子文献的识别方法研究   总被引:1,自引:0,他引:1  
[目的/意义] 研究睡美人与王子文献的识别方法。分析唤醒机制,为未来在学术交流体系中发现"王子"作者,发掘、唤醒低被引和零被引文献的潜在价值提供理论依据。[方法/过程] 采用被引速率指标和睡美人指数两种客观指标识别1970-2005年临床医学四大名刊上发表的睡美人文献;基于以下4个原则寻找唤醒睡美人的王子文献:①发表于被引突增的附近年份;②本身被引次数较高;③与睡美人文献的同被引次数高;④在年度被引次数曲线上,王子文献对睡美人文献的"牵引或拉动"作用非常显著,即至少在睡美人文献引用突增的附近年份,王子文献的年度被引次数应高于睡美人文献。[结果/结论] 由于考虑了全部引文窗的引文曲线,被引速率指标能够识别出那些被引生命周期长、至今仍持续不断高频被引的论文;睡美人指数能够快速识别出睡美人文献,但却无法反映年度被引次数达到峰值之后的引文曲线;将被引速率+发表最初5年年均被引次数两个指标结合起来能够更好地识别睡美人文献。分析发现,综述、指南、著作等"共识型"的文献对于引发那些提出了新思想但尚未被认可的睡美人文献的被引突增起到了关键作用。建议事后识别睡美人文献可采用客观指标与主观界定相结合的方法,事前预测睡美人文献要注意追踪其是否被"共识性"文献推荐和引用,学术评价要特别关注被引速率低的论文。  相似文献   

8.
Review papers tend to be cited more frequently than regular research articles. This fact, together with the continuous increase of the share of reviews in scientific literature, can have important consequences for the measurement of individuals’ research output, usually based on citation analysis. However, studies evaluating the differences in citations of review papers compared to original research articles are almost non-existing in the literature. This paper presents a thorough analysis of the overcitation and overrepresentation of review papers in the most cited papers of the 35 largest subject categories in Science Citation Index-Expanded. Results indicate the average citations received by reviews depends largely on the research area considered, varying from 1.34 to 6.74 times the citations received by original research articles (average value is 2.95). Correlated with this overcitation, there is an important overrepresentation of reviews in the most cited papers, this overrepresentation being greater when the most highly cited papers are considered, i.e. 0.05% and 0.1% most cited papers, where the share of reviews have increased from 16 to 18% in 1990 to around 40% in 2010. Interestingly, the overcitation and overrepresentation in the most cited papers is more important in the areas with the lowest shares of reviews in total publications.  相似文献   

9.
提出一种科技论文国际认同程度的测度方法。实验结果表明,加权国际认同相对于Marek Kosmulski提出的国际认同、h指数、篇均被引次数具有一定的优势,可以弥补h指数类指标对引用次数变化不敏感和对年青研究人员评价不公平的缺陷。同时,也有利于避免因为利益一致带来的相互引用给引文分析带来的干扰。
  相似文献   

10.
本文创新性构建学术论文被引影响因素特征空间,以我校SCI&SSCI学术论文为例,验证机器学习模型在预测学术论文被引频次研究中的有效性和准确性,本文的分析结论可以为高校图书馆开展决策支持服务提供参考。本文梳理学术论文被引频次影响因素及预测方法的相关研究,结合传统文献计量和Altmetrics指标构建学术论文影响因素的特征空间,并通过实验比较线性回归、神经网络、支持向量机三种机器学习模型在预测学术论文被引频次研究中的有效性和准确性。本文的分析结论证明基于Altmetrics视角构建的特征空间的预测准确率大幅度提高,并且支持向量机模型在对学术论文影响力预测的实证研究中表现出优异的性能。  相似文献   

11.
本文以中国学术期刊网络出版总库为信息源,统计分析了山西省各高校图书馆2000年至今发表的全部论文中的高被引论文,从引文频次、年度分布、引文著者、机构、来源期刊以及研究热点等方面进行了深入的分析开口研究,旨在揭示近10年山西省高校图书馆高被引文献的发展规律及特点,为高校图书馆人员开展科研活动、提升论文水平提供参考依据。  相似文献   

12.
Modeling a century of citation distributions   总被引:1,自引:1,他引:0  
The prevalence of uncited papers or of highly cited papers, with respect to the bulk of publications, provides important clues as to the dynamics of scientific research. Using 25 million papers and 600 million references from the Web of Science over the 1900–2006 period, this paper proposes a simple model based on a random selection process to explain the “uncitedness” phenomenon and its decline over the years. We show that the proportion of cited papers is a function of (1) the number of articles available (the competing papers), (2) the number of citing papers and (3) the number of references they contain. Using uncitedness as a departure point, we demonstrate the utility of the stretched-exponential function and a form of the Tsallis q-exponential function to fit complete citation distributions over the 20th century. As opposed to simple power-law fits, for instance, both these approaches are shown to be empirically well-grounded and robust enough to better understand citation dynamics at the aggregate level. On the basis of these models, we provide quantitative evidence and provisional explanations for an important shift in citation practices around 1960. We also propose a revision of the “citation classic” category as a set of articles which is clearly distinguishable from the rest of the field.  相似文献   

13.
俞立平 《图书情报工作》2016,60(12):109-114
[目的/意义] 在多属性评价视角下,学术期刊影响力指标时间轴并不统一,存在时间异质性,评价对象涉及期刊创刊以来的所有论文被引、过去5年论文被引、过去两年论文被引、评价当年被引等等,迫切需要解决这个问题。[方法/过程] 以CSSCI科技管理类期刊为例,提出了两个新指标:隔年影响因子、隔年h指数,并基于灰色关联分析和期刊被引数据统计,验证了这两个指标的合理性。[结果/结论] 期刊被引指标的设计必须尊重时效性、鲁棒性、年度内评价等原则;多属性评价时,期刊评价被引指标必须尽量保证时间轴一致;来源指标与隔年被引指标数据均来自于期刊载文当年数据,多属性评价时间轴本质上只有一个。  相似文献   

14.
开放存取论文下载与引用情况计量研究   总被引:2,自引:0,他引:2  
基于"中国科技论文在线"的论文引用、下载数据,对开放存取论文被引用的年代分布情况、被引频次与下载频次的相关关系进行研究。结果表明:①开放存取论文达到被引峰值的时间在延长,互联网并没有加快开放存取论文的老化,反而有延缓之势;②单篇论文的下载频次与被引频次之间的相关性不显著;③应用性、综述性论文更容易出现"高下载低引用"现象;④从长期、整体角度来看,开放存取论文的下载频次与被引频次之间呈现正相关趋势;⑤被引频次侧重于反映开放存取论文的内在质量,下载频次则反映外部评价指标(可识别性、可获得性、传播速率等)。图3。表2。参考文献39。  相似文献   

15.
Characteristic scores and scales (CSS) – a well-established scientometric tool for the study of citation counts – have been used to document a striking phenomenon that characterizes citation distributions at high levels of aggregation: irrespective of scientific field and citation window empirical studies find a persistent pattern whereby about 70% of scientific papers belong to the class of poorly cited papers, about 21% belong to the class of fairly cited papers, 6% to that of remarkably cited papers and 3% to the class of outstandingly cited papers. This article aims to advance the understanding of this remarkable result by examining it in the context of the lognormal distribution, a popular model used to describe citation counts across scientific fields. The article shows that the application of the CSS method to lognormal distributions provides a very good fit to the 70–21–6–3% empirical pattern provided these distributions are characterized by a standard deviation parameter in the range of about 0.8–1.3. The CSS pattern is essentially explainable as an epiphenomenon of the lognormal functional form and, more generally, as a consequence of the skewness of science which is manifest in heavy-tailed citation distributions.  相似文献   

16.
The findings of Bornmann, Leydesdorff, and Wang (2013b) revealed that the consideration of journal impact improves the prediction of long-term citation impact. This paper further explores the possibility of improving citation impact measurements on the base of a short citation window by the consideration of journal impact and other variables, such as the number of authors, the number of cited references, and the number of pages. The dataset contains 475,391 journal papers published in 1980 and indexed in Web of Science (WoS, Thomson Reuters), and all annual citation counts (from 1980 to 2010) for these papers. As an indicator of citation impact, we used percentiles of citations calculated using the approach of Hazen (1914). Our results show that citation impact measurement can really be improved: If factors generally influencing citation impact are considered in the statistical analysis, the explained variance in the long-term citation impact can be much increased. However, this increase is only visible when using the years shortly after publication but not when using later years.  相似文献   

17.
The non-citation rate refers to the proportion of papers that do not attract any citation over a period of time following their publication. After reviewing all the related papers in Web of Science, Google Scholar and Scopus database, we find the current literature on citation distribution gives more focus on the distribution of the percentages and citations of papers receiving at least one citation, while there are fewer studies on the time-dependent patterns of the percentage of never-cited papers, on what distribution model can fit their time-dependent patterns, as well as on the factors influencing the non-citation rate. Here, we perform an empirical pilot analysis to the time-dependent distribution of the percentages of never-cited papers in a series of different, consecutive citation time windows following their publication in our selected six sample journals, and study the influence of paper length on the chance of papers’ getting cited. Through the above analysis, the following general conclusions are drawn: (1) a three-parameter negative exponential model can well fit time-dependent distribution curve of the percentages of never-cited papers; (2) in the initial citation time window, the percentage of never-cited papers in each journal is very high. However, as the citation time window becomes wider and wider, the percentage of never-cited papers begins to drop rapidly at first, and then drop more slowly, and the total degree of decline for most of journals is very large; (3) when applying the wider citation time windows, the percentage of never-cited papers for each journal begins to approach a stable value, and after that value, there will be very few changes in these stable percentages, unless we meet a large amount of “Sleeping Beauties” type papers; (4) the length of an paper has a great influence on whether it will be cited or not.  相似文献   

18.
In citation network analysis, complex behavior is reduced to a simple edge, namely, node A cites node B. The implicit assumption is that A is giving credit to, or acknowledging, B. It is also the case that the contributions of all citations are treated equally, even though some citations appear multiply in a text and others appear only once. In this study, we apply text-mining algorithms to a relatively large dataset (866 information science articles containing 32,496 bibliographic references) to demonstrate the differential contributions made by references. We (1) look at the placement of citations across the different sections of a journal article, and (2) identify highly cited works using two different counting methods (CountOne and CountX). We find that (1) the most highly cited works appear in the Introduction and Literature Review sections of citing papers, and (2) the citation rankings produced by CountOne and CountX differ. That is to say, counting the number of times a bibliographic reference is cited in a paper rather than treating all references the same no matter how many times they are invoked in the citing article reveals the differential contributions made by the cited works to the citing paper.  相似文献   

19.
p 指数运用于人才评价的有效性实证研究   总被引:2,自引:0,他引:2  
h指数用于高发文、高引用的学者评价是有效的,但对低发文、高引用的学者进行评价存在缺陷,且数值易于雷同,不易区分。p指数在学者研究绩效评价方面具有同h指数相一致的维度,它不仅考虑学者的被引次数(C),而且考虑学者的研究质量指标——平均被引率(C/N)。以图书情报与文献学科领域49位专家为例,对比分析专家的发文量(N)、被引次数(C)、平均被引率、专家h指标、g指数、p指数,并进行相关性分析。结论:p指数优于现有的h指数、g指数,更具有评价的合理性,应在更大范围内进一步使用。  相似文献   

20.
文章运用引文分析法对2005-2010年间发表的目录学论文进行统计,从引文量、引文学科种类、高被引作者和高被引论文数量及被引次数、共被引次数四个方面分析目录学发展现状,最后运用引文分析的多元统计方法(聚类分析法、多维尺度分析法)探讨目录学研究的不足之处。  相似文献   

设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号