首页 | 本学科首页   官方微博 | 高级检索  
相似文献
 共查询到20条相似文献,搜索用时 890 毫秒
1.
为研究中文学术论文下载次数与语言学特征的关系,文章以图书情报学领域被CSSCI收录的7种期刊发表于2014-2017年的6,257篇学术论文为研究对象,选用8个语言学特征指标测度高下载论文(Top 20%)、低下载论文(Bottom 20%)和全体论文的语言学特征。从中值和均值来看,各期刊高下载论文的标题长度几乎都小于总体论文和低下载论文,摘要词汇多样性、正文长度、正文句子长度和正文词汇多样性整体上大于总体论文和低下载论文。从显著性检验结果来看,整体上未通过显著性检验,但特定平台特定期刊的特定语言学特征指标通过了显著性检验。从样本数据来看,整体上语言学特征对中文学术论文下载次数影响很小,但在特定平台特定期刊语言学特征具有一定影响。  相似文献   

2.
In competitive research environments, scholars have a natural interest to maximize the prestige associated with their scientific work. In order to identify factors that might help them address this goal more effectively, the scientometric literature has tried to link linguistic and meta characteristics of academic papers to the associated degree of scientific prestige, conceptualized as cumulative citation counts. In this paper, we take an alternative approach that instead understands scientific prestige in terms of the rankings of the journals that the articles appeared in, as such rankings are routinely used as surrogate research quality indicators. For the purpose of determining the most important drivers of suchlike prestige, we use state-of-the-art text mining tools to extract 344 interpretable features from a large corpus of over 200,000 journal articles in economics. We then estimate beta regression models to investigate the relationship between these predictors and a cross-sectionally standardized version of SCImago Journal Rank (SJR) in multiple topically homogeneous clusters. In so doing, we also reinvestigate the bafflegab theory, according to which more prestigious research papers tend to be less readable, in a methodologically novel way. Our results show the consistently most informative predictors to be associated with the length of the paper, the span of coreference chains in its full text, the deployment of a personal and moderately informal writing style, the “density” of the article in terms of sentences per page, international and institutional collaboration in research teams and the references cited in the paper. Moreover, we identify various linguistic intricacies that matter in the association between readability and scientific prestige, which suggest this relationship to be more complicated than previously assumed.  相似文献   

3.
4.
Document clustering of scientific texts using citation contexts   总被引:3,自引:0,他引:3  
Document clustering has many important applications in the area of data mining and information retrieval. Many existing document clustering techniques use the “bag-of-words” model to represent the content of a document. However, this representation is only effective for grouping related documents when these documents share a large proportion of lexically equivalent terms. In other words, instances of synonymy between related documents are ignored, which can reduce the effectiveness of applications using a standard full-text document representation. To address this problem, we present a new approach for clustering scientific documents, based on the utilization of citation contexts. A citation context is essentially the text surrounding the reference markers used to refer to other scientific works. We hypothesize that citation contexts will provide relevant synonymous and related vocabulary which will help increase the effectiveness of the bag-of-words representation. In this paper, we investigate the power of these citation-specific word features, and compare them with the original document’s textual representation in a document clustering task on two collections of labeled scientific journal papers from two distinct domains: High Energy Physics and Genomics. We also compare these text-based clustering techniques with a link-based clustering algorithm which determines the similarity between documents based on the number of co-citations, that is in-links represented by citing documents and out-links represented by cited documents. Our experimental results indicate that the use of citation contexts, when combined with the vocabulary in the full-text of the document, is a promising alternative means of capturing critical topics covered by journal articles. More specifically, this document representation strategy when used by the clustering algorithm investigated in this paper, outperforms both the full-text clustering approach and the link-based clustering technique on both scientific journal datasets.  相似文献   

5.
6.
To some extent, written academic discourse represents the knowledge and practices of the academic community. Studies investigating writing styles in various disciplines have flourished, but fewer studies have leveraged multi-perspective linguistic indices to analyze academic writings, especially in the information science and library science (IS-LS) domain. This study attempts to provide an overview of how writing styles have evolved over the past 30 years across various subfields in IS-LS from multiple perspectives, that is, lexical complexity, cohesion, syntactic complexity, and readability. Based on a large set of abstracts of academic papers published in IS-LS, the empirical findings showed that the readability, cohesion, and lexical sophistication of abstracts in the IS-LS domain have increased over time, indicating that abstracts tend to contain more information but become less accessible. The gradual improvement in cohesion suggests that academic writing logic has increased, and the rigor of knowledge construction of scientific papers has improved. Furthermore, considerable linguistic variations emerged between subfields in the IS-LS domain, particularly at the lexical level. This study suggested that different subfields had various writing styles due to their research topics, methodologies, orientations, etc. The study also found that papers published in top quartile journals and those that gained higher citations typically had larger lexical density, lexical sophistication, cohesion, and readability. This suggests that influential papers tend to carry more information, address more complex scientific issues, and exercise caution in knowledge construction and presentation.  相似文献   

7.
In the second half of the 20th century, scientific research in physics, chemistry, and engineering began to focus on the use of large government-funded laboratories. This shift toward so-called big science also brought about a concomitant change in scientific work itself, with a sustained trend toward the use of highly specialized scientific teams, elevating the role of team characteristics on scientific outputs. The actual impact of scientific knowledge is commonly measured by how often peer-reviewed publications are, in turn, cited by other researchers. Therefore, how characteristics such as author team seniority, affiliation diversity, and size affect the overall impact of team publications was examined. Citation information and author demographics were reviewed for 123 articles published in Physical Review Letters from 2004 to 2006 and written by 476 scientists who used the National High Magnetic Field Laboratory's facilities. Correlation analysis indicated that author teams that were more multi-institutional and had homogeneous seniority tended to have more senior scientists. In addition, the analysis suggests that more mixed seniority author teams were likely to be less institutionally dispersed. Quantile regression was used to examine the relationships between author-team characteristics and publication impact. The analysis indicated that both weighted average seniority and average seniority had a negative relationship with the number of citations the publication received. Furthermore, the analysis also showed a positive relationship between first-author seniority and the number of citations, and a negative relationship between the number of authors and the number of citations.  相似文献   

8.
哪些因素会影响学术论文的被引次数是文献计量学领域的一个经典研究议题。目前的研究主要关注论文的内容特征和形式特征与被引次数之间的关系,鲜有研究从文本可读性视角切入这一议题。文本可读性影响读者对文本内容的理解和知识吸收,是一个关乎知识传播效率和研究成果认可度的重要因素。本研究在控制论文知识品质和权威性的基础上,使用文本可读性R值等五个变量研究论文的文本可读性对被引次数的影响。以中文图书情报学知名期刊发表于2016—2020年的论文为研究样本,研究发现论文的文本可读性R值、是否采用复合式标题、是否使用公式和表格对被引次数有显著影响,而是否使用图对被引次数没有显著影响。研究验证了中文情境下文本可读性对论文影响力的实质性作用,研究结果对科研人员改善自身的中文学术写作以及提高研究成果影响力具有重要参考价值。  相似文献   

9.
Predatory publishing has become a much‐discussed and highly visible phenomenon over the past few years. One widespread, but hardly tested, assumption is the idea that articles published in predatory journals deviate substantially from those published in traditional journals. In this paper, we address this assumption by utilizing corpus linguistic tools. We compare the ‘academic‐like’ nature of articles from two different journals in political science, one top‐ranking and one alleged predatory. Our findings indicate that there is significant linguistic variation between the two corpora along the dimensions that we test. The articles display notable differences in the types and usage of keywords in the two journals. We conclude that articles published in so‐called predatory journals do not conform to linguistic norms used in higher‐quality journals. These findings may demonstrate a lack of quality control in predatory journals but may also indicate a lack of awareness and use of such linguistic norms by their authors. We also suggest that there is a need for the education of authors in science writing as this may enable them to publish in higher‐ranked and quality‐assured outlets.  相似文献   

10.
11.
Genre is considered to be an important element in scholarly communication and in the practice of scientific disciplines. However, scientometric studies have typically focused on a single genre, the journal article. The goal of this study is to understand the role that handbooks play in knowledge creation and diffusion and their relationship with the genre of journal articles, particularly in highly interdisciplinary and emergent social science and humanities disciplines. To shed light on these questions we focused on handbooks and journal articles published over the last four decades belonging to the research area of science and technology studies (STS), broadly defined. To get a detailed picture we used the full-text of five handbooks (500,000 words) and a well-defined set of 11,700 STS articles. We confirmed the methodological split of STS into qualitative and quantitative (scientometric) approaches. Even when the two traditions explore similar topics (e.g., science and gender) they approach them from different starting points. The change in cognitive foci in both handbooks and articles partially reflects the changing trends in STS research, often driven by technology. Using text similarity measures we found that, in the case of STS, handbooks play no special role in either focusing the research efforts or marking their decline. In general, they do not represent the summaries of research directions that have emerged since the previous edition of the handbook.  相似文献   

12.
参考文献在科技论文中的作用和著录中存在的问题   总被引:2,自引:0,他引:2  
参考文献是科技论文的一部分,是作者对自已研究内容的支持和佐证,它具有继承、论证和说明等诸多作用.通过找出当前著录中存在的问题,提醒科研人员重视文后参考文献著录,达到提升科技论文水平的目的.  相似文献   

13.
Meta-analysis refers to the statistical methods used in research synthesis for combining and integrating results from individual studies. In this regard meta-analytical studies share with narrative reviews the goal of synthesizing the scientific literature on a particular topic, while as in the case of standard articles they present new results. This study aims to identify the potential similarities and differences between meta-analytical studies, reviews and standard articles as regards their impact and structural features in the field of psychology. To this end a random sample of 335 examples of each type of document were selected from the Thomson Reuters Web of Science database. The results showed that meta-analytical studies receive more citations than do both reviews and standard articles. All three types of documents showed a similar pattern in terms of institutional collaboration, while reviews and meta-analytical studies had a similar number of authors per document. However, reviews had a greater number of references and pages than did meta-analytical studies. The implications of these results for the scientific community are discussed.  相似文献   

14.
There have been many recent changes to PubMed to enhance its usefulness. Those changes include: LinkOut Libraries (local holding field), PubMed Central (full-text articles archived by the National Library of Medicine), and LinkOut (access to full-text articles right from the PubMed citation). Medical librarians should be aware of how these features work to best assist their clients. These new features offer the possibility of true desktop access for library patrons. Not only will patrons appreciate these new features, but their use in libraries will literally change what we do, who does it, and how it is done.  相似文献   

15.
We performed a citation analysis on the Web of Science publications consisting of more than 63 million articles and over a billion citations on 254 subjects from 1981 to 2020. We proposed the Article’s Scientific Prestige (ASP) metric and compared this metric to number of citations (#Cit) and journal grade in measuring the scientific impact of individual articles in the large-scale hierarchical and multi-disciplined citation network. In contrast to #Cit, ASP, that is computed based on the eigenvector centrality, considers both direct and indirect citations, and provides steady-state evaluation cross different disciplines. We found that ASP and #Cit are not aligned for most articles, with a growing mismatch amongst the less cited articles. While both metrics are reliable for evaluating the prestige of articles such as Nobel Prize winning articles, ASP tends to provide more persuasive rankings than #Cit when the articles are not highly cited. The journal grade, that is eventually determined by a few highly cited articles, is unable to properly reflect the scientific impact of individual articles. The number of references and coauthors are less relevant to scientific impact, but subjects do make a difference.  相似文献   

16.
The number of received citations have been used as an indicator of the impact of academic publications. Developing tools to find papers that have the potential to become highly-cited has recently attracted increasing scientific attention. Topics of concern by scholars may change over time in accordance with research trends, resulting in changes in received citations. Author-defined keywords, title and abstract provide valuable information about a research article. This study performs a latent Dirichlet allocation technique to extract topics and keywords from articles; five keyword popularity (KP) features are defined as indicators of emerging trends of articles. Binary classification models are utilized to predict papers that were highly-cited or less highly-cited by a number of supervised learning techniques. We empirically compare KP features of articles with other commonly used journal-related and author-related features proposed in previous studies. The results show that, with KP features, the prediction models are more effective than those with journal and/or author features, especially in the management information system discipline.  相似文献   

17.
The objective of this work was to examine the relationship between attitudes about publishing across disciplines and the scientific impact of authors. We conducted a web survey of 1066 authors randomly selected from four disciplines in the Web of Knowledge: economics, anthropology, water resources and biochemistry (approximately 250 from each discipline). Authors were asked questions about publishing norms within their discipline. The h-index of authors was subsequently calculated from data available from the Web of Knowledge. Authors in biochemistry had on average twice the h-index of those in economics, anthropology and water resources. Biochemists had higher expectations about the number of articles published for hire and promotion, more strongly valued interdisciplinary publishing, felt the cutting edge of their science was clearer, and had more defined patterns of author credit assignment than the other disciplines. Anthropologists exhibited a lower relationship between h-index and the number of years since their first publication. We conclude that attitudinal differences between disciplines may lead to differences in the recognition of scientific findings and the therefore the establishment of normal science.  相似文献   

18.
《Research Strategies》1998,16(4):301-307
The ever-growing number of full-text electronic databases and their increasing availability has helped to create greater expectations among the undergraduate students who use them in their research. A study was made of nearly three hundred students enrolled in composition classes to determine the indexes and databases most commonly used for assignments and whether having the full text of journal articles online played a role in shaping the nature of their research. The survey results confirmed the authors' belief from encounters at the reference desk that students are becoming dependent on the availability of full-text databases and are using them in some cases to the exclusion of all other information sources.  相似文献   

19.
随着学术资源共享程度提高,越来越多的学术论文全文被大规模地开放获取,为基于全文本的微观实体扩散研究提供了便利的数据基础和广阔的应用前景。然而,前人研究在分析粒度上多以篇章、作者或主题等作为知识扩散的主要载体,较少关注来自文献全文本内容的微观实体。事实上,作为驱动知识扩散的主要内因,微观实体才是通过引用关系传播的实质内容。文章以分子生物学领域为例,选取该领域1,000篇XML全文本数据,人工标注了理论概念类、工具技术类、数据信息类和特定领域类微观实体,并借助BiLSTM-CRF构建了微观实体抽取模型,精确度、召回率和F1值分别为0.7618、0.7099和0.7349。在此基础上,构建微观实体扩散网络,通过可视化的方式展示了微观实体在宏观和微观层面的扩散模式。宏观层面上,特定领域类微观实体占比最高,说明学者在引用文献时更多倾向于引用所研究领域内的微观实体。微观层面上,能够清晰直观地揭示特定微观实体在文献之间的流动路径,从而方便把握微观实体兴起和发展的方向。  相似文献   

20.
[目的/意义] 在科学研究中,科研团队通过学术交流互动推动着科学进步。以计算语言学领域为例,识别科学领域中科研团队的角色并探究其特征。[方法/过程] 通过构建机构作者合作网络,运用社群识别算法发现科研团队,结合论文引用关系构建科研团队引证网络,基于蝴蝶结模型和网络位置理论划分出领导者、中介者、追随者和孤立者等4种角色的科研团队。[结果/结论] 不同角色的科研团队在成员数量、发文量、合作强度等3个方面具有不同的特征。如领导者角色的团队数量最少而平均规模较大,追随者角色团队数量最多而团队规模较小,中介者团队合作密度与团队的发文量之间存在着显著的负相关关系,孤立者角色的团队与其他团队几乎不存在引证和被引关系。  相似文献   

设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号