首页 | 本学科首页   官方微博 | 高级检索  
     检索      


Applying LDA Topic Modeling in Communication Research: Toward a Valid and Reliable Methodology
Authors:Daniel Maier  A Waldherr  P Miltner  G Wiedemann  A Niekler  A Keinert
Institution:1. Institute for Media and Communication Studies, Free University Berlin, Berlin, Germanymaier@zedat.fu-berlin.de;3. Department of Communication, University of Münster, Münster, Germany;4. Institute for Media and Communication Studies, Free University Berlin, Berlin, Germany;5. Computer Science Institute, University of Leipzig, Leipzig, Germany
Abstract:ABSTRACT

Latent Dirichlet allocation (LDA) topic models are increasingly being used in communication research. Yet, questions regarding reliability and validity of the approach have received little attention thus far. In applying LDA to textual data, researchers need to tackle at least four major challenges that affect these criteria: (a) appropriate pre-processing of the text collection; (b) adequate selection of model parameters, including the number of topics to be generated; (c) evaluation of the model’s reliability; and (d) the process of validly interpreting the resulting topics. We review the research literature dealing with these questions and propose a methodology that approaches these challenges. Our overall goal is to make LDA topic modeling more accessible to communication researchers and to ensure compliance with disciplinary standards. Consequently, we develop a brief hands-on user guide for applying LDA topic modeling. We demonstrate the value of our approach with empirical data from an ongoing research project.
Keywords:
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号