Analyzing public sentiments online: combining human- and computer-based content analysis |
| |
Authors: | Leona Yi-Fan Su Michael A. Cacciatore Xuan Liang Dominique Brossard Dietram A. Scheufele Michael A. Xenos |
| |
Affiliation: | 1. Department of Life Sciences Communication, University of Wisconsin-Madison, Madison, WI, USA;2. Department of Advertising and Public Relations, University of Georgia, Athens, GA, USA;3. Department of Communication Arts, University of Wisconsin-Madison, Madison, WI, USA |
| |
Abstract: | Recent technological developments have created novel opportunities for analyzing and identifying patterns in large volumes of digital content. However, many content analysis tools require researchers to choose between the validity of human-based coding and the ability to analyze large volumes of content through computer-based techniques. This study argues for the use of supervised content analysis tools that capitalize on the strengths of human- and computer-based coding for assessing opinion expression. We begin by outlining the key methodological issues surrounding content analysis as performed by human coders and existing computational algorithms. After reviewing the most popular analytic approaches, we introduce an alternative, hybrid method that is aimed at improving reliability, validity, and efficiency when analyzing social media content. To demonstrate the usefulness of this method, we track nuclear energy- and nanotechnology-related opinion expression on Twitter surrounding the Fukushima Daiichi accident to examine the extent to which the volume and tone of tweets shift in directions consistent with the expected external influence of the event. Our analysis revealed substantial shifts in both the volume and tone of nuclear power-related tweets that were consistent with our expectations following the disaster event. Conversely, there was decidedly more stability in the volume and tone of tweets for our comparison issue. These analyses provide an empirical demonstration of how the presented hybrid method can analyze defined communication sentiment and topics from large-scale social media data sets. The implications for communication scholars are discussed. |
| |
Keywords: | Content analysis sentiment analysis human-based coding computer-based coding supervised machine learning Twitter |
|
|