Predicting survey responses: how and why semantics shape survey statistics on organizational behaviour

Arnulf JK; Larsen KR; Martinsen ØL; Bong CH

doi:10.1371/journal.pone.0106361

Fulltext

Predicting survey responses: how and why semantics shape survey statistics on organizational behaviour

Arnulf JK ¹ , Larsen KR ² , Martinsen ØL ¹ , Bong CH ³

Affiliations

¹ Department of Leadership and Organizational Behaviour, BI Norwegian Business School, Oslo, Norway
² Management and Entrepreneurship Division, Leeds School of Business, University of Colorado at Boulder, Boulder, Colorado, United States of America
³ Faculty of Computer Science and Information Technology, University of Malaysia at Sarawak, Sarawak, Malaysia

PLoS One, 2014;9(9):e106361.

PMID: 25184672 DOI: 10.1371/journal.pone.0106361

Abstract

Some disciplines in the social sciences rely heavily on collecting survey responses to detect empirical relationships among variables. We explored whether these relationships were a priori predictable from the semantic properties of the survey items, using language processing algorithms which are now available as new research methods. Language processing algorithms were used to calculate the semantic similarity among all items in state-of-the-art surveys from Organisational Behaviour research. These surveys covered areas such as transformational leadership, work motivation and work outcomes. This information was used to explain and predict the response patterns from real subjects. Semantic algorithms explained 60-86% of the variance in the response patterns and allowed remarkably precise prediction of survey responses from humans, except in a personality test. Even the relationships between independent and their purported dependent variables were accurately predicted. This raises concern about the empirical nature of data collected through some surveys if results are already given a priori through the way subjects are being asked. Survey response patterns seem heavily determined by semantics. Language algorithms may suggest these prior to administering a survey. This study suggests that semantic algorithms are becoming new tools for the social sciences, opening perspectives on survey responses that prevalent psychometric theory cannot explain.

* Title and MeSH Headings from MEDLINE®/PubMed®, a database of the U.S. National Library of Medicine.

MeSH terms

Similar publications