Preliminary Program

Extracting Social Power Relationships from Natural Language

Philip Bramsen¹, Martha Escobar-Molano², Ami Patel³, Rafael Alonso⁴
¹SBTS, ², ³MIT, ⁴SET Corporation

Abstract

Sociolinguists have long argued that social context influences language use in all manner of ways, resulting in lects. This paper explores a text classification problem we will call lect modeling, an example of what has been termed computational sociolinguistics. In particular, we use machine learning techniques to identify social power relationships between members of a social network, based purely on the content of their interpersonal communication. We rely on statistical methods, as opposed to language-specific engineering, to extract features which represent vocabulary and grammar usage indicative of social power lects. We then apply support vector machines to model the social power lects representing superior-subordinate communication in the Enron email corpus. Our results validate the treatment of lect modeling as a text classification problem – albeit a hard one – and constitute a case for future research in computational sociolinguistics.

Full paper: http://www.aclweb.org/anthology/P/P11/P11-1078.pdf