Why Press Backspace? Understanding User Input Behaviors in Chinese Pinyin Input Method

Yabin Zheng1,  Lixing Xie1,  Zhiyuan Liu1,  Maosong Sun1,  Yang Zhang2,  Liyun Ru2
1Tsinghua University, 2Sogou Inc.


Abstract

Chinese Pinyin input method is very important for Chinese language information processing. Users may make errors when they are typing in Chinese words. In this paper, we are concerned with the reasons that cause the errors. Inspired by the observation that pressing backspace is one of the most common user behaviors to modify the errors, we collect 54,309,334 error-correction pairs from a real-world data set that contains 2,277,786 users via backspace operations. In addition, we present a comparative analysis of the data to achieve a better understanding of users' input behaviors. Comparisons with English typos suggest that some language-specific properties result in a part of Chinese input errors.




Full paper: http://www.aclweb.org/anthology/P/P11/P11-2085.pdf