Piggyback: Using Search Engines for Robust Cross-Domain Named Entity Recognition

Stefan Rüd1,  Massimiliano Ciaramita2,  Jens Müller1,  Hinrich Schütze1
1IfNLP, 2Google


Abstract

We use search engine results to address a particularly difficult cross-domain language processing task, the adaptation of named entity recognition (NER) from news text to web queries. The key novelty of the method is that we submit a token with context to a search engine and use similar contexts in the search results as additional information for correctly classifying the token. We achieve strong gains in NER performance on news, in-domain and out-of-domain, and on web queries.




Full paper: http://www.aclweb.org/anthology/P/P11/P11-1097.pdf