Home

About

Advanced Search

Browse by Discipline

Scientific Societies

E-print Alerts

Add E-prints

E-print Network
FAQHELPSITE MAPCONTACT US


  Advanced Search  

 
Mining and Modeling Relations between Formal and Informal Chinese Phrases from Web Corpora
 

Summary: Mining and Modeling Relations between
Formal and Informal Chinese Phrases from Web Corpora
Zhifei Li and David Yarowsky
Department of Computer Science and Center for Language and Speech Processing
Johns Hopkins University, Baltimore, MD 21218, USA
zhifei.work@gmail.com and yarowsky@cs.jhu.edu
Abstract
We present a novel method for discovering
and modeling the relationship between in-
formal Chinese expressions (including collo-
quialisms and instant-messaging slang) and
their formal equivalents. Specifically, we pro-
posed a bootstrapping procedure to identify
a list of candidate informal phrases in web
corpora. Given an informal phrase, we re-
trieve contextual instances from the web us-
ing a search engine, generate hypotheses of
formal equivalents via this data, and rank
the hypotheses using a conditional log-linear
model. In the log-linear model, we incorpo-

  

Source: Amir, Yair - Department of Computer Science, Johns Hopkins University

 

Collections: Computer Technologies and Information Sciences