Chris Dyer's Homepage

I am a PhD student in the Department of Linguistics at the University of Maryland at College Park working with Philip Resnik. My primary focus is computational linguists (my interests include machine translation and computational phonology and morphology), but problems from a variety of other areas interest me as well.

I participated in the 2006 Summer Workshop at Johns Hopkins which produced the Moses open source toolkit for statistical machine translation.

I'm working with Jimmy Lin in CLIS on using large networks of computers (provided by IBM and Google-- thanks!) to process large amounts of natural language data efficiently. Read the hype!

I'm spending the summer at Google, working on machine translation.

For fun, I play cello. I also like to think about the cognitive aspects of music theory as well as the relationship of language to music.

Here's a somewhat up-to-date CV.

I recently discovered I have an Erdös number of 4.

Refereed Publications/Presentations

C. Dyer, A. Cordova, A. Mont, and J. Lin. Fast, Cheap, and Easy: Construction of Statistical Machine Translation Models with MapReduce. In Proceedings of the ACL-2008 Workshop on Statistical Machine Translation (WMT-2008), Columbus, Ohio, July 2008.

C. Dyer, S. Muresan, and P. Resnik. Generalizing Word Lattice Translation. In Proceedings of the Annual Meeting of the Association for Computational Linguistics (ACL), Columbus, Ohio, July 2008.

C. Dyer. The University of Maryland Translation System for IWSLT 2007. In Proceedings of the International Workshop on Spoken Language Translation (IWSLT) 2007, Trento, Italy, October 2007.

C. Dyer. The 'noiser channel': Translation from morphologically complex languages. In Proceedings of the ACL-2007 Workshop on Statistical Machine Translation (WMT-2007), Prague, Czech Republic, June 2007.

P. Koehn, H. Hoang, A. Birch, C. Callison-Burch, M. Federico, N. Bertoldi, B. Cowan, W. Shen, C. Moran, R. Zens, C. Dyer, O. Bojar, A. Constantin and E. Herbst. Moses: Open Source Toolkit for Statistical Machine Translation. In Proceedings of the Annual Meeting of the Association for Computational Linguistics (ACL), Companion Volume, Prague, Czech Republic, June 2007.

A. Omaki, C. Dyer, S. Malhotra, J. Sprouse, J. Lidz, & C. Phillips. The time-course of anaphoric processing and syntactic reconstruction. Presented at CUNY 2007, San Diego, March 2007.

Other talks

C. Dyer and J. Lin. Computing in the Clouds: Applications of MapReduce in "Web-Scale" Information Processing. Presented at the University of Maryland CLIP Colloquium, October 17, 2007.

C. Dyer. Decoder guided backoff: using word lattices to improve translation from morphologically complex languages. Presented at the MT Marathon, Edinburgh University, April 2007.

C. Dyer and P. Resnik. Confusion network decoding in Hiero. Presented by P. Resnik at GALE 2007 PI Meeting, San Fransisco, March 2007.

C. Dyer. The Syntax of Tonal Music, Syntax Lunch Talk, University of Maryland, October 2006.

Software

Here's a rather tricky Telugu Morphological Analyzer.

Here are my patches for GIZA++ and MkCls so they compile under GCC 4.1

A Java implementation of the Buckwalter Morphological Analyzer v1 (GPL License, data (c) 2001 by the LDC).

Chinese place names (Chinese (utf-8) | English) which were mined from some huge lists at Wikipedia (URL list).


Odds and Ends

As a graduate student, one wears many hats. In this case, it was the chauffeur's hat.
Me with the Father of Linguistics

blog stats