Bibliography for Statistical Alignment and Machine Translation

1 Alignment models
2 Translation models
3 Preprocessing and complementary applications
4 Decoding
5 Related tasks
6 Useful methods and algorithms
  Bibliography


1 Alignment models


2 Translation models


3 Preprocessing and complementary applications

Statistical Machine Translation on Paraphrased Corpora
(Taro Watanabe)
Sonja Niessen's papers
Using POS Information for SMT into Morphologically Rich Languages
[Ueffing and Ney 2003] When translating from English to Spanish or Catalan, the generation of the correct verb fullform is especially difficult because the english verb forms contain fewer information. To cope with this, the english pronouns, modals and verbs are identified with POS tagging and grouped as ``new'' fullforms (ex: ``you will have'' ``you_will_have''). In questions, the order of verbal forms is inversed before forming the groups.


4 Decoding


5 Related tasks


6 Useful methods and algorithms


Bibliography

Lars Ahrenberg, Mikael Andersson, and Magnus Merkel. 1998. A simple hybrid aligner for generating lexical correspondences in parallel texts. In Proc. of the 36th Annual Meeting of the Association for Computational Linguistics and 17th International Conference on Computational Linguistics (COLING-ACL'98), pages 29-35, Montreal, Canada, August 10-14. PDF

Oliver Bender, Klaus Macherey, Franz Josef Och, and Hermann Ney. 2003. Comparison of alignment templates and maximum entropy models for natural language understanding. In Proc. of the 10th Conference of the European Chapter of the ACL (EACL), Budapest, Hungary, April. PS

Adam Berger, Stephen Della Pietra, and Vincent Della Pietra. 1996. A maximum entropy approach to natural language processing. Computational Linguistics, 22(1):39-72, March. PS

Peter F. Brown, Stephen A. Della Pietra, Vincent J. Della Pietra, and R. L. Mercer. 1993. The mathematics of statistical machine translation: Parameter estimation. Computational Linguistics, 19(2):263-311. PDF

Francisco Casacuberta, Enrique Vidal, and Juan Miguel Vilar. 2002. Architectures for speech-to-speech translation using finite-state models. In Proceedings of the Workshop on Speech-to-Speech Translation: Algorithms and Systems, pages 39-44, Philadelphia, July. PDF

Colin Cherry and Dekang Lin. 2003. A probability model to improve word alignment. In Proc. of the Annual Meeting of the Association for Computational Linguistics. PDF PS

George Foster. 2000. Incorporating position information into a maximum entropy/minimum divergence translation model. In Proc. of CoNLL-2000 and LLL-2000, pages 37-42, Lisbon, Portugal. PS

Ismael García Varea, Franz Josef Och, Hermann Ney, and Francisco Casacuberta. 2002. Improving alignment quality in statistical machine translation using context-dependent maximum entropy models. In Proc. 19thInt. Conf. on Computational Linguistics, pages 1051-1054, Taipei,Taiwan. PS.GZ

Philipp Koehn and Kevin Knight. 2002. Chunkmt: Statistical machine translation with richer linguistic knowledge. Draft. Unpublished. PS

Philipp Koehn, Franz Josef Och, and Daniel Marcu. 2003. Statistical phrase-based translation. In Proc. of the Annual Meeting of the Association for Computational Linguistics. PDF PS

John Lafferty. 1996. Gibbs-markov models. Computing Science and Statistics, 27:370-377. PS

Robert Malouf. 2002. A comparison of algorithms for maximum entropy parameter estimation. In Proceedings of the Sixth Conference on Natural Language Learning (CoNLL), pages 49-55 PDF

Daniel Marcu and William Wong. 2002. A phrase-based, joint probability model for statistical machine translation. In Proc. of the Conference on Empirical Methods in Natural Language Processing, Philadelphia, PA, July 6-7. PDF

Franz Josef Och and Hermann Ney. 2002. Dicriminative training and maximum entropy models for statistical machine translation. In Proc. of the Annual Meeting of the Association for Computational Linguistics, pages 295-302, Philadelphia, PA, July. PS

Franz Josef Och and Hermann Ney.2003. A systematic comparison of various statistical alignment models. Computational Linguistics, 29(1):19-51, March.

Franz Josef Och, Christoph Tillmann, and Hermann Ney. 1999. Improved alignment models for statistical machine translation. In Proc. of the Conference on Empirical Methods in Natural Language Processing and Very Large Corpora, pages 20-28, University of Maryland, College Park, MD, June. PS

K. A. Papineni, S. Roukos, and R. T. Ward. 1998. Maximum likelihood and discriminative training of direct translation models. In Proc. Int. Conf. on Acoustics, Speech, and Signal processing, pages 189-192, Seattle, WA, May. PDF

Jörg Tiedemann. 2003. Combining clues for word alignment. In Proc. of the 10th Conference of the European Chapter of the ACL (EACL), Budapest, Hungary, April 12-17. PDF PS

Kristina Toutanova, H. Tolga Ilhan, and Christopher D. Manning. 2002. Extensions to hmm-based statistical word alignment models. In Proc. of the Conference on Empirical Methods in Natural Language Processing, Philadelphia, PA, July 6-7. PDF

Nicola Ueffing and Hermann Ney. 2003. Using pos information for statistical machine translation into morphologically rich languages. In Proc. of the 10th Conference of the European Chapter of the ACL (EACL), Budapest, Hungary, April. PS

Stephan Vogel, Hermann Ney, and Christoph Tillmann. 1996. Hmm-based word alignment in statistical translation. In COLING'96: The 16thInt. Conf. on Computational Linguistics, pages 836-841, Copenhagen, Denmark, August. PS

Ye-Yi Wang and Alex Waibel. 1998. Modeling with structures in statistical machine translation. In Proc. of the 36th Annual Meeting of the Association for Computational Linguistics and 17th International Conference on Computational Linguistics, Montreal, Canada. PS.GZ

Taro Watanabe, Kenji Imamura, and Eiichiro Sumita. 2002. Statistical machine translation based on hierarchical phrase alignment. In Proc. of the 9th International Conference on Theoretical and Methodological Issues in Machine Translation (TMI), pages 188-198, Keihanna, Japan, March. PDF

Kenji Yamada and Kevin Knight. 2001. A syntax-based statistical translation model. In Proc. of the Annual Meeting of the Association for Computational Linguistics, Toulouse, France. PS

Richard Zens, Franz Josef Och, and Hermann Ney. 2002. Phrase-based statistical machine translation. In Springer Verlag, editor, Proc. German Conference on Artificial Intelligence (KI), september. PS

Ying Zhang. unpublished. Integrated phrase segmentation and alignment model for statistical machine translation.


Adrià de Gispert        
Patrik Lambert        
Last update: July 31, 2003