Preface; Part I. Foundations: 1. Introduction; 2. Words, sentences, corpora; 3. Probability theory; Part II. Core Methods: 4. Word-based models; 5. Phrase-based models; 6. Decoding; 7. Language models; 8. Evaluation; Part III. Advanced Topics: 9. Discriminative training; 10. Integrating linguistic information; 11. Tree-based models; Bibliography; Author index; Index.
This first textbook on statistical machine translation shows students and developers how to build an automatic language translation system.
Philipp Koehn is a lecturer in the School of Informatics at the University of Edinburgh. He is the scientific co-ordinator of the European EuroMatrix project and also involved in research funded by DARPA in the USA. He has also collaborated with leading companies in the field, such as Systran and Asia Online. He implemented the widely used decoder Pharoah, and is leading the development of the open source machine translation toolkit Moses.
'Philipp Koehn has provided the first comprehensive text for the
rapidly growing field of statistical machine translation. This book
is an invaluable resource for students, researchers, and software
developers, providing a lucid and detailed presentation of all the
important ideas needed to understand or create a state-of-the-art
statistical machine translation system.' Robert C. Moore, Principal
Researcher, Microsoft Research
'The book primarily represents an ideal introduction to the field
of statistical machine translation, but also tackles many of the
recent results in this area. It is the product of the many years of
both active research and extensive teaching of the author ... Each
chapter is additionally endowed with a summary, further reading and
exercises, achieving thus completely the proposed goal of an
accessible introduction to the statistical machine translation
field. Apart from its formative role for beginners, the book also
stands as a complete guide for researchers in a domain of high
interest and rapid expansion ... For all these reasons, this book
should be welcomed as a highly valuable publication.' Zentralblatt
MATH
'... Statistical Machine Translation provides an excellent
synthesis of a vast amount of literature (the bibliography section
takes up 45 double-column pages) and presents it in a
well-structured and articulate way. Moreover, the book has been
class-tested and contains a set of exercises at the end of each
chapter, as well as numerous references to open source tools and
resources which enable the diligent reader to build MT systems for
any language pair.' Target: International Journal of Translation
Studies
![]() |
Ask a Question About this Product More... |
![]() |