Erasmus-Mundus Course on Machine Translation Course

Melbourne University, Nov 2011

Automatic translation between human languages has become an invaluable technology for enabling communication and access to textual data on the internet. High profile companies, e.g., Google, have demonstrated its utility by providing high quality translation services between many different languages. This course will cover the technologies behind the modern statistical approach to machine translation, as used in Google's system. Statistical machine translation takes a data-driven, machine learning view of the problem, seeking to learn how to translation purely from data and with minimal human input. The course will cover the predominant approaches to machine translation -- word-based, phrase-based and grammar-based translation -- and their accompanying learning and decoding algorithms.

This course will run for 5 days, with lectures from 2-5 or thereabouts.

Materials

The materials used in the lectures and the practical sessions are available for download below These slides are based on a number of other courses and tutorials on SMT, particularly Adam Lopez's ESSLLI course and other sources as noted above.

Assignment

Those taking the course for credit will need to do a small assignment.