11-733: Multilingual Speech-to-Speech Translation Lab/Seminar
Department: Language Technologies Institute (LTI)
Units: 6
Semester: Fall
Instructors: Alan Black , Stephan Vogel
Prerequisites: Instructor consent
Course webpage
Course description:
Building speech-to-speech translation systems (S-2-S) is an extremely
complex task, involving research in Automatic Speech Recognition (ASR),
Machine Translation (MT), Natural Language Understanding (NLU), as
well as Text-to-Speech (TTS) and doing this for many languages doesn't
make it easier. Although substantial progress has been made in each
of these areas over the last years, the integration of the invididual
ASR, MT, NLU, and TTS components to build a good S-2-S system is still
a very challenging task.
The seminar course on Multilingual Speech-to-Speech Translation will
cover important recent work in the areas of ASR, MT, NLU, and TTS
with a special focus on language portable approaches and discuss solutions
for rapidly building state-of-the-art speech-to-speech translation
systems.
In the beginning sessions the instructors and other invited lecturers
will give a brief introduction into the broad field. We will select
papers on particular topics to read by each week. While everyone will
do all readings and participate in the discussions, one person is
assigned per session to present the basic ideas of the topic specific
papers and lead the concluding discussion. |