(DE) (FR) (EN) – Microsoft Speech Language Translation (MSLT) Corpus | Microsoft


microsoft-gray.png

The Microsoft Speech Language Translation Corpus release contains conversational, bilingual speech test and tuning data for English, French, and German collected by Microsoft Research. The package includes audio data, transcripts, and translations and allows end-to-end testing of spoken language translation systems on real-world data. All data contained in this release has been created using a non-public version of Skype Translator. NO PRIVATE USER DATA HAS BEEN COLLECTED OR RELEASED. Instead we hired consultants to have loosely constrained conversations, giving them a list of predefined topics to talk about and a few related questions to start the conversations. Topical constraints were loosely enforced so as to …


via Microsoft Download Center

Advertisements

Leave a comment

Please log in using one of these methods to post your comment:

WordPress.com Logo

You are commenting using your WordPress.com account. Log Out / Change )

Twitter picture

You are commenting using your Twitter account. Log Out / Change )

Facebook photo

You are commenting using your Facebook account. Log Out / Change )

Google+ photo

You are commenting using your Google+ account. Log Out / Change )

Connecting to %s