Obtaining the current release of the TDT2 Corpus

The TDT2 Text and Audio collections can be obtained by contacting Ilya Ahtaridis via email (ldc@ldc.upenn.edu), telephone (215-898-0464) or fax (215-573-2175).

The current release of the TDT2 Text corpus (English and Mandarin) is Version 4.0, (catalog number LDC2001T58). It is provided for free to those who have a 2001 LDC membership; non-members may purchase the text corpus for US$500.

Note that the TDT2 English Audio collection is quite large -- 73 cdroms (catalog number LDC99S84). Because of its size, the LDC must charge non-profit members a media fee to help cover the cost of cdrom replication; the fee is US$1460 for the full set. Non-members may purchase the full set for US$14,600.

The TDT2 Mandarin Audio collection is also available -- it is only 6 cdroms, and is available to LDC members at no additional cost (catalog number LDC2001S93). Non-members may purchase this data set for US$1200.

March, 2002


Comments to: graff@ldc.upenn.edu
Last Modified: Mar 7 13:18:03 EST 2002