(277) previous ~ index ~ next
To: Sheera_Knecht@dragonsys.com
From: Stephanie Strassel <strassel@ldc.upenn.edu>
Subject: Re: TDT2 topics with no stories
Date: Tue, 15 Aug 2000 11:27:47 -0400
Sheera,
While the LDC did perform complete annotation on all 100 topics for TDT2
in English, and for a subset of 20 topics in Mandarin, it is true that
no on-topic stories were found for the 4 topics you mention (and that
the seed stories that generated these 4 topics were also excluded from
the corpus after topic selection). Therefore, there are no on-topic
stories in the TDT2 corpus for topics 20003, 20045, 20049 and 20051.
Stephanie
Sheera_Knecht@dragonsys.com wrote:
>
> Chris:
>
> I seem to be finding that 4 topics (20003, 20045, 20049 and 20051) have no
> on-topic stories
> designated in either English or Mandarin for the entire TDT2 6 month time
> frame (jan-june).
> Is that correct? We'd like to know if all 100 topics (20001-20100) were
> annotated, at least for
> English, for TDT2?
>
> Thanks,
> Sheera
--
Stephanie Strassel
Linguistic Data Consortium
strassel@ldc.upenn.edu
(277) previous ~ index ~ next
Last updated Tue Sep 19 14:30:58 2000