(090) previous ~ index ~ next
To: tdt-distrib@unagi.cis.upenn.edu
From: David Graff <graff@unagi.cis.upenn.edu>
Subject: Problems in Mandarin topic_relevance.table
Date: Mon, 03 May 1999 14:58:59 EDT
Folks,
In the TDT2 Mandarin data release that I announced last Friday, it
turns out that the topic_relevance.table file (in the "tables"
directory) has a couple of serious problems.
I will fix these problems this afternoon, and post a correct version
of the table on our TDT web page (I'll send explicit instructions for
finding it once it's ready).
First, most of the topics in the original table were not numbered
correctly -- the table was created using sequential topic numbers
from 1 to 20, but those numbers need to be changed so that they match
the original TDT2 topic-ids (ranging between 1 and 100) that were
selected for annotating the mandarin data.
Second, there were 54 hits identified by annotators that somehow were
not included in the table -- including all five hits on one of the
topics.
Please DO NOT USE the topic_relevance.table file that was included in
last Friday's release.
I apologize for the inconvenience and confusion.
Dave Graff
(090) previous ~ index ~ next
Last updated Thu May 13 09:28:24 1999