(190) previous ~ index ~ next
To: David Graff <graff@unagi.cis.upenn.edu>
From: James Allan <allan@cs.umass.edu>
Subject: Re: Plans for full train+devtest TDT release
Date: Fri, 02 Oct 1998 15:40:59 -0400
Dave,
I think your release plan is fine. However, I fail to see how it
satisfies one of your goals:
> Another objective is to make sure that there be no confusion about how
> to partition the data into training and testing sets -- both in terms
> of the overall division of the corpus, and in terms of the "per-topic"
> cut-off points (based on "Nt-max") for the tracking task.
How is the overall division of the corpus clear in the asrtext
directory, for example?
-- james
(190) previous ~ index ~ next
Last updated Fri Oct 2 19:04:21 1998