(230) previous ~ index ~ next
To: tdt-distrib@unagi.cis.upenn.edu
From: Jon_Yamron@Dragonsys.com
Subject: Re: Clarification of tracking
Date: Thu, 5 Nov 1998 17:08:16 -0500
OK, I can use anything I find automatically, in the specified "non-topic"
stories and in the January-April data, either to train my topic model or my
background model. I would point out that this kind of messes up the idea
of demonstrating the effect of having different amounts of training data
(e.g., for many events, we may see no difference in performance for Nt=1,
2, 4, 8, or 16, because we are able to automatically extract many useful
examples from prior data, while for other events the differnces will be
large). But if you don't care, I don't care.
Also, as far as I can tell the eval spec doesn't say anything about a
deferral for tracking, does it?
- Jon
James Allan <allan@cs.umass.edu> on 11/05/98 01:30:24 PM
To: Jaime Carbonell <jgc@NL.CS.CMU.EDU>
cc: Jon Yamron/Dragon Systems USA, tdt-distrib@unagi.cis.upenn.edu
Subject: Re: Clarification of tracking
Attacking a fly....
> untouched by human hands. In other words, we may use any test material
> up to and including the story on which we must make a judgement. But
> not next day stories or next month stories. The latter would allow
Tracking has a lookahead component, too. The contents of any story up
to the end of the lookahead component is valid information. If my
delay is 0, then I can use all of the story information in that "file"
(i.e., that half-hour of news). If I have a lookahead of 10, I get a
bunch more data to look at.
-- james
(230) previous ~ index ~ next
Last updated Fri Nov 6 15:29:23 1998