(199) previous ~ index ~ next

To: <tdt-distrib@unagi.cis.upenn.edu>
From: "Sreenivasa Sista" <ssista@bbn.com>
Subject: data problems!
Date: Tue, 28 Sep 1999 16:17:04 -0400

Hi TDTers,

While searching for on-topic stories with very low scores, I've found out
that on-topic training stories from mtas0 and mttkn, having same DOCNO,
discuss different topic. I'm not sure if there are some more such stories in
the corpus.


---------------------------------------------------

trk_SRC=nwt+bnasr_TRAIN:SL=mul,CL=eng_TEST:SL=mul,CL=eng_TOPIC=20076.ndx:

# Topic_training_story VOM19980302.0900.0564
mtas0/19980302_0900_1000_VOA_MAN.mtas0 1217 1270

<DOCNO> VOM19980302.0900.0564 </DOCNO>
<TEXT>
Rescues official says. Because autumn harvests grain and paddy rice quickly
ate. Therefore spring is always most difficult time. North Han and Korea
Red Cross official's dialogue. Has fallen into deadlock since
last year December. One Korea controls department. Red Cross official says.
Though north Han's tendency is stern. No matter what this tissue.
</TEXT>
</DOC>

-----------------------------------------------------

trk_SRC=nwt+bnman_TRAIN:SL=man,CL=eng_TEST:SL=mul,CL=eng_TOPIC=20076.ndx:

# Topic_training_story VOM19980302.0900.0564
mttkn/19980302_0900_1000_VOA_MAN.mttkn 1218 1263

<DOCNO> VOM19980302.0900.0564 </DOCNO>
<TEXT>
In Indonesia capital Jakarta, several hundred students held short protest
activity. Appeal carries on politics and economics reform. Students
held short time assembly, one storm forced them to be terminated this
time protest activity. Although authority arranged police, but
they had not certainly carried on intervention.
</TEXT>
</DOC>

------------------------------------------------------

Sreenivasa Sista
ssista@bbn.com


(199) previous ~ index ~ next

Last updated Wed Sep 29 12:12:22 1999