High Accuracy Retrieval from Documents (HARD) 2004

The objective of the HARD 2004 project is to achieve high accuracy retrieval from documents by leveraging additional information about the searcher and/or the search context, through techniques such as passage retrieval, and using very targeted interaction with the searcher. HARD is in its second year, and is being run as an evaluation track within TREC, the Text REtrieval Conference sponsored by NIST.

The manner in which HARD researchers will leverage additional information about the searcher is through the use of "Metadata", which in this case means that potential searchers will select an array of specific values that will further refine their initial queries. LDC annotators will be responsible for topic creation, and will use the Topic Creation Web Interface to do so.

Project Links

HARD 2004

Evaluation Topics
Topics developed and annotated for the 2004 HARD evaluation

Annotation Guidelines
Guidelines for 2004 are available in .pdf format.

Project Timeline
Visit this page for more information about progress and deliverables.

Intranet HARD page
Annotators: Visit this page for more information about creating and submitting topics

Clarification Form Submission Instructions
Instructions on submitting clarification forms

HARD 2003 (Pilot)

Evaluation Topics
Topics developed and annotated for the 2003 HARD evaluation

HARD 2003
For more information about the first year of HARD, visit the HARD 2003 webpage.

2003 Guidelines
HARD 2003 annotation guidelines in pdf format.

HARD Project Overview
HARD project overview and guidelines for researchers at the University of Massachussetts

NIST's website

LDC's TIDES page
This site provides an overview of all TIDES projects that LDC supports, in addition to a link to the TIDES data matrix.



About LDC | Members | Catalog | Projects | Papers | LDC Online | Search / Help | Contact Us | UPenn | Home | Obtaining Data | Creating Data | Using Data | Providing Data

Contact ldc@ldc.upenn.edu
Last modified: Friday, 09-July-2004 13:24:55 EDT
© 1996-2000 Linguistic Data Consortium, University of Pennsylvania. All Rights Reserved.