Text Analysis Conference Knowledge Base Population (TAC KBP)
The Text Analysis Conference (TAC) is a series of evaluations and workshops organized to promote research in Natural Language Processing and related applications, by providing a large test collection, common evaluation procedures, and a forum for organizations to share their results. The goal of TAC KBP is to develop and evaluate technologies for building and populating knowledge bases (KBs) about named entities from unstructured text. KBP systems must either populate an existing reference KB, or else build a KB from scratch.
Since 2009, LDC has supported TAC KBP through the development of source data and annotations, and by contributing to task development activities and discussions each year. In 2013, LDC is supporting six KBP tracks, all aimed at improving the ability to automatically populate knowledge bases from text:
-
Entity Linking
The entity linking task is to link names in a document collection to entities in a reference KB, or to new named entities discovered in the document collection. Tasks are offered in English, Spanish, and Chinese.
English Slot FillingThe slot filling task is to search a document collection to fill in values for predefined slots (attributes) for a given entity in a reference KB.
Temporal Slot FillingThe goal of temporal slot filling is to augment a KB with temporal constraints on slot filling relations.
Cross-lingual Spanish Slot FillingThe Spanish slot filling track extends English slot filling to the cross-lingual paradigm, in which English and Spanish documents are used to populate an English reference KB.
Sentiment Slot FillingThe overall goal of the Sentiment Slot Filling track is to assess the quality of detectors for scoped and attributed sentiment.
Cold Start KBPThe Cold Start track integrates entity linking and slot filling to build a knowledge base from scratch.
Additional Information
Xuansong Li, Stephanie M. Strassel, Heng Ji, Kira Griffitt, Joe
Ellis
Linguistic Resources for Entity Linking Evaluation: from
Monolingual to Cross-lingual
LREC 2012: 8th International Conference on Language Resources and
Evaluation, Istanbul, May 21-27
Available: Paper in
PDF
Paul McNamee, Hoa Trang Dang, Heather Simpson, Patrick Schone and Stephanie
M. Strassel
An Evaluation of Technologies for Knowledge Base Population
LREC 2010, In Proceedings of the Seventh conference on International
Language Resources and Evaluation. Valletta, Malta, May 2010
Available: Paper
in PDF
Heather Simpson, Stephanie
Strassel, Robert Parker, Paul McNamee
Wikipedia and the Web of Confusable Entities: Experience from Entity
Linking
Query Creation for TAC 2009 Knowledge Base Population
LREC 2010, In Proceedings of the Seventh conference on International
Language Resources and Evaluation. Valletta, Malta, May
2010
Available: Slides
in PDF














