Coreference resolution is a key ingredient for the automatic
interpretation of text. It has been studied mainly from a linguistic
perspective, with an emphasis on establishing potential antededents
for pronouns. Practical applications, such as Information Extraction
(IE), summarization and Question Answering (QA), require accurate
identification of coreference relations between noun phrases in
general. Computational systems for assigning such relations
automatically, require the availability of a sufficient amount of
annotated data for training and testing. For Dutch, annotated data is
scarce and coreference resolution systems are lacking.
In this COREA project, a two-year project which started in July 2005,
we aim to develop a robust system for assigning such relations
automatically, and we will investigate the effect of making
coreference relations explicit on the accuracy of systems for for IE
and QA. We will annotate a limited amount of application-specific
corpus material, which is required for the evaluation of the
coreference resolution system in the context of IE and QA.
|