Katja Markert's Research
My research interests centre on computational
More specifically, I am interested in using knowledge-based
techniques for the automatic treatment of problems in semantics,
and discourse. I am always interested in bright new PhD students with
an NLP or good computer science/mathematical background.
Recent research and ongoing research includes :
- Anaphora Resolution: I currently work on the recognition of information status as well as the resolution of bridging, together with
the Heidelberg Institute of Theoretical Studies ,
where I visited for a year in 2011-2012 as an Alexander von Humboldt fellow>/a>.
Previously, in collaboration with Malvina Nissim
Modjeska,I used simple web queries for bridging the knowledge gap
in the resolution in non-pronominal anaphora. We have integrated a
feature into a machine learning algorithm for other-anaphora and
web-based algorithms for the resolution of definite NP
coreference as well.
- Automatic treatment of regular polysemy and metonymy. In the
Mascara project, Malvina
and I explored supervised and unsupervised machine learning techniques
metonymy recognition. We also produced an annotation scheme for
developed a freely available corpus annotated for metonymies, which is
available from my Data page.
for annotation and learning has been adapted for German at the
University of Hagen as well as for French. The Business School at the
University of Leeds has used our data to research
the conceptualisation via metaphor and metonymy of organisations. We
also organised a
metonymy recognition competition in conjunction with SemEval 2007
- I am interested in opinion mining. Together with Fangzhong Su, I
developed a system to recognise subjective and objective
senses of words in WordNet. Results have been published in Coling 2008,
Naacl 2009 and Naacl 2010. The gold standard data we used is available
from my Data page.
Soon we will make all of WordNet with sentiment annotations available.
- My students Andrew McKinlay and Amal Al-Saif are working on
discourse relations, the former on the recognition for discourse
relations for English, the latter for Arabic. As part of this work we
developed the Leeds Arabic Discourse Treebank , the first corpus with
annotated discourse relations for Arabic. This work was funded by the
British Academy. The Treebank will be available soon via the Linguistic Data Consortium.
- Google WebDoc: Together with Serge Sharoff and Zhili Wu
we work on the automatic recognition of genre for web texts. This work
was funded by a Google research award.
- The recognition of textual entailment and textual inference. Johan Bos and I
in the Textual Entailment competitions organised in 2005 and 2006 and
developed a system for the automatic recognition of
textual entailment that integrates deep and shallow semantic analysis
within a machine learning framework. The part of the system performing
deep semantic analysis can be downloaded on the
- Entity Recognition: I was involved in the SEER project at the
of Edinburgh and Stanford University, which has the generalization of
entity recognition tasks as its goal.
Please visit my publication page for
my research results.
created 1994-11-04, last modified 2008-23-06