Absract:

The success of knowledge extraction depends on many factors, such as the domain, the representativity of the corpus and the availability quallity of the linguistic resources, tools and algorithms. In this talk I will discuss various aspects of nlp-based information extraction and knowledge modelling in different domains, with an emphasis on the legal domain. I will describe how certain types of information can be gathered from text, whereby both manual and automatic text analysis guides the modelling. The results can be e.g. transferred into a thesaurus, re-engineered into an ontology, or used for ontology enrichment.

The Messy Quest of Knowledge

Dr. Wim Peters

Natural Language Processing Group, Department of Computer Science, University of Sheffield