NATURAL LANGUAGE PROCESSING research group
Language research in Computing is also known as
Natural Language Processing ,
Computational Linguistics ,
Corpus Linguistics ,or
Language Engineering .
Central to our research is the computational modelling of language data; a
CORPUS is a text dataset representative of the language
to be analysed. Our research at Leeds University focusses on bootstrapping
linguistic knowledge and resources from text, and is reported in our
Language research graduates have gone on to work in Web search, text analytics, translation and language consulting, online news, voice-to-text, the Search for Extra-Terrestrial Intelligence, and, of course, as University academics !
Eric Atwell, Associate Professor|
Corpus Linguistics, Data Mining with text in English, Arabic, and other languages; Text Analytics applied to Understanding the Quran, detecting terrorist activites, and Healthcare patient records. PUBLICATIONS.
Katja Markert, Reader (on sabbatical summer 2013)|
Data-intensive, corpus-based and web-based natural language processing, Anaphora Resolution, Figurative Language Resolution, Textual Entailment, Sentiment Analysis. PUBLICATIONS.
Claire Brierley, Senior Research Fellow|
Corpus Linguistics, Detecting terrorist activites, e-Health patient records, Prosody resources. PUBLICATIONS.
Owen Johnson, Senior Fellow|
Corpus linguistics for Health Informatics, Healthcare patient records research methods. PUBLICATIONS.
Majdi Sawalha, Research Fellow|
Web-as-Corpus for Islamic Studies; Arabic morphological analysis. PUBLICATIONS.
Justin Washtell, Research Fellow|
Corpus-based distributional models of lexical semantics PUBLICATIONS.
|Saman Hina, Research Fellow|
We collaborate with academic staff in the Centre for Translation Studies :
Evaluation of machine translation systems, Controlled languages, Natural Language Generation, Quality in translation and interpreting, Computer Supported Collaborative Working. PUBLICATIONS
Corpus linguistics, Natural Language Understanding, Natural Language Generation, Lexical semantics, Systemic-Functional grammar. PUBLICATIONS
R-to-L: Eric Atwell, Majdi Sawalha, Justin Washtell, Claire Brierley, Fangzhong Su, Josiah Wang, Owen Tregurtha Nancarrow - at the German Christmas Market next to Leeds University
|STUDENT +Supervisor(s)||RESEARCH TOPIC|
|Amal Alsaif (KM)||Human and automatic annotation of discourse relations for Arabic|
|Samuel Danso (EA,OJ)||Text Analytics to Predict Cause of Death in Verbal Autopsies|
|Kais Dukes (EA)||Arabic Language Computing Applied to the Quran|
|Saman Hina (EA,OJ)||SNOMED semantic tagger for medical corpus linguistics|
|Jaafar, Juliana (OJ,EA,SC)||Text Analytics and Data Mining for e-Health Informatics|
|Andrew McKinlay (KM)||Automatic Detection of Discourse Structure: Relation and Entity Graphs|
|Noushin Rezapour Asheghi (SS,KM)||Genre classification of web pages|
|Abdul-Baquee Muhammad Sharaf (EA)||A Computational Model for Knowledge Representation of the Quran|
|Josiah Wang (ME,KM)||Learning Visual Object Recognition from Text|
|Justin Washtell (EA,KM)||The benefits of proximity as opposed to frequency as a basis for modelling language|
Kais Dukes - University of Leeds Engineering Postgraduate Researcher of the year
Part-time fixed-term (20% for 3 months only)
Web Content Managers
Potential research collaborators and PhD students are very welcome to contact any of the academic staff (Atwell, Markert, Hartley, and Sharoff). Please send us an outline project proposal (see guidelines and example project ideas). You can apply for our PhD programme online.
Selected Publicationsor you can select a fuller list
|Brierley, C; Atwell, ES Non-Traditional Prosodic Features for Automated Phrase-Break Prediction. Literary and Linguistic Computing Journal, vol. 26, pp.279-284. 2011.||LINK|
|Dukes, K; Atwell, ES; Habash, N Supervised Collaboration for Syntactic Annotation of Quranic Arabic. Language Resources and Evaluation Journal, pp.1-30. 2011.||LINK|
|Rapp, AM; Erb, M; Langohr, K; Markert, K Neural Correlates of Metonymy Comprehension in Schizophrenia in: Schizophrenia Bulletin, vol. 37, pp.150-150. Oxford University Press. 2011.|
|Brierley, C; Atwell, ES Holy smoke: vocalic precursors of phrase breaks in Milton's Paradise Lost. Literary and Linguistic Computing Journal, vol. 25, pp.137-151. 2010.||LINK
|Hina, S; Atwell, ES; Johnson, O Semantic Tagging of Medical Narratives with Top Level Concepts from SNOMED CT Healthcare Data Standard. International Journal of Intelligent Computing Research (IJICR), vol. 1, pp.118-123. 2010.||LINK|
|Markert, K; Nissim, M Data and models for metonymy resolution. Language Resources and Evaluation, vol. 43, pp.123-138. 2009.|