School of Computing

FACULTY OF ENGINEERING

 

ALUMNI (+Supervisor)

Claire Brierley (EA)Prosody resources and symbolic prosodic features for automated phrase break prediction (PDF). 2011
Owen Tregurtha Nancarrow (EA)A comparative study of the tagging of adverbs in modern English corpora. 2011
Majdi Shaker Sawalha (EA)Open-source resources and standards for Arabic word structure analysis: fine grained morphological analysis of Arabic text corpora (PDF). 2011
Fangzhong Su (KM,EA) Computational modelling of word sense sentiment. 2011
Noorhan Abbas (EA)Qurany: A Tool to Search for Concepts in the Quran (PDF). 2009
Andrew Roberts (EA)
Grammatical Inference and Corpus Linguistics (PDF). 2008
Eric
Atwell

Corpus Linguistics and Language Learning: Bootstrapping Linguistic Knowledge and Resources from Text (PDF). 2008
Debra
Elliott
(EA,AH)
Corpus-based machine translation evaluation via automated error detection in output texts. 2007
Bayan
Abu Shawar
(EA)
A Corpus Based Approach to Generalise a Chatbot System (PDF). 2005
Bogdan(AH,EA)
Babych
Information Extraction techniques in Machine Translation. 2005
Mandy
Schiffrin
(CS)
Modelling Speech Acts in Conversational Discourse (PDF). 2005
John
Elliott (EA)
Natural Language Learning for the Search for Extraterrestrial Intelligence. 2004
Latifa
Al-Sulaiti
(EA)
Designing and Developing a Corpus of Contemporary Arabic (PDF). 2004
Toshifumi
Oba
(EA)
Using the HTK Speech Recogniser to Analyse Prosody in a Corpus of German Spoken Learner's English (PDF). 2003
Xiao Yuan
Duan
(EA)
Lexical Semantic Association Between Web Documents (PDF). 2002
Menno
van Zaanen
(RB)
Bootstrapping Structure into Language: Alignment-Based Learning (PDF). 2002
Xuegang
Wang
(PM)
Negation in logic and deductive databases (PDF). 2000
George
Demetriou
(EA,CS)
Lexical semantic information processing for large vocabulary human-computer speech communication. 1997
Clive
Souter (EA)
A corpus-trained parser for systemic-functional syntax (PDF). 1996
Adam
Bull
(EA)
The formal description of aerobic dance exercise. 1996
Gavin
Churcher
(EA,CS)
Improving the performance of speech driven applications using linguistic knowledge . 1996
Michael
Schillo
(EA)
Working while driving: corpus based modelling of a natural English voice user-interface to the in-car personal assistant (PDF). 1996
Xiaoda
Zhang
(EA)
MIRTH Chinese and English search engine: a multilingual retrieval tool hierarchy for a World Wide Web virtual corpus. 1996
Nik
Silver
(PM)
Inferencing methods using systemic functional grammar. 1995
Uwe
Jost

(EA)
Probabilistic language modelling for speech recognition. 1995
Simon
Arnfield
(EA)
Prosody and syntax in corpus-based analysis of spoken English. 1994
Alec
Grierson
(RP)
Generating cohesive texts from simulations used in computer-aided instruction. 1994
John S
Hughes
(EA)
Automatically acquiring a classification of words (PDF). 1994
Tim
O'Donoghue
(EA)
Reversing the process of generation in Systemic Grammar. 1993

NATURAL LANGUAGE PROCESSING research group

ALUMNI news PEOPLE projects PUBLICATIONS seminars VACANCIES useful links

Language research in Computing is also known as Natural Language Processing , Computational Linguistics , Corpus Linguistics ,or Language Engineering . Central to our research is the computational modelling of language data; a CORPUS is a text dataset representative of the language to be analysed. Our research at Leeds University focusses on bootstrapping linguistic knowledge and resources from text, and is reported in our PUBLICATIONS.

Language research graduates have gone on to work in Web search, text analytics, translation and language consulting, online news, voice-to-text, the Search for Extra-Terrestrial Intelligence, and, of course, as University academics !

Staff

Eric Atwell Eric Atwell, Associate Professor
Corpus Linguistics, Data Mining with text in English, Arabic, and other languages; Text Analytics applied to Understanding the Quran, detecting terrorist activites, and Healthcare patient records. PUBLICATIONS.
Katja Markert Katja Markert, Reader (on sabbatical summer 2013)
Data-intensive, corpus-based and web-based natural language processing, Anaphora Resolution, Figurative Language Resolution, Textual Entailment, Sentiment Analysis. PUBLICATIONS.

Claire Brierley Claire Brierley, Senior Research Fellow
Corpus Linguistics, Detecting terrorist activites, e-Health patient records, Prosody resources. PUBLICATIONS.
Owen Johnson Owen Johnson, Senior Fellow
Corpus linguistics for Health Informatics, Healthcare patient records research methods. PUBLICATIONS.
Majdi Sawalha Majdi Sawalha, Research Fellow
Web-as-Corpus for Islamic Studies; Arabic morphological analysis. PUBLICATIONS.
Justin Washtell Justin Washtell, Research Fellow
Corpus-based distributional models of lexical semantics PUBLICATIONS.
Saman Hina, Research Fellow
Anonymisation of e-Health patient records.

We collaborate with academic staff in the Centre for Translation Studies :

Tony Hartley Tony Hartley
Evaluation of machine translation systems, Controlled languages, Natural Language Generation, Quality in translation and interpreting, Computer Supported Collaborative Working. PUBLICATIONS
Serge Sharoff Serge Sharoff
Corpus linguistics, Natural Language Understanding, Natural Language Generation, Lexical semantics, Systemic-Functional grammar. PUBLICATIONS

At the German Christmas Market in Leeds
R-to-L: Eric Atwell, Majdi Sawalha, Justin Washtell, Claire Brierley, Fangzhong Su, Josiah Wang, Owen Tregurtha Nancarrow - at the German Christmas Market next to Leeds University

Research Students

STUDENT +Supervisor(s)RESEARCH TOPIC
Amal Alsaif (KM)Human and automatic annotation of discourse relations for Arabic
Samuel Danso (EA,OJ) Text Analytics to Predict Cause of Death in Verbal Autopsies
Kais Dukes (EA)Arabic Language Computing Applied to the Quran
Saman Hina (EA,OJ) SNOMED semantic tagger for medical corpus linguistics
Jaafar, Juliana (OJ,EA,SC)Text Analytics and Data Mining for e-Health Informatics
Andrew McKinlay (KM) Automatic Detection of Discourse Structure: Relation and Entity Graphs
Noushin Rezapour Asheghi (SS,KM)Genre classification of web pages
Abdul-Baquee Muhammad Sharaf (EA) A Computational Model for Knowledge Representation of the Quran
Josiah Wang (ME,KM) Learning Visual Object Recognition from Text
Justin Washtell (EA,KM) The benefits of proximity as opposed to frequency as a basis for modelling language


Kais Dukes - University of Leeds Engineering Postgraduate Researcher of the year

Potential research collaborators and PhD students are very welcome to contact any of the academic staff (Atwell, Markert, Hartley, and Sharoff). Please send us an outline project proposal (see guidelines and example project ideas). You can apply for our PhD programme online.


Selected Publications

or you can select a fuller list
Brierley, C; Atwell, ES Non-Traditional Prosodic Features for Automated Phrase-Break Prediction. Literary and Linguistic Computing Journal, vol. 26, pp.279-284. 2011.LINK
Dukes, K; Atwell, ES; Habash, N Supervised Collaboration for Syntactic Annotation of Quranic Arabic. Language Resources and Evaluation Journal, pp.1-30. 2011.LINK
Rapp, AM; Erb, M; Langohr, K; Markert, K Neural Correlates of Metonymy Comprehension in Schizophrenia in: Schizophrenia Bulletin, vol. 37, pp.150-150. Oxford University Press. 2011.
Brierley, C; Atwell, ES Holy smoke: vocalic precursors of phrase breaks in Milton's Paradise Lost. Literary and Linguistic Computing Journal, vol. 25, pp.137-151. 2010.LINK
DOI
Hina, S; Atwell, ES; Johnson, O Semantic Tagging of Medical Narratives with Top Level Concepts from SNOMED CT Healthcare Data Standard. International Journal of Intelligent Computing Research (IJICR), vol. 1, pp.118-123. 2010.LINK
Markert, K; Nissim, M Data and models for metonymy resolution. Language Resources and Evaluation, vol. 43, pp.123-138. 2009.

Useful links

  • Leeds University research seminars and groups in related subjects: Modern Languages and Cultures, Knowledge Management, Translation Studies, Linguistics and Phonetics, Language Education, English.

  • Bookmarks for corpus-based linguistics

  • International Conferences in Language Computing and Computer Science

  • Language Computing Journals:

  • ACM Transactions on Asian Language Information Processing
  • ACM Transactions on Speech and Language Processing
  • Computational Linguistics Journal
  • Computer Speech and Language Journal
  • Corpora journal
  • Corpus Linguistics and Linguistic Theory journal
  • Egyptian Journal of Language Engineering
  • IEEE Transactions on Audio, Speech and Language Processing
  • International Computer Archive of Modern and Medieval English ICAME Journal
  • International Journal of Asian Language Processing
  • International Journal of Computational Linguistics
  • International Journal of Computational Linguistics and Applications
  • International Journal of Computational Linguistics and Chinese Language Processing
  • International Journal of Computational Linguistics Research
  • International Journal of Computer Processing Of Languages
  • International Journal of Corpus Linguistics
  • International Journal on Natural Language Computing
  • Journal for Language Technology and Computational Linguistics
  • Journal of Logic, Language and Information
  • Journal of Interesting Negative Results in Natural Language Processing and Machine Learning
  • Journal of Quantitative Linguistics
  • Language Resources and Evaluation Journal
  • Linguamatica Journal for the Automatic Processing of the Iberic Languages
  • Linguistic Issues in Language Technology journal
  • Literary and Linguistic Computing Journal
  • Machine Translation Journal
  • Natural Language Engineering Journal
  • Northern European Journal of Language Technology
  • Prague Bulletin of Mathematical Linguistics
  • Procesamiento del Lenguaje Natural Journal
  • Research on Language and Computation
  • Traitement Automatique des Langues journal
    Google Scholar