My publications

Books

Mehler, A., Sharoff, S., Santini, M. (2010) (eds) Genres on the Web: computational models and empirical studies, Springer; book info at the Springer website

Journal papers

Sharoff, S., Babych, B., Hartley, A. (2009) 'Irrefragable answers' using comparable corpora to retrieve translation equivalents. In Language Resources and Evaluation Journal, 43, 15-25, Prepublication draft

Sharoff, S. (2006) Open-source corpora: using the net to fish for linguistic data. In International Journal of Corpus Linguistics 11(4), 435-462, Prepublication draft

Sharoff, S. (2005) The communicative potential of verbs of "away-from" motion in English, German and Russian. In Functions of Language 12(2), 205-240, Prepublication draft

Sharoff, S. (2004) Harnessing the lawless: using comparable corpora to find translation equivalents. Journal of Applied Linguistics 1(3), 333-350. Prepublication draft

Kagarlitsky, Yu, Pichhadze, A., Sharoff, S. (2004) Problemy sozdanija elektronnogo korpusa pamjatnikov drevnerusskoj pis'mennosti (Towards creation of an electronic corpus of Old Russian manuscripts). NTI Series 2, N10, 1-6

Sharoff, S. (2003) Predstavitelnyj korpus russkogo jazyka v kontekste mirovogo opyta. (in Russian: The Russian Reference Corpus and its relationship to the state of the art). NTI Series 2, N6, 8-18 Prepublication draft

Book chapters

Sharoff, S. (2010) In Mehler et al, editors, Genres on the Web: computational models and empirical studies, Springer. prepublication draft Mudraya, O., Piao, S., Rayson, P., Babych, B., Sharoff, S.; Lofberg, L. (2008) Automatic Extraction of Translation Equivalents of Phrasal and Light Verbs in English and Russian. In Phraseology: an interdisciplinary perspective, S Granger and F Meunier (eds.), Benjamins, Amsterdam

Sharoff, S. (2006) Creating general-purpose corpora using automated search engine queries. In Marco Baroni and Silvia Bernardini, editors, WaCky! Working papers on the Web as Corpus. Gedit, Bologna. PDF

Sharoff, S. (2006) How to handle lexical semantics in SFL: a corpus study of purposes for using size adjectives. In: S.Hunston, G.Thompson (eds.) Systemic Linguistics and Corpus. London: Equinox, pages 184-205. PDF

Sharoff, S. (2005) Methods and tools for development of the Russian Reference Corpus. In D. Archer, A. Wilson, P. Rayson (eds.) Corpus Linguistics Around the World. Amsterdam: Rodopi, pages 167-180. PDF

Sharoff, S. (2005) Phenomenology and cognitive science. In S.Franchi, G.Guzeldere, Constructions of the Mind. Cambridge, MA: MIT Press.The prepublication version of the paper in The Stanford Humanities Review, Vol. 4, No 2, 1995, pp. 190-206.

Sharoff, S. (2004) Istorija sozdanija korpusov russkogo jazyka i parametry bol'shogo predstavitel'nogo korpusa (History of Russian corpora and parameters of a large representative corpus). In: J. Wawrzynzyk (ed.) Korpusy Jezyka Rosyjskiego w Polsce i na Swiecie, Lodz: Abagraf, pp.23-38.

Sharoff, S. (2003) `When I use a word, it means just what I choose it to mean'. In C.Inchaurralde, and C. Floren, (eds.) Interaction and Cognition in Linguistics. Frankfurt: Peter Lang. pages 277-293

Boldasov M.V., Sokolova E.G., Sharoff, S. and Zhigalov V.A. (2002) Towards a GIS-based multimodal question answering system. In Renate Pajusalu and Tiit Hennoste (eds.) Tähendusepüüdja/Catcher of the meaning. Festschrift for Professor Haldur Oim on the occasion of his 60th birthday. University of Tartu. Tartu, pages 42-65

Sharoff, S. (2000) Lexis: between the Grammar and the Conceptual Model. In: A. Melby, A. Lommel (eds.) 26th LACUS Forum, Chapel Hill, NC: LACUS, 2000, pp. 369-380.

O.Fedorova, Sharoff, S., L.Sokolova, T.Sosenskaya (1999) Modelirovanie lingvisticheskih znanij v informatsionno-poiskovyh sistemah (On modeling of linguistic knowledge in information-retrieval systems). In E.Rakhilina, Ya.Testelets (eds.) Typology and Linguistic Theory: From description to explanation. Moscow: Jazyki Russkoj Kultury, 1999, pp. 600-612.

Kononenko I., Sharoff, S. (1996) Understanding short texts with integration of knowledge representation methods. In D. Bjorner, M. Broy, I.V. Pottosin (Eds.) Perspectives of System Informatics. Springer Lecture Notes in Computer Science Vol. 1181, Berlin: Springer Verlag, pages 111-121.

Published conference proceedings

Serge Sharoff, Svitlana Kurella, and Tony Hartley. Seeking needles in the Web haystack: finding texts suitable for language learners. In Proc. of Teaching and Language Corpora Conference, TaLC 2008, Lisbon, 2008. PDF

Svitlana Kurella, Serge Sharoff, and Tony Hartley. Rhetorical text structure in acquiring reading skills in L3. In Proc. of Teaching and Language Corpora Conference, TaLC 2008, Lisbon, 2008.

Bogdan Babych, Tony Hartley, and Serge Sharoff. Generalising lexical translation strategies for MT using comparable corpora. In Proc. of the Sixth Language Resources and Evaluation Conference, LREC 2008, Marrakech, 2008.

Marco Baroni, Francis Chantree, Adam Kilgarriff, and Serge Sharoff. Cleaneval: a competition for cleaning web pages. In Proc. of the Sixth Language Resources and Evaluation Conference, LREC 2008, Marrakech, 2008. PDF

Svitlana Kurella, Tony Hartley, and Serge Sharoff. Corpus-based tools for computer-assisted acquisition of reading abilities in cognate languages. In Proc. of the Sixth Language Resources and Evaluation Conference, LREC 2008, Marrakech, 2008.

Serge Sharoff, Mikhail Kopotev, Tomaz Erjavec, Anna Feldman, and Dagmar Divjak. Designing and evaluating a Russian tagset. In Proceedings of the Sixth Language Resources and Evaluation Conference, LREC 2008, Marrakech, 2008. PDF

Sharoff, S., Secară, A. (2007) Who is talking to whom? Linguistic cues for engaging with the audience. Proceedings of Affective Communications in Design. Challenges for Researchers, Leeds, June 2007, 175-182, PDF

Sharoff, S. (2007) Classifying Web corpora into domain and genre using automatic feature identification. In Proc. of Web as Corpus Workshop, Louvain-la-Neuve, September, 2007. PDF

Babych, B., Hartley, A., Sharoff, S. (2007) Translating from under-resourced languages: Comparing direct transfer against pivot translation. In Proc. of MT Summit IX, Copenhagen, August, 2007.PDF

Pendar, N., Sharoff, S. (2007) Supervised Lexical Acquisition for Persian from a Web Corpus. In Proc. of CAASL, Stanford, July, 2007.PDF

Babych, B., Hartley, A., Sharoff, S.; Mudraya, O. (2007) Assisting Translators in Indirect Lexical Transfer. In Proc. of ACL, Prague, June, 2007, pp. 136-143.PDF

Sharoff, S., Babych, B., Hartley, A. (2006) Using comparable corpora to solve problems difficult for human translators. In Proc. of joint COLING-ACL, Sydney, July, 2006. PDF

Sharoff, S. (2006) Translation as problem solving: uses of comparable corpora. In Proc. of Language resources for translation research and practice at LREC2006, Genoa, May, 2006. PDF

Ciobanu, D., Hartley, A. Sharoff, S., (2006) Using Richly Annotated Trilingual Language Resources for Acquiring Reading Skills in a Foreign Language. In Proc. of LREC2006, Genoa, May, 2006. PDF

Sharoff, S., (2006) A Uniform Interface to Large-Scale Linguistic Resources. In Proc. of LREC2006, Genoa, May, 2006. PDF

Sharoff, S., Babych, B., Hartley, A. (2006) Using collocations from comparable corpora to find translation equivalents In Proc. of LREC2006, Genoa, May, 2006. PDF

Sharoff, S., Babych, B., Rayson, P., Mudraya, O., Piao, S., (2006) ASSIST: Automated semantic assistance for translators. Proc. EACL2006, Trento, April, 2006. PDF

Sharoff, S., A.Hartley and P.Llewellyn-Jones (2004) Sentence Generation in British Sign Language. In A. Belz, R. Evans and P. Piwek (eds.) Extended abstracts of posters presented at the Third International Conference on Natural Language Generation, Brighton, UK, July 2004, pages 40-43.

Sharoff, S. (2004) What is at stake: a case study of Russian expressions starting with a preposition. In Proc. of ACL04 Workshop "Multiword Expressions: Integrating Processing". Barcelona, Spain, July, 2004. 17-23. PDF

Sharoff, S. (2004) Towards basic categories for describing properties of texts in a corpus. In Proc. of LREC2004, Lisbon, Portugal, May, 2004, 1743-1746.

Sharoff, S. (2003) K sozdaniju predstavitel'nogo korpusa sovremennogo russkogo jazyka. (Towards development of a representative corpus of modern Russian). In Proc. of DIALOGUE-2003.

Sharoff, S. (2003) Methods and tools for development of the Russian Reference Corpus. In Proc. of Corpus Linguistics Conference. March, 2003, Lancaster, UK. 680.

Sharoff, S. (2002) Meaning as use: exploitation of aligned corpora for the contrastive study of lexical semantics. In Proc. of Language Resources and Evaluation Conference (LREC02). May, 2002, Las Palmas, Spain. 447-452. PDF

Gibbon D., Trippel T., Sharoff, S. (2001) Concordancing for Parallel Spoken Language Corpora. Proc. of Eurospeech 2001. The 7th European Conference on Speech Communication and Technology. September 3-7, 2001. Aalborg, Denmark, pp. 2063-2066.

Sharoff, S., Volchenkov D. (2001) On the application of the dynamical systems theory to social models. Proceedings of the research project: The Sciences of Complexity. N 2001/49. Zentrum für interdisziplinäres Forschung, Universität Bielefeld. 2001, 36 pages. Also available as a chapter on the CD-ROM "The Sciences of Complexity: From Mathematics to Technology to a Sustainable World" from the University of Bielefeld.
Sharoff, S. (2001) Through the looking glass of parallel texts. Proc. of the Corpus Linguistics Conference. Lancaster, March-April, 2001, pp. 543-552.

Sharoff, S., Sokolova, L (2001) Semanticization of language-independent concepts in a multilingual system. Proc. of International Workshop on Computational Semantics (IWCS-4). Utrecht, Holland, January, 2001, pp.433-446.

Sharoff, S. (2000) Husserl's model of linguistic communication. Proc. of the International Conference "Cognitive Modeling in Linguistics", October, 2000. (in Russian)

Sharoff, S., L.Sokolova (2000) Multilingual generation of user manuals in Slavic languages. Proc. of the 7th Russian National AI Conference, (in Russian), Pereslavl, October 2000.

Geert-Jan Kruijff, John Bateman, Danail Dochev, Jiri Hana, Ivana Kruijff-Korbayová, Hana Skoumalová, Lena Sokolova, Kamenka Staykova, Elke Teich, Sharoff, S. (2000) Multilinguality in a Text Generation System for Three Slavic Languages. Proc. of COLING-2000, Luxemburg, August, 2000, pp. 474-480.
John Bateman, Elke Teich, Geert-Jan Kruijff, Ivana Kruijff-Korbayová, Hana Skoumalová, Sharoff, S. (2000) Resources for multilingual text generation in three Slavic languages. Proc. of LREC2000: Language Resources and Evaluation Conference, Athens, Greece, May 30-June 2, 2000, pp. 1763-1767.
Sharoff, S. (2000) Application of Object-Oriented Approach to Linguistic Modeling. Proc. of CICLing-2000, Mexico City, Mexico, February 13-19, 2000, pp. 88-100.

Sharoff, S., V. Zhigalov (1999) Register-domain separation as a methodology for development of natural-language interfaces to databases. Proc. of INTERACT'99 (The Seventh IFIP Conference on Human-Computer Interaction), Edinburgh, UK, September, 1999, pp. 79-85.
Sharoff, S. (1999) About women: meeting phenomenology and cognitive science. Proc. of the First Russian Cognitive Science Conference, (in Russian), September, 1999.
Sharoff, S. (1999) The semiotics of "Choose Multiline from the Draw toolbar". Proc. of the 5th International Congress on Terminology and Knowledge Engineering, Innsbruck (Austria), 24-28 August 1999, pp. 594-602.
Sharoff, S. (1999) On interaction between terms and a domain model. Proc. of the international workshop DIALOGUE'99, (in Russian), Tarusa, June 1999.

Sharoff, S. (1998) On mathematical models of cognitive mechanisms. Proc. of the cognitive modeling workshop at the 6th Russian National AI Conference, (in Russian). October 1998, pp. 222-231. Also these proceedings have been published by the Web Journal of Formal, Computational & Cognitive Linguistics, at Zipped doc file

Sharoff, S. (1998) On difference between natural language ontology and ontology of the problem domain. Proc. of the 6th Russian National AI Conference, (in Russian). October 1998, pp. 41-49.

Towards multilingual generation of manuals: the initial stage of the AGILE project, The international workshop DIALOGUE'98 (with L.Sokolova, in Russian). October 1998, pp. 848-859.

Bateman, J. Sharoff, S. (1998) Multilingual Grammars and Multilingual Lexicons for Multilingual Text Generation. Proc. ECAI'98 Workshop on Multilinguality in the Lexicon, Brighton, U.K., August 1998, p. 1-8.

Sharoff, S. (1997)Meaning and/or Context. Proc. of the Cognitive Science Society Conference, Stanford, August, 1997.

Sharoff, S. (1997) Towards meaning description mechanisms. Proc. of the international workshop DIALOGUE'97, Yasnaya Polyana, June, 1997, (in Russian), pp. 300-302.

Sharoff, S. (1996) Merging linguistic and extralinguistic knowledge in grammar theory. Proc. of the 5th Russian National AI Conference, (in Russian), Kazan, September, 1996. pp. 101-105.

Sharoff, S., L.Sokolova (1996) Contrastive analysis of software manuals. Proc. MULSAIC'96, workshop at ECAI'96, Budapest, August, 1996, pp. 57-60.

Sharoff, S., Sokolova, L. (1996) Interlingua for multilingual generation. Proc. of the international workshop DIALOGUE'96, Pushino, 1996. (in Russian), pp. 240-248.

Sharoff, S., L.Sokolova (1995) Analysis of rhetorical structures in technical manuals and their multilingual generation. Proc. of the workshop on multilingual generation at IJCAI'95, Montreal, 1995, pp. 119-128.

Sharoff, S. (1995) An application of object-oriented programming to linguistic models. Proc. of the international workshop DIALOGUE'95, (in Russian), Kazan, 1995, pp. 332-339.

Sharoff, S., L.Sokolova (1995) Representation of technical manuals by means of rhetorical relations. Proc. of the international workshop DIALOGUE'95, (in Russian), Kazan, 1995, pp. 283-293.

Computational philosophy: intentionality and internal time consciousness. Proc. of the "Reaching for mind" workshop at AISB'95.

Sharoff, S. (1993) SNOOP: A System for Development of Linguistic Processors. Proc. of EWAIC'93, Moscow, 1993, pp. 184-188.

*A summary in English is available for all papers in Russian.

Unpublished talks and presentations

The corpus study of purposes for using size adjectives. Presentation at the 29th International Systemic-Functional Congress (ISFC29), Liverpool, 2002.
'When I use a word, it means just what I choose it to mean'. Presented at the 11th Susan Hübner Workshop on Cognitive Linguistics, Zaragoza, February, 2001. Selected papers published in: C.Inchaurralde, and C. Floren, (eds.) Interaction and Cognition in Linguistics. 2002. Hamburg: John Peter Lang.
Two presentations: AGILE: Automated Generation of Instructions in Languages of Eastern Europe, and On the computational models of lexical meanings. University of Heidelberg, April, 2001
Studying lexical semantics in parallel corpora. Centre for Corpus Linguistics, University of Birmingham, June, 2001
Lexis: between the Grammar and the Conceptual Model. Presented at the 26th LACUS Forum. Selected paper have been published in: A. Melby, A. Lommel (eds.) Selected papers presented at the 26th LACUS Forum, Chapel Hill, NC: LACUS, 2000, pp. 369-380.
Natural Language Understanding and Generation activities at the Russian Research Institute for Artificial Intelligence. Presented at the University of Heidelberg, February, 1998
Use of Ontologies in practical NLP applications. Presented at the Euresco Conference on Logic, Language and Information, Portugal, December, 1994

 

<< Back to the home page

Drafts

The core lexicon of emotions in English, German and Russian. PDF
Concepts vs. Instances in Linguistic Models. PDF

Last modified on 05/07/10 by Serge Sharoff, s.sharoffleeds.ac.uk