Automatic Mapping Among Lexico-Grammatical Annotation Models (AMALGAM)




AMALGAM Home PagePrevious PageUp A LevelNext Page

AMALGAM HOMEPAGE | PREVIOUS PAGE | UP A LEVEL | NEXT PAGE

The Lancaster-Oslo/Bergen (LOB) Corpus Tag-set

Listed alphabetically below are the tags used in the LOB corpus. Each tag has examples of the tokens that were annotated with that tag. The examples are taken directly from the LOB corpus. If the list of examples ends with an ellipsis marker then the tag category can be assumed to be an open class.

The original LOB tagger did not take raw text as input but text that had some features pre-annotated such as indicators to mark abbreviations or special characters. As a result, the examples given in some categories may appear strange. Please consult the notes after the table for further details.

Further information on the tagged LOB corpus can be found at the International Computer Archive of Modern English (ICAME) corpus collection.

Further reading:

Johansson, S., E. Atwell, R. Garside & G. Leech. 1986. The Tagged LOB Corpus: Users' manual. ICAME, The Norwegian Computing Centre for the Humanities, Bergen University, Norway.

Tag

Description

Examples

&FO

formula

10*:-1**: dE *:238**:U a*;n**; T*:-3/2**: E*;p**;(P) R*?8r(cdE.cde) ... [See note 1]

&FW

foreign word

de Welt von Retour Flamme route Musique Ancienne Pro unheimliche Opus baraka Biennale Internationale Novum sine die cantabile letzt bru"cke ...

!

exclamation mark

!

(

opening parenthesis

(

)

closing parenthesis

)

*'

opening quotation mark

*' *" [See note 2]

**'

closing quotation mark

**' **" [See note 2]

*-

dash

*- [See note 2]

,

comma

,

.

full stop

.

...

ellipsis

...

:

colon

:

;

semicolon

;

?

question mark

?

ABL

determiner/pronoun, pre-qualifier

such quite rather such-and-such

ABN

determiner/pronoun, pre-quantifier

all half

ABX

determiner/pronoun, double conjunction or pre-quantifier

both

AP

determiner/pronoun, post-determiner

more most last several next own other many much same less former only very few fewer latter least overmuch ain kast

AP"

determiner/pronoun, post-determiner, ditto

few good many little [See note 3]

AP$

determiner/pronoun, post-determiner, genitive

latter's former's other's

APS

determiner/pronoun, post-determiner, plural

others

APS$

determiner/pronoun, post-determiner, plural, genitive

others'

AT

article, singular

a an every

ATI

article, singular or plural

the no nae ye zee de ze

BE

verb "to be", infinitive or imperitive

be

BED

verb "to be", past tense, 2nd person singular or all persons plural

were

BEDZ

verb "to be", past tense, 1st and 3rd person singular

was

BEG

verb "to be", present participle or gerund

being

BEM

verb "to be", present tense, 1st person singular

am 'm

BEN

verb "to be", past participle

been

BER

verb "to be", present tense, 2nd person singular or all persons plural

are 're art 'rt ai

BEZ

verb "to be", present tense, 3rd person singular

is 's iss ees ai

CC

conjunction, coordinating

and but or nor as & yet / 'n and/or an' only n'

CC"

conjunction, coordinating, ditto

well as

CD

numeral, cardinal

1958 13 two 280,000 20 1959 28.5 400 eight 2 1949 six seven ten 1,400 16 9.40 five 100 four fifty 89 5.30 287 million 1/2 2.35 forty nine 6.55 ...

CD$

numeral, cardinal, genitive

8's 3's 5's 4's

CD-CD

numeral, cardinal, hyphenated pair

1955-6 15-20 1861-1940 1-6 2-0 3-1 33-1 12-1 300-400 0-3 10,000-15,000 1611-1961 six-five 51.1-3 280-338/39 ...

CD1

numeral, cardinal, one

one 1 'un

CD1$

numeral, cardinal, one, genitive

one's 1's

CD1S

numeral, cardinal, one, plural

ones 'uns

CDS

numeral, cardinal, plural

hundreds thousands dozens fifties two-thirds millions 1830's '30s forties middle-thirties sevens '20s 30's 1750s 'forties nines five-sixths zeros ...

CS

conjunction, subordinating

though that as while if because before than since whether for once except until provided unless although even lest now till such so but albeit whereas in considering nisi like whilst 'n whereupon save altho' tho' directly 'cos 'cause immediately

CS"

conjunction, subordinating, ditto

if that as though so far order

DO

verb "to do", uninflected present tense, infinitive or imperitive

do

DOD

verb "to do", past tense

did

DOZ

verb "to do", present tense, 3rd person singular

does doth

DT

determiner/pronoun, singular

another this that each zis zat anudder

DT$

determiner/pronoun, singular, genitive

another's

DTI

determiner/pronoun, singular or plural

any some enough

DTS

determiner/pronoun, plural

these those

DTX

determiner, pronoun or double conjuction

either neither

EX

existential there

there

HV

verb "to have", uninflected present tense, infinitive or imperitive

have 've hast of 'ave

HVD

verb "to have, past tense

had 'd

HVG

verb "to have", present participle or gerund

having havin'

HVN

verb "to have", past participle

had

HVZ

verb "to have", present tense, 3rd person singular

has 's hath ai

IN

preposition

by from at of on for into since in to with despite round as about over without towards behind during under beyond after because against outside including among like apart ...

IN"

preposition, ditto

of from spite with to front as for means opposed top between against la regards board versus

JJ

adjective

large likely out-dated adequate nationalist federal united national elected colonial full proposed secret central final unsatisfactory gross unconstitutional angry human heavy hostile economic monstrous warm-hearted ...

JJ"

adjective, ditto

up off luxe round cut weight priori hoc vires lived board fashioned

JJB

adjective, attributive-only

left-wing rival chief overall main once-and-for-all prime past nuclear-disarming anti-apartheid American-born built-in 89-year-old joint pro-communist centre second-row top ...

JJB"

adjective, attributive-only, ditto

army called

JJR

adjective, comparative

higher better worse easier wider tougher lesser nicer fairer worthier prettier neater noisier deeper happier nobler nearer slower bolder shallower faster-moving ...

JJR"

adjective, comparative, ditto

wearing

JJT

adjective, superlative

best fiercest bitterest largest toughest thorniest rarest humblest freshest sweetest clearest best-regulated kindest simplest-of-all handsomest strangest tallest ...

JJT"

adjective, superlative

selling

JNP

adjective, word-initial capital

African British Rhodesian anti-Negro German Manchu-Edwardian Yugoslav Asian Congolese inter-African Nazi Olympic Ritzy Anglo-American Persian Elizabethan Teutonic un-Italian Marxist Ritzy Norman Viking Luddite Presbyterian Churchillian Orwellian Kentish ...

MD

modal auxillary

may will should would can must might could need 'll shall 'd ought wilt mayest dared maun cou'd dare shoud shoulda 'ud wikk

NC

cited word

many thanks Jimmy ret-key s nonsense Directors' emoluments always only high decadent mouse sous Gita Ghita explode ...

NN

noun, singular, common

life move bill existence institution sentiment abolition independence association bureau investigation service post present spot sum drain answer rejection blow taxation ...

NN"

noun, singular, common, ditto

mortem blanche d'oeuvre douloureux garde d'hotel how up obscura hoccery d'affaires pectoris grata ego between

NN$

noun, singular, common, genitive

protectorate's labour's doctor's parliament's man's child's hour's airliner's week's conference's regiment's unit's pilot's orchestra's river's library's ...

NNP

noun, singular, common, word-initial capital

Chinese Irishman American Australian Negress English Scottish Briton Vansittartism Sinhalese Jew Whitgiftian Berliner Gaitskellism Jesuit Lancastrian Belgian Augustinianism Amorite Anabaptist Druze Celt Rugbeian Highlander ...

NNP$

noun, singular, common, word-initial capital, genitive

Englishman's Russian's Greek's Hungarian's Eskimo's Genoese's Turk's Frenchman's Prussian's Corsican's Canadian's ...

NNPS

noun, plural, common, word-initial capital

Africans Americans French Germans Nazis Arabs Anglo-Saxons Scandinavians Romans Pan-Somalis Berliners Ceylonese Cestrians Wearsiders Czechs Hessians Victoriana Dalmatians Brownists Bantu Kelts Slavs Medes Hindoos ...

NNPS$

noun, plural, common, word-initial capital, genitive

Germans' Tunisians' Americans' Africans' Nyasalanders' Spaniards'

NNS

noun, plural, common

peers nominees steps plans discussions organisations drugs conditions opponents details cuts changes foods families deeds words supplies measures police cars demonstrators ...

NNS"

noun, plural, common, ditto

d'appui ups

NNS$

noun, plural, common, genitive

settlers' neutrals' years' pro-communists' nightingales' players' footballers' sportsmen's officers' women's staffs' contributors' employers' hairdressers' breeders' ...

NNU

noun, abbreviated unit of measurement

\0s \0min \0mph \0d \0in \0p.c \0lb *+1,755million *+900,000 *+3,607,000 *+2 *+720 ... [See note 4]

NNU"

noun, abbreviated unit of measurement, ditto

\0cent cent \0yd [See note 4]

NNUS

noun, abbreviated unit of measurement, plural

\0pts \0yds \0gns *+s \0pp \0mins \0hrs \0revs \0galls \0lbs \0ins [See note 4]

NP

noun, singular, proper

Trevor Williams Michael Manchester Foot-Griffiths Bell Karen Roy Dennis Welensky Rhodesia Nkumbula Macleod Julius Accra Ellender Adenauer George Enoch France Corell-Barnes Selwyn ...

NP$

noun, singular, proper, genitive

Cheung's Griffith's Oxford's England's Guy's Swansea's Conroy's Zealand's Kent's London's Reid's Margaret's Windsor's Chatterley's Nancy's Sibelius's Shakespeare's Khruschev's ...

NPL

noun, singular, locative, word-initial capital

House Sea Hotel Airport Square Plain Island Palace Loch Cape Town River Gallery Yard Cove Park University Parade Mount Head Fountain Colliery Shipyard Citadel ...

NPL$

noun, singular, locative, word-initial capital, genitive

City's Garden's Moor's Theatre's Church's Marsh's College's

NPLS

noun, plural, locative, word-initial capital

Plains Locks Cottages Hills Colleges Universities Schools Churches Sands Galleries Marshes Downs Grottos Islands Isles Farms Pikes Roads Straits Broads Mountains Steps Levels Meadows Precincts Fields Counties Halls Buildings Gardens Galeries Woods Fens Towers Prisons Banks Moors Villages

NPLS$

noun, plural, locative, word-initial capital, genitive

Universities'

NPS

noun, plural, proper

Maritimes Wolves Barbarians Wasps Alps Penguins Brittains Debenhams Beechams Bents Mitchells Courtaulds Salems Spurs Wileys Balkans Lennons Greyfaces Tele-Bins Loyals ...

NPS$

noun, plural, proper, genitive

Bents' Cortaulds' Spurs' Wolves' Josephs' Rovers' Mudlarks' Beddises' Loyals' Shadows' Merwes' Caxtons' Marshams' Barkers' Frys' Slaytons' Stevens' Apaches' Pentlands' Cadwells' Swansons' Robertsons'

NPT

noun, singular, titular, word-initial capital

\0Mr \0MP Sir Premier Secretary Prime Minister President Senator \0Dr Prince \0St \0Pres Professor Herr \0Mrs \0C.I.G.S Rector \0P.C \0Hon \0Det.-Con \0D.C \0D \0PC \0Chief-Insp ... [See note 4]

NPT"

noun, singular, titular, word-initial capital, ditto

\0P

NPT$

noun, singular, titular, word-initial capital, genitive

President's Minister's Premier's Queen's Princess's Duke's King's Ambassador's Earl's Chancellor's Regent's Director's Lord's Pope's Registrar's Emperor's Lady's Laird's Vicar's Commander's Captain's Ma's Rector's Prince's General's ...

NPTS

noun, plural, titular, word-initial capital

\0MPs Lords Chiefs Ministers Comrades \0M.P.s Premiers Deans Representatives Saints Presidents Commandants Sons \0Messrs \0Clrs Mayors Aldermen Ambassadors Directors \0M.P.'s Tsars Emperors \0C.O.'s Rabbis Masters Knights Kings Brothers ...

NPTS$

noun, plural, titular, word-initial capital, genitive

Speakers' \0MPs' Sons' Directors'

NR

noun, singular, adverbial

tomorrow today yesterday south February Monday home July tonight Sunday north October September December January west east March \0Nov Wednesday to-night south-east Tuesday August Saturday May June to-day \0Feb April north-west \0W ...

NR$

noun, singular, adverbial, genitive

today's yesterday's to-night's Wednesday's to-morrow's west's Sunday's Saturday's tonight's Monday's tomorrow's home's

NRS

noun, plural, adverbial

homes Saturdays Tuesdays Mondays Sundays Fridays Thursdays

NRS$

noun, plural, adverbial, genitive

[See note 5]

OD

numeral, ordinal

second first third thirty-ninth fourth sixth seventh 75th 19th 3rd 4th 6th 2nd twentieth 1,000th 44th fifteenth ...

OD$

numeral, ordinal, genitive

[See note 5]

PN

pronoun, nominal

so anybody nothing no anyone none everything something anything nobody someone everybody somebody no-one some nuffin' ought somethin' nothin' summat nossings somep'n

PN"

pronoun, nominal, ditto

one

PN$

pronoun, nominal, genitive

everyone's everybody's anybody's something's anyone's no someone's

PN$"

pronoun, nominal, genitive, ditto

one's

PP$

determiner, possessive

his their my our its her your thy thine tha 'is yer me

PP$$

pronoun, possessive

ours mine theirs yours his hers thine

PP1A

pronoun, personal, nominative, 1st person singular

I

PP1AS

pronoun, personal, nominative, 1st person plural

we wee

PP1O

pronoun, personal, accusative, 1st person singular

me

PP1OS

pronoun, personal, accusative, 1st person plural

us 's

PP2

pronoun, personal, nominative or accusative, 2nd person

you ye thee thou y' ya tha yuh

PP3

pronoun, personal, nominative or accusative, 3rd person singular

it 't

PP3A

pronoun, personal, nominative, 3rd person singular

he she 'e

PP3AS

pronoun, personal, nominative, 3rd person plural

they

PP3O

pronoun, personal, accusative, 3rd person singular

him her 'er 'im

PP3OS

pronoun, personal, accusative, 3rd person plural

them 'em

PPL

pronoun, singular, reflexive

himself itself myself herself yourself oneself ourself thyself

PPLS

pronoun, plural, reflexive

themselves ourselves each one yourselves one-another

PPLS"

pronoun, plural, reflexive, ditto

other another another's other's

QL

qualifier, pre

too least most very as so more that less mighty real awfully stark this sound precious

QLP

qualifier, post

enough indeed

RB

adverb

forward still together violently once immediately bluntly clearly long obviously somewhere too truly seriously accurately profoundly rapidly superbly about ever entirely overseas ...

RB"

adverb, ditto

and large right particular least course once little last short again than well general from full long so on common certain alia the less main facto first yet se brief but ...

RB$

adverb, genitive

else's

RBR

adverb, comparative

later longer earlier more less better faster worse sooner deeper higher harder closer nearer farther cheaper heavier lower poorer oftener slower louder quicker ...

RBT

adverb, superlative

best most least fastest lowest worst nearest farthest closest longest furthest

RI

preposition, adverbial, lacking compliment

before within between above since after with near against alongside opposite but without below besides beyond beneath underneath like

RN

adverb, nominal

now then here there downstairs upstairs indoors tho inland here-and-now down-town zen hyar downtown

RP

adverb, particle

down up in through on off out apart about back over round away outside across around aboard inside aside by past behind under forth to oot

TO

infinitival to

to so in

TO"

infinitival to, ditto

as to order

UH

interjection

yes please well \0O.K oh no aw goodbye ah gee whiz sure hey presto wham amen say why dear good-morning hurrah aye welcome boy oi bang er hi hullo goddammit ...

VB

verb, base: uninflected present, imperitive or infinitive

stop gather turn abolish put take appear prop favour drop meet discuss want fall give consult delay sit attend pass pay help try mention point say increase run know ...

VB"

verb, base: uninflected present, imperitive or infinitive; ditto

pedal stitch shop

VBD

verb, past tense

opposed brought said telephoned went denied told remained added renewed arose wanted meant talked flew covered noted hoped thanked delivered supported felt noticed agreed ...

VBG

verb, present participle or gerund

replying addressing changing switching recommending hoping preserving provoking preventing wearing working winding accepting recalling ordering listening using ...

VBN

verb, past participle

made put backed abolished created recommended transmitted jailed decided presented cancelled prepared discussed used posted regarded written indicated favoured ...

VBZ

verb, present tense, 3rd person singular

believes remains gives smears shocks cables says needs goes seems professes reports becomes publishes hopes attacks cracks feels regards claims suggests comes speaks outlines ...

WDT

WH-determiner, interrogative

what whatsoever whatever which whichsoever whichever vich

WDT"

WH-determiner, interrogative, ditto

ever

WDTR

WH-determiner, relative

which

WP

WH-pronoun, interrogative, nominative or accusative

who whoever wot

WP$

WH-pronoun, interrogative, genitive

whose

WP$R

WH-pronoun, relative, genitive

whose

WPA

WH-pronoun, nominative

whosoever

WPO

WH-pronoun, interrogative, accusative

whom-ever whom

WPOR

WH-pronoun, relative, accusative

whom

WPR

WH-pronoun, relative, nominative or accusative

who that

WRB

WH-adverb

when wherever where how why however whenever wherein whereby whence whereof whereunto whereon

XNOT

negator

not n't na

ZZ

letter of the alphabet

G-91 F B zh2014 A T-34 bf alp P A20 X D pi O F11309 a b Q M R4 x z q y H M1 J1 M2 J2 n S U c e d f ...

Notes

  1. Formulas contain many symbols that were represented by special character sequences in the pre-formatting stage before the tagger was applied to the LOB corpus. This explains the strange appearence of these examples of formulas taken directly from the corpus.
  2. The LOB corpus tagger assumed that the input had been pre-formatted. Mostly the text was left as it would appear on a typed page but some characters where represented differently. Amongst these were the quote character and the dash character. The AMALGAM version of the LOB tagger will recognise and annotate these characters in their usual forms: ` ' " -
  3. Ditto tags were applied to words whose role changes from their normal syntax when applied in cerain combinations. The first word of the combination is tagged as normal and all subsequent words are given the first word's tag plus the ditto symbol ("). For example, the combination "so as to" is tagged TO TO" TO".
  4. Abbreviations in LOB were signalled by adding '\0' to the start of the abbreviated token. The AMALGAM version of the LOB tagger will handle abbreviations whether or not they have the '\0' prefix.
  5. The tag classes NRS$ and OD$ were designed by the LOB corpus developers but have no examples because there weren't any in the LOB corpus itself. NRS$ could be used to tag a word like Sundays' and OD$ the word third's.



AMALGAM Home PagePrevious PageUp A LevelNext Page

AMALGAM HOMEPAGE | PREVIOUS PAGE | UP A LEVEL | NEXT PAGE