Automatic Mapping Among Lexico-Grammatical Annotation Models (AMALGAM)


THE MULTI-TAGGED CORPUS



AMALGAMHomepagePrevious PageUp A LevelNext Page

AMALGAM HOMEPAGE | PREVIOUS PAGE | UP A LEVEL | NEXT PAGE

Using the AMALGAM multi-tagger a corpus of 180 sentences was created. The "multi-tagged" corpus consists of the following texts:

The texts were tagged (raw output from the AMALGAM tagger, including errors) using the Brown, ICE, LLC, LOB, UNIX Parts, POW, SEC and UPenn tagging schemes.

The tagged texts were also proofread and edited by human experts in order to remove any errors made by the AMALGAM tagger.