Automatic
Mapping Among Lexico-Grammatical Annotation Models (AMALGAM)
![]()
This program is effectively a wrapper for Eric Brill's
Rule-based tagger, retrained at Leeds with 8 alternative tagging schemes.
The tagger works by reading in the lexicon, bigram lists and rules from
external files. AMALGAM's tagger works by redirecting Brill's tagger to
read in alternative versions of these defining files so that it can annotate
according to the following eight schemes:
The text to be tagged is first passed through a tokeniser which applies various formatting rules to the text. This can be turned off and on when mailing amalgam-tagger. Again, more details are in the help file.
A description of the training procedure for Brill's tagger that allows it to acquire each new scheme is available.
This site developed and maintained by Eric Atwell (eric@comp.leeds.ac.uk)