|
|
|
||||
|
|
|
|
References and creditsThe hand-tagged closed corpus is a joint effort of the Spanish team, supervised by Uwe Kjær Nissen. The open Spanish system is based on multi-level Constraint Grammar disambiguation and is being developed by Eckhard Bick on the basis of a similar project for Portuguese.
The morphological analyzer used is based on a lexicon of 60.000 base forms, and its output is processed by some 5.000 Constraint Grammar rules for morphological, syntactic (and - in part - semantic) disambiguation. For an introduction to Constraint Grammar theory, see "Fred Karlsson et.al., Constraint Grammar: a language-independent system for parsing unrestricted text, Berlin 1995". The present version of the system uses the CG-2 rule compiler developed and licensed by Pasi Tapanainen. |
|||||||||||||||||