Pre-analyzed Portuguese sentences
Floresta Sintá(c)tica sentences (Newspaper corpus treebank)
This treebank is a joint venture (Floresta Sintá(c)tica) between the Portuguese section of VISL and the project (Processamento computacional do português).
The corpus is based on a 1 million word excerpt from the CETEMPúblico corpus, and consists of manually proof-read PALAVRAS-output, using modified Constraint Grammar category labels. Extensive documentation is available on the project site, as well as a FAQ list and download links.
Browse the sentences:
Newspaper corpus treebank (Floresta)
In the box above, you can type in either a whole sentence from the pre-analyzed set,
or a unique string from the sentence, or the relevant identifying code found at the left of
each sentence. Alternatively, you can click on the icon to the left of each sentence, if there is one.
load_body_global File Not Found: cetemcorpus.pt.html