University of Southern Denmark
-> Corpus Search  Visual Interactive Syntax Learning  
Syddansk Universitet
 

VISL Tools
Grammatical Analyses
Games & Quizzes
Corpus Search
Machine Translation

VISL Guides
Overview
Guided Tour
FAQ

VISL Languages
Arabic
Bosnian
Danish
Dutch
English
Esperanto
Estonian
Finnish
French
German
Greek (Anc.)
Greek (Mod.)
Icelandic
Italian
Japanese
Japanese (Roman ji)
Latin
Latvian
Norwegian (bok.)
Norwegian (ny.)
Portuguese
Russian
Spanish
Swedish Romanian Northern Sami

VISL Lite Grammy
(cross-language teaching system)

Printer-friendly version

 

Corpus Search

Our corpus server (overview) currently has corpora available for the following languages.

The Danish and most Portuguese and Esperanto corpora, as well as the Europarl corpora for all languages can be accessed without a password. Access to the other corpora is currently limited by password to people and projects affiliated with the Institute of Language and Communication at SDU - Odense University.

The VISL project leader, Eckhard Bick, has developed search engines for these corpora which recognize regular expressions and supply search results in the form of concordances, with search hits highlighted in boldface. For those who may be unfamiliar with regular expressions og VISL's grammatical annotation system, the Corpus Search pages offer a brief on-site user manuals, while in-depth definitions and examples of grammatical categories and tags is profided in the info-folders in the relevant language-section at the main VISL site. Further information on regular expressions can be found in the following publication, A Gentle Introduction to Regular Expressions, (pdf-format) by VISL project members, John Dienhart and Henrik Kasch.

On the corpus overview page, rectangular flag links indicate (old) interfaces based on the use of regular expressions (reg.ex.), while round flag buttons indicate (new) menu-based cqp-interfaces, which have been developed with "non-computational" users in mind. Tree flags indicate treebank corpora, allowing strictured constituent searches.

Information about a wide range of additional corpora and on-line search engines can be found by visiting the corpus index developed by Jens Ahlmann Hansen.

 


In order to continue using the Java applets, we recommend using Mozilla Firefox 52 ESR (32bit).
We are actively working on replacing all our Java with portable HTML5.


Copyright 1996-2017 | Report a Problem / Contact Us | Printable Version