AnCora-DEP-CA Corpus Text

Resource Name AnCora-DEP-CA
Description AnCora-DEP-CA is the AnCora-Es multilevel annotated corpus of Catalan in dependency-based representation, consisting of 500,000 words approximately. AnCora-DEP-Es can be used as source of information for inducing grammars, developing, improving and/or evaluating syntactic parsers and algorithms for semantic role labelling, dependency-based. This corpus is used in the CoNLL Shared Task 2009: Syntactic and Semantic Dependencies in Multiple Languages, where the core of the task is to predict syntactic and semantic dependencies and their labelling.
Language Name Catalan
Url http://clic.ub.edu/corpus/
Documentation
Annotation Standoff false
Annotation Type
Character Encoding Utf 8
Contact Person M Antònia Martí Antonín
Creation Mode Mixed
Funding Project
Identifier UB_ancoraCAdep
Language Code http://www.fao.org/aims/aos/languagecode.owl#cat
Language Identifier ca
Licence Gpl
Linguality Monolingual
Media Type Media Type
Meta Share Identifier NOT_DEFINED_FOR_V2
Mime Type http://purl.org/NET/mediatypes/text/xml
Resource Creator
Resource Short Name AnCora-DEP-CA
Segmentation Level
Size Information http://lodserver.iula.upf.edu/Metashare/resource/size_N177E4
Tagset other