AnCora-CA Corpus Text

Resource Name AnCora-CA
Description The AnCora-CA is a Catalan corpus annotated at different levels: Lemma and Part of Speech; Syntactic constituents and functions; Argument structure and thematic roles; Semantic classes of the verb; Denotative type of deverbal nouns; Nouns related to WordNet synsets; Named Entities and Coreference relations; AnCora corpus is mainly based on journalist texts. Detailed information can be found at the corpus web site: http://clic.ub.edu/corpus/en.
Language Name Catalan
Url http://clic.ub.edu/corpus/
Documentation
Annotation Standoff false
Annotation Type
Character Encoding Utf 8
Contact Person M Antònia Martí Antonín
Creation Mode Mixed
Funding Project
Identifier UB_ancoraCA
Language Code http://www.fao.org/aims/aos/languagecode.owl#cat
Language Identifier ca
Licence Gpl
Linguality Monolingual
Media Type Media Type
Meta Share Identifier NOT_DEFINED_FOR_V2
Mime Type http://purl.org/NET/mediatypes/text/xml
Resource Creator
Resource Short Name AnCora-CA
Segmentation Level
Size Information
Tagset other