AnCora-CA Corpus Text

Resource Name	AnCora-CA
Description	The AnCora-CA is a Catalan corpus annotated at different levels: Lemma and Part of Speech; Syntactic constituents and functions; Argument structure and thematic roles; Semantic classes of the verb; Denotative type of deverbal nouns; Nouns related to WordNet synsets; Named Entities and Coreference relations; AnCora corpus is mainly based on journalist texts. Detailed information can be found at the corpus web site: http://clic.ub.edu/corpus/en.
Language Name	Catalan
Url	http://clic.ub.edu/corpus/
Documentation	An Cora Co: Coreferentially Annotated Corpora For Spanish And Catalan Ancora: Multilevel Annotated Corpora For Catalan And Spanish An Cora \| Corpus
Annotation Standoff	false
Annotation Type	Morphosyntactic Annotation Pos Tagging Semantic Annotation: Semantic Roles Semantic Annotation: Word Senses Syntactic Annotation: Shallow Parsing
Character Encoding	Utf 8
Contact Person	M Antònia Martí Antonín
Creation Mode	Mixed
Funding Project	Lang2 World: Discovering The World Knowledge Codified In The Language 3 Lb: Building A Syntactic Semantic Trees Based Database Praxem, Semantic And Pragmatic Annotation Of The Cess Ece Corpus Cess Ece Syntactically And Semantically Annotated Corpora (Spanish, Catalan, Basque)
Identifier	UB_ancoraCA
Language Code	http://www.fao.org/aims/aos/languagecode.owl#cat
Language Identifier	ca
Licence	Gpl
Linguality	Monolingual
Media Type	Media Type
Meta Share Identifier	NOT_DEFINED_FOR_V2
Mime Type	http://purl.org/NET/mediatypes/text/xml
Resource Creator	Universitat Politècnica De Catalunya. Talp Research Center Universitat De Barcelona. Centre De Llenguatge I Computació
Resource Short Name	AnCora-CA
Segmentation Level	Phrase Word
Size Information	http://lodserver.iula.upf.edu/Metashare/resource/size_N16372 http://lodserver.iula.upf.edu/Metashare/resource/size_N16377
Tagset	other