Tibidabo Treebank and IULA Spanish LSP Treebank Train and Test Partitions Corpus Text

Resource Name Tibidabo Treebank and IULA Spanish LSP Treebank Train and Test Partitions
Description This package contains a partition of the Iula Spanish LSP Treebank into train and test sets to perform Machine Learning experiments. In that way the same partitions can be used by different researchers and their results can be directly compared. In this package we also deliver the Tibidabo Treebank (Marimon 2010) which contains a set of sentences extracted from Ancora corpus annotated in the same way than the Iula Treebank. Tibidabo Treebank is a very good test set for models trained with Iula Spanish LSP Treebank since the sentences that form it from a very different domain than those of the Iula Spanish LSP Treebank.
Language Name Spanish
Url http://www.iula.upf.edu
Documentation The Tibidabo Treebank
Annotation Mode Manual
Annotation Standoff false
Annotation Tool
  • FreeLing
  • logon
Annotation Type Syntactic Annotation: Treebanks
Character Encoding Utf 8
Contact Person Núria Bel
Creation Mode Manual
Domain
  • general
  • medicine
  • economy
  • environment
  • computer science
  • law
Funding Project Metanet4 U – Enhancing The European Linguistic Infrastructure
Identifier http://hdl.handle.net/10230/20408
Language Code http://www.fao.org/aims/aos/languagecode.owl#spa
Language Identifier es
Licence Cc By
Linguality Monolingual
Media Type Media Type
Meta Share Identifier NOT_DEFINED_FOR_V2
Original Source
Resource Creator
Resource Short Name Tibidabo Treebank and IULA Spanish LSP Treebank Train and Test Partitions
Segmentation Level Word
Size Information
Tagset EAGLES tagset for Spanish