Resource Name
|
Tibidabo Treebank and IULA Spanish LSP Treebank Train and Test Partitions
|
Description
|
This package contains a partition of the Iula Spanish LSP Treebank into train and test sets to perform Machine Learning experiments. In that way the same partitions can be used by different researchers and their results can be directly compared. In this package we also deliver the Tibidabo Treebank (Marimon 2010) which contains a set of sentences extracted from Ancora corpus annotated in the same way than the Iula Treebank. Tibidabo Treebank is a very good test set for models trained with Iula Spanish LSP Treebank since the sentences that form it from a very different domain than those of the Iula Spanish LSP Treebank.
|
Language Name
|
Spanish
|
Url
|
http://www.iula.upf.edu
|
Documentation
|
The Tibidabo Treebank
|
Annotation Mode
|
Manual
|
Annotation Standoff
|
false
|
Annotation Tool
|
|
Annotation Type
|
Syntactic Annotation: Treebanks
|
Character Encoding
|
Utf 8
|
Contact Person
|
Núria Bel
|
Creation Mode
|
Manual
|
Domain
|
- general
- medicine
- economy
- environment
- computer science
- law
|
Funding Project
|
Metanet4 U – Enhancing The European Linguistic Infrastructure
|
Identifier
|
http://hdl.handle.net/10230/20408
|
Language Code
|
http://www.fao.org/aims/aos/languagecode.owl#spa
|
Language Identifier
|
es
|
Licence
|
Cc By
|
Linguality
|
Monolingual
|
Media Type
|
Media Type
|
Meta Share Identifier
|
NOT_DEFINED_FOR_V2
|
Original Source
|
|
Resource Creator
|
|
Resource Short Name
|
Tibidabo Treebank and IULA Spanish LSP Treebank Train and Test Partitions
|
Segmentation Level
|
Word
|
Size Information
|
|
Tagset
|
EAGLES tagset for Spanish
|