This is the LMF version of the Asturian Freeling lexicon. FreeLing is a developer-oriented library providing language analysis services. FreeLing is designed to be used as an external library from any application requiring this kind of services. Nevertheless, a simple main program is al... more
This is the LMF version of the Apertium bilingual dictionary for Spanish and Asturian languages. Bilingual LMF dictionaries were generated from Apertium bilingual dix files. For each Apertium bilingual correspondence, the corresponding source and target monolingual entries (LexicalEntry... more
This is the LMF version of the Basic Vocabulary of Human Genome from the UPF-IULA located at http://www.iula.upf.edu/rec/vbgenoma/esp/index.html The project Vocabulary of the Human Genome (Biotechnology 2), approved in the 2003 plenary meeting of REALITER, incorporates the basic termino... more
RDF version of the Apertium bilingual dictionary CA-IT. The original dataset (in LMF) comes from http://hdl.handle.net/10230/17105. Apertium is a free/open-source machine translation platform. The LMF (Lexical Markup Framework) version of their linguistic data (available at http://repos... more
This is the LMF version of the SenSem database created by the Spanish Inter-University Research Group GRIAL. As part of SenSem project, a corpus of sentences annotated at the semantic and syntactic levels was created. The source corpus is made up of around 13 million words extracted fro... more
This is the LMF version of the Apertium bilingual dictionary for Occitan and Catalan languages. Bilingual LMF dictionaries were generated from Apertium bilingual dix files. For each Apertium bilingual correspondence, the corresponding source and target monolingual entries (LexicalEntry)... more
This is the LMF version of the Catalan Freeling lexicon. FreeLing is a developer-oriented library providing language analysis services. FreeLing is designed to be used as an external library from any application requiring this kind of services. Nevertheless, a simple main program is als... more
This is the LMF version of the Apertium bilingual dictionary for English and Catalan languages. Bilingual LMF dictionaries were generated from Apertium bilingual dix files. For each Apertium bilingual correspondence, the corresponding source and target monolingual entries (LexicalEntry)... more
RDF version of the Apertium bilingual dictionary ES-CA. The original dataset (in LMF) comes from http://hdl.handle.net/10230/17127 Apertium is a free/open-source machine translation platform. The LMF (Lexical Markup Framework) version of their linguistic data (available at http://reposi... more
This is the LMF version of the LMF UPF Term located at http://www.iula.upf.edu/rec/upfterm/cat/index.htm. The UPF_TERM terminological data bank was created with the purpose of establishing an electronic resort for the consultation and diffusion of terminological projects elaborated by s... more
The Catalan Parole/Simple 'lemon' Lexicon is the OWL version of the Spanish Parole & Simple lexicons (defined during the PAROLE LE2-4017 and SIMPLE LE4-8346 projects) once mapped to Lexinfo Model (http://lexinfo.net/). This data set has been published as Linked Open Data in the Data Hub... more
This is the Catalan EuroWordNet-Lemon lexicon. The lexicon was created from the Catalan Word-Net-LMF lexicon which is part of the Multilingual Central Repository (MCR http://adimen.si.ehu.es/web/MCR). The lexicon conforms to the 'lemon' specifications and it is linked to the Catalan Pa... more
RDF version of the Apertium bilingual dictionary PT-CA. The original dataset (in LMF) comes from http://hdl.handle.net/10230/17120. Apertium is a free/open-source machine translation platform. The LMF (Lexical Markup Framework) version of their linguistic data (available at http://repo... more
This is the Catalan Word-Net-LMF lexicon. The Catalan lexicon is part of the Multilingual Central Repository (MCR http://adimen.si.ehu.es/web/MCR) and contains 23399 lexical Entries. The MCR currently integrates in the same EuroWordNet framework wordnets from five different languages: E... more
This is the LMF version of the Apertium bilingual dictionary for Italian and Catalan languages. Bilingual LMF dictionaries were generated from Apertium bilingual dix files. For each Apertium bilingual correspondence, the corresponding source and target monolingual entries (LexicalEntry)... more
This is the LMF version of the Catalan Freeling Sense. FreeLing is a developer-oriented library providing language analysis services. FreeLing is designed to be used as an external library from any application requiring this kind of services. Nevertheless, a simple main program is also ... more
RDF version of the Apertium bilingual dictionary EN-CA. The original dataset (in LMF) comes from http://hdl.handle.net/10230/17096 Apertium is a free/open-source machine translation platform. The LMF (Lexical Markup Framework) version of their linguistic data (available at http://reposi... more
This is the LMF version of the Multilingual Vocabulary of Economics Resource developed within the frame of the research project RICOTERM2. Financed by the Ministry of Science and Technology, within the Programa Nacional de Promoción General del Conocimiento (2004-2007).
This is the LMF version of the Catalan Parole-Simple lexicon. The original PAROLE lexica (20,000 entries per language) were built conform to a model based on EAGLES guidelines and GENELEX results, underlying a common lexical tool adapted from the EUREKA-GENELEX project. This software to... more
This is the LMF version of the Catalan Apertium ditionary. Monolingual dictionaries for Spanish, Catalan, Gallego and Euskera have been generated from the Apertium expanded lexicons of the es-ca (for both Spanish and Catalan) es-gl (for Galician) and eu-es (for Basque). Apertium is a fr... more
RDF version of the Apertium bilingual dictionary OC-CA. The original dataset (in LMF) comes from http://hdl.handle.net/10230/17107. Apertium is a free/open-source machine translation platform. The LMF (Lexical Markup Framework) version of their linguistic data (available at http://repo... more
This is the LMF version of the Spanish Bank of Neologisms at the Observatori de Neologia (UPF). The Observatori de Neologia (OBNEO), under the direction by Dr. M. Teresa Cabré, is a public-funded consolidated group within the Institut Universitari de Lingüística Aplicada at Universitat ... more
This is the LMF version of the Apertium bilingual dictionary for Portugues and Catalan languages. Bilingual LMF dictionaries were generated from Apertium bilingual dix files. For each Apertium bilingual correspondence, the corresponding source and target monolingual entries (LexicalEntr... more
This is the LMF version of the Apertium bilingual dictionary for French and Catalan languags. Bilingual LMF dictionaries were generated from Apertium bilingual dix files. For each Apertium bilingual correspondence, the corresponding source and target monolingual entries (LexicalEntry) w... more
This is the LMF version of the Apertium bilingual dictionary for Spanish and Catalan languages. Bilingual LMF dictionaries were generated from Apertium bilingual dix files. For each Apertium bilingual correspondence, the corresponding source and target monolingual entries (LexicalEntry)... more
RDF version of the Apertium bilingual dictionary FR-CA. The original dataset (in LMF) comes from http://hdl.handle.net/10230/17097. Apertium is a free/open-source machine translation platform. The LMF (Lexical Markup Framework) version of their linguistic data (available at http://repo... more
This glossary contains German entries with multiple English transfers, 22.000 transfers in total. It is lemma-based, annotated with part-of-speech information and with several probabilities, to support transfer selection for multiple translation possibilities. It is the result of PANACE... more
This glossary contains German entries with multiple English transfers, 22.000 transfers in total. It is lemma-based, annotated with part-of-speech information and with source language conceptual contexts which indicate which translation to select in a given context, to support transfer ... more
This is the LMF version of the LMF UPF Term located at http://www.iula.upf.edu/rec/upfterm/cat/index.htm. The UPF_TERM terminological data bank was created with the purpose of establishing an electronic resort for the consultation and diffusion of terminological projects elaborated by s... more
This folder contains files for bilingual glossary creation from factored phrase tables that includes part of speech tagged text for for EL-EN language pair. The tables are firstly filtered using part of speech tag sequences for each language so that entries with unsuitable part of speec... more
This folder contains files for bilingual glossary creation from factored phrase tables that include part of speech tagged text for EL-EN language pair. The tables are firstly filtered using part of speech tag sequences for each language so that entries with unsuitable part of speech seq... more
This is a domain-specific lexicon of for English for environtment (ENV) domain. This lexicon contains a set of nouns classified into seven different semantic classes. It has been automatically created using the PANACEA web services for noun classification and the crawled data for this d... more
This is a domain-specific lexicon of for English for labour (LAB) domain. This lexicon contains a set of nouns classified into seven different semantic classes. It has been automatically created using the PANACEA web services for noun classification and the crawled data for this domain ... more
This is the LMF version of the Apertium bilingual dictionary for English and Catalan languages. Bilingual LMF dictionaries were generated from Apertium bilingual dix files. For each Apertium bilingual correspondence, the corresponding source and target monolingual entries (LexicalEntry)... more
We present a set of English gold-standards for different noun classes created in PANACEA to train and test automatic classifiers. To create these gold-standards we used we the data from the SemEval 2007 workshop Task 07: Coarse Grained English All-Words (Navigli et al., 2007). The words... more
This is the LMF version of the Apertium English dictionary. Monolingual dictionary for English was generated from the Apertium expanded lexicon of the en-es pair system (English/Spanish). Apertium is a free/open-source machine translation platform, initially aimed at related-language pa... more
This lexicon was produced using an inductive SCF classifier, the tpc_subcat_inductive webservice in the PANACEA project. The lexicon was automatically produced from the PANACEA MCv2 crawled corpus, by parsing the data with the RASP parser (Third Release, Open-Source Version, February 20... more
RDF version of the Apertium bilingual dictionary EN-ES. The original dataset (in LMF) comes from http://hdl.handle.net/10230/1711 Apertium is a free/open-source machine translation platform. The LMF (Lexical Markup Framework) version of their linguistic data (available at http://reposit... more
This lexicon was produced using an inductive SCF classifier, the tpc_subcat_inductive webservice in the PANACEA project. The lexicon was automatically produced from the PANACEA MCv2 crawled corpus, by parsing the data with the RASP parser (Third Release, Open-Source Version, February 20... more
This is the LMF version of the Basic Vocabulary of Human Genome from the UPF-IULA located at http://www.iula.upf.edu/rec/vbgenoma/esp/index.html The project Vocabulary of the Human Genome (Biotechnology 2), approved in the 2003 plenary meeting of REALITER, incorporates the basic termino... more
This is a domain-specific lexicon for English for environment (ENV) domain. This lexicon contain both, subcategorization frames for verbs and lexical semantic classes for nouns. This lexicon has been automatically created using PANACEA webservices using crawled data. The crawled data wa... more
This folder contains files for bilingual glossary creation from factored phrase tables that include part of speech tagged text for EL-EN language pair. The tables are firstly filtered using part of speech tag sequences for each language so that entries with unsuitable part of speech seq... more
This lexical resource is the LMF version of the Termoteca, a multilingual terminological database based on the monolingual and parallel speciality texts collected in the corpora of the University of Vigo, namely in the CLUVI Corpus and in the Galician Technical Corpus.
This glossary contains terminology in French-to-English, with a focus on labour legislation terms, resulting from PANACEA research. It contains about 2441 entries, both single words and multiwords, with part-of-speech information, manually validated.
This is a domain-specific lexicon for English for labour (LAB) domain. This lexicon contain both, subcategorization frames for verbs and lexical semantic classes for nouns. This lexicon has been automatically created using PANACEA webservices using crawled data. The crawled data was obt... more
This folder contains files for bilingual glossary creation from factored phrase tables that include part of speech tagged text for for FR-EN language pair. The tables are firstly filtered using part of speech tag sequences for each language so that entries with unsuitable part of speech... more
RDF version of the Apertium bilingual dictionary EN-GL. The original dataset (in LMF) comes from http://hdl.handle.net/10230/17109 Apertium is a free/open-source machine translation platform. The LMF (Lexical Markup Framework) version of their linguistic data (available at http://reposi... more
This glossary contains German entries with multiple English transfers, 22.000 transfers in total. It is lemma-based, annotated with part-of-speech information and with source language conceptual contexts which indicate which translation to select in a given context, to support transfer ... more
This is a domain-specific gold-standard for English subcategorization frames, in the case, for environment (ENV) domain. This gold-standard was manually developed, choosing a set of 28 verbs and 200 senteces for each verb. For each sentence, the SCFs present for the studied verb were ma... more
The Parole/Simple 'lexinfo' Ontology is the OWL version of the Parole & Simple model (defined during the PAROLE LE2-4017 and SIMPLE LE4-8346 projects) once mapped to Lexinfo Model (http://lexinfo.net/). This data set has been published as Linked Open Data in the Data Hub (http://thedata... more
This is the LMF version of the Apertium bilingual dictionary for English and Galician languages. Apertium is a free/open-source machine translation platform, initially aimed at related-language pairs but recently expanded to deal with more divergent language pairs (such as English-Catal... more
This is the LMF version of the LMF UPF Term located at http://www.iula.upf.edu/rec/upfterm/cat/index.htm. The UPF_TERM terminological data bank was created with the purpose of establishing an electronic resort for the consultation and diffusion of terminological projects elaborated by s... more
This is the LMF version of the English-Galician CLUVI Dictionary developed under the direction of Xavier Gómez Guinovart (2005-2012) from parallel texts in the CLUVI Corpus of the University of Vigo.
This is the LMF version of the Apertium bilingual dictionary for English and Spanish languages. Bilingual LMF dictionaries were generated from Apertium bilingual dix files. For each Apertium bilingual correspondence, the corresponding source and target monolingual entries (LexicalEntry)... more
This glossary contains terminology in French-to-English, with a focus on environmental terms, resulting from PANACEA research. It contains about 3846 entries, both single words and multiwords, with part-of-speech information, manually validated.
This is a domain-specific gold-standard for English subcategorization frames, in the case, for labour (LAB) domain. This gold-standard was manually developed, choosing a set of 29 verbs and 200 senteces for each verb. For each sentence, the SCFs present for the studied verb were manuall... more
This glossary contains German entries with multiple English transfers, 22.000 transfers in total. It is lemma-based, annotated with part-of-speech information and with several probabilities, to support transfer selection for multiple translation possibilities. It is the result of PANACE... more
This folder contains files for bilingual glossary creation from factored phrase tables that include part of speech tagged text for FR-EN language pair. The tables are firstly filtered using part of speech tag sequences for each language so that entries with unsuitable part of speech seq... more
This folder contains files for bilingual glossary creation from factored phrase tables that includes part of speech tagged text for for EL-EN language pair. The tables are firstly filtered using part of speech tag sequences for each language so that entries with unsuitable part of speec... more
RDF version of the Apertium bilingual dictionary EN-CA. The original dataset (in LMF) comes from http://hdl.handle.net/10230/17096 Apertium is a free/open-source machine translation platform. The LMF (Lexical Markup Framework) version of their linguistic data (available at http://reposi... more
This is the LMF version of the Apertium bilingual dictionary for Esperanto and English languages. Bilingual LMF dictionaries were generated from Apertium bilingual dix files. For each Apertium bilingual correspondence, the corresponding source and target monolingual entries (LexicalEntr... more
This is the LMF version of the Apertium bilingual dictionary for Basque and English languages. Bilingual LMF dictionaries were generated from Apertium bilingual dix files. For each Apertium bilingual correspondence, the corresponding source and target monolingual entries (LexicalEntry) ... more
This is the LMF version of the Apertium bilingual dictionary for Spanish and Aragonese languages. Bilingual LMF dictionaries were generated from Apertium bilingual dix files. For each Apertium bilingual correspondence, the corresponding source and target monolingual entries (LexicalEntr... more
This is a domain-specific lexicon for Spanish for environment (ENV) domain. This lexicon contain both, subcategorization frames for verbs and lexical semantic classes for nouns. This lexicon has been automatically created using PANACEA webservices using crawled data.
This is a domain-specific lexicon of for Spanish for labour (LAB) domain. This lexicon contains the a set of nouns classified into nine different semantic classes. It has been automatically created using the PANACEA web services for noun classification and the crawled data for this doma... more
RDF version of the Apertium bilingual dictionary ES-RO. The original dataset (in LMF) comes from http://hdl.handle.net/10230/17122 Apertium is a free/open-source machine translation platform. The LMF (Lexical Markup Framework) version of their linguistic data (available at http://reposi... more
RDF version of the Apertium bilingual dictionary FR-ES. The original dataset (in LMF) comes from http://hdl.handle.net/10230/17090. Apertium is a free/open-source machine translation platform. The LMF (Lexical Markup Framework) version of their linguistic data (available at http://repo... more
RDF version of the Apertium bilingual dictionary OC-ES. The original dataset (in LMF) comes from http://hdl.handle.net/10230/17124 . Apertium is a free/open-source machine translation platform. The LMF (Lexical Markup Framework) version of their linguistic data (available at http://rep... more
This is a domain-specific lexicon of for Spanish for environment (ENV) domain. This lexicon contains the a set of nouns classified into nine different semantic classes. It has been automatically created using the PANACEA web services for noun classification and the crawled data for this... more
This is a domain-specific gold-standard for Spanish subcategorization frames, in the case, for labour legislation (LAB) domain. This gold-standard was manually developed, choosing a set of 30 verbs and 200 senteces for each verb. For each sentence, the SCFs present for the studied verb ... more
This is a domain-specific lexicon for Spanish subcategorization frames for labour (LAB) domain. This lexicon has been automatically created using the PANACEA web service named tpc_subcat_inductive (http://registry.elda.org/services/223) and the crawled data for this domain and language,... more
RDF version of the Apertium bilingual dictionary EN-ES. The original dataset (in LMF) comes from http://hdl.handle.net/10230/1711 Apertium is a free/open-source machine translation platform. The LMF (Lexical Markup Framework) version of their linguistic data (available at http://reposit... more
This is the LMF version of the Apertium bilingual dictionary for Spanish Galician languages. Bilingual LMF dictionaries were generated from Apertium bilingual dix files. For each Apertium bilingual correspondence, the corresponding source and target monolingual entries (LexicalEntry) we... more
This is the LMF version of the Apertium bilingual dictionary for Spanish and Portuguese languages. Bilingual LMF dictionaries were generated from Apertium bilingual dix files. For each Apertium bilingual correspondence, the corresponding source and target monolingual entries (LexicalEnt... more
This is the LMF version of the Apertium bilingual dictionary for Basque and Spanish languages. Bilingual LMF dictionaries were generated from Apertium bilingual dix files. For each Apertium bilingual correspondence, the corresponding source and target monolingual entries (LexicalEntry) ... more
This is the LMF version of the SenSem database created by the Spanish Inter-University Research Group GRIAL. As part of SenSem project, a corpus of sentences annotated at the semantic and syntactic levels was created. The source corpus is made up of around 13 million words extracted fro... more
The Spanish Parole/Simple 'lemon' Lexicon is the OWL version of the Spanish Parole & Simple lexicons (defined during the PAROLE LE2-4017 and SIMPLE LE4-8346 projects) once mapped to Lexinfo Model (http://lexinfo.net/). This data set has been published as Linked Open Data in the Data Hub... more
This is the LMF version of the Apertium Spanish dictionary. Monolingual dictionaries for Spanish, Catalan, Gallego and Euskera have been generated from the Apertium expanded lexicons of the es-ca (for both Spanish and Catalan) es-gl (for Galician) and eu-es (for Basque). Apertium is a f... more
This is a domain-specific lexicon for Spanish for labour (LAB) domain. This lexicon contain both, subcategorization frames for verbs and lexical semantic classes for nouns. This lexicon has been automatically created using PANACEA webservices using crawled data.
This is the Spanish Word-Net-LMF lexicon. The Spanish lexicon is part of the Multilingual Central Repository (MCR http://adimen.si.ehu.es/web/MCR) and contains 37,876 lexical Entries. The MCR currently integrates in the same EuroWordNet framework wordnets from five different languages: ... more
This is the LMF version of the Apertium bilingual dictionary for English and Spanish languages. Bilingual LMF dictionaries were generated from Apertium bilingual dix files. For each Apertium bilingual correspondence, the corresponding source and target monolingual entries (LexicalEntry)... more
We present a set of Spanish gold-standards for different noun classes created in PANACEA to train and test automatic classifiers. To create these gold-standards we used the Spanish working lexicon of the Incyta Machine Translation system (Alonso and Bocsák, 2005) to create the abstract,... more
This is the LMF version of the Multilingual Vocabulary of Economics Resource developed within the frame of the research project RICOTERM2. Financed by the Ministry of Science and Technology, within the Programa Nacional de Promoción General del Conocimiento (2004-2007).
This is the LMF version of the LMF UPF Term located at http://www.iula.upf.edu/rec/upfterm/cat/index.htm. The UPF_TERM terminological data bank was created with the purpose of establishing an electronic resort for the consultation and diffusion of terminological projects elaborated by s... more
This is the LMF version of the Apertium bilingual dictionary for Occitan and Spanish languages. Bilingual LMF dictionaries were generated from Apertium bilingual dix files. For each Apertium bilingual correspondence, the corresponding source and target monolingual entries (LexicalEntry)... more
This is the LMF version of the Spanish Parole-Simple lexicon. The original PAROLE lexica (20,000 entries per language) were built conform to a model based on EAGLES guidelines and GENELEX results, underlying a common lexical tool adapted from the EUREKA-GENELEX project. This software to... more
This is a domain-specific gold-standard for Spanish subcategorization frames, in the case, for environment (ENV) domain. This gold-standard was manually developed, choosing a set of 30 verbs and 200 senteces for each verb. For each sentence, the SCFs present for the studied verb were ma... more
This is the LMF version of the Spanish Freeling lexicon. FreeLing is a developer-oriented library providing language analysis services. FreeLing is designed to be used as an external library from any application requiring this kind of services. Nevertheless, a simple main program is als... more
This is a multi-level, multi-domain lexicon for Spanish. It combines the automatically acquired lexica for ENV and LAB domains using PANACEA platform and some general domain lexica, manually developed. The automatically acquired lexica consist of semantic classification of nouns in 9 di... more
This is the LMF version of the Apertium bilingual dictionary for Italian and Spanish languages. Bilingual LMF dictionaries were generated from Apertium bilingual dix files. For each Apertium bilingual correspondence, the corresponding source and target monolingual entries (LexicalEntry)... more
RDF version of the Apertium bilingual dictionary ES-PT. The original dataset (in LMF) comes from http://hdl.handle.net/10230/17101 Apertium is a free/open-source machine translation platform. The LMF (Lexical Markup Framework) version of their linguistic data (available at http://reposi... more
This is the LMF version of the Apertium bilingual dictionary for Spanish Romanian languages. Bilingual LMF dictionaries were generated from Apertium bilingual dix files. For each Apertium bilingual correspondence, the corresponding source and target monolingual entries (LexicalEntry) we... more
RDF version of the Apertium bilingual dictionary ES-CA. The original dataset (in LMF) comes from http://hdl.handle.net/10230/17127 Apertium is a free/open-source machine translation platform. The LMF (Lexical Markup Framework) version of their linguistic data (available at http://reposi... more
RDF version of the Apertium bilingual dictionary EU-ES. The original dataset (in LMF) comes from http://hdl.handle.net/10230/17095. Apertium is a free/open-source machine translation platform. The LMF (Lexical Markup Framework) version of their linguistic data (available at http://repo... more
This is a domain-specific lexicon for Spanish subcategorization frames for environment (ENV) domain. This lexicon has been automatically created using the PANACEA web service named tpc_subcat_inductive (http://registry.elda.org/services/223) and the crawled data for this domain and lang... more
RDF version of the Apertium bilingual dictionary ES-GL. The original dataset (in LMF) comes from http://hdl.handle.net/10230/17121 Apertium is a free/open-source machine translation platform. The LMF (Lexical Markup Framework) version of their linguistic data (available at http://reposi... more
This is the LMF version of the Spanish Parole lexicon. The original PAROLE lexica (20,000 entries per language) were built conform to a model based on EAGLES guidelines and GENELEX results, underlying a common lexical tool adapted from the EUREKA-GENELEX project. This software tool was ... more
This is the LMF version of the Apertium bilingual dictionary for Spanish and Catalan languages. Bilingual LMF dictionaries were generated from Apertium bilingual dix files. For each Apertium bilingual correspondence, the corresponding source and target monolingual entries (LexicalEntry)... more
This is the LMF version of the Spanish Freeling Sense. FreeLing is a developer-oriented library providing language analysis services. FreeLing is designed to be used as an external library from any application requiring this kind of services. Nevertheless, a simple main program is also ... more
This is the LMF version of the Spanish Bank of Neologisms at the Observatori de Neologia (UPF). The Observatori de Neologia (OBNEO), under the direction by Dr. M. Teresa Cabré, is a public-funded consolidated group within the Institut Universitari de Lingüística Aplicada at Universitat ... more
This is the LMF version of the Apertium bilingual dictionary for French and Spanish languages. Bilingual LMF dictionaries were generated from Apertium bilingual dix files. For each Apertium bilingual correspondence, the corresponding source and target monolingual entries (LexicalEntry) ... more
This is the LMF version of the Basic Vocabulary of Human Genome from the UPF-IULA located at http://www.iula.upf.edu/rec/vbgenoma/esp/index.html The project Vocabulary of the Human Genome (Biotechnology 2), approved in the 2003 plenary meeting of REALITER, incorporates the basic termino... more
This is the LMF version of the Apertium bilingual dictionary for Spanish and Asturian languages. Bilingual LMF dictionaries were generated from Apertium bilingual dix files. For each Apertium bilingual correspondence, the corresponding source and target monolingual entries (LexicalEntry... more
This lexical resource is the LMF version of the Termoteca, a multilingual terminological database based on the monolingual and parallel speciality texts collected in the corpora of the University of Vigo, namely in the CLUVI Corpus and in the Galician Technical Corpus.
Gold-standard for Spanish verbal subcategorization frames. The gold-standard was built merging two manually developed dictionaries: the Spanish working lexicon of the Incyta Machine Translation system (Alonso and Bocsák, 2005) and the Spanish Resource Grammar (Marimon 2010). See Necsule... more
This is the Spanish EuroWordNet-Lemon lexicon. The lexicon was created from the Spanish Word-Net-LMF lexicon which is part of the Multilingual Central Repository (MCR http://adimen.si.ehu.es/web/MCR). The lexicon conforms to the 'lemon' specification.
This is the LMF version of the Apertium bilingual dictionary for Esperanto and Spanish languages. Bilingual LMF dictionaries were generated from Apertium bilingual dix files. For each Apertium bilingual correspondence, the corresponding source and target monolingual entries (LexicalEntr... more
This is the LMF version of the Apertium bilingual dictionary for Basque and Spanish languages. Bilingual LMF dictionaries were generated from Apertium bilingual dix files. For each Apertium bilingual correspondence, the corresponding source and target monolingual entries (LexicalEntry) ... more
This is the Basque Word-Net-LMF lexicon. The Basque lexicon is part of the Multilingual Central Repository (MCR http://adimen.si.ehu.es/web/MCR) and contains 23399 lexical Entries. The MCR currently integrates in the same EuroWordNet framework wordnets from five different languages: Eng... more
RDF version of the Apertium bilingual dictionary EU-ES. The original dataset (in LMF) comes from http://hdl.handle.net/10230/17095. Apertium is a free/open-source machine translation platform. The LMF (Lexical Markup Framework) version of their linguistic data (available at http://repo... more
This is the LMF version of the Basque Apertium dictionary. Monolingual dictionaries for Spanish, Catalan, Gallego and Euskera have been generated from the Apertium expanded lexicons of the es-ca (for both Spanish andCatalan) es-gl (for Galician) and eu-es (for Basque). Apertium is a fre... more
This is the LMF version of the Multilingual Vocabulary of Economics Resource developed within the frame of the research project RICOTERM2. Financed by the Ministry of Science and Technology, within the Programa Nacional de Promoción General del Conocimiento (2004-2007).
This is the LMF version of the Apertium bilingual dictionary for Basque and English languages. Bilingual LMF dictionaries were generated from Apertium bilingual dix files. For each Apertium bilingual correspondence, the corresponding source and target monolingual entries (LexicalEntry) ... more
This is the Basque EuroWordNet-Lemon lexicon. The lexicon was created from the Basque Word-Net-LMF lexicon which is part of the Multilingual Central Repository (MCR http://adimen.si.ehu.es/web/MCR). The lexicon conforms to the 'lemon' specification.
This folder contains files for bilingual glossary creation from factored phrase tables that include part of speech tagged text for FR-EN language pair. The tables are firstly filtered using part of speech tag sequences for each language so that entries with unsuitable part of speech seq... more
This folder contains files for bilingual glossary creation from factored phrase tables that include part of speech tagged text for for FR-EN language pair. The tables are firstly filtered using part of speech tag sequences for each language so that entries with unsuitable part of speech... more
This lexical resource is the LMF version of the Termoteca, a multilingual terminological database based on the monolingual and parallel speciality texts collected in the corpora of the University of Vigo, namely in the CLUVI Corpus and in the Galician Technical Corpus.
This is the LMF version of the LMF UPF Term located at http://www.iula.upf.edu/rec/upfterm/cat/index.htm. The UPF_TERM terminological data bank was created with the purpose of establishing an electronic resort for the consultation and diffusion of terminological projects elaborated by s... more
RDF version of the Apertium bilingual dictionary FR-CA. The original dataset (in LMF) comes from http://hdl.handle.net/10230/17097. Apertium is a free/open-source machine translation platform. The LMF (Lexical Markup Framework) version of their linguistic data (available at http://repo... more
This glossary contains terminology in French-to-English, with a focus on labour legislation terms, resulting from PANACEA research. It contains about 2441 entries, both single words and multiwords, with part-of-speech information, manually validated.
RDF version of the Apertium bilingual dictionary FR-ES. The original dataset (in LMF) comes from http://hdl.handle.net/10230/17090. Apertium is a free/open-source machine translation platform. The LMF (Lexical Markup Framework) version of their linguistic data (available at http://repo... more
This is the LMF version of the Apertium French dictionary. Monolingual dictionary for French was generated from the Apertium expanded lexicon of the fr-es pair system (French/Spanish). Apertium is a free/open-source machine translation platform, initially aimed at related-language pairs... more
This is the LMF version of the Apertium bilingual dictionary for French and Spanish languages. Bilingual LMF dictionaries were generated from Apertium bilingual dix files. For each Apertium bilingual correspondence, the corresponding source and target monolingual entries (LexicalEntry) ... more
This glossary contains terminology in French-to-English, with a focus on environmental terms, resulting from PANACEA research. It contains about 3846 entries, both single words and multiwords, with part-of-speech information, manually validated.
This is the LMF version of the Apertium bilingual dictionary for French and Catalan languags. Bilingual LMF dictionaries were generated from Apertium bilingual dix files. For each Apertium bilingual correspondence, the corresponding source and target monolingual entries (LexicalEntry) w... more
This is the LMF version of the Apertium bilingual dictionary for Esperanto and French languages. Bilingual LMF dictionaries were generated from Apertium bilingual dix files. For each Apertium bilingual correspondence, the corresponding source and target monolingual entries (LexicalEntry... more
RDF version of the Apertium bilingual dictionary ES-GL. The original dataset (in LMF) comes from http://hdl.handle.net/10230/17121 Apertium is a free/open-source machine translation platform. The LMF (Lexical Markup Framework) version of their linguistic data (available at http://reposi... more
This is the LMF version of the Apertium bilingual ditionary for Portugues and Galician languages. Bilingual LMF dictionaries were generated from Apertium bilingual dix files. For each Apertium bilingual correspondence, the corresponding source and target monolingual entries (LexicalEntr... more
This is the Galician Word-Net-LMF lexicon. The Galician lexicon is part of the Multilingual Central Repository (MCR http://adimen.si.ehu.es/web/MCR) and contains 23399 lexical Entries. The MCR currently integrates in the same EuroWordNet framework wordnets from five different languages:... more
RDF version of the Apertium bilingual dictionary PT-GL. The original dataset (in LMF) comes from http://hdl.handle.net/10230/17115 . Apertium is a free/open-source machine translation platform. The LMF (Lexical Markup Framework) version of their linguistic data (available at http://rep... more
This is the LMF version of the Galician Apertium dictionary. Monolingual dictionaries for Spanish, Catalan, Galician and Euskera have been generated from the Apertium expanded lexicons of the es-ca (for both Spanish and Catalan) es-gl (for Galician) and eu-es (for Basque). Apertium is a... more
This is the LMF version of the Multilingual Vocabulary of Economics Resource developed within the frame of the research project RICOTERM2. Financed by the Ministry of Science and Technology, within the Programa Nacional de Promoción General del Conocimiento (2004-2007).
This is the LMF version of the Galician Freeling dictionary. FreeLing is a developer-oriented library providing language analysis services. FreeLing is designed to be used as an external library from any application requiring this kind of services. Nevertheless, a simple main program is... more
RDF version of the Apertium bilingual dictionary EN-GL. The original dataset (in LMF) comes from http://hdl.handle.net/10230/17109 Apertium is a free/open-source machine translation platform. The LMF (Lexical Markup Framework) version of their linguistic data (available at http://reposi... more
This is the LMF version of the English-Galician CLUVI Dictionary developed under the direction of Xavier Gómez Guinovart (2005-2012) from parallel texts in the CLUVI Corpus of the University of Vigo.
This lexical resource is the LMF version of the Termoteca, a multilingual terminological database based on the monolingual and parallel speciality texts collected in the corpora of the University of Vigo, namely in the CLUVI Corpus and in the Galician Technical Corpus.
This is the LMF version of the Apertium bilingual dictionary for Spanish Galician languages. Bilingual LMF dictionaries were generated from Apertium bilingual dix files. For each Apertium bilingual correspondence, the corresponding source and target monolingual entries (LexicalEntry) we... more
This is the LMF version of the Apertium bilingual dictionary for English and Galician languages. Apertium is a free/open-source machine translation platform, initially aimed at related-language pairs but recently expanded to deal with more divergent language pairs (such as English-Catal... more
This is the Galician EuroWordNet-Lemon lexicon. The lexicon was created from the Galician Word-Net-LMF lexicon which is part of the Multilingual Central Repository (MCR http://adimen.si.ehu.es/web/MCR). The lexicon conforms to the 'lemon' specification.
TThe PANACEA_SCF_Gold_LAB_IT is a manually created gold-standard lexicon of verbal subcategorisation frames for 27 verb lemmas. The language is Italian and the domain is Labour Legislation. The lexicon was created through manual annotation of 200 random contexts drawn from the MCv1_LAB_... more
The PAROLE-SCF-31-IT is a lexicon of verb subcategorisation frames for 31 verb lemmas extracted from the PAROLE Italian Lexicon (Ruimy et a. 2003).
TThis is a lexicon of verb subcategorisation frames automatically extracted from a 300Mio words newspaper corpus using a language independent SCF acquisition software. The lexicon was produced at CNR-ILC, Pisa, Italy as an outcome of the PANACEA EU-FP7 Funded Project under Grant Agreeme... more
This is the LMF version of the LMF UPF Term located at http://www.iula.upf.edu/rec/upfterm/cat/index.htm. The UPF_TERM terminological data bank was created with the purpose of establishing an electronic resort for the consultation and diffusion of terminological projects elaborated by s... more
This is the LMF version of the Apertium bilingual dictionary for Italian and Catalan languages. Bilingual LMF dictionaries were generated from Apertium bilingual dix files. For each Apertium bilingual correspondence, the corresponding source and target monolingual entries (LexicalEntry)... more
The OpenDomain SCF Italian Lexicon is a lexicon of verb subcategorisation frames automatically extracted from a 300Mio words newspaper corpus using a language dependent SCF acquisition software. The lexicon was produced at CNR-ILC, Pisa, Italy as an outcome of the PANACEA EU-FP7 Funded ... more
The PANACEA_SCF_Gold_ENV_IT is a manually created gold-standard lexicon of verbal subcategorisation frames for 26 verb lemmas. The language is Italian and the domain is Environment. The lexicon was created through manual annotation of 200 random contexts drawn from the MC_v1_ENV_IT corp... more
The Italian PANACEA_ENV_MWE_SCF_merged.lmf.xml lexicon is obtained by merging two automatically extracted lexicons: a domain lexicon (environment) for SCFs, PANACEA_SCF_IT_environment.lmf.xml and a MWE Italian lexicon env-mw.lmf.xml. The lexicon was produced at CNR-ILC, Pisa, Italy as a... more
RDF version of the Apertium bilingual dictionary CA-IT. The original dataset (in LMF) comes from http://hdl.handle.net/10230/17105. Apertium is a free/open-source machine translation platform. The LMF (Lexical Markup Framework) version of their linguistic data (available at http://repos... more
The PANACEA_SCF_IT_LAB is an automatically created lexicon of verbal subcategorisation frames for 27 verb lemmas. The language is Italian and the domain is Labour Legislation. The lexicon was acquired from a monolingual corpus automatically crawled, normalised, cleaned, and NLP processe... more
The PANACEA_SCF_IT_ENV is an automatically created lexicon of verbal subcategorisation frames for 26 verb lemmas. The language is Italian and the domain is Environment. The lexicon was acquired from a monolingual corpus automatically crawled, normalised, cleaned, and NLP processed (Corp... more
The Environment MW Italian Lexicon is a lexicon of noun-noun multiword expressions automatically extracted from a 36Mio word web crawled corpus in the environmental domain. The lexicon was produced at CNR-ILC, Pisa, Italy as an outcome of the PANACEA EU-FP7 Funded Project under Grant Ag... more
This is the LMF version of the Apertium bilingual dictionary for Italian and Spanish languages. Bilingual LMF dictionaries were generated from Apertium bilingual dix files. For each Apertium bilingual correspondence, the corresponding source and target monolingual entries (LexicalEntry)... more
The PANACEA_SCF_Lexicon_ITA_Gold is an manually created gold-standard lexicon of verbal subcategorisation frames for 30 verb lemmas. The language is Italian and the domain is general. The lexicon was manually created using three already existing lexicons, two manually curated (PAROLE-IT... more
The Italian PANACEA_LAB_SCF_MWE_merged.lmf.xml lexicon is obtained by merging two automatically extracted lexicons: a domain lexicon (labour) for SCFs, PANACEA_SCF_IT_labour.lmf.xml and a MWE Italian lexicon lab-mw.lmf.xml. The lexicon was produced at CNR-ILC, Pisa, Italy as an outcome ... more
The Italian PAROLE_env_merged.lmf.xml is SCF lexicon obtained by merging two automatically extracted lexicons: a domain lexicon (environment) PANACEA_SCF_IT_environment.lmf.xml and a the SCFs Italian lexicon PAROLE_lmf_subcat_ita.lmf.xml, generated from PAROLE SIMPLE Lexicon. The lexico... more
This is the LMF version of the Apertium Italian dictionary. Monolingual dictionary for Italian was generated from the Apertium expanded lexicon of the es-it. Apertium is a free/open-source machine translation platform, initially aimed at related-language pairs but recently expanded to d... more
The Italian PANACEA_rep_env_merged.lmf.xml is SCF lexicon obtained by merging two automatically extracted lexicons: a domain lexicon (environment)for SCFs, PANACEA_SCF_IT_environment.lmf.xml and a general domain SCFs Italian lexicon repubblica.scf_extracted.lmf. The lexicon was produced... more
The Italian PAROLE_lab_merged.lmf.xml is SCF lexicon obtained by merging two automatically extracted lexicons: a domain lexicon (labour) PANACEA_SCF_IT_labour.lmf.xml and a the SCFs Italian lexicon PAROLE_lmf_subcat_ita.lmf.xml, generated from PAROLE SIMPLE Lexicon. The lexicon was prod... more
The Italian PANACEA_rep_lab_merged.lmf.xml is SCF lexicon obtained by merging two automatically extracted lexicons: a domain lexicon (labour) PANACEA_SCF_IT_labour.lmf.xml and a general domain SCFs Italian lexicon repubblica.scf_extracted.lmf. The lexicon was produced at CNR-ILC, Pisa, ... more
This is the LMF version of the LMF UPF Term located at http://www.iula.upf.edu/rec/upfterm/cat/index.htm. The UPF_TERM terminological data bank was created with the purpose of establishing an electronic resort for the consultation and diffusion of terminological projects elaborated by s... more
This is the LMF version of the LMF UPF Term located at http://www.iula.upf.edu/rec/upfterm/cat/index.htm. The UPF_TERM terminological data bank was created with the purpose of establishing an electronic resort for the consultation and diffusion of terminological projects elaborated by s... more
RDF version of the Apertium bilingual dictionary OC-CA. The original dataset (in LMF) comes from http://hdl.handle.net/10230/17107. Apertium is a free/open-source machine translation platform. The LMF (Lexical Markup Framework) version of their linguistic data (available at http://repo... more
This is the LMF version of the Apertium bilingual dictionary for Occitan and Spanish languages. Bilingual LMF dictionaries were generated from Apertium bilingual dix files. For each Apertium bilingual correspondence, the corresponding source and target monolingual entries (LexicalEntry)... more
RDF version of the Apertium bilingual dictionary OC-ES. The original dataset (in LMF) comes from http://hdl.handle.net/10230/17124 . Apertium is a free/open-source machine translation platform. The LMF (Lexical Markup Framework) version of their linguistic data (available at http://rep... more
This is the LMF version of the Apertium bilingual dictionary for Occitan and Catalan languages. Bilingual LMF dictionaries were generated from Apertium bilingual dix files. For each Apertium bilingual correspondence, the corresponding source and target monolingual entries (LexicalEntry)... more
This is the LMF version of the Apertium bilingual dictionary for Spanish and Portuguese languages. Bilingual LMF dictionaries were generated from Apertium bilingual dix files. For each Apertium bilingual correspondence, the corresponding source and target monolingual entries (LexicalEnt... more
This is the LMF version of the Apertium bilingual ditionary for Portugues and Galician languages. Bilingual LMF dictionaries were generated from Apertium bilingual dix files. For each Apertium bilingual correspondence, the corresponding source and target monolingual entries (LexicalEntr... more
RDF version of the Apertium bilingual dictionary PT-GL. The original dataset (in LMF) comes from http://hdl.handle.net/10230/17115 . Apertium is a free/open-source machine translation platform. The LMF (Lexical Markup Framework) version of their linguistic data (available at http://rep... more
This lexical resource is the LMF version of the Termoteca, a multilingual terminological database based on the monolingual and parallel speciality texts collected in the corpora of the University of Vigo, namely in the CLUVI Corpus and in the Galician Technical Corpus.
This is the LMF version of the Apertium bilingual dictionary for Portugues and Catalan languages. Bilingual LMF dictionaries were generated from Apertium bilingual dix files. For each Apertium bilingual correspondence, the corresponding source and target monolingual entries (LexicalEntr... more
This is the LMF version of the LMF UPF Term located at http://www.iula.upf.edu/rec/upfterm/cat/index.htm. The UPF_TERM terminological data bank was created with the purpose of establishing an electronic resort for the consultation and diffusion of terminological projects elaborated by s... more
RDF version of the Apertium bilingual dictionary ES-PT. The original dataset (in LMF) comes from http://hdl.handle.net/10230/17101 Apertium is a free/open-source machine translation platform. The LMF (Lexical Markup Framework) version of their linguistic data (available at http://reposi... more
This is the LMF version of the Apertium Portuguese dictionary. Monolingual dictionary for Portuguese was generated from the Apertium expanded lexicon of the pt-ca pair system. Apertium is a free/open-source machine translation platform, initially aimed at related-language pairs but rece... more
RDF version of the Apertium bilingual dictionary PT-CA. The original dataset (in LMF) comes from http://hdl.handle.net/10230/17120. Apertium is a free/open-source machine translation platform. The LMF (Lexical Markup Framework) version of their linguistic data (available at http://repo... more
This is the LMF version of the LMF UPF Term located at http://www.iula.upf.edu/rec/upfterm/cat/index.htm. The UPF_TERM terminological data bank was created with the purpose of establishing an electronic resort for the consultation and diffusion of terminological projects elaborated by s... more
RDF version of the Apertium bilingual dictionary ES-RO. The original dataset (in LMF) comes from http://hdl.handle.net/10230/17122 Apertium is a free/open-source machine translation platform. The LMF (Lexical Markup Framework) version of their linguistic data (available at http://reposi... more
This is the LMF version of the Apertium bilingual dictionary for Spanish Romanian languages. Bilingual LMF dictionaries were generated from Apertium bilingual dix files. For each Apertium bilingual correspondence, the corresponding source and target monolingual entries (LexicalEntry) we... more
This is the LMF version of the LMF UPF Term located at http://www.iula.upf.edu/rec/upfterm/cat/index.htm. The UPF_TERM terminological data bank was created with the purpose of establishing an electronic resort for the consultation and diffusion of terminological projects elaborated by s... more
This is the LMF version of the Apertium bilingual dictionary for Esperanto and Spanish languages. Bilingual LMF dictionaries were generated from Apertium bilingual dix files. For each Apertium bilingual correspondence, the corresponding source and target monolingual entries (LexicalEntr... more
This is the LMF version of the Apertium bilingual dictionary for Esperanto and Catalan languages. Bilingual LMF dictionaries were generated from Apertium bilingual dix files. For each Apertium bilingual correspondence, the corresponding source and target monolingual entries (LexicalEntr... more
This is the LMF version of the Apertium bilingual dictionary for Esperanto and English languages. Bilingual LMF dictionaries were generated from Apertium bilingual dix files. For each Apertium bilingual correspondence, the corresponding source and target monolingual entries (LexicalEntr... more
This is the LMF version of the Apertium bilingual dictionary for Esperanto and French languages. Bilingual LMF dictionaries were generated from Apertium bilingual dix files. For each Apertium bilingual correspondence, the corresponding source and target monolingual entries (LexicalEntry... more
This is the LMF version of the Apertium bilingual dictionary for Esperanto and Catalan languages. Bilingual LMF dictionaries were generated from Apertium bilingual dix files. For each Apertium bilingual correspondence, the corresponding source and target monolingual entries (LexicalEntr... more
This is the LMF version of the Apertium bilingual dictionary for Spanish and Aragonese languages. Bilingual LMF dictionaries were generated from Apertium bilingual dix files. For each Apertium bilingual correspondence, the corresponding source and target monolingual entries (LexicalEntr... more
The Place Names Database (KNAB) is a systematic computerized collection of data on geographical names from both Estonia and abroad that is being developed at the Institute of Estonian Language. Its purpose is to facilitate the study and standardization of geographical names by providing... more