Lexica

Lexica by Linguality

Bilingual

  • Apertium RDF En Ca

    RDF version of the Apertium bilingual dictionary EN-CA. The original dataset (in LMF) comes from http://hdl.handle.net/10230/17096 Apertium is a free/open-source machine translation platform. The LMF (Lexical Markup Framework) version of their linguistic data (available at http://reposi... more

  • Apertium RDF Es Ca

    RDF version of the Apertium bilingual dictionary ES-CA. The original dataset (in LMF) comes from http://hdl.handle.net/10230/17127 Apertium is a free/open-source machine translation platform. The LMF (Lexical Markup Framework) version of their linguistic data (available at http://reposi... more

  • Basque Spanish LMF Apertium Bilingual Dictionary

    This is the LMF version of the Apertium bilingual dictionary for Basque and Spanish languages. Bilingual LMF dictionaries were generated from Apertium bilingual dix files. For each Apertium bilingual correspondence, the corresponding source and target monolingual entries (LexicalEntry) ... more

  • Apertium RDF Oc Es

    RDF version of the Apertium bilingual dictionary OC-ES. The original dataset (in LMF) comes from http://hdl.handle.net/10230/17124 . Apertium is a free/open-source machine translation platform. The LMF (Lexical Markup Framework) version of their linguistic data (available at http://rep... more

  • Apertium RDF En Es

    RDF version of the Apertium bilingual dictionary EN-ES. The original dataset (in LMF) comes from http://hdl.handle.net/10230/1711 Apertium is a free/open-source machine translation platform. The LMF (Lexical Markup Framework) version of their linguistic data (available at http://reposit... more

  • English Spanish LMF Apertium Bilingual Dictionary

    This is the LMF version of the Apertium bilingual dictionary for English and Spanish languages. Bilingual LMF dictionaries were generated from Apertium bilingual dix files. For each Apertium bilingual correspondence, the corresponding source and target monolingual entries (LexicalEntry)... more

  • Apertium RDF Ca It

    RDF version of the Apertium bilingual dictionary CA-IT. The original dataset (in LMF) comes from http://hdl.handle.net/10230/17105. Apertium is a free/open-source machine translation platform. The LMF (Lexical Markup Framework) version of their linguistic data (available at http://repos... more

  • Apertium RDF Pt Gl

    RDF version of the Apertium bilingual dictionary PT-GL. The original dataset (in LMF) comes from http://hdl.handle.net/10230/17115 . Apertium is a free/open-source machine translation platform. The LMF (Lexical Markup Framework) version of their linguistic data (available at http://rep... more

  • PANACEA Environment Bilingual Glossary El En (Greek English)

    This folder contains files for bilingual glossary creation from factored phrase tables that include part of speech tagged text for EL-EN language pair. The tables are firstly filtered using part of speech tag sequences for each language so that entries with unsuitable part of speech seq... more

  • PANACEA Environment Bilingual Glossary French To English

    This glossary contains terminology in French-to-English, with a focus on environmental terms, resulting from PANACEA research. It contains about 3846 entries, both single words and multiwords, with part-of-speech information, manually validated.

  • Italian Catalan LMF Apertium Bilingual Dictionary

    This is the LMF version of the Apertium bilingual dictionary for Italian and Catalan languages. Bilingual LMF dictionaries were generated from Apertium bilingual dix files. For each Apertium bilingual correspondence, the corresponding source and target monolingual entries (LexicalEntry)... more

  • Portuguese Galician LMF Apertium Bilingual Dictionary

    This is the LMF version of the Apertium bilingual ditionary for Portugues and Galician languages. Bilingual LMF dictionaries were generated from Apertium bilingual dix files. For each Apertium bilingual correspondence, the corresponding source and target monolingual entries (LexicalEntr... more

  • PANACEA Bilingual Glossary German English With Contextual Transfer Information

    This glossary contains German entries with multiple English transfers, 22.000 transfers in total. It is lemma-based, annotated with part-of-speech information and with source language conceptual contexts which indicate which translation to select in a given context, to support transfer ... more

  • PANACEA Labour Bilingual Glossary El En (Greek English)

    This folder contains files for bilingual glossary creation from factored phrase tables that includes part of speech tagged text for for EL-EN language pair. The tables are firstly filtered using part of speech tag sequences for each language so that entries with unsuitable part of speec... more

  • PANACEA Labour Bilingual Glossary Fr En (French English)

    This folder contains files for bilingual glossary creation from factored phrase tables that include part of speech tagged text for for FR-EN language pair. The tables are firstly filtered using part of speech tag sequences for each language so that entries with unsuitable part of speech... more

  • Italian Spanish LMF Apertium Bilingual Dictionary

    This is the LMF version of the Apertium bilingual dictionary for Italian and Spanish languages. Bilingual LMF dictionaries were generated from Apertium bilingual dix files. For each Apertium bilingual correspondence, the corresponding source and target monolingual entries (LexicalEntry)... more

  • Apertium RDF Fr Ca

    RDF version of the Apertium bilingual dictionary FR-CA. The original dataset (in LMF) comes from http://hdl.handle.net/10230/17097. Apertium is a free/open-source machine translation platform. The LMF (Lexical Markup Framework) version of their linguistic data (available at http://repo... more

  • Apertium RDF Es Pt

    RDF version of the Apertium bilingual dictionary ES-PT. The original dataset (in LMF) comes from http://hdl.handle.net/10230/17101 Apertium is a free/open-source machine translation platform. The LMF (Lexical Markup Framework) version of their linguistic data (available at http://reposi... more

  • Occitan Catalan LMF Apertium Bilingual Dictionary

    This is the LMF version of the Apertium bilingual dictionary for Occitan and Catalan languages. Bilingual LMF dictionaries were generated from Apertium bilingual dix files. For each Apertium bilingual correspondence, the corresponding source and target monolingual entries (LexicalEntry)... more

  • Spanish Galician LMF Apertium Bilingual Dictionary

    This is the LMF version of the Apertium bilingual dictionary for Spanish Galician languages. Bilingual LMF dictionaries were generated from Apertium bilingual dix files. For each Apertium bilingual correspondence, the corresponding source and target monolingual entries (LexicalEntry) we... more

  • Apertium RDF Es Gl

    RDF version of the Apertium bilingual dictionary ES-GL. The original dataset (in LMF) comes from http://hdl.handle.net/10230/17121 Apertium is a free/open-source machine translation platform. The LMF (Lexical Markup Framework) version of their linguistic data (available at http://reposi... more

  • Apertium RDF En Gl

    RDF version of the Apertium bilingual dictionary EN-GL. The original dataset (in LMF) comes from http://hdl.handle.net/10230/17109 Apertium is a free/open-source machine translation platform. The LMF (Lexical Markup Framework) version of their linguistic data (available at http://reposi... more

  • French Catalan LMF Apertium Bilingual Dictionary

    This is the LMF version of the Apertium bilingual dictionary for French and Catalan languags. Bilingual LMF dictionaries were generated from Apertium bilingual dix files. For each Apertium bilingual correspondence, the corresponding source and target monolingual entries (LexicalEntry) w... more

  • English Galician LMF Apertium Bilingual Dictionary

    This is the LMF version of the Apertium bilingual dictionary for English and Galician languages. Apertium is a free/open-source machine translation platform, initially aimed at related-language pairs but recently expanded to deal with more divergent language pairs (such as English-Catal... more

  • Esperanto English LMF Apertium Bilingual Dictionary

    This is the LMF version of the Apertium bilingual dictionary for Esperanto and English languages. Bilingual LMF dictionaries were generated from Apertium bilingual dix files. For each Apertium bilingual correspondence, the corresponding source and target monolingual entries (LexicalEntr... more

  • Spanish Catalan LMF Apertium Bilingual Dictionary

    This is the LMF version of the Apertium bilingual dictionary for Spanish and Catalan languages. Bilingual LMF dictionaries were generated from Apertium bilingual dix files. For each Apertium bilingual correspondence, the corresponding source and target monolingual entries (LexicalEntry)... more

  • Spanish Asturian LMF Apertium Bilingual Dictionary

    This is the LMF version of the Apertium bilingual dictionary for Spanish and Asturian languages. Bilingual LMF dictionaries were generated from Apertium bilingual dix files. For each Apertium bilingual correspondence, the corresponding source and target monolingual entries (LexicalEntry... more

  • French Spanish LMF Apertium Bilingual Dictionary

    This is the LMF version of the Apertium bilingual dictionary for French and Spanish languages. Bilingual LMF dictionaries were generated from Apertium bilingual dix files. For each Apertium bilingual correspondence, the corresponding source and target monolingual entries (LexicalEntry) ... more

  • Spanish Romanian LMF Apertium Bilingual Dictionary

    This is the LMF version of the Apertium bilingual dictionary for Spanish Romanian languages. Bilingual LMF dictionaries were generated from Apertium bilingual dix files. For each Apertium bilingual correspondence, the corresponding source and target monolingual entries (LexicalEntry) we... more

  • Occitan Spanish LMF Apertium Bilingual Dictionary

    This is the LMF version of the Apertium bilingual dictionary for Occitan and Spanish languages. Bilingual LMF dictionaries were generated from Apertium bilingual dix files. For each Apertium bilingual correspondence, the corresponding source and target monolingual entries (LexicalEntry)... more

  • PANACEA Bilingual Glossary German English With Probabilities

    This glossary contains German entries with multiple English transfers, 22.000 transfers in total. It is lemma-based, annotated with part-of-speech information and with several probabilities, to support transfer selection for multiple translation possibilities. It is the result of PANACE... more

  • Apertium RDF Oc Ca

    RDF version of the Apertium bilingual dictionary OC-CA. The original dataset (in LMF) comes from http://hdl.handle.net/10230/17107. Apertium is a free/open-source machine translation platform. The LMF (Lexical Markup Framework) version of their linguistic data (available at http://repo... more

  • Spanish Aragonese LMF Apertium Bilingual Dictionary

    This is the LMF version of the Apertium bilingual dictionary for Spanish and Aragonese languages. Bilingual LMF dictionaries were generated from Apertium bilingual dix files. For each Apertium bilingual correspondence, the corresponding source and target monolingual entries (LexicalEntr... more

  • English Galician Cluvi Dictionary

    This is the LMF version of the English-Galician CLUVI Dictionary developed under the direction of Xavier Gómez Guinovart (2005-2012) from parallel texts in the CLUVI Corpus of the University of Vigo.

  • Portuguese Catalan LMF Apertium Bilingual Dictionary

    This is the LMF version of the Apertium bilingual dictionary for Portugues and Catalan languages. Bilingual LMF dictionaries were generated from Apertium bilingual dix files. For each Apertium bilingual correspondence, the corresponding source and target monolingual entries (LexicalEntr... more

  • Spanish Portuguese LMF Apertium Bilingual Dictionary

    This is the LMF version of the Apertium bilingual dictionary for Spanish and Portuguese languages. Bilingual LMF dictionaries were generated from Apertium bilingual dix files. For each Apertium bilingual correspondence, the corresponding source and target monolingual entries (LexicalEnt... more

  • PANACEA Environment Bilingual Glossary Fr En (French English)

    This folder contains files for bilingual glossary creation from factored phrase tables that include part of speech tagged text for FR-EN language pair. The tables are firstly filtered using part of speech tag sequences for each language so that entries with unsuitable part of speech seq... more

  • Esperanto Spanish LMF Apertium Bilingual Dictionary

    This is the LMF version of the Apertium bilingual dictionary for Esperanto and Spanish languages. Bilingual LMF dictionaries were generated from Apertium bilingual dix files. For each Apertium bilingual correspondence, the corresponding source and target monolingual entries (LexicalEntr... more

  • Apertium RDF Eu Es

    RDF version of the Apertium bilingual dictionary EU-ES. The original dataset (in LMF) comes from http://hdl.handle.net/10230/17095. Apertium is a free/open-source machine translation platform. The LMF (Lexical Markup Framework) version of their linguistic data (available at http://repo... more

  • Apertium RDF Pt Ca

    RDF version of the Apertium bilingual dictionary PT-CA. The original dataset (in LMF) comes from http://hdl.handle.net/10230/17120. Apertium is a free/open-source machine translation platform. The LMF (Lexical Markup Framework) version of their linguistic data (available at http://repo... more

  • Apertium RDF Es Ro

    RDF version of the Apertium bilingual dictionary ES-RO. The original dataset (in LMF) comes from http://hdl.handle.net/10230/17122 Apertium is a free/open-source machine translation platform. The LMF (Lexical Markup Framework) version of their linguistic data (available at http://reposi... more

  • Esperanto Catalan LMF Apertium Bilingual Dictionary

    This is the LMF version of the Apertium bilingual dictionary for Esperanto and Catalan languages. Bilingual LMF dictionaries were generated from Apertium bilingual dix files. For each Apertium bilingual correspondence, the corresponding source and target monolingual entries (LexicalEntr... more

  • Basque English LMF Apertium Bilingual Dictionary

    This is the LMF version of the Apertium bilingual dictionary for Basque and English languages. Bilingual LMF dictionaries were generated from Apertium bilingual dix files. For each Apertium bilingual correspondence, the corresponding source and target monolingual entries (LexicalEntry) ... more

  • PANACEA Labour Legislation Bilingual Glossary French To English

    This glossary contains terminology in French-to-English, with a focus on labour legislation terms, resulting from PANACEA research. It contains about 2441 entries, both single words and multiwords, with part-of-speech information, manually validated.

  • Apertium RDF Fr Es

    RDF version of the Apertium bilingual dictionary FR-ES. The original dataset (in LMF) comes from http://hdl.handle.net/10230/17090. Apertium is a free/open-source machine translation platform. The LMF (Lexical Markup Framework) version of their linguistic data (available at http://repo... more

  • Esperanto French LMF Apertium Bilingual Dictionary

    This is the LMF version of the Apertium bilingual dictionary for Esperanto and French languages. Bilingual LMF dictionaries were generated from Apertium bilingual dix files. For each Apertium bilingual correspondence, the corresponding source and target monolingual entries (LexicalEntry... more

  • English Catalan LMF Apertium Bilingual Dictionary

    This is the LMF version of the Apertium bilingual dictionary for English and Catalan languages. Bilingual LMF dictionaries were generated from Apertium bilingual dix files. For each Apertium bilingual correspondence, the corresponding source and target monolingual entries (LexicalEntry)... more

Back to top

Monolingual

  • LMF Version Of The Sen Sem Spanish Data Base

    This is the LMF version of the SenSem database created by the Spanish Inter-University Research Group GRIAL. As part of SenSem project, a corpus of sentences annotated at the semantic and syntactic levels was created. The source corpus is made up of around 13 million words extracted fro... more

  • Spanish Parole Simple Lemon Lexicon

    The Spanish Parole/Simple 'lemon' Lexicon is the OWL version of the Spanish Parole & Simple lexicons (defined during the PAROLE LE2-4017 and SIMPLE LE4-8346 projects) once mapped to Lexinfo Model (http://lexinfo.net/). This data set has been published as Linked Open Data in the Data Hub... more

  • Basque Euro Word Net Lemon

    This is the Basque EuroWordNet-Lemon lexicon. The lexicon was created from the Basque Word-Net-LMF lexicon which is part of the Multilingual Central Repository (MCR http://adimen.si.ehu.es/web/MCR). The lexicon conforms to the 'lemon' specification.

  • PANACEA Spanish Automatically Acquired Lexicon For Env Domain: Subcategorization Frames And Lexical Semantic Classes For Nouns

    This is a domain-specific lexicon for Spanish for environment (ENV) domain. This lexicon contain both, subcategorization frames for verbs and lexical semantic classes for nouns. This lexicon has been automatically created using PANACEA webservices using crawled data.

  • Portuguese LMF Apertium Dictionary

    This is the LMF version of the Apertium Portuguese dictionary. Monolingual dictionary for Portuguese was generated from the Apertium expanded lexicon of the pt-ca pair system. Apertium is a free/open-source machine translation platform, initially aimed at related-language pairs but rece... more

  • Basque Word Net LMF

    This is the Basque Word-Net-LMF lexicon. The Basque lexicon is part of the Multilingual Central Repository (MCR http://adimen.si.ehu.es/web/MCR) and contains 23399 lexical Entries. The MCR currently integrates in the same EuroWordNet framework wordnets from five different languages: Eng... more

  • PANACEA Spanish Automatically Acquired Lexicon For Lab Domain: Subcategorization Frames And Lexical Semantic Classes For Nouns

    This is a domain-specific lexicon for Spanish for labour (LAB) domain. This lexicon contain both, subcategorization frames for verbs and lexical semantic classes for nouns. This lexicon has been automatically created using PANACEA webservices using crawled data.

  • Catalan LMF Apertium Dictionary

    This is the LMF version of the Catalan Apertium ditionary. Monolingual dictionaries for Spanish, Catalan, Gallego and Euskera have been generated from the Apertium expanded lexicons of the es-ca (for both Spanish and Catalan) es-gl (for Galician) and eu-es (for Basque). Apertium is a fr... more

  • PANACEA Italian Automatically Acquired Lexicon For Lab Domain: Subcategorization Frames (V Subcat)

    The PANACEA_SCF_IT_LAB is an automatically created lexicon of verbal subcategorisation frames for 27 verb lemmas. The language is Italian and the domain is Labour Legislation. The lexicon was acquired from a monolingual corpus automatically crawled, normalised, cleaned, and NLP processe... more

  • Catalan LMF Freeling Sense

    This is the LMF version of the Catalan Freeling Sense. FreeLing is a developer-oriented library providing language analysis services. FreeLing is designed to be used as an external library from any application requiring this kind of services. Nevertheless, a simple main program is also ... more

  • PANACEA Italian V Subcat Gold Standard For Lab Domain

    TThe PANACEA_SCF_Gold_LAB_IT is a manually created gold-standard lexicon of verbal subcategorisation frames for 27 verb lemmas. The language is Italian and the domain is Labour Legislation. The lexicon was created through manual annotation of 200 random contexts drawn from the MCv1_LAB_... more

  • PANACEA Spanish Automatically Acquired Lexicon For Lab Domain: Subcategorization Frames (V Subcat)

    This is a domain-specific lexicon for Spanish subcategorization frames for labour (LAB) domain. This lexicon has been automatically created using the PANACEA web service named tpc_subcat_inductive (http://registry.elda.org/services/223) and the crawled data for this domain and language,... more

  • Asturian LMF Freeling Lexicon

    This is the LMF version of the Asturian Freeling lexicon. FreeLing is a developer-oriented library providing language analysis services. FreeLing is designed to be used as an external library from any application requiring this kind of services. Nevertheless, a simple main program is al... more

  • Spanish LMF Parole/Simple Lexicon

    This is the LMF version of the Spanish Parole-Simple lexicon. The original PAROLE lexica (20,000 entries per language) were built conform to a model based on EAGLES guidelines and GENELEX results, underlying a common lexical tool adapted from the EUREKA-GENELEX project. This software to... more

  • PANACEA Italian V Subcat Gold Standard For Env Domain

    The PANACEA_SCF_Gold_ENV_IT is a manually created gold-standard lexicon of verbal subcategorisation frames for 26 verb lemmas. The language is Italian and the domain is Environment. The lexicon was created through manual annotation of 200 random contexts drawn from the MC_v1_ENV_IT corp... more

  • LMF Version Of The Sen Sem Catalan Data Base

    This is the LMF version of the SenSem database created by the Spanish Inter-University Research Group GRIAL. As part of SenSem project, a corpus of sentences annotated at the semantic and syntactic levels was created. The source corpus is made up of around 13 million words extracted fro... more

  • Spanish Euro Word Net Lemon

    This is the Spanish EuroWordNet-Lemon lexicon. The lexicon was created from the Spanish Word-Net-LMF lexicon which is part of the Multilingual Central Repository (MCR http://adimen.si.ehu.es/web/MCR). The lexicon conforms to the 'lemon' specification.

  • Parole Simple Lex Info Ontology

    The Parole/Simple 'lexinfo' Ontology is the OWL version of the Parole & Simple model (defined during the PAROLE LE2-4017 and SIMPLE LE4-8346 projects) once mapped to Lexinfo Model (http://lexinfo.net/). This data set has been published as Linked Open Data in the Data Hub (http://thedata... more

  • PANACEA English Automatically Acquired Lexicon For Lab Domain: Subcategorization Frames And Lexical Semantic Classes For Nouns

    This is a domain-specific lexicon for English for labour (LAB) domain. This lexicon contain both, subcategorization frames for verbs and lexical semantic classes for nouns. This lexicon has been automatically created using PANACEA webservices using crawled data. The crawled data was obt... more

  • PANACEA Labour Scf Mwe Merged Italian Lexicon

    The Italian PANACEA_LAB_SCF_MWE_merged.lmf.xml lexicon is obtained by merging two automatically extracted lexicons: a domain lexicon (labour) for SCFs, PANACEA_SCF_IT_labour.lmf.xml and a MWE Italian lexicon lab-mw.lmf.xml. The lexicon was produced at CNR-ILC, Pisa, Italy as an outcome ... more

  • PANACEA Italian Automatically Acquired Lexicon For Env Domain: Subcategorization Frames (V Subcat)

    The PANACEA_SCF_IT_ENV is an automatically created lexicon of verbal subcategorisation frames for 26 verb lemmas. The language is Italian and the domain is Environment. The lexicon was acquired from a monolingual corpus automatically crawled, normalised, cleaned, and NLP processed (Corp... more

  • Catalan Word Net LMF

    This is the Catalan Word-Net-LMF lexicon. The Catalan lexicon is part of the Multilingual Central Repository (MCR http://adimen.si.ehu.es/web/MCR) and contains 23399 lexical Entries. The MCR currently integrates in the same EuroWordNet framework wordnets from five different languages: E... more

  • PANACEA Spanish V Subcat Gold Standard Lexicon Lab Domain

    This is a domain-specific gold-standard for Spanish subcategorization frames, in the case, for labour legislation (LAB) domain. This gold-standard was manually developed, choosing a set of 30 verbs and 200 senteces for each verb. For each sentence, the SCFs present for the studied verb ... more

  • PANACEA Italian Parole V Subcat Gold Standard Lexicon

    The PAROLE-SCF-31-IT is a lexicon of verb subcategorisation frames for 31 verb lemmas extracted from the PAROLE Italian Lexicon (Ruimy et a. 2003).

  • Spanish LMF Freeling Sense

    This is the LMF version of the Spanish Freeling Sense. FreeLing is a developer-oriented library providing language analysis services. FreeLing is designed to be used as an external library from any application requiring this kind of services. Nevertheless, a simple main program is also ... more

  • PANACEA Italian V Subcat Repubblica Lexicon (Language Independent Extractor)

    TThis is a lexicon of verb subcategorisation frames automatically extracted from a 300Mio words newspaper corpus using a language independent SCF acquisition software. The lexicon was produced at CNR-ILC, Pisa, Italy as an outcome of the PANACEA EU-FP7 Funded Project under Grant Agreeme... more

  • Catalan LMF Parole/Simple Lexicon

    This is the LMF version of the Catalan Parole-Simple lexicon. The original PAROLE lexica (20,000 entries per language) were built conform to a model based on EAGLES guidelines and GENELEX results, underlying a common lexical tool adapted from the EUREKA-GENELEX project. This software to... more

  • PANACEA English Automatically Acquired Lexicon For Env Domain: Subcategorization Frames And Lexical Semantic Classes For Nouns

    This is a domain-specific lexicon for English for environment (ENV) domain. This lexicon contain both, subcategorization frames for verbs and lexical semantic classes for nouns. This lexicon has been automatically created using PANACEA webservices using crawled data. The crawled data wa... more

  • PANACEA Spanish Automatically Acquired Lexicon For Env Domain: Subcategorization Frames (V Subcat)

    This is a domain-specific lexicon for Spanish subcategorization frames for environment (ENV) domain. This lexicon has been automatically created using the PANACEA web service named tpc_subcat_inductive (http://registry.elda.org/services/223) and the crawled data for this domain and lang... more

  • PANACEA Spanish V Subcat Gold Standard Lexicon Env Domain

    This is a domain-specific gold-standard for Spanish subcategorization frames, in the case, for environment (ENV) domain. This gold-standard was manually developed, choosing a set of 30 verbs and 200 senteces for each verb. For each sentence, the SCFs present for the studied verb were ma... more

  • Spanish LMF Parole Lexicon

    This is the LMF version of the Spanish Parole lexicon. The original PAROLE lexica (20,000 entries per language) were built conform to a model based on EAGLES guidelines and GENELEX results, underlying a common lexical tool adapted from the EUREKA-GENELEX project. This software tool was ... more

  • PANACEA English Automatically Acquired Lexicon For Env Domain: Subcategorization Frames (V Subcat)

    This lexicon was produced using an inductive SCF classifier, the tpc_subcat_inductive webservice in the PANACEA project. The lexicon was automatically produced from the PANACEA MCv2 crawled corpus, by parsing the data with the RASP parser (Third Release, Open-Source Version, February 20... more

  • Spanish LMF Apertium Dictionary

    This is the LMF version of the Apertium Spanish dictionary. Monolingual dictionaries for Spanish, Catalan, Gallego and Euskera have been generated from the Apertium expanded lexicons of the es-ca (for both Spanish and Catalan) es-gl (for Galician) and eu-es (for Basque). Apertium is a f... more

  • Galician Word Net LMF

    This is the Galician Word-Net-LMF lexicon. The Galician lexicon is part of the Multilingual Central Repository (MCR http://adimen.si.ehu.es/web/MCR) and contains 23399 lexical Entries. The MCR currently integrates in the same EuroWordNet framework wordnets from five different languages:... more

  • PANACEA Environment And Repubblica Merged Italian Lexicon

    The Italian PANACEA_rep_env_merged.lmf.xml is SCF lexicon obtained by merging two automatically extracted lexicons: a domain lexicon (environment)for SCFs, PANACEA_SCF_IT_environment.lmf.xml and a general domain SCFs Italian lexicon repubblica.scf_extracted.lmf. The lexicon was produced... more

  • PANACEA English V Subcat Gold Standard For Env Domain

    This is a domain-specific gold-standard for English subcategorization frames, in the case, for environment (ENV) domain. This gold-standard was manually developed, choosing a set of 28 verbs and 200 senteces for each verb. For each sentence, the SCFs present for the studied verb were ma... more

  • Catalan LMF Freeling Lexicon

    This is the LMF version of the Catalan Freeling lexicon. FreeLing is a developer-oriented library providing language analysis services. FreeLing is designed to be used as an external library from any application requiring this kind of services. Nevertheless, a simple main program is als... more

  • English LMF Apertium Dictionary

    This is the LMF version of the Apertium English dictionary. Monolingual dictionary for English was generated from the Apertium expanded lexicon of the en-es pair system (English/Spanish). Apertium is a free/open-source machine translation platform, initially aimed at related-language pa... more

  • Bank Of Spanish Neologisms

    This is the LMF version of the Spanish Bank of Neologisms at the Observatori de Neologia (UPF). The Observatori de Neologia (OBNEO), under the direction by Dr. M. Teresa Cabré, is a public-funded consolidated group within the Institut Universitari de Lingüística Aplicada at Universitat ... more

  • PANACEA Environment And Parole Merged Italian Lexicon

    The Italian PAROLE_env_merged.lmf.xml is SCF lexicon obtained by merging two automatically extracted lexicons: a domain lexicon (environment) PANACEA_SCF_IT_environment.lmf.xml and a the SCFs Italian lexicon PAROLE_lmf_subcat_ita.lmf.xml, generated from PAROLE SIMPLE Lexicon. The lexico... more

  • Catalan Parole Simple Lemon Lexicon

    The Catalan Parole/Simple 'lemon' Lexicon is the OWL version of the Spanish Parole & Simple lexicons (defined during the PAROLE LE2-4017 and SIMPLE LE4-8346 projects) once mapped to Lexinfo Model (http://lexinfo.net/). This data set has been published as Linked Open Data in the Data Hub... more

  • PANACEA Environment Multi Word Italian Lexicon

    The Environment MW Italian Lexicon is a lexicon of noun-noun multiword expressions automatically extracted from a 36Mio word web crawled corpus in the environmental domain. The lexicon was produced at CNR-ILC, Pisa, Italy as an outcome of the PANACEA EU-FP7 Funded Project under Grant Ag... more

  • PANACEA Labour And Parole Merged Italian Lexicon

    The Italian PAROLE_lab_merged.lmf.xml is SCF lexicon obtained by merging two automatically extracted lexicons: a domain lexicon (labour) PANACEA_SCF_IT_labour.lmf.xml and a the SCFs Italian lexicon PAROLE_lmf_subcat_ita.lmf.xml, generated from PAROLE SIMPLE Lexicon. The lexicon was prod... more

  • Trl Spanish V Subcat Lexicon: LMF Format

    Gold-standard for Spanish verbal subcategorization frames. The gold-standard was built merging two manually developed dictionaries: the Spanish working lexicon of the Incyta Machine Translation system (Alonso and Bocsák, 2005) and the Spanish Resource Grammar (Marimon 2010). See Necsule... more

  • PANACEA English Automatically Acquired Lexicon For Env Domain: Lexical Semantic Classes For Nouns

    This is a domain-specific lexicon of for English for environtment (ENV) domain. This lexicon contains a set of nouns classified into seven different semantic classes. It has been automatically created using the PANACEA web services for noun classification and the crawled data for this d... more

  • PANACEA Italian V Subcat Repubblica Lexicon (Language Dependent Extractor)

    The OpenDomain SCF Italian Lexicon is a lexicon of verb subcategorisation frames automatically extracted from a 300Mio words newspaper corpus using a language dependent SCF acquisition software. The lexicon was produced at CNR-ILC, Pisa, Italy as an outcome of the PANACEA EU-FP7 Funded ... more

  • PANACEA Spanish Multi Level, Multi Domain Lexicon

    This is a multi-level, multi-domain lexicon for Spanish. It combines the automatically acquired lexica for ENV and LAB domains using PANACEA platform and some general domain lexica, manually developed. The automatically acquired lexica consist of semantic classification of nouns in 9 di... more

  • Catalan Euro Word Net Lemon

    This is the Catalan EuroWordNet-Lemon lexicon. The lexicon was created from the Catalan Word-Net-LMF lexicon which is part of the Multilingual Central Repository (MCR http://adimen.si.ehu.es/web/MCR). The lexicon conforms to the 'lemon' specifications and it is linked to the Catalan Pa... more

  • PANACEA English Gold Standard For Lexical Semantic Classification

    We present a set of English gold-standards for different noun classes created in PANACEA to train and test automatic classifiers. To create these gold-standards we used we the data from the SemEval 2007 workshop Task 07: Coarse Grained English All-Words (Navigli et al., 2007). The words... more

  • PANACEA Labour And Repubblica Merged Italian Lexicon

    The Italian PANACEA_rep_lab_merged.lmf.xml is SCF lexicon obtained by merging two automatically extracted lexicons: a domain lexicon (labour) PANACEA_SCF_IT_labour.lmf.xml and a general domain SCFs Italian lexicon repubblica.scf_extracted.lmf. The lexicon was produced at CNR-ILC, Pisa, ... more

  • PANACEA Italian V Subcat Gold Standard Lexicon

    The PANACEA_SCF_Lexicon_ITA_Gold is an manually created gold-standard lexicon of verbal subcategorisation frames for 30 verb lemmas. The language is Italian and the domain is general. The lexicon was manually created using three already existing lexicons, two manually curated (PAROLE-IT... more

  • Spanish LMF Freeling Lexicon

    This is the LMF version of the Spanish Freeling lexicon. FreeLing is a developer-oriented library providing language analysis services. FreeLing is designed to be used as an external library from any application requiring this kind of services. Nevertheless, a simple main program is als... more

  • PANACEA English Automatically Acquired Lexicon For Lab Domain: Lexical Semantic Classes For Nouns

    This is a domain-specific lexicon of for English for labour (LAB) domain. This lexicon contains a set of nouns classified into seven different semantic classes. It has been automatically created using the PANACEA web services for noun classification and the crawled data for this domain ... more

  • Bank Of Catalan Neologisms

    This is the LMF version of the Spanish Bank of Neologisms at the Observatori de Neologia (UPF). The Observatori de Neologia (OBNEO), under the direction by Dr. M. Teresa Cabré, is a public-funded consolidated group within the Institut Universitari de Lingüística Aplicada at Universitat ... more

  • Basque LMF Apertium Dictionary

    This is the LMF version of the Basque Apertium dictionary. Monolingual dictionaries for Spanish, Catalan, Gallego and Euskera have been generated from the Apertium expanded lexicons of the es-ca (for both Spanish andCatalan) es-gl (for Galician) and eu-es (for Basque). Apertium is a fre... more

  • French LMF Apertium Dictionary

    This is the LMF version of the Apertium French dictionary. Monolingual dictionary for French was generated from the Apertium expanded lexicon of the fr-es pair system (French/Spanish). Apertium is a free/open-source machine translation platform, initially aimed at related-language pairs... more

  • Italian LMF Apertium Dictionary

    This is the LMF version of the Apertium Italian dictionary. Monolingual dictionary for Italian was generated from the Apertium expanded lexicon of the es-it. Apertium is a free/open-source machine translation platform, initially aimed at related-language pairs but recently expanded to d... more

  • Galician Euro Word Net Lemon

    This is the Galician EuroWordNet-Lemon lexicon. The lexicon was created from the Galician Word-Net-LMF lexicon which is part of the Multilingual Central Repository (MCR http://adimen.si.ehu.es/web/MCR). The lexicon conforms to the 'lemon' specification.

  • PANACEA English Automatically Acquired Lexicon For Lab Domain: Subcategorization Frames (V Subcat)

    This lexicon was produced using an inductive SCF classifier, the tpc_subcat_inductive webservice in the PANACEA project. The lexicon was automatically produced from the PANACEA MCv2 crawled corpus, by parsing the data with the RASP parser (Third Release, Open-Source Version, February 20... more

  • PANACEA English V Subcat Gold Standard For Lab Domain

    This is a domain-specific gold-standard for English subcategorization frames, in the case, for labour (LAB) domain. This gold-standard was manually developed, choosing a set of 29 verbs and 200 senteces for each verb. For each sentence, the SCFs present for the studied verb were manuall... more

  • Knab, Place Names Database

    The Place Names Database (KNAB) is a systematic computerized collection of data on geographical names from both Estonia and abroad that is being developed at the Institute of Estonian Language. Its purpose is to facilitate the study and standardization of geographical names by providing... more

  • Galician LMF Freeling Lexicon

    This is the LMF version of the Galician Freeling dictionary. FreeLing is a developer-oriented library providing language analysis services. FreeLing is designed to be used as an external library from any application requiring this kind of services. Nevertheless, a simple main program is... more

  • PANACEA Spanish Automatically Acquired Lexicon For Env Domain: Lexical Semantic Classes For Nouns

    This is a domain-specific lexicon of for Spanish for environment (ENV) domain. This lexicon contains the a set of nouns classified into nine different semantic classes. It has been automatically created using the PANACEA web services for noun classification and the crawled data for this... more

  • PANACEA Spanish Automatically Acquired Lexicon For Lab Domain: Lexical Semantic Classes For Nouns

    This is a domain-specific lexicon of for Spanish for labour (LAB) domain. This lexicon contains the a set of nouns classified into nine different semantic classes. It has been automatically created using the PANACEA web services for noun classification and the crawled data for this doma... more

  • Spanish Word Net LMF

    This is the Spanish Word-Net-LMF lexicon. The Spanish lexicon is part of the Multilingual Central Repository (MCR http://adimen.si.ehu.es/web/MCR) and contains 37,876 lexical Entries. The MCR currently integrates in the same EuroWordNet framework wordnets from five different languages: ... more

  • Galician LMF Apertium Dictionary

    This is the LMF version of the Galician Apertium dictionary. Monolingual dictionaries for Spanish, Catalan, Galician and Euskera have been generated from the Apertium expanded lexicons of the es-ca (for both Spanish and Catalan) es-gl (for Galician) and eu-es (for Basque). Apertium is a... more

  • PANACEA Environment Scf Mwe Merged Italian Lexicon

    The Italian PANACEA_ENV_MWE_SCF_merged.lmf.xml lexicon is obtained by merging two automatically extracted lexicons: a domain lexicon (environment) for SCFs, PANACEA_SCF_IT_environment.lmf.xml and a MWE Italian lexicon env-mw.lmf.xml. The lexicon was produced at CNR-ILC, Pisa, Italy as a... more

  • PANACEA Spanish Gold Standard For Lexical Semantic Classification

    We present a set of Spanish gold-standards for different noun classes created in PANACEA to train and test automatic classifiers. To create these gold-standards we used the Spanish working lexicon of the Incyta Machine Translation system (Alonso and Bocsák, 2005) to create the abstract,... more

Back to top

Multilingual

  • Multilingual Vocabulary Of Economics

    This is the LMF version of the Multilingual Vocabulary of Economics Resource developed within the frame of the research project RICOTERM2. Financed by the Ministry of Science and Technology, within the Programa Nacional de Promoción General del Conocimiento (2004-2007).

  • Termoteca

    This lexical resource is the LMF version of the Termoteca, a multilingual terminological database based on the monolingual and parallel speciality texts collected in the corpora of the University of Vigo, namely in the CLUVI Corpus and in the Galician Technical Corpus.

  • LMF Upf Term

    This is the LMF version of the LMF UPF Term located at http://www.iula.upf.edu/rec/upfterm/cat/index.htm. The UPF_TERM terminological data bank was created with the purpose of establishing an electronic resort for the consultation and diffusion of terminological projects elaborated by s... more

  • Basic Vocabulary Of Human Genome

    This is the LMF version of the Basic Vocabulary of Human Genome from the UPF-IULA located at http://www.iula.upf.edu/rec/vbgenoma/esp/index.html The project Vocabulary of the Human Genome (Biotechnology 2), approved in the 2003 plenary meeting of REALITER, incorporates the basic termino... more

Back to top