The project's goal was to identify and track topics about the Greco-Roman world as they appear in multilingual public document collections (Internet Archive, JSTOR, HathiTrust, etc.). This project aimed to create an environment and to generate 'dynamic variorum' editions of texts based ... more
This project is a multi-institutional, international collaboration between environmental historians in Canada and computer scientists in the UK that uses text-mining software to explore thousands of pages of historical documents related to international commodity trading in the British ... more
The project aimed to develop new ways of discovering and analyzing language patterns embedded in historical newspaper databases. Its approach was to combine text mining and geospatial visualization methods to explore massive collections of electronic texts.
This collaborative project analyzed the degree to which the effects of the Enlightenment can be observed in the writing of people of various occupations in a corpus of 53,000 18th-century letters called Electronic Enlightenment (EE). The hypothesis of the project presents a new perspect... more
Project's goal is to develop ways of searching and visualizing the interactions between humanities and sciences based on models of argumentation by means of tool generation. The research behind this project looks to establish what volume of collections of text (such as HathiTrust/Googl... more
This project is researching the application of tools to detect, link and visualize events, trends, people, organizations, and other entities of interest to social history. Having text-mining-based rich semantic metadata extraction for collections' indexing, clustering and classification... more
This project focuses on the extraction of information about places, people and events in their lives from medieval charters by using NLP technologies as NER, among others.
The project's goal was to visualize certain correspondence networks from digital scholarly editions of historical letters as a way of exploring a bundle of historical questions about the geographic range, diversity, and interactions among intellectuals during seventeenth, eighteenth, an... more
A collaboration among interdisciplinary researchers at Concordia University and the University of Wisconsin-Madison, Project Arclight is developing a web-based tool that enables the study of 20th century American media through comparisons across time and space. The Arclight tool uses... more
This project aimed at providing a historic place-name gazetteer covering a thousand years of history and providing links to attestations in old texts and maps', NER techniques were used to extract new data from digitized English Place -Name Survey.
A collaboration among interdisciplinary researchers at Concordia University and the University of Wisconsin-Madison, Project Arclight is developing a web-based tool that enables the study of 20th century American media through comparisons across time and space. The Arclight tool uses... more
This is an e-communication and e-politics project that analyses the 2011 municipal election campaign in Spain. The main objective of the project is to analyse new tendencies in e-politics and to demonstrate the impact of internet on the electoral process. CLARIN contributed to the resea... more
This project investigates androcentric practices in Spanish general press since late 80s. CLARIN contributed in the research by providing services and tools to run an experiment that automatically analised a corpus of Spanish press since 2002. 150,000 newspaper headlines were analysed.... more
This is an e-communication and e-politics project that analyses the 2011 municipal election campaign in Spain. The main objective of the project is to analyse new tendencies in e-politics and to demonstrate the impact of internet on the electoral process. CLARIN contributed to the resea... more
The project aims to improve the search on a Nineteenth-Century American Newspapers corpus using and developing data mining tools. It is working spaces-efficient n-gram indexing to identify candidate newspapers and then exploits local models of alignment to identify reprinted fragments u... more
The project aims to detect “temporal traces and interconnecting relations of text passages in German language novels from 1500 and 1990, as well as social science texts created since 1909”.
The project "aims at generating specific knowledge from ancient texts and will provide this knowledge via an open web-portal to the scientific community for future empirical studies. For this purpose researches from the fields of Computer Science and Ancient Science will cooperate to a... more