Complete list of projects
Please note that the project factsheets will no longer be updated. All information relevant to the projects can be found on the CORDIS factsheets. These are updated on a regular basis with public deliverables, etc. Links to the CORDIS factsheets are given below, and on the respective project pages.
ABack to top |
ACCEPT - Automated Community Content Editing PorTal The use of machine translation (MT) is becoming much more pervasive. At the same time, Web 2.0 paradigms are democratising content creation - stressing the value of communities of users creating content for each other. However, right now these two trends are fairly incompatible. MT engines, even statistical engines, cannot produce acceptable results for community content due to the extreme variability within the content. The ACCEPT project will address this issue by developing new technologies designed specifically to help MT work better in this environment. Start date: 1 January 2012 End date: 31 December 2014 Project officer: Pierre-Paul Sondag Factsheet on Cordis Factsheet Website |
Lack of sufficient linguistic resources for many languages and domains currently is one of the major obstacle in further advancement of automated translation. The main goal of the ACCURAT research is to find, analyze and evaluate novel methods how comparable corpora can compensate for this shortage of linguistic resources to improve MT quality significantly for under-resourced languages and narrow domains. The ACCURAT project will provide researchers and developers with novel methodology and fully functional model for exploiting comparable corpora to increase translation quality of existing and emerging MT systems. Start date: 1 January 2010 End date: 30 June 2012 Project officer: Aleksandra Wesolowska Factsheet on CORDIS Factsheet Website |
AnnoMarket - Annotation Resource Marketplace in the Cloud The project aims to revolutionise the text annotation domain, by delivering an affordable, open marketplace for pay-as-you-go, cloud-based extraction resources and services, in multiple languages. This project will be driven by a consortium dominated by commercial partners, from three EU countries and with 43% of the budget assigned to SMEs. Start date: 1 June 2012 End date: 31 May 2014 Project officer: Susan Fraser Factsheet on CORDIS Website |
ATLAS - Applied Techology for Language-Aided CMS The advent of the Web revolutionized the way in which content is manipulated and delivered. As a result, digital content in various languages has become widely available on the Internet and its sheer volume and language diversity have presented an opportunity for embracing new methods and tools for content creation and distribution. Although significant improvements have been made lately in the field of web content management, there is still a growing demand for online content services that incorporate language-based technology. Mechanisms such as automatic annotation of important words, phrases and names, text summarization and categorization, and computer-aided translation could facilitate the process of manipulating heterogeneous multilingual content as well as enhance end-user experience by allowing for better content navigation. This project unifies such mechanisms in a common software platform called ATLAS and builds three separate solutions around this platform. Start date: 1 March 2010 End date: 28 February 2013 Project officer: Pierre-Paul Sondag Factsheet on EUROPA Factsheet Website |
BBack to top |
There is a continuing increasing need for educational institutes to provide course syllabi documentation and other educational information in English. Access to translated course syllabi and degree programmes plays a crucial role in the degree to which universities effectively attract foreign students and, more importantly, have an impact on international profiling. To present all education information in English is a major challenge for most higher educational institutes and the figures and trends show that investment in traditional human translation services is prohibitive, consequently course materials and degree programmes are often provided in local languages only. BOLOGNA aims to provide a solution to the problem by offering a low-cost, web-based, high-quality translation service already in use by several Swedish universities. Start date: 1 March 2011 End date: 28 February 2013 Project officer: Pierre-Paul Sondag Factsheet on EUROPA Factsheet Website |
CBack to top |
The CASMACAT project will build the next generation translators workbench to improve productivity, quality, and work practices in the translation industry. We will carry out cognitive studies of actual unaltered translator behaviour based on key logging and eye tracking. The acquired data will be examined for how interfaces with enriched information are used, to determine translator types and styles, and to build a cognitive model of the translation process. Start date: 1 November 2011 End date: 31 October 2014 Project officer: Aleksandra Wesolowska Factsheet on CORDIS Factsheet Website |
CESAR - Central and South-East European Resources Human language technologies crucially depend on language resources and tools that are useable, useful and available. However, even where language resources and respective tools are available they have been developed mostly in a sporadic manner, in response to specific project needs and with relatively little regard to their long-term sustainability, IPR status, interoperability, reusability in different contexts as well as to their potential deployment in multilingual applications. CESAR, in close harmony with META-NET intends to address this issue by enhancing, upgrading, standardising and cross-linking a wide variety of language resources and tools and making them available, thus contributing to an open linguistic infrastructure. Start date: 1 February 2011 End date: 31 January 2013 Project officer: Kimmo Rossi Factsheet on EUROPA Factsheet Website |
CLASSiC - Computational Learning in Adapptive systems for Spoken Conversation The overall goal of the CLASSiC project is to facilitate the rapid deployment of accurate and robust spoken dialogue systems that can learn from experience. The approach is based on statistical learning methods with a unified treatment of uncertainty across the entire systems (speech recognition, spoken language and understanding, dialogue management, natural language generation and speech synthesis). This will result in a modular processing framework with an explicit representation of uncertainty connecting the various sources of uncertainty (understanding errors, ambiguity, etc) to the constraints to be exploited (task, dialogue and user contexts). the architecture supports a layered hierarch of supervised learning and reinforcement learning in order to facilitate mathematically principled optimisation and adaptation techniques. It is being developed in close co-operation with an industrial partner in order to ensure a practical deployment platform as well as a flexible research test-bed. Start date: 1 March 2008 End date: 28 February 2011 Project officer: Philippe Gelin Factsheet on CORDIS Website |
Co-friend - Cognitive and flexible learning system operating robust interpretation of extended real scenes by multi-sensors datafusion Co-FRIEND aims to design a framework for understanding human activities in real environments, through an artificial cognitive vision system, identifying objects and events, and extracting sense from scene observation. It will manage uncertainty and change, and will create analysis meaning. Start date: 1 February 2008 End date: 28 February 2011 Project officer: Michel Brochard Factsheet on CORDIS |
COSYNE - Multi-Lingual Content Synchronisation with WIKIS The combination of dynamic user-generated content and multi-lingual aspects is particularly prominent in Wiki sites. Wikis have gained increased popularity over the last few years as a means of collaborative content creation as they allow users to set up and edit web pages directly. A growing number of organizations use Wikis as an efficient means to provide and maintain information across several sites. Currently, multi-lingual Wikis rely on users to manually translate different Wiki pages on the same subject. This is not only a time-consuming procedure but also the source of many inconsistencies, as users update the different language versions separately, and every update would require translators to compare the different language versions and synchronize the updates. Start date: 1 March 2010 End date: 28 February 2013 Project officer: Stefano Bertolo Factsheet on CORDIS Website |
DBack to top |
Dicta-Sign - Sign language Recognition, Generation and Modelling with application in Deaf Communication Dicta-Sign addresses the need for communication between deaf individuals and communication via natural language by deaf users with various human computer interfaces (HCI) environments. Dicta-Sign is one of two FP7 projects addressing sign language. The ultimate goal is to enable deaf users to fully integrate into the information society and use interactive social media in their mother tongue, the sign language, and in communication between themselves and with hearing people who use "normal" written language. The technology to be developed will enable automatic recognition of sign language, transcription and translation into written language, and automatic generation of sign language by online avatars. Start date: 1 February 2009 End date: 31 January 2012 Project officer: Kimmo Rossi Factsheet on CORDIS Factsheet Website |
DIRHA - Distant speech Interaction for Robust Home Applications The DIRHA project addresses the development of voice-enabled automated home environments based on distant-speech interaction in different languages. A distributed microphone network is installed in the rooms of a house in order to monitor selectively acoustic and speech activities observable inside any space, and to eventually run a spoken dialogue session with a given user in order to implement a service or to have access to appliances and other devices. Start date: 1 January 2012 End date: 31 December 2014 Project officer: Pierre-Paul Sondag Factsheet on CORDIS Factsheet Website |
EBack to top |
EASTIN-CL - Crosslingual and multimodal Search in a Portal for Support of Assisted Living The project will support the social participation of disabled and elderly people by providing a crosslingual and multimodal portal containing information on assistive tools and technology. There are large collections of assistive technology products and additional information, searched by both professionals (doctors, insurances, manufacturers) and end users. Eastin-CL will improve access to this information. Start date: 1 March 2010 End date: 31 May 2012 Project officer: Michel Brochard Factsheet on EUROPA Factsheet |
EMIME - Effective Multilingual Interaction in Mobile Environments The EMIME project will help to overcome the language barrier by developing a mobile device that performs personalised speech-to-speech translation, such that the a user’s spoken input in one language is used to produce spoken output in another language, while continuing to sound like the user’s voice. Personalisation of systems for cross-lingual spoken communication is an important, but little explored, topic. It is essential for providing more natural interaction and making the computing device a less obtrusive element when assisting human-human interactions. Start date: 1 March 2008 End date: 28 February 2011 Project officer: Philippe Gelin Factsheet on CORDIS Website |
EUMSSI - Event Understanding through Multimodal Social Stream Interpreting The main objective of EUMSSI is developing technologies for identifying and aggregating data presented as unstructured information in sources of very different nature (video, image, audio, speech, text and social context), including both online (e.g., YouTube) and traditional media (e.g. audiovisual repositories), and for dealing with information of very different degrees of granularity. The multimodal analytics will help organize, classify and cluster cross-media streams, by enriching its associated metadata. A core idea is that the process of integrating content from different media sources is carried out in an interactive manner, so that the data resulting from one media helps reinforce the aggregation of information from other media, in a cross-modal interoperable semantic representation framework. This will be accomplished thanks to the integration in a multimodal platform of state-of-the-art information extraction and analysis techniques from the different fields involved. Interoperability and interactive reinforcement of the data aggregation and a high-level semantic, conceptual and eventive representation will distinguish this proposal from others that incorporate multimodal search. The resulting platform will be potentially useful for any application in need of cross-media data analysis and interpretation, such as intelligent content management systems, personalized recommendation, real time event tracking, content filtering, etc. The project brings together 5 universities and research centres, a public service broadcaster and a SME providing solutions for the media industry. The real-world necessities of the 2 user partners motivate two strong user cases that have immediate market applicability. We also expect EUMSSI, which covers English, German, Spanish and French, to promote interaction and mutual knowledge among the diverse linguistic communities within Europe. Start date: 1 December 2013 End date: 30 November 2016 Project officer: Aleksandra Wesolowska Factsheet on CORDIS Website |
EU-BRIDGE - Bridges Across the Language Divide Today Europe is facing larger and more critical language challenges than ever before. The production of multilingual content now far outpaces our ability to translate it by human effort and we must turn to automatic methods to cope. Thus, effective and innovative alternatives must be provided to Europes citizens and businesses. High performing machine translation technology can be part of the solution. Recent advances in machine translation (MT) technology now show great promise, as systems can be trained automatically from data and achieve respectable performance, even from speech input. However, MT still has very high maintenance costs, and is unsuited to cope with many of todays digital medias relentlessly changing streams of information, across different topics, styles, and genres. "Bridges Across the Language Divide" (EU-BRIDGE) proposes to advance speech translation to the point where it can deal with the varying input conditions occurring in digital media, and is able to automatically adapt itself to the changing domains. Start date: 1 February 2012 End date: 31 January 2015 Project officer: Carola Carstens Factsheet on CORDIS Factsheet Website |
EuroMatrixPlus - Bringing Machine Translation for European Languages to the User EuroMatrixPlus is our "flagship" project in machine translation. It addresses all 23 EU official languages and progressively improves the quality of automatic translations by moving towards the self-learning machine translation (able to learn from its mistakes). To be noted that this theme is currently being expanded by recent projects from FP7 Call 4, currently under negotiation. Europe provides a challenge due to its vast diversity of languages, and machine translation technology will provide means to address this challenge. EuroMatrixPlus will lift the research strategy and infrastructure for statistical and hybrid machine translation developed within EuroMatrix to a higher level by adding new scientific components, by experimenting with novel community-based methods for testing and data collection and by building and utilizing stronger connections into the user world. At the same time EuroMatrixPlus will preserve the established brand name for the well accepted infrastructure, continue the most promising research strands, and extend the growing open source research community hat has formed around the EuroMatrix project. Start date: 1 March 2009 End date: 30 April 2012 Project officer: Michel Brochard Factsheet on CORDIS Factsheet Website |
EUROSENTIMENT - Language Resource Pool for Sentiment Analysis in European Languages During the last years, there has been a high increase in the use of social networks and blogs so that citizens and consumers express now widely their opinions about different topics like politics, society and media, through these channels. However the development of systems for sentiment analysis of these opinions is hampered by difficulties to access and get the necessary language resources, for several reasons: - language resource owners fears for losing competitiveness; - lack of agreed language resource schemas for sentiment analysis and not normalised magnitudes for measuring sentiment strength; - high costs for adapting existing language resources for sentiment analysis; - reduced visibility, accessibility and interoperability of the language resources. Start date: 1 September 2012 End date: 31 August 2014 Project officer: Pierre-Paul Sondag Factsheet on CORDIS Website |
EXCITEMENT - EXploring Customer Interactions Through textual EntailMENT Identifying semantic inference relations between texts is a major underlying language processing task, needed in practically all text understanding applications. For example, Question Answering and Information Extraction systems should verify that extracted answers and relations are indeed inferred from the text passages; multi-document text summarization needs to infer that one sentence entails another in order to avoid redundantly including both in a summary; and so on. While such apparently similar inferences are broadly needed, there are currently no generic semantic "engines" or platforms for broad textual inference. Rather, tools exist for narrow semantic tasks, but systems have to independently assemble and augment them to obtain a complete inference process. Start date: 1 January 2012 End date: 31 December 2014 Project officer: Carola Carstens Factsheet on CORDIS Factsheet Website |
FBack to top |
FAUST - Feedback Analysis for User adaptive Statistical Translation The FAUST project will develop machine translation (MT) systems which respond rapidly and intelligently to user feedback. Current web-based MT systems provide high-volume translation without real-time. Most systems provide no opportunity for users to offer opinions or corrections for translation results. Other systems ask users for feedback on translation, however the user does not see any benefit to providing feedback: the translation does not change in response to the feedback. Our goal is to develop high-volume translation systems capable of adapting to user feedback in real-time. Start date: 1 February 2010 End date: 31 January 2013 Project officer: Pierre-Paul Sondag Factsheet on CORDIS Factsheet Website |
FLaReNet was submitted and selected for funding under the
eContentplus
Programme. This programme came to an end on 31
December 2008. Measures to make digital content more accessible will
continue however under the ICT Policy Support Programme (
ICT-PSP
).
FLaReNet International cooperation and re-creation of a community are the most important drivers for a coherent evolution of the Language Resource (LR) area in the next years. FLaReNet will be a European forum to facilitate interaction among LR stakeholders. Its structure considers that LRs present various dimensions and must be approached from many perspectives: technical, but also organisational, economic, legal, political. The Network addresses also multicultural and multilingual aspects, essential when facing access and use of digital content in today's Europe. The Proceedings of the FLaReNet Launch event "Shaping the Future of the Multilingual Digital Europe" held on 12-13 February 2009 in Vienna are available for download below. FLaReNet Proceedings (1.17MB) Start date: 1 September 2008 End date: 31 August 2011 Project officer: Kimmo Rossi Factsheet on CORDIS Website |
FLAVIUS - Foreign LAnguage Versions of Internet and User generated Sites The web has become the largest place to publish and share all kind of information, from news to user-generated content. The answer to almost any question can be found online, but not necessarily in the working language of each user. Thus, despite of the development of translation tools, language is still a barrier, as most people access information written in their own language only. The FLAVIUS project aims at bridging the language gap between content publishers and users by providing an online platform accessible to websites owners that will enable them to generate multilingual versions of their site, quickly, easily and efficiently in as many languages as they want. Start date: 1 April 2010 End date: 30 September 2012 Project officer: Aleksandra Wesolowska Factsheet on EUROPA Factsheet Website |
GBack to top |
GALATEAS - Generalized Analysis of Logs for Automatic Translation and Episodic Analysis of Searches With the growth of digital libraries and digital library federation (as well as partially unstructured collections of documents such as web sites), a large set of vendors is offering engines for retrieving contents and metadata via search requests by the end user (queries). In most cases these queries are just unstructured fragments of text in a specific language. The first service offered by GALATEAS (LangLog) is focussed on getting meaning out of these lists of queries and it is addressed to library/federation/site managers. Start date: 1 April 2010 End date: 31 March 2013 Project officer: Pierre-Paul Sondag Factsheet on EUROPA Factsheet Website |
GET HOME SAFE - Extended Multimodal Search and Communication Systems for Safe In-Car Application The aim of the proposed project is to develop a system for safe information access (search, navigation, point of interest) and communication (texting) while driving. In order to reach that goal, we approach the problem from a holistic view, investigating the underlying "driving forces", studying the goals underlying searching and texting. Special attention will be paid to task and context factors such as multi-tasking and the associated cognitive load for the driver. Start date: 1 January 2012 End date: 31 December 2014 Project officer: Pierre-Paul Sondag Factsheet on CORDIS Factsheet Website |
IBack to top |
iTRANSLATE4 - Internet Translators for all European Languages The goal of this project is to integrate the best machine translation services of all the major European MT providers in a single website that will offer free online machine translation from any official European language to any other. Translation between all European language pairs will be available by the partners direct or linked translators. Service providers will advertise their extra translation services and also make profit from adverts sold here. Start date: 1 March 2010 End date: 29 February 2012 Project officer: Philippe Gelin Factsheet on EUROPA Factsheet Website |
LBack to top |
LetsMT! - Platform for Online Sharing of Training Data and Building User Tailored MT In recent years, statistical machine translation (SMT) has become the leading paradigm for machine translation. SMT systems are built by analyzing huge volumes of parallel corpus and learning translation models from this data. The quality of SMT systems largely depends on the size of training data. Since the majority of parallel data is in major languages, SMT systems for larger languages are of much better quality compared to systems for smaller languages. The cost and the know-how required for building custom MT solutions deter many small-to-medium companies from utilizing the power of MT technologies. To fully exploit the huge potential of existing open SMT technologies we propose to build an innovative online collaborative platform for data sharing and MT building. Start date: 1 March 2010 End date: 31 August 2012 Project officer: Kimmo Rossi Factsheet on EUROPA Factsheet Website |
LIDER - Linked Data as an enabler of cross-media and multilingual content analytics for enterprises across Europe The explosive growth of content in volume, velocity and variety on the Web demands new approaches to content analytics, addressing issues in large scale analysis and interpretation of heterogeneous data sets, originating in different media, human languages, jurisdictions, etc. Among these, language diversity in particular has become a ubiquitous aspect of the Web in light of increasing globalization. Recently, semantic-level, i.e., language- and media-independent, data analysis and representation methods such as those provided by Linked Data and Semantic Web technologies, have been introduced to provide innovative content analytics solutions for such heterogeneous, multilingual and multimedia content. An important missing component however, is the representation of language- and media-specific information that will be needed for interpreting such data correctly - across different media and across the increasing variety of human languages used nowadays on the Web. Start date: 1 November 2013 End date: 31 October 2015 Project officer: Susan Fraser Factsheet on CORDIS Website |
LiMoSINe - Linguistically Motivated Semantic aggregation engiNes We increasingly live our life online. Information is accumulated on a wide range of human activities, from science and facts, to personal content, opinions, and trends. Across the globe, peoples knowledge, experiences and interactions effortlessly find their way to online outlets, alongside traditional edited content, ready to be shared with millions. LiMoSINe will integrate the research activities of leading researchers across diverse topics with a view to enabling new kinds of language-based search technology. Start date: 1 November 2011 End date: 31 October 2014 Project officer: Pierre-Paul Sondag Factsheet on CORDIS Factsheet Website |
LIREC - Living with robots and interactive companions LIREC aims to establish a multi-faceted theory of artificial long-term companions (including memory, emotions, cognition, communication, learning, etc.), embody this theory in robust and innovative technology and experimentally verify both the theory and technology in real social environments. Whether as robots, social toys or graphical and mobile synthetic characters, interactive and sociable technology is advancing rapidly. However, the social, psychological and cognitive foundations and consequences of such technological artefacts entering our daily lives - at work, or in the home - are less well understood. The project has produced some entertaining and educational videos about their work. Start date: 1 March 2008 End date: 31 August 2012 Project officer: Pierre-Paul Sondag Factsheet on CORDIS Website |
LISE - Legal Language Interoperability Services There is an urgent need for consolidated administrative nomenclatures and legal terminologies as tools to enhance interoperability and cross-border collaboration. Without high quality and standards-based terminologies, it is impossible to reach precision, efficiency, and transparency within and across any services, processes and systems in the areas of legal and administrative work. Start date: 1 February 2011 End date: 31 July 2013 Project officer: Susan Fraser Factsheet on EUROPA Factsheet Website |
LTCompass - Guiding Language Technology Paths from Research to Market COMPASS is a Support Action for the Language Technology European research community and industry, that aims to: 1.Promote faster, wider and smoother market take-up of Language Technology (LT) research results by facilitating closer collaboration between research and industry players, and by generating better knowledge and understanding of demand/user needs and expectations 2.Foster the process of consolidation of LT stakeholders as a self-sustainable and innovative global industry, strategically co-ordinated under a common vision and approach to consumers' needs, and acting upon a shared innovation agenda for the continuous improvement of their technological base. Start date: 1 November 2011 End date: 28 February 2014 Project officer: Kimmo Rossi Factsheet on CORDIS Factsheet Website |
LTWeb - Language Technology in the Web
The goal of LT-Web is to set the foundation for the integration of language technologies into core Web technologies, via the creation of a standard defining three kinds of metadata about: 1) information in Web content being relevant for language technology processing; 2) processes for creating Web content via localisation and content management work flows; 3) language technology applications and resources used in these applications. Start date: 1 January 2012 End date: 31 December 2013 Project officer: Kimmo Rossi Factsheet on CORDIS Factsheet Website |
MBack to top |
MANTRA - Multilingual Annotation of Named Entities and Terminology Resources This project will provide multilingual terminologies and semantically annotated multilingual documents, e.g., patent texts, to improve the accessibility of scientific information from multilingual documents. Start date: 1 July 2012 End date: 30 June 2014 Project officer: Saila Rinne Factsheet on CORDIS Website |
MateCat - Machine Translation Enhanced Computer Assisted Translation Worldwide demand of translation services has dramatically accelerated in the last decade, as an effect of the market globalization and the growth of the Information Society. Computer Assisted Translation (CAT) tools are currently the dominant technology in the translation and localization market. These include spell checkers, terminology managers, electronic dictionaries, full-text search tools, concordancers, bitexts, translation memory (TM) managers, and machine translation (MT) engines. Recent achievements by the so- called statistical MT approach have raised new expectations in the translation industry. So far, statistical MT has focused on providing ready-to-use translations, rather than outputs that minimize the effort of a human translator. The MateCat project aims at pushing what can be considered the new frontier of CAT technology: how to effectively integrate statistical MT within the translation workflow. Start date: 1 November 2011 End date: 31 October 2014 Project officer: Aleksandra Wesolowska Factsheet on CORDIS Factsheet Website |
MEDAR - Mediterranean Arabic Language and Speech Technology
MEDAR addresses International Cooperation between the EU and the Mediterranean region on Speech and Language Technologies for Arabic. The development of language resources and tools for the Arabic language will promote exchanges in the fields of culture and economy. By focussing on Arabic language technology and making both the technology and content available in Arabic helps to address the needs of citizens and make Arabic more accessible to the non-Arabic world, in order to stimulate business, research and mutual understanding. Start date: 1 February 2008 End date: 31 July 2010 Project officer: Kimmo Rossi Website |
METALOGUE - Multiperspective Multimodal Dialogue: dialogue system with metacognitive abilities The goal of METALOGUE is to produce a multimodal dialogue system that is able to implement an interactive behaviour that seems natural to users and is flexible enough to exploit the full potential of multimodal interaction. It will be achieved by understanding, controlling and manipulating system's own and users' cognitive processes. The new dialogue manager will incorporate a cognitive model based on metacognitive skills that will enable planning and deployment of appropriate dialogue strategies. The system will be able to monitor both its own and users' interactive performance, reason about the dialogue progress, guess the users' knowledge and intentions, and thereby adapt and regulate the dialogue behaviour over time. Start date: 1 November 2013 End date: 31 October 2016 Project officer: Pierre-Paul Sondag Factsheet on CORDIS Website |
META-NET (T4ME) - Technologies for the Multilingual European Information Society Linguistic diversity is a corner stone of our multicultural European society. To preserve this essential asset in the age of the emerging information and knowledge society, Europe needs ICT technologies and applications at affordable costs that enable communication, collaboration and participation across language boundaries, secure their language users equal access to the information and knowledge society, and support each language in the advanced functionalities of networked ICT. Start date: 1 February 2010 End date: 31 January 2013 Project officer: Kimmo Rossi Factsheet on CORDIS Factsheet Website |
METANET4U - Enhancing the European Linguistic Infrastructure METANET4U aims to contribute to the establishment of a pan-European digital platform that makes available language resources and services, encompassing both datasets and software tools, for speech and language processing, and supports a new generation of exchange facilities for them. Start date: 1 February 2011 End date: 31 January 2013 Project officer: Kimmo Rossi Factsheet on EUROPA Factsheet Website |
META-NORD - Baltic and Nordic Parts of the European Open Linguistic Infrastructure META-NORD aims to establish an open linguistic infrastructure in the Baltic and Nordic countries by providing a description of the national landscape in terms of language use, contributing to a pan-European digital resource exchange facility, building and operating interconnected repositories and moblizing national and regional actors. Start date: 1 February 2011 End date: 31 January 2013 Project officer: Kimmo Rossi Factsheet on EUROPA Factsheet Website |
MICO - Media in Context With the tremendous increase in multimedia content on the Web and in corporate intranets, discovering hidden meaning in raw multimedia is becoming one of the biggest challenges. Analysing multimedia content is still in its infancy, requires expert knowledge, and the few available products are associated with excessive price tags, while still not delivering sufficient quality for many tasks. This makes it almost impossible for normal companies, particularly SMEs, to make use of this technology. Also, analysis components typically operate in isolation and do not consider the context (e.g. embedding text) of a media resource. Start date: 1 November 2013 End date: 31 October 2016 Project officer: Susan Fraser Factsheet on CORDIS Website |
Mimics - multimodal immersive motion rehabilitation with interactive cognitive systems The main hypothesis of this project is that movement training for neurorehabilitation can be substantially improved through immersive and multimodal sensory feedback. The approach is real-time acquisition of behavioural and physiological data from patients and the use of this to adaptively and dynamically change the displays of an immersive virtual reality system, with the goal of maximising patient motivation. This will result in complex systems that are natural, user-friendly and easy to use. Start date: 1 January 2008 End date: 31 December 2010 Project officer: Philippe Gelin Factsheet on CORDIS Website |
MLi - Towards a MultiLingual Data Services infrastructure This Support Action will deliver the strategic vision and operational specifications needed for building the MLi (European MultiLingual data & services Infrastructure), formulate a multiannual plan for its development and deployment, and shape the multi-stakeholders alliances ensuring its long term sustainability. Start date: 1 November 2013 End date: 31 October 2015 Project officer: Kimmo Rossi Factsheet on CORDIS Website |
MOLTO - Multilingual On-Line Translation MOLTO's goal is to develop a set of tools for translating texts between multiple languages in real time with high quality. Languages are separate modules in the tool and can be varied; prototypes covering a majority of the EU's 23 official languages will be built. Start date: 1 March 2010 End date: 28 February 2013 Project officer: Pierre-Paul Sondag Factsheet on CORDIS Factsheet Website |
MONNET - Multilingual Ontologies for Networked Knowledge The Monnet project will provide a semantics-based solution for integrated information access across language barriers, which is of growing importance to industry as exemplified by the Monnet use cases. A key solution to this problem is to deal with information at the semantic level, i.e. by abstracting away over language and form, allowing for more advanced and uniform: i) integration, ii) aggregation, iii) querying and iv) presentation of information across languages. Start date: 1 March 2010 End date: 28 February 2013 Project officer: Pierre-Paul Sondag Factsheet on CORDIS Factsheet |
MORMED - Multilingual Organic Information Management in the Medical Domain MORMED proposes a multilingual community platform combining Web 2.0 social software applications with semantic interpretation of domain relevant content, enhanced with automatic translation capabilities, fine-tuned for a specific domain. MORMED will be piloted upon the community interested in Lupus or Antiphospholipid Syndrome (Hughes Syndrome), involving researchers, medical doctors, general practitioners, patients and patient support groups. Start date: 1 April 2010 End date: 31 August 2012 Project officer: Susan Fraser Factsheet on Europa Factsheet Website |
MosesCore - Moses Open Source Evaluation and Support Co-Ordination for Outreach and Exploitation Machine translation (MT) has increasingly become an indispensable tool for coping with Europes linguistic diversity. However machine translation systems are complex software applications requiring significant levels of cooperation and coordination to enable research to continue to flourish, and to improve the usage rate amongst commercial translators, and public bodies. Start date: 1 February 2012 End date: 31 January 2015 Project officer: Aleksandra Wesolowska Factsheet on CORDIS Factsheet Website |
MultilingualWeb - Advancing the Multilingual Web, Thematic Network Given the importance of the World Wide Web to communication in all walks of life, and as the share of English Web pages decreases and that of languages spoken in the European Union increases, the importance of ensuring the multilingual viability of the World Wide Web is paramount. In order to build on current internationalization of the Web and move it forward, it is important to raise awareness of existing best practices and standards related to managing content on the multilingual Web, and look forward to what remains to be done. The project is coordinated by the World Wide Web Consortium (W3C), an organization of currently over 400 members worldwide from research and industry, headed by the Web's inventor, Sir Tim Berners-Lee. The other partners represent stakeholders from a range of affected areas. Start date: 1 April 2010 End date: 31 March 2012 Project officer: Kimmo Rossi Factsheet on EUROPA Factsheet Website |
MULTISENSOR - Mining and Understanding of multilinguaL contenT for Intelligent Sentiment Enriched coNtext and Social Oriented inteRpretation The consumption of large amounts of multilingual and multimedia content regardless of its reliability and cross-validation can have important consequences on the society. An indicative example is the current crisis of the financial markets in Europe, which has created an extremely unstable ground for economic transactions and caused insecurity in the population. The fact that the national mass media provide exaggerated and contradictory information and the inaptness to understand local contexts from different countries has a considerable share in the aggravation of the crisis. To break this spiral, we need multilingual technologies with sentiment, social and spatiotemporal competence that are able to interpret, relate and summarize economic information and news created from various local subjective and biased views and disseminated via TV, radio, mass media websites and social media. Start date: 1 November 2013 End date: 31 October 2016 Project officer: Aleksandra Wesolowska Factsheet on CORDIS Website |
OBack to top |
OpeNER - Open Polarity Enhanced Named Entity Recognition Currently there are a multitude of companies offering Content Analytics and Social Internet Mining services for the purposes of Opinion Mining and Sentiment Analysis. Truly effective Sentiment Analysis is a complex NLP task in monolingual contexts alone. In multilingual contexts the complexity increase many-fold and also presents the challenge of comparison of opinion across languages and cultures. Named Entity Recognition and Classification is also key element to this challenge. As with is often the case in many innovative sectors and industries, a high percentage of SMEs are active offering niche solutions to specific segments of the market and/or domains. Acquiring or developing the base qualifying technologies require to enter this market is and expensive undertaking that redirects limited the resources of SMEs away from offering products and services that the market demands. The OpeNER project has as the goal of reuse and repurposing of exiting language resources and data sets to provide a set of underlying technologies to the broader community. OpeNER will focus on the provision of a supplementary sentiment lexicon with culturally normalised and graduated values. Start date: 1 July 2012 End date: 30 June 2014 Project officer: Pierre-Paul Sondag Factsheet on CORDIS Website |
ORGANIC - Self-organised recurrent neural learning for language processing The human brain is an unrivalled “engine” for speech processing and language understanding. It integrates a large variety of learning, adaptation, optimization and self-stabilization mechanisms across many dynamically interacting levels of processing. The result of this highly entwined mesh of processes is supreme robustness, efficiency, and versatility. ORGANIC adopts principles of cortical architecture and self-organizing neurodynamics for the design of a new type of cognitive architectures for linguistic processing tasks. The consortium brings together pioneers in recurrent neural network research, cortical architectures for speech and language processing, speech processing research and an industrial partner who is leading in text recognition. Start date: 1 April 2009 End date: 31 March 2012 Project officer: Pierre-Paul Sondag Factsheet on CORDIS Factsheet |
Organic.Lingua - Demonstrating the potential of a multilingual web portal for Sustainable Agricultural & Environmental Education The Organic.Lingua project will extend the current Organic.Edunet portal to fill gaps in multilingual support and cross-language resource organisation and search, by significantly expanding its linguistic coverage. By doing so it aims to cover both the public sector needs by providing agricultural and environmental researchers and educators with a pan-European information, communication and collaboration platform, as well as by creating new business opportunities for the relevant private sector by demonstrating how a commercial system that will serve a global education market can be deployed. Start date: 1 March 2011 End date: 28 February 2014 Project officer: Susan Fraser Factsheet on EUROPA Factsheet |
PBack to top |
PANACEA - Platform for Automatic, Normalized Annotation and Cost-Effective Acquisition of Language Resources for Human Language Technologies A strategic challenge for Europe in today's globalised economy is to overcome language barriers through technological means. In particular, Machine Translation systems are expected to have a significant impact in the managing of multilingualism in Europe. PANACEA is facing the most critical aspect for Machine Translation to produce this expected impact in Europe: the so-called resources bottleneck. MT technologies are in themselves language independent, but they are inherently tied to the availability of language dependent resources. Start date: 1 January 2010 End date: 31 December 2012 Project officer: Susan Fraser Factsheet on CORDIS Factsheet |
PARLANCE - Probabalistic Adaptive Real-Time Learning and Natural Conversational Engine The project goal is to design and build mobile applications that approach human performance in conversational interaction, specifically in terms of the interactional skills needed to do so. These skills will include recognising and generating conversational speech incrementally in real-time, adapting to new concepts without manual intervention, and personalising interaction. All of these skills will be learned or adapted using real data, and will be used to build systems for interactive hyper-local search in three languages (English, Spanish and Mandarin) and for two domains such as property search and tourist information. Start date: 1 November 2011 End date: 31 October 2014 Project officer: Pierre-Paul Sondag Factsheet on CORDIS Factsheet Website |
PHEME - Computing Veracity Across Media, Languages, and Social Networks Social media poses three major computational challenges, dubbed by Gartner the 3Vs of big data: volume, velocity, and variety. Content analytics methods have faced additional difficulties, arising from the short, noisy, and strongly contextualised nature of social media. In order to address the 3Vs of social media, new language technologies have emerged, e.g. using locality sensitive hashing to detect breaking news stories from media streams (volume), predicting stock market movements from microblog sentiment (velocity), and recommending blogs and news articles based on user content (variety). PHEME will focus on a fourth crucial, but hitherto largely unstudied, challenge: veracity. It will model, identify, and verify phemes (internet memes with added truthfulness or deception), as they spread across media, languages, and social networks. Start date: 1 January 2014 End date: 31 December 2016 Project officer: Susan Fraser Factsheet on CORDIS Website |
PinView - Personal Information Navigator Adapting Through Viewing The PinView consortium combines pioneering application expertise with a solid machine learning background in content-based information retrieval. the project will focus on developing new information retrieval principles needed for replacing or complementing explicit search queries. The research will facilitate a prototype of a proactive personal information navigator that allows retrieval of multimodal information (still images, text, video) available on the web and versatile databases. During browsing and searching with a task-dependent interface, the goals of the user will be inferred from explicit and implicit feedback signals and interaction (eye movements, pointer traces and clicks, speech) complemented with social filtering. The collected rich multimodal responses from the user are processed with new advanced machine learning methods to infer the implicit topic of the user's interest as well as the sense in which it is interesting in the current context. Start date: 1 January 2008 End date: 31 March 2011 Project officer: Pierre-Paul Sondag Factsheet on CORDIS Website |
PLuTO - Patent Language Translations Online PLuTO will build on existing state-of-the-art machine translation tools, currently successfully used for trademarks, and adapt them to the wider area of IP protection, and to patent translations in particular. Based on the experience of the existing consortium members, an online machine translation system will be built that is capable of assisting patent searchers with their multi-lingual information needs, much more reliably than general-purpose MT tools and much faster than human-based translations. Start date: 1 April 2010 End date: 31 March 2013 Project officer: Susan Fraser Factsheet on EUROPA Factsheet Website |
PortDial- Language Resources for Portable Multilingual Spoken Dialogue Systems The PortDial project brings together European SMEs that are developing state-of-the-art spoken dialogue systems (SDS) and the handcrafted semantic components with research institutions at the forefront of progress in the automatic creation or enrichment of semantic language resources. The aim is to apply these technologies towards the creation of domain-specific multilingual SDS resources, specifically, data-linked ontologies and grammars. Start date: 1 June 2012 End date: 31 May 2014 Project officer: Stefano Bertolo Factsheet on CORDIS Factsheet Website |
PRESEMT - Pattern REcognition-based Statistically Enhanced MT PRESEMT is a flexible and adaptable MT system, based on a language-independent method, whose principles ensure easy portability to new language pairs. This method attempts to overcome well-known problems of other MT approaches, e.g. bilingual corpora compilation or creation of new rules per language pair. PRESEMT will address the issue of effectively managing multilingual content and is expected to suggest a language-independent machine-learning-based methodology. Start date: 1 January 2010 End date: 31 December 2012 Project officer: Michel Brochard Factsheet on CORDIS Factsheet Website |
Prometheus - prediction and interpretation of human behaviour based on probabilistic structures and heterogeneous sensors The project intends to establish a link between fundamental sensing tasks and automated cognition processes that concern the understanding a short-term prediction of human behaviour as well as complex human interaction. The analysis of human behaviour is unrestricted environments, including localization and tracking of multiple people and recognition of their activities, currently constitutes a topic of intensive research in the signal processing and computer vision communities. Start date: 1 January 2008 End date: 31 December 2010 Project officer: Michel Brochard Factsheet on CORDIS Website |
PROMISLingua - Performance Operational and Multilingual Interactive Services to support Compliance for SMEs in Europe PROMISLingua's objectives are the translation, localisation and rollout of the existing PROMIS® online service with a total of eight languages in order to deliver a cost-efficient and easy-to-use internet-based service enabling SMEs to comply with safety, health, environment and quality regulations. Start date: 1 April 2011 End date: 30 September 2013 Project officer: Pierre-Paul Sondag Factsheet on EUROPA Factsheet Website |
QBack to top |
QTLaunchPad - Preparation and Launch of a large-scale action for Quality Translation Technology The support action will prepare the grounds for a new type of collaborative MT research dedicated to overcoming existing quality barriers. QTLaunchPad will assemble and provide needed data and tools in-cluding specialised translation corpora, test suites and tools for quality assessment, create a shared quality metrics for human and machine translation, improve automatic translation quality estimation, extend an existing platform for resource-sharing to the needs of quality MT research, define strategies and challenges and then plan and launch a large-scale research and innovation action for a breakthrough in quality translation technology. European research has the potential to overtake Google in the pursuit of automatic quality translation! Start date: 1 July 2012 End date: 30 June 2014 Project officer: Aleksandra Wesolowska Factsheet on CORDIS Factsheet Website |
QTLeap - Quality Translation by Deep Language Engineering Approaches The support action will prepare the grounds for a new type of collaborative MT research dedicated to overcoming existing quality barriers. QTLaunchPad will assemble and provide needed data and tools in-cluding specialised translation corpora, test suites and tools for quality assessment, create a shared quality metrics for human and machine translation, improve automatic translation quality estimation, extend an existing platform for resource-sharing to the needs of quality MT research, define strategies and challenges and then plan and launch a large-scale research and innovation action for a breakthrough in quality translation technology. European research has the potential to overtake Google in the pursuit of automatic quality translation! Start date: 1 November 2013 End date: 31 October 2016 Project officer: Aleksandra Wesolowska Factsheet on CORDIS Website |
RBack to top |
Robocast - robot and sensors integration as guidance for enhanced computer assisted surgery and therapy The ROBOCAST project aims to develop ICT scientific methods and technologies which focus on robot assisted keyhole neurosurgery. A modular system, allowing a reduction of the footprint, will be developed with two robots and one active bio-mimetic probe, able to cooperate among themselves in a biomimetic sensory-motor integrated framework. A gross positioning 3-axes robot will support a miniature parallel robot holding the probe to be introduced through a "keyhole" opening into the skull of the patient. Optical trackers (tracking the end effector and the patient), an imaging endoscope camera, and electromagnetic position and force sensors (on the probe) will extend robot perception by providing the control system with position and force feedback from the operating tools, and with visual information of the surgical field. Start date: 1 January 2008 End date: 31 December 2010 Project officer: Michel Brochard Factsheet on CORDIS Website |
ROCKIT - Roadmap for Conversational Interaction Technologies ROCKIT is a strategic roadmapping proposal for research and innovation in the area of natural conversational interaction. The primary scientific focus concerns interactive agents which are proactive, multimodal, social, and autonomous. A second focus concerns systems which can extract and exploit rich context and knowledge from heterogenous data sources. The main goal of ROCKIT is the development of a Research and Innovation Roadmap which integrates the vision and innovation agendas of those organisations (concerned with R&D and exploitation) in the field across Europe, with a broad coverage across sectors. A key goal is to bring together public sector research organisations with commercial organisations at all scales, with a particular focus on SMEs that represent the majority of fragmented commercial activity in Europe. Start date: 1 December 2013 End date: 30 November 2015 Project officer: Pierre-Paul Sondag Factsheet on CORDIS Website |
SBack to top |
SAVAS - Sharing AudioVisual Language Resources for Automatic Subtitling Nowadays, due to the quantity of the demand and the cost of the process, manual subtitling is no longer feasible. Broadcasters and subtitling companies are seeking for more productive subtitling alternatives. In this context, SAVAS partners aim to acquire, share and reuse audiovisual resources of broadcasters and subtitling companies so that high-tech European ASR (Automatic Speech Recognition) companies can use the shared data to develop domain specific Large Vocabulary Continuous Speech Recognisers (LVCSRs) in new languages to solve the automated subtitling needs of the media industry. Within the project, data and LVCSR technology for automated subtitling will be collected, shared and developed for the following six languages: Basque, Spanish, Italian, French, German and Portuguese. Start date: 1 May 2012 End date: 30 April 2014 Project officer: Susan Fraser Factsheet on CORDIS Factsheet Website |
SEMAINE - sustained Emotionally colooured Machine-human Interaction using Nonverbal Expression The aim of the SEMAINE project is to build a Sensitive Artificial Listener – a multimodal dialogue system with the social interaction skills needed for a sustained conversation with a human user. The system will emphasise “soft” communication skills, i.e. non-verbal, social and emotional perception, interaction and behaviour capabilities. The Sensitive Artificial Listener paradigm involves only very limited verbal capabilities, but has been shown to be suited for prolonged human-machine interaction. In this paradigm, we will build a real-time, robust interactive system perceiving a human user's facial expression, gaze, and voice, and engaging with the user through an Embodied Conversational Agent's body, face and voice. The agent will exhibit audiovisual listener feedback in real time while the user is speaking, and will take the user's feedback into account while the agent is speaking. The agent will pursue different dialogue strategies depending on the user's state; it will learn to interpret the user's non-verbal behaviour and adapt its own behaviour accordingly. Start date: 1 January 2008 End date: 31 December 2010 Project officer: Philippe Gelin Factsheet on CORDIS Website |
SENSEI - Making Sense of Human-Human Conversation Data The overall goals of the SENSEI project are twofold. First, SENSEI will develop summarization/analytics technology to help users make sense of human conversation streams from diverse media channels. Second, SENSEI will design and evaluate its summarization technology in ecological environments, aiming to improve task performance and productivity of end-users. Conversational interaction is the most natural and persistent paradigm for business relations with end-customers or users. In contact centres millions of customer spoken conversations are handled daily. On social media platforms hundreds of millions of blog posts are delivered through generalist or proprietary platforms. In both cases, conversations have little impact on the intended target "listeners" due to the volume, velocity and diversity (media, style, social context) of the document streams (spoken conversations and blog posts). Most language analytics technology is limited in that it performs keyword search, which does not provide automatic descriptions of what happened, who said what, which opinions are held on what subject, in a coherent, readable and executable form. Start date: 1 November 2013 End date: 31 October 2016 Project officer: Carola Carstens Factsheet on CORDIS Website |
SERA - Social Engagement with Robots and Agents The project SERA aims to advance science in the field of social acceptability of verbally interactive robots and agents, with a view especially to their applications in assistive technologies (companions, virtual butlers). To this aim, the project will undertake a field study in three iterations to collect data of real-life, long-term and open-ended relationships of subjects with robotic devices. The three iterations test different conditions (functionalities) of the equipment, which will consist of a room equipped with sensors at the subjects' home, a computer and a simple robotic device (the Nabaztag) as the front-end for interaction. The project partners will analyse the collected audio and video data in parallel, using different, mainly qualitative, methods. Data analysis will be prepared and accompanied by theoretical and methodological research in order to a) take into account the state of the art and b) ensure the quality of the field study. The project will use findings from the field study to specify, build and implement a reference architecture for social engagement, and use it for developing a showcase system of combined speech based service applications with relevance to the target field and audience. Start date: 1 January 2009 End date: 31 December 2010 Project officer: Pierre-Paul Sondag Factsheet on CORDIS Website |
SignSpeak - Scientific understanding and vision-based technological development for continuous sign language recognition and translation Deaf communities revolve around sign languages as they are their natural means of communication. Although deaf, hard of hearing and hearing signers can communicate without boundaries amongst themselves, there is a serious challenge for the deaf community in trying to integrate into educational, social and work environments, as the vast majority of Europeans do not have signing skills. The overall goal of SignSpeaker is to develop a new vision-based technology for translating continuous sign language to text, in order to improve the communication between deaf and hearing communities. This is the second FP7 project addressing sign language. Start date: 1 April 2009 End date: 31 March 2012 Project officer: Aleksandra Wesolowska Factsheet on CORDIS Factsheet Website |
SIMPLE4ALL - Speech synthesis that improves through adaptive learning The Simple4All project will create speech synthesis technology that learns from data with little or no expert supervision and continually improves itself, simply by being used. In order to be accepted by users, the voice of a spoken interaction system must be natural and appropriate for the content. Using the same voice for every application is not acceptable to users. But creating a speech synthesiser for a new language or domain is too expensive, because current technology relies on labelled data and human expertise. Start date: 1 November 2011 End date: 31 October 2014 Project officer: Leonhard Maqua Factsheet on CORDIS Factsheet Website |
SSPNET - Social Signal Processing Network The ability to understand and manage social signals of a person we are communicating with is the core of social intelligence. Social intelligence is a facet of human intelligence that has been argued to be indispensable and perhaps the most important for success in life. Although each one of us understands the importance of social signals in everyday life situations, and in spite of recent advances in machine analysis and synthesis of relevant behavioural cues like blinks, smiles, crossed arms, laughter, etc., the research efforts in machine analysis and synthesis of human social signals like empathy, politeness, and (dis)agreement, are few and tentative. Start date: 1 February 2009 End date: 31 January 2014 Project officer: Philippe Gelin Factsheet on CORDIS Website |
SUMAT - an online service for subtitling by machine translation Subtitling plays a very important role as it is the preferred multimedia content translation method in most of the European countries and for most of the genres in order to make audiovisual content widely accessible across languages. The increasing use and transmission of digital multimedia and multilingual content through the Web, DVDs and the current European and national policis promoting the subtitling of content broadcast by public TV stations have had as a consequence that subtitling demands have increased in recent years. Start date: 1 April 2011 End date: 31 March 2013 Project officer: Susan Fraser Factsheet on EUROPA Factsheet Website |
TBack to top |
TaaS - Terminology as a Service The aim of the TaaS project is to create a cloud-based platform for acquiring, cleaning up, sharing, and reusing multilingual terminological data, one of the most important language resources for industry, academia, and society in general. The motivation of the TaaS project is to address an evident need for instant access to the most recent terms and direct user involvement in the creation and sharing of terminology data. Such a platform will provide a variety of online services for key terminology tasks becoming an integral part of the multifaceted global cloud-based service infrastructure. TaaS services will perform term identification in the user provided documents. The platform will perform terminology extraction and return a list of term candidates. Start date: 01 June 2012 End date: 31 May 2014 Project officer: Aleksandra Wesolowska Factsheet on CORDIS Website |
transLectures - Transcription and Translation of Video Lectures Online educational repositories of video lectures are rapidly growing on the basis of increasingly available and standardised infrastructure. A well-known example of this is the VideoLectures web portal, a free and open access educational video lectures repository, and a major player in the development of the widely used Opencast Matterhorn platform for educational video management. As in other repositories, transcription and translation of video lectures in VideoLectures is needed to make them accessible to speakers of different languages and to people with disabilities. However, also as in other repositories, most lectures in VideoLectures are neither transcribed nor translated because of the lack of efficient solutions to obtain them at a reasonable level of accuracy. The aim of transLectures is to develop innovative, cost-effective solutions to produce accurate transcriptions and translations in VideoLectures, with generality across other Matterhorn-related repositories. Start date: 1 November 2011 End date: 31 October 2014 Project officer: Susan Fraser Factsheet on CORDIS Factsheet Website |
TrendMiner - Large Scale, Cross-lingual Trend Mining and Summarisation of Real-Time Media Streams The recent massive growth in online media and the rise of user-authored content (e.g weblogs, Twitter, Facebook) has lead to challenges of how to access and interpret these strongly multilingual data, in a timely, efficient, and affordable manner. Scientifically, streaming online media pose new challenges, due to their shorter, noisier, and more colloquial nature. Moreover, they form a temporal stream strongly grounded in events and context. Consequently, existing language technologies fall short on accuracy, scalability and portability. Start date: 1 November 2011 End date: 31 October 2014 Project officer: Susan Fraser Factsheet on CORDIS Factsheet Website |
TTC - Terminology Extraction, Translation Tools and Comparable Corpora The TTC project aims at improving machine translation tools, computer-assisted translation tools and multilingual content management tools by automatically generating bilingual terminologies from comparable corpora in several European languages (i.e. English, French, German and Latvian), as well as in Chinese and Russian. Start date: 1 January 2010 End date: 31 December 2012 Project officer: Pierre-Paul Sondag Factsheet on CORDIS Factsheet Website |
XBack to top |
X-LIKE - Cross-Lingual Knowledge Extraction The goal of the X-LIKE project is to develop technology to monitor and aggregate knowledge that is currently spread across global mainstream and social media, and to enable cross-lingual services for publishers, media monitoring and business intelligence. In terms of research contributions, the aim is to combine scientific insights from several scientific areas to contribute in the area of cross-lingual text understanding. By combining modern computational linguistics, machine learning, text mining and semantic technologies we plan to deal with the following two key open research problems: - to extract and integrate formal knowledge from multilingual texts with cross-lingual knowledge bases, and - to adapt linguistic techniques and crowdsourcing to deal with irregularities in informal language used primarily in social media. Start date: 1 January 2012 End date: 31 December 2014 Project officer: Susan Fraser Factsheet on CORDIS Factsheet Website |
xLiMe – crossLingual crossMedia knowledge extraction Europe is different from other large media markets such as the US or China in that information is being generated in different languages and distributed via diverse streams of localised media channels. Automatic analysis is complicated further by different content types (audio, video, text) and different channels (mainstream, social media). Thus, information can only be analysed independently for each dimension. This restricts the extractable knowledge and keeps it fragmented, which ultimately constrains the exchange of information. xLiMe proposes to extract knowledge from different media channels and languages and relate it to cross-lingual, cross-media knowledge bases. Start date: 1 November 2013 End date: 31 October 2016 Project officer: Saila Rinne Factsheet on CORDIS Website Source: http://cordis.europa.eu |