Want the latest?

And more!

LTO investigated the possibility to use ESIF (European Structural and Investment Funds) for language technology project ideas. Have a look at our dedicated web page with opportunities country by country, often including regional "smart specialisations". Take an active step and find new ways to get your project funded!

Last charrete “Collaborating around language resources for the digital single market”

Charrete “Collaborating around language resources for the digital single market”

 

The LT Observatory project aims at facilitating the access to language resources (LR) for research and for a commercial use, in an operational way.  As result a catalogue of  LRs (Lt-observe) has been implemented. In this charrette, held in Paris (8 of December) “Collaborating around language resources for the digital single market”, a dialogue among attendants was promoted for analysing:

  • The best actions to support the sustainability of the Lt-observe Catalogue.
  • The next steps and the main barriers to promote Language Technologies as a key enable in the Digital Single Market.  

As result of this dialogue, some collaborative actions among repository infrastructures were defined to improve Lt-observe Catalogue and to define its evolution. Additionally, some agreements were reached to start with the definition of a LT strategy that will really contribute to launch language technologies as a key enabler of the Digital Single Market.   

The charrette was co-organised by ELDA/ELRA premises and counted with 9 attendants representing key stakeholders in Language Resources: repositories infrastructures, research and industry.

FETLT 2016 and Final Workshop LT-Observatory

LTO Final Event FETLT 2016 Sevilla (Spain) - Presentations

 

Future and Emerging Trends in Language Technologies 2016, Sevilla, Spain brought together researchers, academics and industry representatives for a 3-days interactive conference. The second half of the first day was dedicated to the Final Workshop of the LT-Observatory project.

See our summary on Storify

Download the presentations:

Future and Emerging Trends in Language Technologies, Machine Learning and Big Data.

Luz Esparza, ZABALA Innovation Consulting. 

A Path Through the Funding Maze: Funding Programmes for Language Technologies.

Margaretha Mazura, EMF.

Access is King: The LTO LR on-line Catalogue

Philippe Wacker, LT-Innovate.

What MT professionals need to know and what decision makers should know” -  A practical vademecum.

Vesna Lušicky, University of Vienna.

Multilingual Europe in late 2016 A Strategic Research and Innovation Agenda for the Multilingual Digital Single Market.

Georg Rehm, Coordinator CRACKER, General Secretary.

The challenge of collecting and evaluating LRs for commercial use.

Bente Maegaard, CLARIN ERIC(and University of Copenhagen).

Workshop on open data and language processing technologies: An opportunity not to be missed pre event IODC 2016

open data and language processing technologies: An opportunity not to be missed pre event IODC 2016

 

The LT Observatory team has participated recently in the workshop on open data and language processing technologies: An opportunity not to be missed pre event of the IODC 2016 conference. The event was held in Madrid and some interest key issues related to the future of open data language resources were exposed to the attendees.

This presentation tried to show both the potential and the challenges that open data language resources (LRs) represent for the industry. Which is the current position of industry towards open LRs? Which open LRs are available? Which would be the best way to encourage open data LRs?... were some of the main points of discussion.

In this context, the EU’s political and economic goal of establishing a Digital Single Market (DSM) to boost on-line commerce and foster exchange of content and data across Europe is being held back by linguistic fragmentation. The European Commission predicts that the transition to the integrated DSM will deliver up to €400 billion in economic growth by 2020. However, the connected and truly integrated DSM can only exist once all language barriers have been overcome and all languages are connected through technologies.

Some of the main conclusions after discussions in the workshop were the following:

  • There are Open Language Resources, but we should to make them more useful for a research, operational and commercial use: How? Enriching them with useful metadata and providing tools for statistical Machine Translation.
  • “Share” is a key word. There are a lot of isolated research and industrial solutions, regarding LR.  If they were more viewable, they could produce a big advance in language technologies.
  • IPR and copyright key issues must be understood, simplified and harmonize all over Europe.

Charrette “Optimising knowledge and practice for operational language resources”

Charrette Paris

 

The LT Observatory project aimed at looking forward to the kind of technologies, environment and use cases that are likely to emerge in the LR-MT space in the next 2 to 5 years. The charrette recently held in Paris (22 of September) “Optimising knowledge and practice for operational language resources” was focused on promoting dialogue among attendants on:

  • Standards of best practice in LR management (find, create, share, evaluate, use, improve)
  • Tools to enable such practices, and also to rapidly expand the range of available LRs across more (EU) language pairs

Also a discussion about Standards for LR metadata issues that may in future target such textual dimensions as semantics, domain-centric, rhetorical, stylistic, etc. for future applications of MT was perfomed. That is, going beyond the current formal criteria of file type etc., and anticipating rich content-based metadata criteria that can better (and automatically) target LRs that meet the right quality standards.

The charrette was co-organised by ELDA/ELRA premises, counted with 19 attendants with different profiles within the MT, LR field: researchers, companies, public entities…

The results of this charrette  will feed into our Final Conference, to be organised in November 2016, at which the tools and findings of the LTO project will be presented to a broad cross-selection of stakeholders.

Call for Participation: Neural MT Marathon 2016

MT Marathon 2016

 

MT Marathon 2016
Prague, September 12-17, 2016
www.statmt.org/mtm16

The programme is ready and we put an unprecedented focus on neural machine translation.

In addition to your laptop and hacking projects in mind, make sure to bring:

  • your high school memories of algebra
  • pencil and paper for old-style exercise lesson and exam
  • your Amazon account to get a $50 voucher for (GPU) computations

Register at: www.statmt.org/mtm16/registration
(Free of charge but limited to 100 places.)

** Project proposals are still more than welcome **

As always, project topics will get finalized on the first day of the Marathon, but it was found useful in the past to announce and refine project proposals ahead of time.

If you have an idea what you'd like to implement in a small team of fellow participants, or if you just want to peek at what is going to be proposed, have a look or edit the live document linked from:

www.statmt.org/mtm16/projects.html

Dialogue Day in the 5th Language Technology Industry Summit - 17-18 May 2016

lt-innovate lti-summit programme

For the fifth consecutive year, LT-Innovate, Europe's language technology industry association, organised the LT Industry Summit on 17-18 May 2016 in Brussels. The LT Industry Summit is the yearly point of convergence between the language technology industry, its clients, research partners and policy makers.

This year's summit programme included best practice presentations by language technology suppliers and users, presentations of new solutions and technologies, buyer presentations of needs in terms of language technologies and presentations of research output. Taking advantage of the fact that the most relevant stakeholders in the area were present, the second Dialogue Day of the LT_Observatory project was held with the topic: “Language  infrastructures and Language Resources: Pulling the Pieces of the Jigsaw Together”.

You can find the presentations at: http://www.lt-innovate.org/lti-summit/programme

You can find the summary about the event in Storify

1st review meeting and coordination meeting (Luxembourg, 3-4 February 2016)

Second Charrette (Brussels, 4 December 2015)

Objectives: 1) get the expertise of a group of experts on the nature and quality of the first set of linguistic resources collected by the Observatory LT-Catalog; (2) work with experts to determine the essential criterion for practical use of the Catalog; and (3) explore all other ways to develop and capture opportunities for the LT-O Catalog.

1st Charrette (Vienna, 8/9/10 June 2015)

It was held on the occasion of the 20th European Symposium on Languages for Special Purposes (Vienna) in the Centre for Translation Studies (Zentrum für Translationswissenschaft) of the University of Vienna, Austria.

Objective: get input from terminology experts on best practices in terminology construction/collection, management and delivery for practical applications in an automated translation scenario in Europe. The consortium was particularly interested in identifying the main gaps in the existing coverage of terminology and in the national and local languages in which terminology is expressed. 

flickr PICTURES on our FLICKR

Dialogue Workshop on Language Resources (Brussels 25/26 June 2015)

The first LTO Dialogue Workshop entitled "Language Resources - Foundations of the Multilingual Digital Single Market" took place on 25 June 2015 in Brussels, on the occasion of the LT-Innovate Summit. High-level keynote speakers set the scene in the morning plenary, followed by the EC's CEF. 

Workshop Summary 

In three sessions, representatives from industry, research and public administrations discussed the demand and supply of language resources, their availability and accessibility, followed by a third sessoin on how to reconcile both and close the gap of missing language resources. Very early in the discussion, it became apparent that the purpose of the use of LR is paramount when defining usability. While existing repositories like META-SHARE, ELRA or CLARIN VLE did a remarkable work in compiling LRs, their usability differs depending on the user. Industry users often prefer their networks combined with clients' resources. New sources like Wikipedia translate may be future sources based on crowd-sourcing. Opus or Babelnet are other open access LRs, like the Acquis Communautaire of the EU. 

More information 

 PICTURES on our FLICKR