MSR-WS2020

Third Workshop on Multilingual Surface Realisation

Barcelona, December 12th, 2020

MSR'20 - SIGGEN event

COLING'20

COLING FAQ

Call for papers

↑

Universal Dependencies (UD)

parsing

MSR’20 invites contributions on all topics that are related to multilingual and monolingual surface realisation in NLG, specifically including and encouraging reversible methods. We welcome all submissions that address problems of surface-oriented generation such as grammatical and/or information structure-driven word order determination, inflection, functional word determination, paraphrasing, etc. We particularly encourage the submission of papers that make a clear contribution to the progress in robust multilingual surface generation, i.e. present methods easily portable from one language to another and clearly scalable. Topics of interest include, but are not limited to:

Linearisation in NLG
Multilingual approaches to surface realisation
Function word generation
Inflection in NLG
Joint generation from abstract representations
Surface-oriented text simplification
Surface-oriented spoken language generation
Application of surface realisation for grammatical error correction
NLG in surface-oriented paraphrasing
Deep learning approaches to NLG

Shared Task

↑

SR’19 Shared Task

SR’20 webpage

As in previous years, the goal of the shared task is to generate a well-formed sentence given the input structure, and there are two tracks with different levels of complexity:

Shallow Track (Track 1): This track starts from vanilla UD structures in which word order information has been removed and tokens have been lemmatised, i.e. the inputs are unordered dependency trees with lemmatised nodes that contain PoS tags and morphological information as found in the original annotations. The task is equivalent to determining the word order and inflecting the words. As indicated above, there will be both a closed (T1a) and an open (T1b) subtrack.
Deep Track (Track 2): This track starts from UD structures from which functional words (in particular, auxiliaries, functional prepositions and conjunctions) and surface-oriented morphological information have been removed. In addition to what has to be done for the Shallow Track, the Deep Track thus involves introducing the removed functional words and morphological features. Again, there will be a closed (T2a) and an open (T2b) subtracks.
Deep learning approaches to NLG

SR'20 website

Important Dates

↑

12 July 2020

28 July 2020

20 August, 2020

~~8 October 2020~~ 18 October

~~20 October 2020~~ 25 October

31 October 2020

12 December 2020

Submissions

↑

We invite long papers (8 pages) and short papers (4 pages). Both long and short papers have unlimited references, and their final versions will be given one additional page (up to 9 and 5 pages, respectively, in the proceedings and unlimited pages for references).

Softconf START conference management system

To encourage inclusiveness and the presentation of speculative and recent work, inclusion in the conference proceedings will be made optional. The author’s preference should be indicated with the final submission.

Multiple submissions policy:

Templates, guidelines and other policies:

COLING website

Registration

↑

COLING’20 registration page

Program

The workshop will consist of technical presentations, the presentation of the shared task results, an invited talk and a discussion session.

Yue Zhang

14:00 Opening

14:15

Invited Talk: Yue Zhang

AMR to text generation -- a brief review and a case study using back-parsing

15:00

Oral presentation

The Third Multilingual Surface Realisation Shared Task: Overview and Evaluation Results
Simon Mille, Anja Belz, Bernd Bohnet, Thiago Castro Ferreira, Yvette Graham, Leo Wanner

15:30 Break

15:50

15:50

16:00

16:10

16:20

16:30

16:40

Short presentation and Q&A with authors

BME-TUW at SR’20: Lexical grammar induction for surface realization
Gábor Recski, Ádám Kovács, Kinga Gémes, Judit Ács and Andras Kornai

ADAPT at SR’20: How Preprocessing and Data Augmentation Help to Improve Surface Realization
Henry Elder

IMSurReal Too: IMS in the Surface Realization Shared Task 2020
Xiang Yu, Simon Tannert, Ngoc Thang Vu and Jonas Kuhn

Lexical Induction of Morphological and Orthographic Forms for Low-Resourced Languages
Taha Tobaili

NILC at SR’20: Exploring Pre-Trained Models in Surface Realisation
Marco Antonio Sobrevilla Cabezudo and Thiago Pardo

Surface Realization Using Pretrained Language Models
Farhood Farahnak, Laya Rafiee, Leila Kosseim and Thomas Fevens

16:50 Break

17:00 Panel/Discussions

18:00 Closing

Proceedings

↑

You can download the proceedings from the ACL Anthology and see the details of the task results and participating systems:

MSR'20 workshop proceedings

MSR'19 workshop proceedings

MSR'18 workshop proceedings

Programme Committee

↑

Contact

msr.organizers@gmail.com

Organising committee

↑

Simon Mille	TALN Pompeu Fabra University, Barcelona, Spain
Anya Belz	University of Brighton UK
Bernd Bohnet	Google Research, London, UK
Thiago Castro Ferreira	Federal University of Minas Gerais, Brasil
Yvette Graham	ADAPT Center, Dublin City University, Ireland
Leo Wanner	TALN Pompeu Fabra University and ICREA, Barcelona, Spain

Funding

Photo by Christopher Burns on Unsplash

14:00	Opening
14:15	Invited Talk: Yue Zhang AMR to text generation -- a brief review and a case study using back-parsing
15:00	Oral presentation The Third Multilingual Surface Realisation Shared Task: Overview and Evaluation Results Simon Mille, Anja Belz, Bernd Bohnet, Thiago Castro Ferreira, Yvette Graham, Leo Wanner
15:30	Break
15:50 15:50 16:00 16:10 16:20 16:30 16:40	Short presentation and Q&A with authors BME-TUW at SR’20: Lexical grammar induction for surface realization Gábor Recski, Ádám Kovács, Kinga Gémes, Judit Ács and Andras Kornai ADAPT at SR’20: How Preprocessing and Data Augmentation Help to Improve Surface Realization Henry Elder IMSurReal Too: IMS in the Surface Realization Shared Task 2020 Xiang Yu, Simon Tannert, Ngoc Thang Vu and Jonas Kuhn Lexical Induction of Morphological and Orthographic Forms for Low-Resourced Languages Taha Tobaili NILC at SR’20: Exploring Pre-Trained Models in Surface Realisation Marco Antonio Sobrevilla Cabezudo and Thiago Pardo Surface Realization Using Pretrained Language Models Farhood Farahnak, Laya Rafiee, Leila Kosseim and Thomas Fevens
16:50	Break
17:00	Panel/Discussions
18:00	Closing