StatNLP HD Blog

This is the blog of the Statistical NLP Group at the Department of Computational Linguistics, Heidelberg University. Our research addresses various aspects of the problem of the confusion of languages, by means of statistical learning for natural language processing.

We blog about pitfalls in methodologies, recent advances and important problems in this field of research.

Recent Posts

RL in NMT: The Good, the Bad and the Ugly

Discussing good, bad and ugly practices of reinforcement learning in neural machine translation.

Julia Kreutzer

Updated: January 16, 2020

Tags: BLEU, expected reward, exposure bias, future challenges, monolingual data, neural machine translation, policy gradient, reinforcement learning, sampling

Categories: opinion piece, paper discussion

Translating Middle Egyptian Hieroglyphs

This blog post gives an overview of the paper “Multi-Task Modeling of Phonographic Languages: Translating Middle Egyptian Hieroglyphs”.

Philipp Wiesenbach

Updated: October 29, 2019

Tags: low resource scenario, multi-task modeling, neural machine translation

Categories: new publication

Joey NMT - A Minimalist NMT Toolkit for Novices

Introducing Joey NMT, a minimalist neural machine translation framework for novices built on Pytorch.

Julia Kreutzer

Updated: October 23, 2019

Tags: beginners, neural machine translation

Categories: new publication, software

Response-Based and Counterfactual Learning for Sequence-to-Sequence Tasks in NLP: An Overview

This post presents a summary of my PhD thesis. I explored how to learn from feedback given to model outputs when the collection of direct supervision signals...

Carolin Lawrence

Updated: August 15, 2019

Tags: machine translation, reinforcement learning, semantic parsing

Categories: overview, PhD thesis

The Real Challenge of Real-World Reinforcement Learning: The Human Factor

How can we give RL agents that learn from human feedback a possible advantage to succeed in this difficult learning scenario?

Stefan Riezler

Updated: July 26, 2019

Tags: human factor, neural machine translation, reinforcement learning

Categories: opinion piece, paper discussion

Counterfactual Learning of Semantic Parsers When Even Gold Answers Are Unattainable

How can we train semantic parsers if neither question-parse nor question-answer pairs can be collected?

Carolin Lawrence

Updated: January 14, 2019

Tags: MRT, reinforcement learning, REINFORCE, semantic parsing

Categories: overview

Taming Wild Reward Functions: The Score Function Gradient Estimator Trick

This post explains the need for the score function gradient estimator trick and how it works.

Carolin Lawrence

Updated: November 12, 2018

Tags: expected reward, exposure bias, MRT, neural machine translation, policy gradient, REINFORCE, sampling, score function gradient estimator, semantic parsing

Categories: explanation