Welcome to the Statistical Natural Language Processing Group at the Institute for Computational Linguistics at Heidelberg University. Our research is on the intersection of machine learning and natural language processing, with a special focus on interactive statistical learning techniques. For example, we work on interactive neural machine translation and neural question answering systems, where an artificial intelligence agent learns from human reinforcement/bandit feedback.

We organize the weekly Statistical NLP Colloquium.

group photo
StatNLP group @ Botanical Gardens (next to our department)

Latest news

New publication at IWSLT 2023

New research from the StatNLP group on Improving End-to-End Speech Translation by Imitation-Based Knowledge Distillation with Synthetic Transcripts will be presented at IWSLT 2023. The paper is available here.

New publication at EAMT2023

New research from the StatNLP group about Enhancing Supervised Learning with Contrastive Markings in Neural Machine Translation Training will be presented at EAMT 2023. The paper is available here.

New publication at ICASSP 2023

New publication from the StatNLP group on Make More of Your Data: Minimal Effort Data Augmentation for Automatic Speech Recognition and Translation appears at ICASSP 2023. The paper is available here.

New publication at ICLR 2023

New research from the StatNLP group on Towards Inferential Reproducibility of Machine Learning Research appears at ICLR 2023. The paper is available here.

New publication at EMNLP 2022 Demo Track

JoeyS2T will be presented at the demo track in EMNLP 2022. The paper is available on arxiv. We released pre-trained models for the LibriSpeech ASR and MuST-C en-de Speech Translation benchmarks here.