Ruprecht-Karls-Universität Heidelberg
Institut für Computerlinguistik

Bilder vom Neuenheimer Feld, Heidelberg und der Universität Heidelberg
Siegel der Uni Heidelberg

Data release: Two new language pairs in PatTR

The ICL Statistical Natural Language Processing Group has recentley added 19M English-French and 5M French-German sentence pairs to the open source parallel corpus PatTR. The data was extracted from EPO, WIPO and USPTO patents and automatically aligned at the sentence level. For more information and download see the PatTR-webpage.

zum Seitenanfang