Data release: Two new language pairs in PatTR
The ICL Statistical Natural Language Processing Group has recentley added 19M English-French and 5M French-German sentence pairs to the open source parallel corpus PatTR. The data was extracted from EPO, WIPO and USPTO patents and automatically aligned at the sentence level. For more information and download see the PatTR-webpage.