Profile
go back

TAUS Corona Crisis Corpus

Description

This corpus is generated by applying Matching Data selection to TAUS DataCloud and ParaCrawl data. The query corpus used is crawled from web for latest Corona virus related articles and news. The selected data is related to virology, epidemic, medicine and healthcare. Each file contains two tab seperated columns: first column is source text and second is the target. Anyone who is training their own MT engines can download these corpora and use them to improve their translation services and systems.


Comments

Leave a Reply

  1. Add to list add to your list
See also
Saved current search
Add сollection