TAUS Corona Crisis Corpus


This corpus is generated by applying Matching Data selection to TAUS DataCloud and ParaCrawl data. The query corpus used is crawled from web for latest Corona virus related articles and news. The selected data is related to virology, epidemic, medicine and healthcare. Each file contains two tab seperated columns: first column is source text and second is the target. Anyone who is training their own MT engines can download these corpora and use them to improve their translation services and systems.


Добавить комментарий

  1. Добавить в избранное add to your list
Смотреть также
Сохранить текущий поиск
Добавить подборку