TranslatorEvals

We (Mikhail and Rodion) could not find a good independent benchmark of major online translators and decided to do it ourselves. Our bets were mostly on DeepL and Yandex correspondingly, but we also take Google as a baseline.

Method

Data preparation

To select diverse Engish sentences we took the first 300 sentences from C4 dataset. After filtering out sentences with 5000+ characters we have 278 remaining. See data/unparsed_texts.txt and parsing.ipynb.

Translation

To simplify the process, texts were translated using APIs of these three translators. Unfortunately, Yandex API was not working for us, so we asked another person for whom API was working to help with translation. Resutls are in trans_dl_go.csv and trans_ya.json, or all together in dataset.csv.

Evaluation

openai api
"you are a Russian native speaker and fluent in English. you are given an original sentence in English and three candidate translations. which of them conveys the original meaning in the most accurate and fluent way? print 1, 2 or 3.

Name		Name	Last commit message	Last commit date
Latest commit History 13 Commits
data		data
LICENSE		LICENSE
README.md		README.md
analysis.ipynb		analysis.ipynb
auto_evaluation.ipynb		auto_evaluation.ipynb
dataset.ipynb		dataset.ipynb
human_eval.py		human_eval.py
parsing.ipynb		parsing.ipynb
requirements.txt		requirements.txt
translation.ipynb		translation.ipynb

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

TranslatorEvals

Method

Data preparation

Translation

Evaluation

About

Uh oh!

Releases

Packages

Uh oh!

Contributors 2

Uh oh!

Languages

License

WideLearning/TranslatorEvals

Folders and files

Latest commit

History

Repository files navigation

TranslatorEvals

Method

Data preparation

Translation

Evaluation

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors 2

Uh oh!

Languages

Packages