09/04/2026 1:49 PM

⬅️ [08/04/2026 1:13 PM](<./08_04_2026 1_13 PM.md>) | ⬆️ [2026 - April](<./README.md>) | [14/04/2026 10:30 AM](<./14_04_2026 10_30 AM.md>) ➡️

09/04/2026 1:49 PM

Screenshot 2026-04-09 at 1.49.36 PM.png

AI generated translations make native speakers go "Huh that's not how I would say that"
Might bring that in to the argument.

Screenshot 2026-04-09 at 1.53.52 PM.png

Haha the benchmarks were from generated data which meant that using auto-translated data for training increased performance on benchmarks, but reduced real world performance. But training on real data decreased performance for benchmarks.

Quality and diversity more important than quantity.

Screenshot 2026-04-09 at 2.15.21 PM.png

⬅️ [08/04/2026 1:13 PM](<./08_04_2026 1_13 PM.md>) | ⬆️ [2026 - April](<./README.md>) | [14/04/2026 10:30 AM](<./14_04_2026 10_30 AM.md>) ➡️