DNA 2.1
Collection
Making Qwen3 Think in Korean with Reinforcement Learning https://arxiv.org/abs/2508.10355
•
2 items
•
Updated
DNA 2.1 is a fine-tuned Qwen3 14B model that thinks natively in Korean through a two-stage training approach. This model is released alongside the paper Making Qwen3 Think in Korean with Reinforcement Learning.
This model builds upon Smoothie Qwen3, which reduces Chinese token emission probabilities and enhances Korean reasoning capabilities.
If you use this model in your research, please cite our paper:
@misc{lee2025makingqwen3thinkkorean,
title={Making Qwen3 Think in Korean with Reinforcement Learning},
author={Jungyup Lee and Jemin Kim and Sang Park and SeungJae Lee},
year={2025},
eprint={2508.10355},
archivePrefix={arXiv},
primaryClass={cs.CL},
url={https://arxiv.org/abs/2508.10355},
}