Skip to content

SimpleStrat: Diversifying Language Model Generation with Stratification

⬅️ [TPO: Aligning Large Language Models with Multi-branch & Multi-step Preference Tr](<./TPO_ Aligning Large Language Models with Multi-branch & Multi-step Preference Tr.md>) | ⬆️ [Reading List](<./README.md>) | [RL In Name Only](<./RL In Name Only.md>) ➡️

SimpleStrat: Diversifying Language Model Generation with Stratification

https://arxiv.org/abs/2410.09038


⬅️ [TPO: Aligning Large Language Models with Multi-branch & Multi-step Preference Tr](<./TPO_ Aligning Large Language Models with Multi-branch & Multi-step Preference Tr.md>) | ⬆️ [Reading List](<./README.md>) | [RL In Name Only](<./RL In Name Only.md>) ➡️