SimpleStrat: Diversifying Language Model Generation with Stratification
⬅️ [TPO: Aligning Large Language Models with Multi-branch & Multi-step Preference Tr](<./TPO_ Aligning Large Language Models with Multi-branch & Multi-step Preference Tr.md>) | ⬆️ [Reading List](<./README.md>) | [RL In Name Only](<./RL In Name Only.md>) ➡️
SimpleStrat: Diversifying Language Model Generation with Stratification
https://arxiv.org/abs/2410.09038
⬅️ [TPO: Aligning Large Language Models with Multi-branch & Multi-step Preference Tr](<./TPO_ Aligning Large Language Models with Multi-branch & Multi-step Preference Tr.md>) | ⬆️ [Reading List](<./README.md>) | [RL In Name Only](<./RL In Name Only.md>) ➡️