#ai-alignment
Every summary, chronological. Filter by category, tag, or source from the rail.
Tag · #ai-alignment
Tandem Reinforcement Learning: Aligning AI Reasoning with Humans
Tandem Reinforcement Learning (TRL) forces stronger models to co-generate reasoning with weaker models, resulting in more legible, robust, and human-compatible chains of thought without sacrificing performance.
arXiv cs.AI
Showing 1 of 1