Top-rated papers from ICLR 2025
Scaling In-the-Wild Training for Diffusion-based Illumination Harmonization and Editing by Imposing Consistent Light Transport – Rating: 9.0
– https://openreview.net/pdf?id=u1cQYxRI1H…
OLMoE: Open Mixture-of-Experts Language Models
– Rating: 8.67
– https://openreview.net/pdf?id=xXTkbTBmqq…
Compositional Entailment Learning for Hyperbolic Vision-Language Models
– Rating: 8.0
– https://openreview.net/pdf?id=3i13Gev2hV…
The Complexity of Two-Team Polymatrix Games with Independent Adversaries
– Rating: 8.0
– https://openreview.net/pdf?id=9VGTk2NYjF…
Spread Preference Annotation: Direct Preference Judgment for Efficient LLM Alignment
– Rating: 8.0
– https://openreview.net/pdf?id=BPgK5XW1Nb…
SAM 2: Segment Anything in Images and Videos
– Rating: 8.0
– https://openreview.net/pdf?id=Ha6RTeWMd0…
Streaming Algorithms For $\ell_p$ Flows and $\ell_p$ Regression
– Rating: 8.0
– https://openreview.net/pdf?id=Kpjvm2mB0K…
Differential Transformer
– Rating: 8.0
– https://openreview.net/pdf?id=OvoCm1gGhN…
LoRA Done RITE: Robust Invariant Transformation Equilibration for LoRA Optimization
– Rating: 8.0
– https://openreview.net/pdf?id=VpWki1v2P8…
Spider 2.0: Can Language Models Resolve Real-World Enterprise Text-to-SQL Workflows?
– Rating: 8.0
– https://openreview.net/pdf?id=XmProj9cPs…
BigCodeBench: Benchmarking Code Generation with Diverse Function Calls and Complex Instructions
– Rating: 8.0
– https://openreview.net/pdf?id=YrycTjllL0…
Cybench: A Framework for Evaluating Cybersecurity Capabilities and Risks of Language Models
– Rating: 8.0
– https://openreview.net/pdf?id=tc90LV0yRL…
AlphaEdit: Null-Space Constrained Knowledge Editing for Language Models
– Rating: 8.0
MAP: Multi-Human-Value Alignment Palette
– Rating: 8.0
Scaling and evaluating sparse autoencoders
– Rating: 7.8
– https://openreview.net/pdf?id=tcsZt9ZNKD…
Judge Decoding: Faster Speculative Sampling Requires Going Beyond Model Alignment
– Rating: 7.75
– https://openreview.net/pdf?id=mtSSFiqW6y…
Simplifying, Stabilizing and Scaling Continuous-time Consistency Models
– Rating: 7.6
– https://openreview.net/pdf?id=LyJi5ugyJx…
Transfusion: Predict the Next Token and Diffuse Images with One Multi-Modal Model
– Rating: 7.6

![Slow Perception: Let’s Perceive Geometric Figures Step-by-step[缓慢感知:让我们逐步感知几何图形]-AI论文](https://assh83.com/wp-content/uploads/2025/01/1-1-360x180.png)
![Large Concept Models:Language Modeling in a Sentence Representation Space[大型概念模型:在句子表示空间中的语言建模]-AI论文](https://assh83.com/wp-content/uploads/2025/01/image-1-360x180.png)
![Cultural Evolution of Cooperation among LLM Agents[大型语言模型代理间合作的文化演化]-AI论文](https://assh83.com/wp-content/uploads/2025/01/image-360x180.png)

![Do NOT Think That Much for 2+3=? On the Overthinking of o1-Like LLMs[不要过度思考2+3等于几 在类LLM的过度思考上]-AI论文](https://assh83.com/wp-content/uploads/2025/01/1-2-350x250.png)
![Slow Perception: Let’s Perceive Geometric Figures Step-by-step[缓慢感知:让我们逐步感知几何图形]-AI论文](https://assh83.com/wp-content/uploads/2025/01/1-1-350x250.png)
![Ensembling Large Language Models with Process Reward-Guided Tree Search for Better Complex Reasoning[结合大型语言模型与过程奖励引导的树搜索以提升复杂推理能力]-AI论文](https://assh83.com/wp-content/uploads/2025/01/1-350x248.png)
![Large Concept Models:Language Modeling in a Sentence Representation Space[大型概念模型:在句子表示空间中的语言建模]-AI论文](https://assh83.com/wp-content/uploads/2025/01/image-1-350x250.png)