Top-rated papers from ICLR 2025
Scaling In-the-Wild Training for Diffusion-based Illumination Harmonization and Editing by Imposing Consistent Light Transport – Rating: 9.0
– https://openreview.net/pdf?id=u1cQYxRI1H…
OLMoE: Open Mixture-of-Experts Language Models
– Rating: 8.67
– https://openreview.net/pdf?id=xXTkbTBmqq…
Compositional Entailment Learning for Hyperbolic Vision-Language Models
– Rating: 8.0
– https://openreview.net/pdf?id=3i13Gev2hV…
The Complexity of Two-Team Polymatrix Games with Independent Adversaries
– Rating: 8.0
– https://openreview.net/pdf?id=9VGTk2NYjF…
Spread Preference Annotation: Direct Preference Judgment for Efficient LLM Alignment
– Rating: 8.0
– https://openreview.net/pdf?id=BPgK5XW1Nb…
SAM 2: Segment Anything in Images and Videos
– Rating: 8.0
– https://openreview.net/pdf?id=Ha6RTeWMd0…
Streaming Algorithms For $\ell_p$ Flows and $\ell_p$ Regression
– Rating: 8.0
– https://openreview.net/pdf?id=Kpjvm2mB0K…
Differential Transformer
– Rating: 8.0
– https://openreview.net/pdf?id=OvoCm1gGhN…
LoRA Done RITE: Robust Invariant Transformation Equilibration for LoRA Optimization
– Rating: 8.0
– https://openreview.net/pdf?id=VpWki1v2P8…
Spider 2.0: Can Language Models Resolve Real-World Enterprise Text-to-SQL Workflows?
– Rating: 8.0
– https://openreview.net/pdf?id=XmProj9cPs…
BigCodeBench: Benchmarking Code Generation with Diverse Function Calls and Complex Instructions
– Rating: 8.0
– https://openreview.net/pdf?id=YrycTjllL0…
Cybench: A Framework for Evaluating Cybersecurity Capabilities and Risks of Language Models
– Rating: 8.0
– https://openreview.net/pdf?id=tc90LV0yRL…
AlphaEdit: Null-Space Constrained Knowledge Editing for Language Models
– Rating: 8.0
MAP: Multi-Human-Value Alignment Palette
– Rating: 8.0
Scaling and evaluating sparse autoencoders
– Rating: 7.8
– https://openreview.net/pdf?id=tcsZt9ZNKD…
Judge Decoding: Faster Speculative Sampling Requires Going Beyond Model Alignment
– Rating: 7.75
– https://openreview.net/pdf?id=mtSSFiqW6y…
Simplifying, Stabilizing and Scaling Continuous-time Consistency Models
– Rating: 7.6
– https://openreview.net/pdf?id=LyJi5ugyJx…
Transfusion: Predict the Next Token and Diffuse Images with One Multi-Modal Model
– Rating: 7.6