OpenAIo1原理解读:蒙特卡洛树搜索引导大模型进行自我训练[ReST-MCTS∗: LLM Self-Training via Process Reward Guided Tree Search]-AI论文 by 小远 2024-11-04 0 这篇论文的标题是《ReST-MCTS∗: ...
AI论文 预训练代理和世界模型的缩放定律[SCALING LAWS FOR PRE-TRAINING AGENTS AND WORLD MODELS]-AI论文 by 小远 2024-11-20 0 1. Introduction 在本章中... Read more
AI论文 SynthVLM: High-Efficiency and High-Quality Synthetic Data forVision Language Models[视觉语言模型的高效高质量合成数据]-AI论文 by 小远 2024-11-19 0 1.摘要 摘要部分,论文提出了一种名为S... Read more
AI论文 腾讯:More Agents Is All You Need[多智能体是你需要的]-AI论文 by 小远 2024-11-19 0 1. 引言(Introduction) ... Read more