OpenAIo1原理解读:偏好链优化:改进LLMs中的思维链推理[Chain of Preference Optimization: Improving Chain-of-Thought Reasoning in LLMs]-AI论文 by 小远 2024-10-21 0 1. Introduction(介绍) ...
OpenIAo1原理解读:Q*强化学习与启发式搜索推理框架[Q*: Improving Multi-step Reasoning for LLMs with Deliberative Planning]-AI论文 by 小远 2024-09-27 0 1. 引言 (Introduction) ...
AI论文 预训练代理和世界模型的缩放定律[SCALING LAWS FOR PRE-TRAINING AGENTS AND WORLD MODELS]-AI论文 by 小远 2024-11-20 0 1. Introduction 在本章中... Read more
AI论文 SynthVLM: High-Efficiency and High-Quality Synthetic Data forVision Language Models[视觉语言模型的高效高质量合成数据]-AI论文 by 小远 2024-11-19 0 1.摘要 摘要部分,论文提出了一种名为S... Read more
AI论文 腾讯:More Agents Is All You Need[多智能体是你需要的]-AI论文 by 小远 2024-11-19 0 1. 引言(Introduction) ... Read more