本期的 9 篇论文如下:
00:23 🧮 ProcessBench: Identifying Process Errors in Mathematical Reasoning(ProcessBench:识别数学推理中的过程错误)
01:13 🧠 Unraveling the Complexity of Memory in RL Agents: an Approach for Classification and Evaluation(揭开强化学习代理中记忆复杂性的分类与评估方法)
01:58 🧠 Training Large Language Models to Reason in a Continuous Latent Space(在连续潜在空间中训练大型语言模型进行推理)
02:38 🌐 Exploring Multi-Grained Concept Annotations for Multimodal Large Language Models(探索多粒度概念注释在多模态大语言模型中的应用)
03:22 🎥 Divot: Diffusion Powers Video Tokenizer for Comprehension and Generation(Divot:基于扩散模型的视频理解与生成)
04:09 🎥 You See it, You Got it: Learning 3D Creation on Pose-Free Videos at Scale(所见即所得:在无姿态视频上大规模学习3D创作)
04:53 🌍 Global and Dense Embeddings of Earth: Major TOM Floating in the Latent Space(地球的全局与密集嵌入:潜在空间中的Major TOM浮动)
05:31 🌐 Robust Multi-bit Text Watermark with LLM-based Paraphrasers(基于LLM的鲁棒多比特文本水印)
06:15 🤖 CARP: Visuomotor Policy Learning via Coarse-to-Fine Autoregressive Prediction(CARP:通过粗到细自回归预测进行视觉运动策略学习)
![](https://image.xyzcdn.net/Fk_JHl8Usx_sSQKGUXteZlas7O1r.webp)
【关注我们】
您还可以在以下平台找到我们,获得播客内容以外更多信息
小红书: AI速递