← Today/2026-03-15

📅 Sunday, March 15, 2026

Top ML papers scored in PaperBrief digests on this day.

📅

Top Papers — Mar 15

The highest-scoring arXiv ML papers from Mar 15, ranked by LLM relevance.

Share
1📄 Notable
Security Considerations for Artificial Intelligence Agents

Ninghui Li +3 · 2026-03-12

5.0

This article, a lightly adapted version of Perplexity's response to NIST/CAISI Request for Information 2025-0035, details our observations and recommendations concerning the security of frontier AI ag…

2📄 Notable
XSkill: Continual Learning from Experience and Skills in Multimodal Agents

Guanyu Jiang +4 · 2026-03-12

5.0

Multimodal agents can now tackle complex reasoning tasks with diverse tools, yet they still suffer from inefficient tool use and inflexible orchestration in open-ended settings. A central challenge is…

3📄 Notable
Mock Worlds, Real Skills: Building Small Agentic Language Models with Synthetic Tasks, Simulated Environments, and Rubric-Based Rewards

Yuanjie Lyu +4 · 2026-01-30

5.0

Small LLMs often struggle to match the agentic capabilities of large, costly models. While reinforcement learning can help, progress has been limited by two structural bottlenecks: existing open-sourc…

4📄 Notable
Strategic Navigation or Stochastic Search? How Agents and Humans Reason Over Document Collections

Łukasz Borchmann +14 · 2026-03-12

4.0

Multimodal agents offer a promising path to automating complex document-intensive workflows. Yet, a critical question remains: do these agents demonstrate genuine strategic reasoning, or merely stocha…

5📄 Notable
Can Small Language Models Use What They Retrieve? An Empirical Study of Retrieval Utilization Across Model Scale

Sanchit Pandey · 2026-03-12

4.0

Retrieval augmented generation RAG is widely deployed to improve factual accuracy in language models yet it remains unclear whether smaller models of size 7B parameters or less can effectively utilize…

6📄 Notable
Try, Check and Retry: A Divide-and-Conquer Framework for Boosting Long-context Tool-Calling Performance of LLMs

Kunfeng Chen +4 · 2026-03-12

4.0

Tool-calling empowers Large Language Models (LLMs) to interact with external environments. However, current methods often struggle to handle massive and noisy candidate tools in long-context tool-call…

7📄 Notable
Let It Flow: Agentic Crafting on Rock and Roll, Building the ROME Model within an Open Agentic Learning Ecosystem

Weixun Wang +89 · 2025-12-31

4.0

Agentic crafting requires LLMs to operate in real-world environments over multiple turns by taking actions, observing outcomes, and iteratively refining artifacts. Despite its importance, the open-sou…

8📄 Notable
Mobile-Agent-RAG: Driving Smart Multi-Agent Coordination with Contextual Knowledge Empowerment for Long-Horizon Mobile Automation

Yuxiang Zhou +4 · 2025-11-15

4.0

Mobile agents show immense potential, yet current state-of-the-art (SoTA) agents exhibit inadequate success rates on real-world, long-horizon, cross-application tasks. We attribute this bottleneck to …

9📄 Notable
TURA: Tool-Augmented Unified Retrieval Agent for AI Search

Zhejun Zhao +10 · 2025-08-06

4.0

The advent of Large Language Models (LLMs) is transforming search engines into conversational AI search products, primarily using Retrieval-Augmented Generation (RAG) on web corpora. However, this par…

10📄 Notable
CARROT: A Learned Cost-Constrained Retrieval Optimization System for RAG

Ziting Wang +4 · 2024-11-01

4.0

Large Language Models (LLMs) have demonstrated impressive ability in generation and reasoning tasks but struggle with handling up-to-date knowledge, leading to inaccuracies or hallucinations. Retrieva…

Want papers like these in your inbox?

PaperBrief sends you a personalised daily digest of the arXiv papers that actually matter for your research track.

Get your personalised digest →