Rigorous Error Certification for Neural PDE Solvers: From Empirical Residuals to Solution GuaranteesAmartya Mukherjee, Maxwell Fitzsimmons, David C. Del Rey Fernández et al.
cs.LGmath.APmath.FAMar 19, 2026
Uncertainty quantification for partial differential equations is traditionally grounded in discretization theory, where solution error is controlled via mesh/grid refinement. Physics-informed neural n…
SOL-ExecBench: Speed-of-Light Benchmarking for Real-World GPU Kernels Against Hardware LimitsEdward Lin, Sahil Modi, Siva Kumar Sastry Hari et al.
cs.LGcs.AIMar 19, 2026
As agentic AI systems become increasingly capable of generating and optimizing GPU kernels, progress is constrained by benchmarks that reward speedup over software baselines rather than proximity to h…
Nemotron-Cascade 2: Post-Training LLMs with Cascade RL and Multi-Domain On-Policy DistillationZhuolin Yang, Zihan Liu, Yang Chen et al.
cs.CLcs.AIcs.LGMar 19, 2026
We introduce Nemotron-Cascade 2, an open 30B MoE model with 3B activated parameters that delivers best-in-class reasoning and strong agentic capabilities. Despite its compact size, its mathematical an…
NavTrust: Benchmarking Trustworthiness for Embodied NavigationHuaide Jiang, Yash Chaudhary, Yuping Wang et al.
cs.ROcs.AIcs.CVcs.LGMar 19, 2026
There are two major categories of embodied navigation: Vision-Language Navigation (VLN), where agents navigate by following natural language instructions; and Object-Goal Navigation (OGN), where agent…
DreamPartGen: Semantically Grounded Part-Level 3D Generation via Collaborative Latent DenoisingTianjiao Yu, Xinzhuo Li, Muntasir Wahed et al.
cs.CVcs.AIcs.LGMar 19, 2026
Understanding and generating 3D objects as compositions of meaningful parts is fundamental to human perception and reasoning. However, most text-to-3D methods overlook the semantic and functional stru…
How Uncertainty Estimation Scales with Sampling in Reasoning ModelsMaksym Del, Markus Kängsepp, Marharyta Domnich et al.
cs.AIcs.CLcs.LGMar 19, 2026
Uncertainty estimation is critical for deploying reasoning language models, yet remains poorly understood under extended chain-of-thought reasoning. We study parallel sampling as a fully black-box app…
Meanings and Measurements: Multi-Agent Probabilistic Grounding for Vision-Language NavigationSwagat Padhan, Lakshya Jain, Bhavya Minesh Shah et al.
cs.ROcs.AIcs.CLcs.CVMar 19, 2026
Robots collaborating with humans must convert natural language goals into actionable, physically grounded decisions. For example, executing a command such as "go two meters to the right of the fridge"…
SAVeS: Steering Safety Judgments in Vision-Language Models via Semantic CuesCarlos Hinojosa, Clemens Grange, Bernard Ghanem
cs.CVcs.AIcs.CLcs.LGMar 19, 2026
Vision-language models (VLMs) are increasingly deployed in real-world and embodied settings where safety decisions depend on visual context. However, it remains unclear which visual evidence drives th…
Adaptive Regime-Aware Stock Price Prediction Using Autoencoder-Gated Dual Node Transformers with Reinforcement Learning ControlMohammad Al Ridhawi, Mahtab Haj Ali, Hussein Al Osman
cs.LGcs.AIq-fin.STMar 19, 2026
Stock markets exhibit regime-dependent behavior where prediction models optimized for stable conditions often fail during volatile periods. Existing approaches typically treat all market states unifor…
CustomTex: High-fidelity Indoor Scene Texturing via Multi-Reference CustomizationWeilin Chen, Jiahao Rao, Wenhao Wang et al.
cs.CVcs.AIMar 19, 2026
The creation of high-fidelity, customizable 3D indoor scene textures remains a significant challenge. While text-driven methods offer flexibility, they lack the precision for fine-grained, instance-le…
D5P4: Partition Determinantal Point Process for Diversity in Parallel Discrete Diffusion DecodingJonathan Lys, Vincent Gripon, Bastien Pasdeloup et al.
cs.AIcs.LGMar 19, 2026
Discrete diffusion models are promising alternatives to autoregressive approaches for text generation, yet their decoding methods remain under-studied. Standard decoding methods for autoregressive mod…
ARIADNE: A Perception-Reasoning Synergy Framework for Trustworthy Coronary Angiography AnalysisZhan Jin, Yu Luo, Yizhou Zhang et al.
cs.CVcs.AIMar 19, 2026
Conventional pixel-wise loss functions fail to enforce topological constraints in coronary vessel segmentation, producing fragmented vascular trees despite high pixel-level accuracy. We present ARIADN…
Hypothesis-Conditioned Query Rewriting for Decision-Useful RetrievalHangeol Chang, Changsun Lee, Seungjoon Rho et al.
cs.CLcs.AIcs.LGMar 19, 2026
Retrieval-Augmented Generation (RAG) improves Large Language Models (LLMs) by grounding generation in external, non-parametric knowledge. However, when a task requires choosing among competing options…
dTRPO: Trajectory Reduction in Policy Optimization of Diffusion Large Language ModelsWenxuan Zhang, Lemeng Wu, Changsheng Zhao et al.
cs.AIMar 19, 2026
Diffusion Large Language Models (dLLMs) introduce a new paradigm for language generation, which in turn presents new challenges for aligning them with human preferences. In this work, we aim to improv…
Regret Bounds for Competitive Resource Allocation with Endogenous CostsRui Chai
cs.AIcs.DScs.GTcs.LGMar 19, 2026
We study online resource allocation among N interacting modules over T rounds. Unlike standard online optimization, costs are endogenous: they depend on the full allocation vector through an interacti…
From Accuracy to Readiness: Metrics and Benchmarks for Human-AI Decision-MakingMin Hun Lee
cs.HCcs.AIcs.LGMar 19, 2026
Artificial intelligence (AI) systems are deployed as collaborators in human decision-making. Yet, evaluation practices focus primarily on model accuracy rather than whether human-AI teams are prepared…
Em-Garde: A Propose-Match Framework for Proactive Streaming Video UnderstandingYikai Zheng, Xin Ding, Yifan Yang et al.
cs.CVcs.AIMar 19, 2026
Recent advances in Streaming Video Understanding has enabled a new interaction paradigm where models respond proactively to user queries. Current proactive VideoLLMs rely on per-frame triggering decis…
Geography According to ChatGPT -- How Generative AI Represents and Reasons about GeographyKrzysztof Janowicz, Gengchen Mai, Rui Zhu et al.
cs.AIcs.CYMar 19, 2026
Understanding how AI will represent and reason about geography should be a key concern for all of us, as the broader public increasingly interacts with spaces and places through these systems. Similar…
Foundations of Schrödinger Bridges for Generative ModelingSophia Tang
cs.LGcs.AIMar 19, 2026
At the core of modern generative modeling frameworks, including diffusion models, score-based models, and flow matching, is the task of transforming a simple prior distribution into a complex target d…
AgentDS Technical Report: Benchmarking the Future of Human-AI Collaboration in Domain-Specific Data ScienceAn Luo, Jin Du, Xun Xian et al.
cs.LGcs.AIstat.MEMar 19, 2026
Data science plays a critical role in transforming complex data into actionable insights across numerous domains. Recent developments in large language models (LLMs) and artificial intelligence (AI) a…
SEM: Sparse Embedding Modulation for Post-Hoc Debiasing of Vision-Language ModelsQuentin Guimard, Federico Bartsch, Simone Caldarella et al.
cs.CVcs.AIcs.LGMar 19, 2026
Models that bridge vision and language, such as CLIP, are key components of multimodal AI, yet their large-scale, uncurated training data introduce severe social and spurious biases. Existing post-hoc…
FedTrident: Resilient Road Condition Classification Against Poisoning Attacks in Federated LearningSheng Liu, Panos Papadimitratos
cs.CRcs.AIcs.DCcs.LGMar 19, 2026
FL has emerged as a transformative paradigm for ITS, notably camera-based Road Condition Classification (RCC). However, by enabling collaboration, FL-based RCC exposes the system to adversarial partic…
Evaluating Game Difficulty in Tetris Block PuzzleChun-Jui Wang, Jian-Ting Guo, Hung Guei et al.
cs.AIcs.LGMar 19, 2026
Tetris Block Puzzle is a single player stochastic puzzle in which a player places blocks on an 8 x 8 grid to complete lines; its popular variants have amassed tens of millions of downloads. Despite th…
Translating MRI to PET through Conditional Diffusion Models with Enhanced Pathology AwarenessYitong Li, Igor Yakushev, Dennis M. Hedderich et al.
cs.CVcs.AIMar 19, 2026
Positron emission tomography (PET) is a widely recognized technique for diagnosing neurodegenerative diseases, offering critical functional insights. However, its high costs and radiation exposure hin…
MultihopSpatial: Multi-hop Compositional Spatial Reasoning Benchmark for Vision-Language ModelYoungwan Lee, Soojin Jang, Yoorhim Cho et al.
cs.CVcs.AIMar 19, 2026
Spatial reasoning is foundational for Vision-Language Models (VLMs), particularly when deployed as Vision-Language-Action (VLA) agents in physical environments. However, existing benchmarks predominan…
RewardFlow: Topology-Aware Reward Propagation on State Graphs for Agentic RL with Large Language ModelsXiao Feng, Bo Han, Zhanke Zhou et al.
cs.AIcs.CLcs.LGMar 19, 2026
Reinforcement learning (RL) holds significant promise for enhancing the agentic reasoning capabilities of large language models (LLMs) with external environments. However, the inherent sparsity of ter…
Motion-o: Trajectory-Grounded Video ReasoningBishoy Galoaa, Shayda Moezzi, Xiangyu Bai et al.
cs.CVcs.AIMar 19, 2026
Recent research has made substantial progress on video reasoning, with many models leveraging spatio-temporal evidence chains to strengthen their inference capabilities. At the same time, a growing se…
Are complicated loss functions necessary for teaching LLMs to reason?Gabriele Carrino, Andrea Sassella, Nicolo Brunello et al.
cs.LGcs.AIcs.CLMar 19, 2026
Recent advances in large language models (LLMs) highlight the importance of post training techniques for improving reasoning and mathematical ability. Group Relative Policy Optimization (GRPO) has sho…
Perceptio: Perception Enhanced Vision Language Models via Spatial Token GenerationYuchen Li, Amanmeet Garg, Shalini Chaudhuri et al.
cs.CVcs.AIMar 19, 2026
Large Vision Language Models (LVLMs) excel at semantic understanding but struggle with fine grained spatial grounding, as the model must implicitly infer complex geometry without ever producing a spat…
Online Learning and Equilibrium Computation with Ranking FeedbackMingyang Liu, Yongshan Chen, Zhiyuan Fan et al.
cs.LGcs.CLcs.GTMar 19, 2026
Online learning in arbitrary, and possibly adversarial, environments has been extensively studied in sequential decision-making, and it is closely connected to equilibrium computation in game theory. …
Track Diffusion & Generative — Get notified when new papers are scored
Sign up free and get daily digests tailored to your research interests.
Sign up free