Differentiable latent structure discovery for interpretable forecasting in clinical time seriesIvan Lerner, Jean Feydy, Alexandre Kalimouttou et al.
cs.LGApr 30, 2026
Background: Timely, uncertainty-aware forecasting from irregular electronic health records (EHR) can support critical-care decisions, yet most approaches either impute to a grid or sacrifice interpret…
Strait: Perceiving Priority and Interference in ML Inference ServingHaidong Zhao, Nikolaos Georgantas
cs.LGApr 30, 2026
Machine learning (ML) inference serving systems host deep neural network (DNN) models and schedule incoming inference requests across deployed GPUs. However, limited support for task prioritization an…
TwinGate: Stateful Defense against Decompositional Jailbreaks in Untraceable Traffic via Asymmetric Contrastive LearningBowen Sun, Chaozhuo Li, Yaodong Yang et al.
cs.CRcs.CLcs.LGApr 30, 2026
Decompositional jailbreaks pose a critical threat to large language models (LLMs) by allowing adversaries to fragment a malicious objective into a sequence of individually benign queries that collecti…
PRISM: Pre-alignment via Black-box On-policy Distillation for Multimodal Reinforcement LearningSudong Wang, Weiquan Huang, Xiaomin Yu et al.
cs.CVcs.AIcs.CLApr 30, 2026
The standard post-training recipe for large multimodal models (LMMs) applies supervised fine-tuning (SFT) on curated demonstrations followed by reinforcement learning with verifiable rewards (RLVR). H…
MIFair: A Mutual-Information Framework for Intersectionality and Multiclass FairnessJeanne Monnier, Thomas George, Frédéric Guyard et al.
cs.LGcs.AIcs.CYcs.ITApr 30, 2026
Fairness in machine learning remains challenging due to its ethical complexity, the absence of a universal definition, and the need for context-specific bias metrics. Existing methods still struggle w…
TransVLM: A Vision-Language Framework and Benchmark for Detecting Any Shot TransitionsCe Chen, Yi Ren, Yuanming Li et al.
cs.CVcs.AIApr 30, 2026
Traditional Shot Boundary Detection (SBD) inherently struggles with complex transitions by formulating the task around isolated cut points, frequently yielding corrupted video shots. We address this f…
VibroML: an automated toolkit for high-throughput vibrational analysis and dynamic instability remediation of crystalline materials using machine-learned potentialsRogério Almeida Gouvêa, Gian-Marco Rignanese
cond-mat.mtrl-scics.AIcs.LGphysics.comp-phApr 30, 2026
While machine-learned interatomic potentials (MLIPs) accelerate phonon dispersion calculations, merely identifying dynamical instabilities in computationally predicted materials is insufficient; autom…
Heterogeneous Scientific Foundation Model CollaborationZihao Li, Jiaru Zou, Feihao Fang et al.
cs.AIcs.CLcs.LGApr 30, 2026
Agentic large language model systems have demonstrated strong capabilities. However, their reliance on language as the universal interface fundamentally limits their applicability to many real-world p…
GUI Agents with Reinforcement Learning: Toward Digital InhabitantsJunan Hu, Jian Liu, Jingxiang Lai et al.
cs.AIcs.CVApr 30, 2026
Graphical User Interface (GUI) agents have emerged as a promising paradigm for intelligent systems that perceive and interact with graphical interfaces visually. Yet supervised fine-tuning alone canno…
Training-Free Tunnel Defect Inspection and Engineering Interpretation via Visual Recalibration and Entity ReconstructionShipeng Liu, Liang Zhao, Dengfeng Chen et al.
cs.CVcs.AIApr 30, 2026
Tunnel inspection requires outputs that can support defect localization, measurement, severity grading, and engineering documentation. Existing training-free foundation-model pipelines usually stop at…
Synthetic Computers at Scale for Long-Horizon Productivity SimulationTao Ge, Baolin Peng, Hao Cheng et al.
cs.AIcs.CLcs.LGApr 30, 2026
Realistic long-horizon productivity work is strongly conditioned on user-specific computer environments, where much of the work context is stored and organized through directory structures and content…
In-Context Prompting Obsoletes Agent Orchestration for Procedural TasksSimon Dennis, Michael Diamond, Rivaan Patil et al.
cs.AIcs.LGApr 30, 2026
Agent orchestration frameworks -- LangGraph, CrewAI, Google ADK, OpenAI Agents SDK, and others -- place an external orchestrator above the LLM, tracking state and injecting routing instructions at eve…
The Effects of Visual Priming on Cooperative Behavior in Vision-Language ModelsKenneth J. K. Ong
cs.AIcs.CVApr 30, 2026
As Vision-Language Models (VLMs) become increasingly integrated into decision-making systems, it is essential to understand how visual inputs influence their behavior. This paper investigates the effe…
Measuring research data reuse in scholarly publications using generative artificial intelligence: Open Science Indicator development and preliminary resultsLauren Cadwallader, Iain Hrynaszkiewicz, parth sarin et al.
cs.DLcs.CLApr 30, 2026
Numerous metascience studies and other initiatives have begun to monitor the prevalence of open science practices when it is more important to understand the 'downstream' effects or impacts of open sc…
Simulating clinical interventions with a generative multimodal model of human physiologyGuy Lutsker, Gal Sapir, Jordi Merino et al.
cs.AIApr 30, 2026
Understanding how human health changes over time, and why responses to interventions vary between individuals, remains a central challenge in medicine. Here we present HealthFormer, a decoder-only tra…
Defending Quantum Classifiers against Adversarial Perturbations through Quantum AutoencodersEmma Andrews, Sahan Sanjaya, Prabhat Mishra
quant-phcs.LGApr 30, 2026
Machine learning models can learn from data samples to carry out various tasks efficiently. When data samples are adversarially manipulated, such as by insertion of carefully crafted noise, it can cau…
FlexiTac: A Low-Cost, Open-Source, Scalable Tactile Sensing Solution for Robotic SystemsBinghao Huang, Yunzhu Li
cs.ROcs.AIcs.LGApr 30, 2026
We present FlexiTac, a low-cost, open-source, and scalable piezoresistive tactile sensing solution designed for robotic end-effectors. FlexiTac is a practical "plug-in" module consisting of (i) thin, …
Efficient Multivector Retrieval with Token-Aware Clustering and Hierarchical IndexingSilvio Martinico, Franco Maria Nardini, Cosimo Rulli et al.
cs.IRcs.LGApr 30, 2026
Multivector retrieval models achieve state-of-the-art effectiveness through fine-grained token-level representations, but their deployment incurs substantial computational and memory costs. Current so…
Sequential Inference for Gaussian Processes: A Signal Processing PerspectiveDaniel Waxman, Fernando Llorente, Petar M. Djurić
eess.SPcs.LGstat.COstat.MLApr 30, 2026
The proliferation of capable and efficient machine learning (ML) models marks one of the strongest methodological shifts in signal processing (SP) in its nearly 100-year history. ML models support the…
Differential Subgroup Discovery: Characterizing Where Two Populations Differ, and WhySascha Xu, Jilles Vreeken
cs.LGApr 30, 2026
We study the problem of understanding where two populations differ within a feature space, which we formalize in the concept of a differential subgroup: a subset of individuals from both populations w…
Explainable Load Forecasting with Covariate-Informed Time Series Foundation ModelsMatthias Hertel, Alexandra Nikoltchovska, Sebastian Pütz et al.
cs.LGApr 30, 2026
Time Series Foundation Models (TSFMs) have recently emerged as general-purpose forecasting models and show considerable potential for applications in energy systems. However, applications in critical …
Mind the Gap: Structure-Aware Consistency in Preference LearningMehryar Mohri, Yutao Zhong
cs.LGstat.MLApr 30, 2026
Preference learning has become the foundation of aligning Large Language Models (LLMs) with human intent. Popular methods, such as Direct Preference Optimization (DPO), minimize surrogate losses as pr…
Linear-Core Surrogates: Smooth Loss Functions with Linear Rates for Classification and Structured PredictionMehryar Mohri, Yutao Zhong
cs.LGstat.MLApr 30, 2026
The choice of loss function in classification involves a fundamental trade-off: smooth losses (like Cross-Entropy) enable fast optimization rates but yield slow square-root consistency bounds, while p…
Exploration Hacking: Can LLMs Learn to Resist RL Training?Eyon Jang, Damon Falck, Joschka Braun et al.
cs.LGcs.CLApr 30, 2026
Reinforcement learning (RL) has become essential to the post-training of large language models (LLMs) for reasoning, agentic capabilities and alignment. Successful RL relies on sufficient exploration …
Why Self-Supervised Encoders Want to Be NormalYuval Domb
cs.ITcs.AIcs.LGApr 30, 2026
We develop a geometric and information-theoretic framework for encoder-decoder learning built on the Information Bottleneck (IB) principle. Recasting IB as a rate-distortion problem with Kullback-Leib…
Learning When to Remember: Risk-Sensitive Contextual Bandits for Abstention-Aware Memory Retrieval in LLM-Based Coding AgentsMehmet Iscan
cs.CLcs.AIcs.LGApr 30, 2026
Large language model (LLM)-based coding agents increasingly rely on external memory to reuse prior debugging experience, repair traces, and repository-local operational knowledge. However, retrieved m…
FedHarmony: Harmonizing Heterogeneous Label Correlations in Federated Multi-Label LearningZhiqiang Kou, Junxiang Wu, Wenke Huang et al.
cs.LGApr 30, 2026
Federated Multi-Label Learning is a distributed paradigm where multiple clients possess heterogeneous multi-label data and perform collaborative learning under privacy constraints without sharing raw …
ITS-Mina: A Harris Hawks Optimization-Based All-MLP Framework with Iterative Refinement and External Attention for Multivariate Time Series ForecastingPourya Zamanvaziri, Amirhossein Sadr, Aida Pakniyat et al.
cs.LGcs.AIApr 30, 2026
Multivariate time series forecasting plays a pivotal role in numerous real-world applications, including financial analysis, energy management, and traffic planning. While Transformer-based architectu…
Linguistically Informed Multimodal Fusion for Vietnamese Scene-Text Image Captioning: Dataset, Graph Framework, and Phonological AttentionNhi Ngoc-Yen Nguyen, Anh-Duc Nguyen, Nghia Hieu Nguyen et al.
cs.CVcs.CLApr 30, 2026
Scene-text image captioning requires fusing three information streams -- visual features, OCR-detected text, and linguistic knowledge -- to generate descriptions that faithfully integrate text visible…
Kernelized Advantage Estimation: From Nonparametric Statistics to LLM ReasoningShijin Gong, Kai Ye, Jin Zhu et al.
cs.LGstat.MLApr 30, 2026
Recent advances in large language models (LLMs) have increasingly relied on reinforcement learning (RL) to improve their reasoning capabilities. Three approaches have been widely adopted: (i) Proximal…
Track Diffusion & Generative — Get notified when new papers are scored
Sign up free and get daily digests tailored to your research interests.
Sign up free