← Back to Search

Structure-Preserving Graph Contrastive Learning for Mathematical Information Retrieval

β˜†β˜†β˜†β˜†β˜†Mar 9, 2026arxiv β†’

Chun-Hsi Ku, Hung-Hsuan Chen

Abstract

This paper introduces Variable Substitution as a domain-specific graph augmentation technique for graph contrastive learning (GCL) in the context of searching for mathematical formulas. Standard GCL augmentation techniques often distort the semantic meaning of mathematical formulas, particularly for small and highly structured graphs. Variable Substitution, on the other hand, preserves the core algebraic relationships and formula structure. To demonstrate the effectiveness of our technique, we apply it to a classic GCL-based retrieval model. Experiments show that this straightforward approach significantly improves retrieval performance compared to generic augmentation strategies. We release the code on GitHub.\footnote{https://github.com/lazywulf/formula_ret_aug}.

Explain this paper

Ask this paper

Loading chat…

Rate this paper