Linear causal discovery with interventional constraints

Guo, Zhigao and Dong, Feng (2026) Linear causal discovery with interventional constraints. Machine Learning, 115 (3). 35. ISSN 1573-0565 (https://doi.org/10.1007/s10994-026-06998-z)

[thumbnail of Guo-Dong-ML-2026-Linear-causal-discovery-with-interventional-constraints]
Preview
Text. Filename: Guo-Dong-ML-2026-Linear-causal-discovery-with-interventional-constraints.pdf
Final Published Version
License: Creative Commons Attribution 4.0 logo

Download (5MB)| Preview

Abstract

Incorporating causal knowledge and mechanisms is essential for refining causal models and improving downstream tasks, such as designing new treatments. In this paper, we introduce a novel concept in causal discovery, termed interven-tional constraints, which differs fundamentally from interventional data. While interventional data require direct perturbations of variables, interventional con-straints encode high-level causal knowledge in the form of inequality constraints on causal effects. For instance, in the Sachs dataset, Akt has been shown to be activated by PIP3, meaning PIP3 exerts a positive causal effect on Akt. Existing causal discovery methods allow enforcing structural constraints (e.g., requiring a causal path from PIP3 to Akt), but they may still produce incorrect causal con-clusions, such as learning that “PIP3 inhibits Akt.” Interventional constraints bridge this gap by explicitly constraining the total causal effect between vari-able pairs, ensuring learned models respect known causal influences. To formalize interventional constraints, we adopt a metric to quantify total causal effects for linear causal models and formulate the problem as a constrained optimization task, solved using a two-stage constrained optimization method. We evaluate our approach on real-world datasets and demonstrate that integrating interven-tional constraints not only improves model accuracy and ensures consistency with established findings, making models more explainable, but also facilitates the discovery of new causal relationships that would otherwise be costly to identify.