Publications

You can also find my articles on my Google Scholar profile.

Causal meta-analysis: Rethinking the foundations of evidence-based medicine

Published in Arxiv, 2026

Meta-analysis, by synthesizing effect estimates from multiple studies conducted in diverse settings, stands at the top of the evidence hierarchy in clinical research. Yet, conventional approaches based on fixed- or random-effects models lack a causal framework, which may limit their interpretability and utility for public policy. Incorporating causal inference reframes meta-analysis as the estimation of well-defined causal effects on clearly specified populations, enabling a principled approach to handling study heterogeneity. We show that classical meta-analysis estimators have a clear causal interpretation when effects are measured as risk differences. However, this breaks down for nonlinear measures like the risk ratio and odds ratio. To address this, we introduce novel causal aggregation formulas that remain compatible with standard meta-analysis practices and do not require access to individual-level data. To evaluate real-world impact, we apply both classical and causal meta-analysis methods to 500 published meta-analyses. While the conclusions often align, notable discrepancies emerge, revealing cases where conventional methods may suggest a treatment is beneficial when, under a causal lens, it is in fact harmful.

Recommended citation: Berenfeld, C., Boughdiri, A., Colnet, B., van Amsterdam, W. A., Bellet, A., Khellaf, R., Scornet, E. & Josse, J. (2025). Causal meta-analysis: Rethinking the foundations of evidence-based medicine." Conference Article.
Download Paper

Principled Federated Random Forests for Heterogeneous Data

Published in Arxiv, 2026

Random Forests (RF) are among the most powerful and widely used predictive models for centralized tabular data, yet few methods exist to adapt them to the federated learning setting. Unlike most federated learning approaches, the piecewise-constant nature of RF prevents exact gradient-based optimization. As a result, existing federated RF implementations rely on unprincipled heuristics: for instance, aggregating decision trees trained independently on clients fails to optimize the global impurity criterion, even under simple distribution shifts. We propose FedForest, a new federated RF algorithm for horizontally partitioned data that naturally accommodates diverse forms of client data heterogeneity, from covariate shift to more complex outcome shift mechanisms. We prove that our splitting procedure, based on aggregating carefully chosen client statistics, closely approximates the split selected by a centralized algorithm. Moreover, FedForest allows splits on client indicators, enabling a non-parametric form of personalization that is absent from prior federated random forest methods. Empirically, we demonstrate that the resulting federated forests closely match centralized performance across heterogeneous benchmarks while remaining communication-efficient.

Recommended citation: Khellaf, R., Scornet, E., Bellet, A., & Josse, J. (2026). Principled Federated Random Forests for Heterogeneous Data." Conference Article.
Download Paper

Handling Covariate Mismatch in Federated Linear Prediction

Published in Arxiv, 2026

Federated learning enables institutions to train predictive models collaboratively without sharing raw data, addressing privacy and regulatory constraints. In the standard horizontal setting, clients hold disjoint cohorts of individuals and collaborate to learn a shared predictor. Most existing methods, however, assume that all clients measure the same features. We study the more realistic setting of covariate mismatch, where each client observes a different subset of features, which typically arises in multicenter collaborations with no prior agreement on data collection. We formalize learning a linear prediction under client-wise MCAR patterns and develop two modular approaches tailored to the dimensional regime and communication budget. In the low-dimensional setting, we propose a plug-in estimator that approximates the oracle linear predictor by aggregating sufficient statistics to estimate the covariance and cross-moment terms. In higher dimensions, we study an impute-thenregress strategy: (i) impute missing covariates using any exchangeability-preserving imputation procedure, and (ii) fit a ridge-regularized linear model on the completed data. We provide asymptotic and finite-sample learning rates for our predictors, explicitly characterizing their behaviour with the global dimension, the client-specific feature partition, and the distribution of samples across sites.

Recommended citation: Ayme, A., & Khellaf, R. (2026). Handling Covariate Mismatch in Federated Linear Prediction." Conference Article.
Download Paper

Federated Causal Inference from Multi-Site Observational Data via Propensity Score Aggregation

Published in Arxiv, 2025

Causal inference typically assumes centralized access to individual-level data. Yet, in practice, data are often decentralized across multiple sites, making centralization infeasible due to privacy, logistical, or legal constraints. We address this problem by estimating the Average Treatment Effect (ATE) from decentralized observational data via a Federated Learning (FL) approach, allowing inference through the exchange of aggregate statistics rather than individual-level data. We propose a novel method to estimate propensity scores via a federated weighted average of local scores using Membership Weights (MW), defined as probabilities of site membership conditional on covariates. MW can be flexibly estimated with parametric or non-parametric classification models using standard FL algorithms. The resulting propensity scores are used to construct Federated Inverse Propensity Weighting (Fed-IPW) and Augmented IPW (Fed-AIPW) estimators. In contrast to meta-analysis methods, which fail when any site violates positivity, our approach exploits heterogeneity in treatment assignment across sites to improve overlap. We show that Fed-IPW and Fed-AIPW perform well under site-level heterogeneity in sample sizes, treatment mechanisms, and covariate distributions. Theoretical analysis and experiments on simulated and real-world data demonstrate clear advantages over meta-analysis and related approaches.

Recommended citation: Khellaf, R., Bellet, A., & Josse, J. (2025). Federated Causal Inference from Multi-Site Observational Data via Propensity Score Aggregation." Conference Article.
Download Paper | Download Slides

Federated Causal Inference: Multi-Study ATE Estimation beyond Meta-Analysis

Published in AISTATS, 2024

In this paper, we study Federated Causal Inference, an approach to estimate treatment effects from decentralized data across studies, or data centers. We compare three classes of Average Treatment Effect (ATE) estimators derived from the Plug-in G-Formula, ranging from simple meta-analysis to one-shot and multi-shot federated learning, the latter leveraging the full data to learn the outcome model (albeit requiring more communication). Focusing on Randomized Controlled Trials (RCTs), we derive the asymptotic variance of these estimators for linear models. Our results provide practical guidance on selecting the appropriate estimator for various scenarios, including heterogeneity in sample sizes, covariate distributions, treatment assignment schemes, and center effects. We validate these findings with a simulation study.

Recommended citation: Khellaf R., Bellet A., Josse J. (2024). "Federated Causal Inference: Multi-sources ATE estimation." Conference Article.
Download Paper | Download Slides

Rémi Khellaf

Publications

Causal meta-analysis: Rethinking the foundations of evidence-based medicine

Principled Federated Random Forests for Heterogeneous Data

Handling Covariate Mismatch in Federated Linear Prediction

Federated Causal Inference from Multi-Site Observational Data via Propensity Score Aggregation

Federated Causal Inference: Multi-Study ATE Estimation beyond Meta-Analysis