Toward Efficient Membership Inference Attacks against Federated Large Language Models: A Projection Residual Approach

Deng, Guilin; Chen, Silong; Luo, Yuchuan; Liu, Yi; Wang, Songlei; Cai, Zhiping; Liu, Lin; Jia, Xiaohua; Fu, Shaojing

Computer Science > Machine Learning

arXiv:2604.21197 (cs)

[Submitted on 23 Apr 2026]

Title:Toward Efficient Membership Inference Attacks against Federated Large Language Models: A Projection Residual Approach

Authors:Guilin Deng, Silong Chen, Yuchuan Luo, Yi Liu, Songlei Wang, Zhiping Cai, Lin Liu, Xiaohua Jia, Shaojing Fu

View PDF HTML (experimental)

Abstract:Federated Large Language Models (FedLLMs) enable multiple parties to collaboratively fine-tune LLMs without sharing raw data, addressing challenges of limited resources and privacy concerns. Despite data localization, shared gradients can still expose sensitive information through membership inference attacks (MIAs). However, FedLLMs' unique properties, i.e. massive parameter scales, rapid convergence, and sparse, non-orthogonal gradients, render existing MIAs ineffective. To address this gap, we propose ProjRes, the first projection residuals-based passive MIA tailored for FedLLMs. ProjRes leverages hidden embedding vectors as sample representations and analyzes their projection residuals on the gradient subspace to uncover the intrinsic link between gradients and inputs. It requires no shadow models, auxiliary classifiers, or historical updates, ensuring efficiency and robustness. Experiments on four benchmarks and four LLMs show that ProjRes achieves near 100% accuracy, outperforming prior methods by up to 75.75%, and remains effective even under strong differential privacy defenses. Our findings reveal a previously overlooked privacy vulnerability in FedLLMs and call for a re-examination of their security assumptions. Our code and data are available at $\href{this https URL}{link}$.

Comments:	This is the full version (including complete appendices and supplementary materials) of the paper accepted for publication at the 2026 IEEE Symposium on Security and Privacy
Subjects:	Machine Learning (cs.LG)
Cite as:	arXiv:2604.21197 [cs.LG]
	(or arXiv:2604.21197v1 [cs.LG] for this version)
	https://doi.org/10.48550/arXiv.2604.21197

Submission history

From: Silong Chen [view email]
[v1] Thu, 23 Apr 2026 01:44:04 UTC (1,481 KB)

Computer Science > Machine Learning

Title:Toward Efficient Membership Inference Attacks against Federated Large Language Models: A Projection Residual Approach

Submission history

Access Paper:

Current browse context:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Machine Learning

Title:Toward Efficient Membership Inference Attacks against Federated Large Language Models: A Projection Residual Approach

Submission history

Access Paper:

Current browse context:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators