VLA-Forget: Vision-Language-Action Unlearning for Embodied Foundation Models

Ranjan, Ravi; Polyzou, Agoritsa

Computer Science > Computer Vision and Pattern Recognition

arXiv:2604.03956 (cs)

[Submitted on 5 Apr 2026 (v1), last revised 22 Apr 2026 (this version, v2)]

Title:VLA-Forget: Vision-Language-Action Unlearning for Embodied Foundation Models

Authors:Ravi Ranjan, Agoritsa Polyzou

View PDF HTML (experimental)

Abstract:Vision-language-action (VLA) models are emerging as embodied foundation models for robotic manipulation, but their deployment introduces a new unlearning challenge: removing unsafe, spurious, or privacy-sensitive behaviors without degrading perception, language grounding, and action control. In OpenVLA-style policies, behavior is produced through a fused visual encoder, a cross-modal projector, and a language backbone that predicts tokenized robot actions, so undesirable knowledge can be distributed across perception, alignment, and reasoning/action layers rather than confined to a single module. Consequently, partial unlearning applied only to the vision stack or only to the language backbone is often insufficient, while conventional unlearning baselines designed for standalone vision or language models may leave residual forgetting or incur unnecessary utility loss in embodied settings. We propose VLA-Forget, a hybrid unlearning framework that combines ratio-aware selective editing for perception and cross-modal specificity with layer-selective reasoning/action unlearning for utility-preserving forgetting. VLA-Forget jointly optimizes three objectives: targeted forgetting, perceptual preservation, and reasoning retention, through staged updates over the visual encoder, projector, and upper action-generating transformer blocks. Across forget-set behavior probes and retain-task evaluations, VLA-Forget improves forgetting efficacy by 10%, preserves perceptual specificity by 22%, retains reasoning and task success by 9%, and reduces post-quantization recovery by 55% relative to strong unlearning baselines.

Comments:	18 pages, 9 figures, Accepted to ACL-2026, KnowFM
Subjects:	Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
Cite as:	arXiv:2604.03956 [cs.CV]
	(or arXiv:2604.03956v2 [cs.CV] for this version)
	https://doi.org/10.48550/arXiv.2604.03956

Submission history

From: Ravi Ranjan Kumar [view email]
[v1] Sun, 5 Apr 2026 04:23:18 UTC (22,815 KB)
[v2] Wed, 22 Apr 2026 18:43:28 UTC (22,815 KB)

Computer Science > Computer Vision and Pattern Recognition

Title:VLA-Forget: Vision-Language-Action Unlearning for Embodied Foundation Models

Submission history

Access Paper:

Current browse context:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computer Vision and Pattern Recognition

Title:VLA-Forget: Vision-Language-Action Unlearning for Embodied Foundation Models

Submission history

Access Paper:

Current browse context:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators