← Back ICRA 2026

Latent-RAG: Identity Retrieval-Guided Latent Augmentation for Privacy-Preserving Person Re-Identification

Seung-hyeok Back, Eungi Lee, Hyung-Il Kim, Seok Bong Yoo

PDF

AI summary

Key figure (auto-extracted from paper)

Latent-RAG balances strong visual privacy and robust defense against recovery attacks with high person re-identification accuracy without requiring additional model training.

Person re-identification Privacy preservation Latent space manipulation Retrieval-augmented generation Recovery attack defense Visual obfuscation

Problem

Current privacy-preserving person re-ID methods either degrade recognition accuracy through heavy obfuscation or leak structural cues that remain vulnerable to image recovery attacks.

Approach

The framework retrieves identity-similar latent codes from a vector store, fuses them via inverse self-attention to maximize visual distortion, and applies gradient-based latent manipulation to preserve identity vectors while disrupting structural cues.

Key results

Achieves lowest PSNR and SSIM scores among baselines for superior visual obfuscation
Maintains high Rank-1 and mAP re-ID accuracy across Market-1501, MSMT17, and CUHK03
Demonstrates robust resistance to black-box recovery attacks using advanced reconstruction models
Operates effectively with frozen pretrained generator parameters, eliminating the need for additional training

Why it matters

It enables secure, real-time visual data processing for robotics and surveillance systems by effectively neutralizing recovery attacks without compromising identity recognition utility.

Abstract

Person re-identification (re-ID) is crucial for se- curity applications, including autonomous robots that monitor individuals via continuous image acquisition. Such data are transmitted to a database; however, if stored without adequate protection, they can be intercepted, posing privacy risks. In re- sponse, the existing methods balance privacy and accuracy, but protected images still reveal structural cues, such as silhouettes or edges. These methods rely on randomness to defend against recovery attacks, limiting the guarantee of complete protection. Thus, this work proposes latent retrieval-augmented genera- tion (RAG), an identity retrieval-guided latent augmentation framework for privacy-preserving person re-ID that balances the re-ID performance with privacy protection. The proposed method generates augmented codes that distort appearance and disrupt mapping to the original input by retrieving identity- similar latent codes and applying inverse self-attention, en- hancing its robustness to recovery attacks. Next, this approach employs gradient-based latent code manipulation to preserve identity vectors to maintain re-ID accuracy. The hierarchical latent codes are concurrently adjusted to eliminate structural cues that could threaten privacy. The experimental results demonstrate that Latent-RAG induces strong visual distortion, reliable re-ID accuracy and a robust defense against recovery attacks, even without additional training with a few frozen parameters in a pretrained generator. Our code is available at https://github.com/BACKAI/Latent-RAG.

Index terms

Deep Learning for Visual Perception Surveillance Robotic Systems Recognition