← Back ICRA 2026

Data-Efficient Constrained Robot Learning with Probabilistic Lagrangian Control

Shiming He, Yuzhe Ding

PDF

AI summary

Key figure (auto-extracted from paper)

GIBO-Lag eliminates oscillatory dynamics in constrained robot learning by using probabilistic inference to adaptively control the Lagrange multiplier, yielding safer and more data-efficient policy search.

Constrained robot learning Bayesian optimization Lagrangian relaxation Gaussian processes Data efficiency Safety-aware control

Problem

Lagrangian methods in constrained robot learning often exhibit oscillatory primal-dual updates that cause safety violations and slow convergence. Existing Bayesian optimization techniques fail to scale to high-dimensional policy spaces while maintaining strict safety guarantees.

Approach

The method combines gradient-information Bayesian optimization with a Jacobian Gaussian process model, using the posterior probability of constraint satisfaction to actively minimize the Lagrange multiplier while ensuring safety.

Key results

Reduces oscillatory dynamics in multiplier updates
Achieves lower regret in high-dimensional synthetic domains
Finds feasible policies in simulated and real robot tasks
Scales to 64-dimensional policy spaces without manual tuning

Why it matters

Enables safe, sample-efficient deployment of learning-based controllers on physical robots where trial-and-error is costly and hazardous.

Abstract

We propose a novel framework for data-efficient black-box robot learning under constraints. Our approach in- tegrates probabilistic inference with Lagrangian optimization. With the guide of a learned Gaussian process model, the La- grange multiplier is controlled by the probability of whether the constraints would be satisfied. This reduces the typical oscillations seen in primal-dual updates and therefore improves both data efficiency and safety during learning. Both synthetic results and robot experiments demonstrate that our method is a scalable and effective solution for constrained robot learning problems.

Index terms

Probabilistic Inference Reinforcement Learning Compliance and Impedance Control