-
100 relevanceCRAFT: Adapting VLA Models to Contact-Rich Manipulation Via Force-Aware Curriculum Fine-TuningThe paper directly addresses the topic by proposing a framework specifically designed to adapt VLA models for contact-rich manipulation.
-
100 relevanceAudio-VLA: Adding Contact Audio Perception to Vision-Language-Action Model for Robotic ManipulationThe paper directly proposes a Vision-Language-Action (VLA) model specifically designed to enhance contact-rich manipulation by incorporating audio perception.
-
100 relevanceFD-VLA: Force-Distilled Vision-Language-Action Model for Contact-Rich ManipulationThe paper explicitly focuses on developing a Vision-Language-Action (VLA) model specifically for contact-rich manipulation.
-
100 relevanceEnhancing VLA Precision in Robotic Manipulation Via FiLM-Based Force/Torque-Vision IntegrationThe paper directly proposes a method to enhance VLA models specifically for contact-rich robotic manipulation tasks using Force/Torque integration.
-
100 relevanceLearning End-To-End Dexterous Arm-Hand VLA Policies with Shared Autonomy: DexGrasp AI Copilot for Efficient TeleoperationThe paper directly addresses the development of an end-to-end VLA policy for dexterous arm-hand manipulation using tactile feedback and force-adaptive actions, which is a core example of contact-rich manipulation.
-
100 relevanceMLA: A Multisensory Language�Action Model for Multimodal Understanding and Forecasting in Robotic ManipulationThe paper explicitly focuses on a Multisensory Language-Action model designed specifically to improve complex and contact-rich robotic manipulation by integrating tactile tokens and other sensory modalities.
-
95 relevanceDexora: Open-Source VLA for High-DoF Bimanual DexterityThe paper presents a VLA system specifically designed for high-DoF bimanual dexterity, which is inherently centered on contact-rich manipulation.
-
95 relevanceNeuroVLA: Surgical Scenario-Aware Learning of Debulking Skills in Endoscopic Robotic Neurosurgery Via Vision-Language-Action ModelThe paper explicitly proposes a Vision-Language-Action (VLA) model for surgical debulking, which inherently involves contact-rich manipulation tasks such as grasping and transferring.
-
90 relevanceIMPACT: Intelligent Motion Planning with Acceptable Contact Trajectories Via Vision-Language ModelsThe paper directly addresses contact-rich manipulation by leveraging Vision-Language Models (VLMs) to determine acceptable contacts and guide motion planning.
-
85 relevanceRethinking the Practicality of Vision-Language-Action Model: A Comprehensive Benchmark and an Improved BaselineThe paper directly addresses Vision-Language-Action (VLA) models for various manipulation tasks across different embodiments, which is central to contact-rich manipulation.
-
85 relevanceDAM-VLA: A Dynamic Action Model-Based Vision-Language-Action Framework for Robot ManipulationThe paper presents a VLA framework that explicitly addresses the need for precise manipulation by separating gross arm movement from fine gripper control, which is fundamental for contact-rich tasks.
-
85 relevanceOmniVLA: Physically-Grounded Multimodal VLA with Unified Multi-Sensor Perception for Robotic ManipulationThe paper proposes a VLA model integrating non-RGB sensors (radar, infrared, audio) which are critical for overcoming visual occlusions and perceiving physical interactions typical of contact-rich manipulation.
-
85 relevanceManipForce: Force-Guided Policy Learning with Frequency-Aware Representation for Contact-Rich ManipulationThe paper directly addresses contact-rich manipulation using a multimodal transformer policy combining vision and force data, although it focuses on Vision-Force-Action rather than incorporating Language.
-
85 relevanceIVRA: Improving Visual-Token Relations for Robot Action Policy with Training-Free Hint-Based GuidanceThe paper focuses on improving the spatial understanding of VLA models for precise robotic manipulation across benchmarks like LIBERO, which is highly relevant to the requirements of contact-rich tasks.
-
75 relevanceAugVLA-3D: Depth-Driven Feature Augmentation for Vision-Language-Action ModelsThe paper focuses on improving VLA models' spatial understanding via depth augmentation, which is critical for precise robotic manipulation, although it does not explicitly address the tactile or force-feedback aspects typical of 'contact-rich' tasks.
-
75 relevanceRealMirror: A Comprehensive, Open-Source Vision-Language-Action Platform for Embodied AIThe paper presents a comprehensive VLA platform and benchmark for humanoid robots, which are central to contact-rich manipulation, though it focuses more on the infrastructure and Sim2Real transfer than specific contact-rich techniques.
-
75 relevanceGoal-VLA: Image-Generative VLMs as Object-Centric World Models Empowering Zero-shot Robot ManipulationThe paper proposes a VLA framework for robotic manipulation using generative VLMs, but it focuses more on high-level goal generation and spatial reasoning than specifically on contact-rich dynamics.
-
65 relevanceVLA-Reasoner: Empowering Vision-Language-Action Models with Reasoning Via Online Monte Carlo Tree SearchThe paper focuses on improving VLA performance for long-horizon general robotic manipulation via MCTS and reasoning, but does not explicitly address the specific challenges of contact-rich manipulation such as force control or tactile feedback.
-
45 relevanceThe Better You Learn, the Smarter You Prune: Towards Efficient Vision-Language-Action Models Via Differentiable Token PruningWhile the paper focuses on improving the efficiency of VLA models used in robotic manipulation, it addresses computational overhead rather than the specific physical or control challenges associated with contact-rich manipulation.
-
40 relevanceIntentionVLA: Generalizable and Efficient Embodied Intention Reasoning for Human�Robot InteractionWhile the paper focuses on VLA models for manipulation and HRI, it emphasizes high-level intention reasoning rather than the low-level physics or control challenges specific to contact-rich manipulation.
-
30 relevanceEveryDayVLA: A Vision-Language-Action Model for Affordable Robotic ManipulationWhile the paper discusses Vision-Language-Action (VLA) models for robotic manipulation, it focuses on hardware affordability and general task success rather than the specific challenges of contact-rich manipulation.
-
30 relevanceShapeForce: Low-Cost Soft Robotic Wrist for Contact-Rich ManipulationThe paper focuses on a hardware sensor for contact-rich manipulation but does not mention or utilize Vision-Language-Action (VLA) models.
-
20 relevanceExploiting Vulnerabilities: Universal Adversarial Attacks on Vision-Language-Action Models in RoboticsWhile the paper discusses Vision-Language-Action (VLA) models in robotics, its focus is on adversarial security attacks rather than the mechanics or implementation of contact-rich manipulation.
-
10 relevanceContact Detection and Manipulation with a Shape-Memory Alloy Based Soft GripperWhile the paper addresses contact-rich manipulation via soft robotics and sensing, it does not involve Vision-Language-Action (VLA) models or AI-driven policy learning.
-
10 relevanceVSL-Skin: Individually Addressable Phase-Change Voxel Skin for Variable-Stiffness and Virtual Joints Bridging Soft and Rigid RobotsThe paper focuses on hardware materials for variable stiffness and morphological control, whereas the topic of interest is high-level Vision-Language-Action (VLA) models.