Zero-Shot Recognition of Test Tube Types by Automatically Collecting and Labeling RGB Data
Yu Tang, Weiwei Wan, Hao Chen, Masaki Matsushita, Jun Takahashi, Takeyuki Kotaka, Kensuke Harada
AI summary
Problem
Current vision-based lab automation relies on supervised learning, requiring extensive manually labeled datasets that break down when new or non-standardized test tubes are introduced.
Approach
The system first predicts rack slot occupancy globally, then locally crops and segments cap images using SAM2, automatically clustering them with ViT features and HDBSCAN to incrementally build a recognition library without human supervision.
Key results
- Zero-shot recognition of novel test tube types without prior training
- Automatic on-the-fly clustering and library expansion via HDBSCAN
- Robust slot occupancy estimation using simulated data and Depth Anything V2
- Successful real-world validation on a robotic manipulator with diverse tube geometries
Why it matters
Empowers adaptive laboratory automation by eliminating the need for costly retraining, allowing robots to instantly handle new experimental apparatus.
Abstract
This work presents a method for automatically detecting and recognizing test tube types in a rack. It leverages automatic segmentation, clustering, and labeling processes to eliminate the need for explicitly preparing training data. These processes are addressed by using combined global prediction and local cropping, where global prediction estimates the slot occupation states of a rack, and local cropping extracts tube pictures in the local regions of each slot for clustering and labeling. With the help of the proposed method, the robotic tube manipulation system no longer needs tailored data and explicit training in the presence of new tubes, thus achieving flexibil- ity and efficiency. Experimental evaluations conducted with a RealSense D405 camera and the UFactory xArm Lite6 robot manipulator confirm the method’s effectiveness in accurately identifying novel test tube types under real-world conditions.