From the perspective of object state modeling, visual object tracking can be regarded as a unified process that combines object state estimation and object localization. In this framework, state ...
Open-vocabulary object detection (OVD) is a critical research area in computer vision, particularly for applications in autonomous driving and robotics. Many existing OVD methods adopt transformer ...