Publication | Closed Access
Coupled Detection and Trajectory Estimation for Multi-Object Tracking
275
Citations
20
References
2007
Year
Unknown Venue
EngineeringMachine LearningVideo ProcessingLocalizationImage AnalysisPattern RecognitionSpacetime TrajectoriesObject TrackingRobot LearningMultiple Object TrackingMachine VisionObject DetectionSpacetime Trajectory EstimationMoving Object TrackingComputer ScienceDeep LearningComputer VisionTrajectory EstimationEye TrackingTracking System
The study proposes a coupled optimization framework that jointly performs object detection and spacetime trajectory estimation for multi‑object tracking. The method formulates tracking as a hypothesis‑selection problem, iteratively selecting globally optimal spacetime trajectories using a pedestrian detector, enforcing non‑overlap constraints, and feeding back successful trajectories to guide future detections while maintaining efficiency through incremental updates and pruning. The resulting approach can automatically initialize and track a large and varying number of persons over long periods and through complex scenes with clutter, occlusions, and large‑scale background changes, and it can recover from mismatches and temporarily lost tracks.
We present a novel approach for multi-object tracking which considers object detection and spacetime trajectory estimation as a coupled optimization problem. It is formulated in a hypothesis selection framework and builds upon a state-of-the-art pedestrian detector. At each time instant, it searches for the globally optimal set of spacetime trajectories which provides the best explanation for the current image and for all evidence collected so far, while satisfying the constraints that no two objects may occupy the same physical space, nor explain the same image pixels at any point in time. Successful trajectory hypotheses are fed back to guide object detection in future frames. The optimization procedure is kept efficient through incremental computation and conservative hypothesis pruning. The resulting approach can initialize automatically and track a large and varying number of persons over long periods and through complex scenes with clutter, occlusions, and large-scale background changes. Also, the global optimization framework allows our system to recover from mismatches and temporarily lost tracks. We demonstrate the feasibility of the proposed approach on several challenging video sequences.
| Year | Citations | |
|---|---|---|
Page 1
Page 1