Publication | Open Access
Joint 2D-3D-Semantic Data for Indoor Scene Understanding
684
Citations
7
References
2017
Year
Geometric LearningEngineeringMachine Learning3D Computer VisionImage AnalysisData ScienceImage-based ModelingJoint 2D-3d-semantic DataComputational GeometryMachine VisionGeometric Feature ModelingLarge-scale Indoor SpacesSemantic AnnotationsComputer Science3D Object RecognitionComputer VisionNatural SciencesRegistered ModalitiesScene UnderstandingScene Modeling
We introduce a large‑scale indoor dataset that provides mutually registered 2D, 2.5D, and 3D modalities with instance‑level semantic and geometric annotations. The dataset covers over 6,000 m², includes more than 70,000 RGB images, corresponding depths, surface normals, semantic labels, global XYZ images (both regular and 360° equirectangular), camera metadata, and registered raw and semantically annotated 3D meshes and point clouds. The dataset facilitates joint and cross‑modal learning models and supports unsupervised methods that exploit regularities in large‑scale indoor spaces. The dataset is available at http://3Dsemantics.stanford.edu/.
We present a dataset of large-scale indoor spaces that provides a variety of mutually registered modalities from 2D, 2.5D and 3D domains, with instance-level semantic and geometric annotations. The dataset covers over 6,000m2 and contains over 70,000 RGB images, along with the corresponding depths, surface normals, semantic annotations, global XYZ images (all in forms of both regular and 360° equirectangular images) as well as camera information. It also includes registered raw and semantically annotated 3D meshes and point clouds. The dataset enables development of joint and cross-modal learning models and potentially unsupervised approaches utilizing the regularities present in large-scale indoor spaces. The dataset is available here: http://3Dsemantics.stanford.edu/
| Year | Citations | |
|---|---|---|
Page 1
Page 1