Object Weight Prediction Using RGBD Imaging and Semantic Metadata Integration


Abstract

We propose a novel dataset and method for predicting object weight using multimodal data captured with an RGBD camera. The dataset integrates RGB images, depth maps, and detailed metadata, facilitating a comprehensive analysis of the relationship between visual and physical properties of objects.
The data acquisition system was designed to ensure uniformity, reproducibility, and efficiency. It comprises a motorized turntable for precise multi-angle captures, a high-resolution RGBD camera for simultaneous acquisition of visual and depth information, and custom software for automation and synchronization. For each object, 12 RGB images and 12 corresponding depth maps were recorded at regular angular intervals, accompanied by metadata including exact weight measurements, material composition, and hierarchical taxonomy based on WordNet. This semantic organization supports advanced contextual and relational analyses of objects in scenes.
The dataset stands out for its multimodal richness, methodological rigor, and semantic interoperability, making it a valuable resource for advancing computer vision research. Applications include weight prediction, object recognition, and semantic scene understanding, particularly in studying the correlation between visual cues and material properties. This work lays a foundation for further exploration of multimodal approaches in vision-based object analysis.



Halim Djerroud ¹, Nico Steyn ², Abderraouf Benali ¹

¹Université Paris-Saclay, UVSQ, LISV, 78140, Vélizy-Villacoublay, France.
² Tshwane University of Technology. Department of Electrical Engineering, Pretoria, South Africa.


Paper (comming)   Dataset   Code
weight prediction rgb depth metadata

Demo


Dataset

The dataset is organized as follows:

       dataset_export/
            ├── metadata.json
            ├── images_depth
            ├── ....
            └── ....
            ├── images_gray
            ├── ....
            └── ....
            ├── images_rgb
            ├── ....
            └── ....
            └── README.md
            
            
weight prediction rgb depth metadata

Software