AI Navigate

Joint Optimization of Storage and Loading for High-Performance 3D Point Cloud Data Processing

arXiv cs.CV / 3/19/2026

📰 NewsDeveloper Stack & InfrastructureTools & Practical UsageModels & Research

Key Points

  • PcRecord is introduced as a unified storage format for 3D point clouds to reduce storage footprint and accelerate data loading and processing across large-scale datasets.
  • A multi-stage parallel data processing pipeline is proposed to optimize compute resource usage and achieve substantial speedups.
  • The approach handles diverse storage formats (PLY, XYZ, BIN) and demonstrates significant cross-dataset performance gains on GPU and Ascend hardware.
  • The work targets high-impact 3D vision applications such as autonomous driving, robotic perception, and augmented reality by enabling faster processing of large-scale point cloud data.

Abstract

With the rapid development of computer vision and deep learning, significant advancements have been made in 3D vision, partic- ularly in autonomous driving, robotic perception, and augmented reality. 3D point cloud data, as a crucial representation of 3D information, has gained widespread attention. However, the vast scale and complexity of point cloud data present significant chal- lenges for loading and processing and traditional algorithms struggle to handle large-scale datasets.The diversity of storage formats for point cloud datasets (e.g., PLY, XYZ, BIN) adds complexity to data handling and results in inefficiencies in data preparation. Al- though binary formats like BIN and NPY have been used to speed up data access, they still do not fully address the time-consuming data loading and processing phase. To overcome these challenges, we propose the .PcRecord format, a unified data storage solution designed to reduce the storage occupation and accelerate the processing of point cloud data. We also introduce a high-performance data processing pipeline equipped with multiple modules. By leveraging a multi-stage parallel pipeline architecture, our system optimizes the use of computational resources, significantly improving processing speed and efficiency. This paper details the im- plementation of this system and demonstrates its effectiveness in addressing the challenges of handling large-scale point cloud datasets.On average, our system achieves performance improvements of 6.61x (ModelNet40), 2.69x (S3DIS), 2.23x (ShapeNet), 3.09x (Kitti), 8.07x (SUN RGB-D), and 5.67x (ScanNet) with GPU and 6.9x, 1.88x, 1.29x, 2.28x, 25.4x, and 19.3x with Ascend.