3D Gaussian Splatting (3DGS) has recently demonstrated outstanding performance in 3D reconstruction. However, its scalability to large scenes remains limited by single-GPU memory constraints. We propose ScaleGS, a scalable distributed training framework for large-scale 3DGS with constant-degree cross-GPU communication. (1) We first present a median-guided binary partitioning algorithm and pixel-tile parallelism to reduce memory pressure on a single GPU. To address the boundary artifacts caused by partitioning, we introduce an autonomous partition growth mechanism that maintains global Gaussian uniqueness and cross-GPU parameter synchronization. (2) To resolve the scalability challenges, we design a greedy GPU-Tile remapping strategy based on pixel-tile parallelism to achieve \(O(1)\) cross-GPU communication complexity for nearly all GPUs in representative scenes. (3) Our framework finally introduces adaptive load balancing that periodically monitors workloads and efficiently migrates Gaussians between neighboring GPUs with negligible overhead. Evaluations show that ScaleGS outperforms state-of-the-art methods, achieving up to 20% faster training and approximately 20% model size reduction on 8 Tesla P40 GPUs without compromising reconstruction quality.
BibTex Code Here