3dgs-engineering-guide

star 105

Guide for deploying 3DGS from research to production: 10 industry verticals, engineering stack, GIS toolchain solutions, cross-platform deployment, and common pitfalls

jaccen By jaccen schedule Updated 5/20/2026

name: 3dgs-engineering-guide description: "Guide for deploying 3DGS from research to production: 10 industry verticals, engineering stack, GIS toolchain solutions, cross-platform deployment, and common pitfalls. References 713+ methods." version: 1.9.0 author: jaccen tags: ["3dgs", "gaussian-splatting", "engineering", "deployment", "digital-twin", "autonomous-driving"]

3DGS Engineering Guide

Bridging the gap from academic research to production deployment for 3D Gaussian Splatting.

Agent Instructions

When invoked, follow this workflow:

  1. Identify use case — determine application domain and constraints (platform, scale, real-time, budget)
  2. Recommend pipeline — select tools and pipeline from sections below
  3. Reference papers — point to methods in references/3dgs-methods-overview.md and references/methods-systems-apps.md
  4. Provide concrete next steps — actionable items, not generic advice
  5. Warn about pitfalls — highlight domain-specific failure modes from Section 5

1. Industry Application Landscape

1.1 Autonomous Driving Simulation

Maturity: Engineering | Players: aiSim, Li Auto mindVLA, NVIDIA DRIVE Sim

Pipeline: Real-world scan (LiDAR + multi-camera) → 3DGS reconstruction → Sensor simulation → HIL/SIL testing

Key papers: GSDrive, GS-Playground (10^4 FPS, RSS 2026), GS-Surrogate, FieryGS, Nighttime AD GS, Real2Sim (4DGS + differentiable MPM), GS-SCNet, Ground4D, ULF-Loc (CVPR 2026 highlight), ConFixGS [2605.09688], FRUC [2605.29997] (feed-forward cooperative driving), DeGO [2605.28587] (deformable Gaussian occupancy, CVPR 2026)

Quality bar: Sensor sim error < 0.02, LiDAR > 30 FPS, LPIPS < 0.1, Radar ±3 dB

Notes: ConFixGS provides plug-and-play confidence-aware diffusion repair for +3.68 dB PSNR on Waymo, applicable to pretrained feedforward models; FRUC enables calibration-free multi-agent reconstruction; DeGO decouples rigid/non-rigid motion for human-centric occupancy; LiDAR sim requires opaque surface Gaussians; OpenDRIVE co-registration mandatory; nighttime needs separate IR-adjacent training; FastGS enables 100-second training with comparable quality — applicable for rapid iteration in AD sensor sim pipeline

1.2 Digital Twin & Smart City

Maturity: Commercial | Players: SuperMap, FantoVision, LCC

Pipeline: Aerial + streetview → Large-scale 3DGS → S3M conversion → GIS integration → IoT fusion

Key papers: DiffSoup, Street Gaussians, GlobalSplat, Large-Scale HQ Head, ArtiTwinSplat (interactable digital twin from RGB-D)

Standards: S3M (Chinese GIS), OGC 3D Tiles, glTF/glb, CityGML

Notes: City-level = 10^9–10^10 Gaussians; WGS84→ENU→3DGS alignment critical; streaming LOD mandatory; S3M needs custom exporter

1.3 Cultural Heritage & Museum

Maturity: Commercial

Pipeline: Controlled-lighting photography → High-fidelity 3DGS → Color calibration → Digital archive → VR/AR exhibition

Quality: Sub-mm geometry, ΔE < 2 (CIE76), 2048×2048+ texture, lossless compression

Notes: Dome/array lighting > flash; attach DOI/catalog metadata; store raw images + COLMAP + checkpoint + compressed .ply

1.4 Film & Game Production

Maturity: Exploration | Players: Volcengine, UE team, Tencent

Pipeline: Multi-camera capture → 3DGS → Mesh extraction (SuGaR/2DGS) → UE5 import → Virtual production

Notes: 3DGS→mesh needed for DCC; SuGaR (TSDF) > naive marching cubes; material separation (GOR-IS/SSD-GS) for relighting; 4DGS (GauFRe/DeformGS) for temporal consistency; UE5 Nanite+Lumen experimental

1.5 E-commerce 3D Display

Maturity: Commercial

Pipeline: Turntable photography → 3DGS → Compression (MobileGS/GETA-3DGS) → Web AR preview

Requirements: < 50 MB, browser-renderable (WebGPU/WebGL2 via gsplat.js), < 5s load on 4G

Notes: 50x+ compression needed for web; mesh fallback for low-end; AR needs mesh (Quick Look/Scene Viewer)

1.6 Industrial Inspection

Maturity: Engineering

Pipeline: Drone capture → 3DGS → AI defect detection → Measurement → Report

Key papers: EnerGS (LiDAR-3DGS fusion), RGS (CBCT inspection), E2EGS (end-to-end field)

Notes: GPS geotagging for defect correlation; EnerGS for LiDAR+cam fusion; detect ≥ 5mm at 10m; CAAC/FAA compliance

1.7 AR/VR/MR

Maturity: Exploration

Pipeline: Real-time headset scan → 3DGS → 6DoF tracking + low-latency render → MR overlay

Key papers: Mobile Avatar, GS-Playground, CoherentRaster (subpixel rasterization for light field)

Notes: < 20ms motion-to-photon; VkSplat for cross-VR; hybrid 3DGS+mesh for occlusion physics; Vision Pro = ARKit+Metal, Quest = OpenXR+Vulkan

1.8 BIM & Architecture

Maturity: Engineering | Players: LumenBIM × LCC

Pipeline: TLS + drone → 3DGS → IFC alignment → As-built verification → LCC delivery

Key papers: BrepGaussian (B-rep aware), CADFS (CAD feature saliency)

Notes: ICP registration before overlay; IFC coordinate mapping; LCC proprietary streaming format

1.9 Robotics & Embodied AI

Maturity: Rapidly Growing

Pipeline: 3DGS environment → Physics sim (GS-Playground) → Policy learning (sim-to-real) → Deployment

Key papers:

  • GaussianGrasper (IEEE T-RO 2024) — Open-vocabulary grasping via SAM+CLIP feature distillation into 3DGS
  • GraspSplats (CoRL 2024) — Zero-shot manipulation with 3D feature splatting; scene editing support
  • ManiGaussian (ECCV 2024) — Dynamic GS world model for multi-task manipulation via future scene prediction
  • GSMem (arXiv 2026) — 3DGS as persistent spatial memory for zero-shot embodied exploration & QA
  • RoboSplat (RSS 2025) — Diverse data generation via Gaussian primitive manipulation; 87.8% success rate
  • VR-Robo (RAL 2025) — Real-to-Sim-to-Real for visual robot navigation without depth sensors
  • GS-Playground (RSS 2026) — 10^4 FPS batch 3DGS + parallel physics for robot learning
  • Forecast-GS (arXiv 2026) — Predictive 3DGS for goal-directed manipulation planning

Sub-directions:

  1. Grasping & Manipulation — GaussianGrasper, GraspSplats, ManiGaussian, RoboSplat
  2. Navigation & Locomotion — VR-Robo, GS-Playground, MAGICIAN
  3. Embodied Reasoning — GSMem (spatial memory), Forecast-GS (predictive planning), ESI-Bench (spatial intelligence evaluation)
  4. Driving Policy RL — GSDrive (3DGS environment for reinforcement learning), SpaceDrive (VLM spatial awareness for AD)
  5. Embodied Simulation — LEGS (embodied GS simulation, arXiv:2606.01458)

Toolchain: ROS2 (point cloud/depth topics), MuJoCo/Isaac Sim physics backend, GS-Playground (high-throughput sim)

Notes: 10^4 FPS sim transforms sample efficiency; ROS2 as point cloud/depth topics; debias with real-world fine-tuning; GraspSplats demonstrates NeRF unsuitable for scene changes — prefer 3DGS for manipulation tasks requiring scene editing

1.10 Military Simulation

Maturity: Early, classified | Security: GuardMarkGS (unified watermarking + edit deterrence for 3DGS assets)

Requirements: Air-gapped deployment, indigenous tools, > 60 FPS, sub-meter terrain, multi-spectral (visible+IR+SAR)

Notes: No foreign cloud/API; DEM/DSM fusion; no sensitive data in checkpoints

World Model Integration

3DGS is emerging as a core 3D primitive for world models across multiple domains:

Domain Method 3DGS Role Maturity
Autonomous Driving Simulation RAD, DLWM, X-World Twin digital world for RL/IL training Production (XPeng, Momenta)
Robot Manipulation GS-World, Spark 2.0 Differentiable simulation engine Research → Early Production
Interactive 3D World Generation GWM, FlashWorld Dynamics modeling primitive Research
Web-Native World Model Rendering Visionary WebGPU rendering platform Open Source (Shanghai AI Lab)

Engineering considerations:

  • Sim2Real gap: 3DGS simulation fidelity directly impacts policy transfer quality (RAD shows closed-loop RL in 3DGS reduces IL causal confusion)
  • Real-time constraint: World models require ≥20fps for interactive use; 3DGS rendering speed is often the bottleneck
  • Physical consistency: Standard 3DGS lacks physics; GS-World adds differentiable physics as simulation engine layer
  • Scalability: Urban-scale world models need distributed 3DGS (BlitzGS pattern) + streaming (PD-4DGS pattern)
  • Web deployment: Visionary demonstrates WebGPU + ONNX as viable path for browser-native world models

2. Engineering Technology Stack

2.1 Data Acquisition

Device Type Use Case Key Requirements
DSLR/Mirrorless High-fidelity capture Manual exposure, fixed focal length
Drone (RTK) Aerial survey > 80% forward, > 60% side overlap
LiDAR AD simulation, inspection Time-synced with cameras
Mobile (LiDAR) Quick indoor scan iPad Pro/iPhone for rapid scouting
TLS Architectural, industrial Sub-mm accuracy for as-built

Software: COLMAP (SfM+MVS standard), ORB-SLAM3/BLEPS (visual SLAM), LIO-SAM/FAST-LIO2 (LiDAR SLAM), FreeMoCap (AGPL-3.0, markerless MoCap from webcams, outputs .trc/.c3d/.fbx, pip install freemocap)

Key considerations: Camera calibration consistency, manual/HDR exposure, > 60% image overlap, GCPs for georeferencing, overcast preferred

2.2 Reconstruction

Framework Language Best For
original 3DGS CUDA/Python Research, benchmarking
gsplat PyTorch/CUDA Custom training, differentiable
2DGS CUDA/Python Mesh-extraction pipelines
Scaffold-GS CUDA/Python Large-scale scenes
OpenGaussian OpenGL Non-CUDA rendering
Scale Gaussians Training GPU
Object/room 100K–1M 10–30 min RTX 4070
Building 1M–10M 1–3 h RTX 4090
City block 10M–100M 3–7 h A100 80GB
City district 100M–1B 12–24 h A100/H100 cluster

Compression: HAC (100x), MobileGS (CPU-runnable), GETA-3DGS (5x), MesonGS++ (34x, SOTA rate-distortion), AdaGScale (adaptive), CodecSplat (ultra-compact feed-forward, 20–108 KiB/scene, ArXiv 2605.25563)

Rule: No compression for prototyping → add when deployment demands; validate compressed vs original.

2.3 Post-processing

Mesh extraction: SuGaR (TSDF, clean meshes), 2DGS+Poisson, Marching Cubes (baseline, blobby), NeuS2-GS (hybrid SDF+Gaussian)

Material separation: GOR-IS (albedo/shading/normals), SSD-GS (scatter+shadow) — enables relighting

Relighting: GS³ (SH-based), GaRe, LumiMotion — critical for virtual production and e-commerce

Relighting (feed-forward): F-RNG (ArXiv 2605.25975) — feed-forward relightable 3DGS, ~25× faster than optimization-based relighting; recommended for production relighting pipelines where iterative optimization is prohibitive

Editing: GaussianEditor, ObjectMorpher, TransSplat, SuperSplat (PlayCanvas, MIT, browser-based: inspect/edit/compress/publish PLY & SOG; https://superspl.at/editor)

Toolchain: splat-transform (PlayCanvas, MIT, CLI) — PLY→SOG (~20x), PLY→streamed SOG (LOD), -K collision mesh (.collision.glb); npm install -g @playcanvas/splat-transform

MoCap input: FreeMoCap (AGPL-3.0) — webcam MoCap → SMPL/FLAME → drive GaussianAvatar/EmoTaG; same rig for MoCap + 3DGS training images; note: AGPL-3.0 not MIT-compatible for commercial use

2.4 Deployment

Engine Backend Platform 3DGS Native?
original 3DGS CUDA NVIDIA GPU Yes
VkSplat Vulkan Cross-platform Yes
GSeurat Vulkan C++23 Cross-platform Yes
BlitzGS Multi-GPU (parity sharding) Distributed Yes
msplat Metal macOS/iOS Yes
tortuise CPU (Rust) Any CPU Yes
PlayCanvas Engine WebGL2/WebGPU Web Yes (first-class)
gsplat.js WebGPU/WebGL2 Web Yes
@playcanvas/react WebGL2/WebGPU Web Yes (Splats component)
UE5 plugin DX12 Desktop/Console Plugin
Unity renderer Vulkan/DX12 Multi-platform Plugin

Streaming: CAGS (VQ + LoD, ~7x, chunked with global codebook), AV1-3DGS (AV1 motion vectors for SfM, 63% training reduction), PD-4DGS (progressive 4D streaming, DASH/HLS-compatible), progressive loading (coarse→fine), view-dependent prioritization, 20–50 Mbps for 1080p

Formats: .ply (uncompressed), .splat (compact binary, web-friendly), .sog (PlayCanvas, ~20x, streaming LOD, chunked with manifest), .spz (Niantic, ~10x, mobile/AR), custom (HAC/MesonGS++), future: 3D Tiles + Gaussian extension

2.5 Integration

GIS: SuperMap S3M extension, Cesium ion, ArcGIS (experimental)

BIM: IFC/STEP via BrepGaussian, Navisworks federated review, Revit as-built comparison

AD: OpenDRIVE + 3DGS co-registration, aiSim 6, ROS2 sensor topics

Game engines: UE5 (experimental Nanite-compatible), Unity (gsplat package), Godot (community, early), PlayCanvas (MIT, first-class 3DGS + collision + navmesh + physics + WebXR, @playcanvas/react)

Robotics: ROS2 scene server, MuJoCo/Isaac Sim, GS-Playground

2.6 The GIS Toolchain Gap: "3DGS Looks Good but Does Nothing"

The #1 pain point blocking 3DGS from production use (based on industry practitioner analysis, particularly WebGIS engineer xjjdjj).

After expensive drone surveys and 3DGS reconstruction, the resulting PLY file cannot: measure distances, cut cross-sections, calculate volumes, compute surface areas, query semantics, or overlay real-time video.

5 Root Causes:

  1. Format mismatch: 3DGS = unstructured Gaussian primitives; GIS expects structured geometry (mesh faces, point clouds with topology). No standard conversion layer.
  2. No spatial reference: 3DGS lives in arbitrary local coordinates; GIS requires WGS84/projected CRS.
  3. No semantic layer: No notion of "this group is a building" / "this surface is a road."
  4. No analysis primitives: GIS operates on mesh faces/edges/vertices; ray-Gaussian intersection is not a standard GIS operation.
  5. No real-time data fusion: 3DGS is static; live video overlay requires camera pose estimation + temporal sync + occlusion handling.

6 Solution Categories:

  1. Distance measurement: Raycasting through Gaussian field → surface point → Euclidean distance; or KNN surface estimation; project to vertical/horizontal plane first
  2. Cross-section clipping: Plane-Gaussian intersection; GPU shader real-time clipping; use cases: geological, architectural, pipeline
  3. Volume calculation: Voxelization (occupancy grid × voxel volume) or Gaussian integral (probability mass above reference plane); needs closed-surface assumption
  4. Surface area: Multi-view projected area (SH degree-0) or mesh extraction first (SuGaR/2DGS)
  5. Semantic enrichment: SAM/SAGA segment 2D → project to 3D Gaussians; or CLIP embeddings for semantic queries; map to CityGML/OGC
  6. Real-time video fusion: Camera calibration + SLAM pose → frame-to-3D projection → depth z-buffering → temporal progressive update

PlayCanvas Pipeline (3 CLI commands — first end-to-end open-source making 3DGS scenes interactable in browser; source: PlayCanvas Blog 2026-04):

splat-transform scene.ply --seed-pos 0,1,0 --voxel-params 0.05,0.1 \
  --voxel-carve 1.6,0.2 -K scene.sog
npx glb-to-navmesh scene.collision.glb navmesh.bin
# Step 3: Bake lightness probes (in-engine, ~15s, ~40KB JSON)
Component Tool Output Size
Collision mesh splat-transform -K (voxelization + flood-fill) .collision.glb ~1 MB
Nav mesh recast-navigation navmesh.bin ~100 KB
Lightness grid Probe script (cubemap luminance, Rec.601) lightness.json ~40 KB
Streamed SOG splat-transform (LOD partitioning) Multi-chunk .sog/ + manifest ~5% of PLY

Key insights: Voxelization + flood-fill = sealed collision meshes (no manual cleanup); lightness probes as JSON (no runtime raytracing, mobile-friendly); SOG streaming enables mobile deployment of million-Gaussian scenes.

GIS Toolchain Solutions:

Task Tool Notes
PLY → 3D Tiles libTileSplat, supermap-3dtiles Cesium-compatible
PLY → collision mesh splat-transform -K Voxelization + flood-fill
PLY → nav mesh splat-transform + recast-navigation Collision GLB → Recast
PLY → compressed SOG splat-transform 20x, streaming LOD
Web 3DGS editor SuperSplat Browser-based, PWA
Spatial analysis Custom Python (NumPy + plyfile) Build custom GIS layer
Semantic labeling SAGA SAM → 3D projection
Lightness baking PlayCanvas probe script ~15s bake, ~40KB
Volume calculation Custom voxelizer + PLY parser Not yet standard
Cesium rendering gsplat.js, cesium-3dgs-plugin Three.js limited native support

Standards progress: CSM group standard for 3DGS modeling initiated (2026-04); S3M extended for 3DGS; 3D Tiles extension proposals; Spatial-TTT (ECCV 2026): streaming spatial memory for continuous city-scale understanding; Holi-Spatial (ICML 2026 Oral): automated 4M+ spatial data from video streams


3. Best Practices

3.1 Quality Assurance

Geometric: Chamfer Distance, F-Score (τ ∈ {1mm, 5mm, 10mm}), normal consistency

Visual: PSNR/SSIM/LPIPS — WARNING: insufficient for engineering use; human evaluation required for sign-off

Engineering metrics: sensor sim fidelity vs real data, real-time FPS (30/60/90+ by domain), memory footprint, time-to-first-render, rate-distortion curves

3.2 Scalability

  • Scene splitting: octree/voxel grid, ~1M Gaussians/cell, overlap zones for seams
  • LOD: multi-resolution hierarchy, distance-based switching, view-dependent refinement
  • Streaming: camera pose → spatial index → LOD + frustum culling → compress → transfer → decompress & render
Scenario Compression Ratio Quality
Prototyping None 1x None
Desktop GETA-3DGS 5x Minimal
Mobile MobileGS / CAGS 10–50x Moderate
Web MesonGS++ + .splat/SPZ 30–50x Acceptable
Large-scale HAC + progressive / CAGS 50–100x Significant

3.3 Cross-Platform

Platform Backend Fallback Max Scene Real-time?
Desktop (NVIDIA) CUDA Vulkan 10M+ 60 FPS
Desktop (AMD/Intel) VkSplat GSeurat 5M+ 30 FPS
Desktop (CPU) tortuise (Rust) 500K No
macOS (Apple) msplat (Metal) 3M 20 FPS
iOS Metal 1M 15 FPS
Android Vulkan WebGPU 1M 15 FPS
Web WebGPU WebGL2 500K–2M Varies
VR (Quest 3) Vulkan (OpenXR) 2M 72 Hz
VR (Vision Pro) Metal 3M 90 Hz

Checklist: target GPU family, VRAM fallback to lower LOD, color space (sRGB/linear/HDR), min-spec hardware, memory leak testing over extended sessions

3.4 Data Pipeline Automation

CI/CD: Data validation → COLMAP SfM+MVS → 3DGS training → quality gate (PSNR/F-Score) → compression → deploy to CDN → alert on regression

Quality gates: PSNR < 28 dB = flag; geometric drift > 5mm = flag; coverage gaps; floater/needle artifacts

Versioning: Raw images + COLMAP in git; checkpoints (.ply) in git LFS/DVC; semantic versioning; changelog per version

Monitoring: FPS P50/P95/P99, Gaussian count, file size, data freshness, user engagement metrics


4. Decision Trees

4.1 By Use Case

  • AD simulation → aiSim 6 / CARLA + 3DGS plugin + OpenDRIVE + ROS2
  • Digital twin / Smart city → SuperMap GIS + LCC streaming / S3M
  • Cultural heritage → Polycam (capture) + COLMAP + 3DGS; Luma AI (preview)
  • E-commerce → gsplat.js / three.js + compression
  • Film / Game → UE5 plugin + SuGaR (mesh) + material separation
  • Industrial inspection → DJI + COLMAP + 3DGS + YOLO/SAM
  • Robotics → GS-Playground (sim) + ROS2
  • Avatar / MoCap → FreeMoCap + GaussianAvatar/EmoTaG + SMPL/FLAME
  • BIM / Architecture → LCC + IFC alignment + as-built verification
  • Research → original 3DGS + gsplat + custom extensions

4.2 By Platform

  • Desktop (NVIDIA) → CUDA backend
  • Desktop (AMD/Intel) → VkSplat / GSeurat
  • Mobile (iOS/Android) → VkSplat / msplat (Metal) / WebGPU
  • Web → gsplat.js / three.js / PlayCanvas Engine + @playcanvas/react
  • VR headset → OpenXR+Vulkan (Quest) / Metal (Vision Pro)

4.3 By Scene Scale

  • < 100K Gaussians → original 3DGS, 5–15 min on RTX 3070+
  • < 10M → Scaffold-GS + GETA-3DGS (5x), 30 min–2h on RTX 4090
  • < 100M → Spatial partitioning + MesonGS++ (34x), 2–7h on A100
  • > 1B → LCC + S3M + HAC (100x), distributed 12–48h on GPU cluster

5. Common Engineering Pitfalls

  • Over-fitting to training views: Artifacts at novel viewpoints. Fix: more viewpoints at different elevations, depth/opacity regularization, validate on held-out views.
  • Floating artifacts: Semi-transparent blobs in empty space. Fix: depth regularization, opacity pruning (α < threshold), post-processing depth filter.
  • Memory explosion at scale: GPU OOM > 10M Gaussians. Fix: spatial partitioning from day one, Scaffold-GS anchors, streaming for > 10M.
  • Sensor sim fidelity ignored: High PSNR but inaccurate LiDAR/Radar. Fix: validate sensor outputs vs real data; opaque surface Gaussians for LiDAR; calibrate Radar cross-section.
  • CUDA lock-in: Cannot deploy to AMD/Intel/Mobile. Fix: VkSplat/GSeurat (Vulkan), msplat (Metal), tortuise (Rust CPU), brush (Rust/WebGPU/Burn, most complete cross-platform: Win/Mac/Linux/Android/Web, 4.3k stars, faster than gsplat); abstract CUDA behind interface.
  • Sorting bottleneck for semi-transparent scenes: Alpha-compositing requires depth sort, which becomes the bottleneck for scenes with many overlapping semi-transparent Gaussians. Fix: DP-GES (Depth Peeling for sort-free surfel rendering) eliminates sorting entirely by using layered depth peeling; applicable when surfel representation is acceptable.
  • No version control for 3DGS: Cannot reproduce/track changes. Fix: git LFS or DVC; separate metadata (YAML) from binary; semantic versioning.
  • Static lighting assumption: Breaks under different lighting. Fix: plan relighting upfront; GOR-IS/SSD-GS decomposition; GS³/GaRe SH-based relighting; F-RNG for feed-forward relighting at ~25× the speed of optimization-based approaches.
  • Temporal inconsistency: Video flicker, object jumping. Fix: 4DGS (GauFRe, DeformGS, ScubeGS); temporal smoothness loss.
  • Under-estimated compression artifacts: Visible holes, color shifts. Fix: rate-distortion benchmarks first; domain-specific metrics (not just PSNR); uncompressed reference for comparison.
  • Hierarchical tile partitioning/rasterization scale mismatch: Can break exact alpha compositing. Fix: use HiGS-style dual-scale architecture with conservative coverage test.

6. Reference Papers

Domain Methods
AD Simulation GSDrive, GS-Playground (RSS 2026), GS-Surrogate, FieryGS, GS-SCNet, Ground4D, ULF-Loc (CVPR 2026), Nighttime AD, Real2Sim, ConFixGS (+3.68 dB Waymo), StreetNVS (multi-sensor NVS, arXiv:2606.01590)
World Models GWM, FlashWorld, GS-World, Visionary, RAD, DLWM, X-World
Digital Twin DiffSoup, Street Gaussians, GlobalSplat, Large-Scale HQ Head
Volumetric Medical GaussianPile (CVPR 2026, slice-aware PSF projection for CT/cBCT/ABUS/LSM; focus-aware physical model with finite-thickness sensitivity map; additive rasterization (not alpha-blending) for volumetric intensity accumulation; ~16-26× compression over voxel grids; 11× faster than NeRF; 8min avg convergence; supports ultrasound/microscopy/MRI)
Dynamic SLAM Flow4DGS-SLAM (optical flow-guided 4DGS temporal consistency), GGD-SLAM (ICRA 2026, generalizable motion model for dynamic SLAM)
Inspection EnerGS, RGS, E2EGS
Simulation PhysGaussian, Gaussian Splashing, GS-Playground, SAM3D-Phys [2605.30239] (generative 3D priors + physics for simulatable objects), RAF (CVPR 2026 Findings, representation abstraction framework bridging 3DGS and physics engines; 3-stage pipeline: (1) asset abstraction—static world via Gaussian segmentation→collision mesh, dynamic world via opacity field sampling→physics particles; (2) unified simulation kernel—MPM/SPH/PBD/rigid-body/articulated-body multi-solver coupling for heterogeneous interaction (fluid-soft body, cloth-complex geometry, robot-rigid body); (3) visual recoupling—physics state→3DGS center+covariance update, mesh barycentric binding, UE5 Lumen+ray-traced rendering; 5 demo scenarios: SPH fluid+3DGS garden, SPH-MPM fluid+soft donut, robot arm+rigid objects, PBD cloth+statue draping, rigid fruits+3DGS container), FreeForm (CVPR 2026, particle-skinned eigenmodes for elastic deformation on 3DGS; enables soft-body simulation without mesh proxies)
Relighting GS³, GaRe, SSD-GS, LumiMotion, GOR-IS, Ambient-Robust Inverse Rendering [2605.30250] (active RGB-NIR for material decomposition)
Cross-platform VkSplat, GSeurat (Vulkan C++23), msplat (Metal), tortuise (Rust CPU), brush (Rust/WebGPU, 4.3k stars), AdaGScale, BlitzGS (distributed)
Feed-Forward SplatWeaver [2605.07287] (expert-routing, 30% budget reduction, 301 FPS, no calibration; code: github.com/yecongwan/SplatWeaver), ZPressor [2505.23734] (100+ input-view scalability via bottleneck-aware compression), VolSplat [2509.19297] (voxel-aligned prediction for multi-view consistency), PM-Loss [2506.05327] (pointmap loss for feed-forward depth quality), DéjàView [2605.30215] (looped transformer, inference-time compute knob K), HeadsUp [2605.04035] (UV-parameterized head, 10K+ subjects, Apple), Z-Order GS [2605.13465] (CVPR 2026 Oral, Z-order Morton curve spatial indexing for cache-coherent Gaussian traversal; sparse attention (grouped+top-k) reduces O(N²)→O(N log N); 1000× faster than per-scene optimization; 2-3× fewer Gaussians vs DepthSplat/AnySplat), ZipSplat [2606.05102] (token-based feed-forward; k-means clustering decouples Gaussian count from pixel grid; additive rasterization for volumetric rendering; ~6× fewer Gaussians, +2.1 dB PSNR SOTA on DL3DV/RealEstate10K; pose-free; DA3-Giant backbone; coupling init+progressive view training+单向Chamfer geometry loss for stability)
BIM/CAD BrepGaussian, CADFS, GS-CAD, GaussCAD, KDH-CAD (knowledge-data hybrid, arXiv:2606.01702)
Editing GaussianEditor, ObjectMorpher, TransSplat, AlbedoEdit (video-level albedo editing, arXiv:2606.01362)
Security GuardMarkGS (watermarking + edit deterrence)
Rendering CoherentRaster (subpixel, light field), 3DGEER (exact ray, ICLR 2026), SparseOIT (order-independent transparency), DP-GES (sort-free surfel rendering via depth peeling, ArXiv 2605.25345), View-Dependent Splatting Kernels [2605.25426] (learned view-dependent kernels, SIGGRAPH 2026), DDF-GS (ray-query GI via Gaussian field, arXiv:2606.00817), D4RT (CVPR 2026 Best Paper, differentiable rasterization milestone for 3DGS rendering pipeline)
Streaming CAGS (~7x VQ+LoD), AV1-3DGS (63% training reduction), PD-4DGS (progressive 4D streaming), MGS [2603.19234] (Matryoshka continuous LoD, single model multi-fidelity)
Acceleration AdpSplit [2605.06876] (error-driven adaptive split, drop-in for 9-22% training speedup), HiGS (NVIDIA, 15.8x rendering speedup, arXiv:2606.00352)
Generative Optimization CAdam (SIGGRAPH 2026, context-adaptive densification for generative distillation pipelines)
Compression HAC (100x), MobileGS (CPU), GETA-3DGS (5x), MesonGS++ (34x), AdaGScale, CodecSplat (ultra-compact latent coding, 20–108 KiB/scene, ArXiv 2605.25563)
Relighting GS³, GaRe, SSD-GS, LumiMotion, GOR-IS, F-RNG (feed-forward, ~25× faster, ArXiv 2605.25975)

See knowledge base: references/3dgs-methods-overview.md, references/methods-core.md, references/methods-semantic-editing.md, references/methods-systems-apps.md


7. Terminology

  • Cardinality Gaussian Expert Routing: Routing mechanism where discrete experts predict different numbers of Gaussians per pixel based on scene complexity (cf. SplatWeaver)
  • Bottleneck-Aware Multi-View Compression: Compressing redundant multi-view latent tokens before Gaussian prediction to keep feed-forward 3DGS tractable as input view count grows (cf. ZPressor)
  • Voxel-Aligned Prediction: Predicting Gaussians in a shared voxel-space reference frame instead of independently from pixels, reducing duplicate or inconsistent splats across views (cf. VolSplat)
  • Pointmap Loss: Supervising depth-derived geometry in 3D point coordinates rather than only pixel-wise depth values, improving boundary smoothness without inference overhead (cf. PM-Loss)
  • Skew-Normal Splatting: Using Azzalini skew-normal distribution instead of symmetric Gaussian for asymmetric boundary representation
  • Stochastic Budget Training: Training strategy that randomly samples Gaussian budget each iteration to learn ordered, LoD-compatible representations (cf. MGS)

Part of Awesome-Gaussian-Skills

Related Skills

  • 3dgs-method-compare — Method comparison (use for selecting methods for deployment)
  • 3dgs-code-reviewer — Code review (use for detecting deployment-critical bugs)
  • 3dgs-mcp-renderer — MCP rendering protocol (use for real-time rendering integration)
  • 3dgs-spatial-agent — Spatial intelligence (use for agent-driven deployment scenarios)

Guardrail: Do Not Apply From Memory

Do NOT try to apply the logic, method data, bug patterns, or technical details described in this skill from memory. Always read the SKILL.md and referenced files from disk before producing any output. The knowledge base is updated frequently; stale memory may produce outdated, inaccurate, or fabricated results.

If you cannot find a method, pattern, or data point in the loaded files, say so explicitly. Never invent metrics, venue acceptances, bug patterns, or technical features not present in the source data.

Install via CLI
npx skills add https://github.com/jaccen/Awesome-Gaussian-Skills --skill 3dgs-engineering-guide
Repository Details
star Stars 105
call_split Forks 9
navigation Branch main
article Path SKILL.md
More from Creator