术语库 · 印刻万物 TOP3DGS

Glossary

A Map of Gaussian Splatting Concepts

52 cross-linked terms layered for beginners, practitioners, and researchers—same entry, your choice of depth.

View

54 entries

TechniqueBeginner

3D Gaussian Splatting (3DGS)

Also known as：Gaussian Splatting / 3DGS

A revolutionary 3D scene representation and real-time rendering technique. It explicitly models environments using millions of anisotropic Gaussian ellipsoids, each parameterized with covariance, opacity, and spherical harmonics. Coupled with a highly efficient differentiable rasterization pipeline, it achieves photorealistic novel view synthesis at exceptional frame rates.

Primarily utilized during high-fidelity 3D reconstruction and real-time rendering phases, serving as a fundamental pillar for modern spatial computing.

#Training#Real-time#Paper

Further reading

3DGS 原论文

ConceptBeginner

Splat

Also known as：Gaussian / Splatting Primitive

The fundamental 3D primitive comprising a 3DGS scene. It acts conceptually like a semi-transparent droplet of paint suspended in 3D space. Shaped as an anisotropic ellipsoid, its color and opacity dynamically shift depending on the viewing angle to simulate realistic lighting.

Creators and algorithms continuously add, remove, or modify these primitives during the optimization phase to closely approximate real-world environments.

#Optimization#Viewing

Further reading

原始概念出处 (EWA Splatting)

TechniqueIntermediate

Structure-from-Motion (SfM)

Also known as：SfM

A computer vision technique that automatically estimates camera poses and extracts a sparse 3D point cloud from a sequence of 2D images. In the 3DGS workflow, SfM provides the crucial initial camera parameters and base structure required for subsequent initialization.

Deployed during the pre-processing stage immediately after data capture. Calculations are typically automated by open-source libraries like COLMAP.

#Capture#Optimization

Further reading

COLMAP 官方文档

TechniqueIntermediate

Multi-View Stereo (MVS)

Also known as：MVS

A 3D reconstruction algorithm operating downstream of SfM. It utilizes stereo matching techniques on image sets with known camera parameters to generate dense point clouds. While traditional pipelines rely on MVS for mesh extraction, 3DGS efficiently bypasses this expensive step.

The core step in traditional mesh modeling pipelines, though standard Gaussian workflows typically omit it or use it solely for benchmarking purposes.

#Training#Case Study

Further reading

MVS 综述

TechniqueBeginner

Neural Radiance Fields (NeRF)

Also known as：NeRF

A technique utilizing Multilayer Perceptrons (MLPs) to implicitly represent continuous 3D scenes. Given a viewing direction and spatial coordinates, the neural network outputs color and volume density. NeRF catalyzed the novel view synthesis revolution, acting as the direct predecessor to 3DGS. Radiance Fields' overview ties the original 2020 Berkeley work to later acceleration projects such as Instant-NGP, which shortened practical training times for many scenes.

Frequently utilized by researchers as a baseline metric. Due to sluggish rendering speeds, it is progressively being replaced by 3DGS in real-time interactive scenarios.

#Paper#Real-time

Further reading

NeRF 原始项目页

FormatBeginner

Point Cloud

Also known as：Points

A collection of discrete data points in three-dimensional space, typically containing coordinate geometry alongside attributes like color or reflectance. Within 3DGS, the sparse point cloud generated by SfM serves as the initial seeding ground for Gaussian ellipsoids.

Present throughout the initial stages of the spatial computing pipeline. It is also the direct output format for various LiDAR sensors and scanning devices.

#Capture#Cultural Heritage

Further reading

点云库 (PCL) GitHub

FormatBeginner

Polygon Mesh

Also known as：Mesh / 网格

The dominant model representation in traditional 3D engines, composed of vertices, edges, and polygonal faces (typically triangles) forming a continuous surface shell. Unlike the volumetric and scattered nature of 3DGS, meshes demand strict surface topology, struggling with hair or translucency.

The native language of mainstream game engines and modeling software like Blender or Unreal. Seamlessly bridging meshes with Gaussian splats remains a critical industry objective.

#Viewing#Editing

Further reading

SuGaR (GS 转 Mesh)

ConceptExpertPending verification

Covariance Matrix

Also known as：Covariance / Σ

The mathematical expression controlling the spatial morphology of an individual Gaussian primitive. In 3DGS context, it dictates the scaling and rotation of the ellipsoid across X, Y, and Z axes, allowing it to conform to shapes ranging from minute dust to flat walls.

Operates within the foundational gradient calculations of model training, meticulously ensuring that ellipsoids maintain mathematical validity (positive semi-definite) during transformations.

#Optimization#Training

MetricBeginnerPending verification

Opacity (Alpha)

Also known as：Alpha / 透明度

A scalar value defining the physical solidness of a Gaussian primitive, ranging from 0 (completely transparent) to 1 (fully opaque). During rasterization, colors from overlapping splats are blended based on their alpha values. Superfluous, highly transparent splats are pruned out.

Determines the visual clarity of the scene and serves as the primary metric for the algorithm to auto-regulate primitive count by pruning redundant volumes.

#Optimization#Viewing

TechniqueIntermediate

Spherical Harmonics (SH)

Also known as：SH

A set of orthogonal basis functions defined on the surface of a sphere, utilized to faithfully approximate view-dependent color variations. In 3DGS, each splat holds SH coefficients rather than a single color, enabling realistic specular highlights based on the observer's angle.

Responsible for high-fidelity lighting and reflections. Typically divided into degrees; higher degrees yield richer chromatic detail but significantly increase memory consumption.

#Optimization#Real-time

Further reading

球谐函数在图形学中的应用

WorkflowIntermediatePending verification

Densification

Also known as：克隆与分裂 / Cloning and Splitting

The 'growth' phase of the Adaptive Density Control strategy. When the model detects missing details or oversized splats with high variance during training, densification triggers. It clones existing splats to fill empty spaces or splits massive ones, thereby dynamically enhancing geometric resolution.

Occurs strictly within the core training loop. It acts as the critical automated replication mechanism granting 3DGS its extraordinary visual clarity.

#Training#Optimization

TechniqueExpert

3D Gaussian Splatting as Markov Chain Monte Carlo (3DGS-MCMC)

Also known as：3DGS as MCMC / Gaussian Splatting MCMC

In plain wordsReframes splat optimization as stochastic search over discrete Gaussian sets via MCMC-style proposals and accept/reject moves, with explicit noise learning rates plus scale and opacity regularizers atop the Inria reference trainer.

DetailsNeurIPS 2024 Spotlight work by UBC Vision reframes 3DGS optimization as Markov Chain Monte Carlo to stabilize adaptive density control. The public codebase extends the Inria reference trainer with CLI flags such as cap_max, scale_reg, opacity_reg, noise_lr, and init_type while inheriting the bundled non-commercial license and Ubuntu 20.04 test matrix.

Introduced

2024 · Shakiba Kheradmand et al. (UBC Vision)

"3D Gaussian Splatting as Markov Chain Monte Carlo"[src]

#Training#Optimization#Paper

↓ Prerequisites

↑ Go deeper

Related tools

inria-gaussian-splatting

Key papers

arxiv.org/abs/2404.09591

Further reading

WorkflowIntermediatePending verification

Pruning

Also known as：裁剪 / Culling

The 'culling' phase of the Adaptive Density Control strategy. To prevent infinite primitive inflation from overwhelming VRAM, the algorithm periodically inspects the scene during training. It forcibly deletes invisible splats with ultra-low opacity or overly massive, ineffective ones, ensuring model efficiency.

Working symbiotically with densification throughout the training timeline, it serves as the safeguard mechanism for controlling final file sizes and rendering frame rates.

#Training#Optimization

TechniqueExpertPending verification

Adaptive Density Control (ADC)

Also known as：ADC

The macro-regulatory algorithm overseeing both densification and pruning behaviors. By calculating the average magnitude of positional gradients, it diagnoses which 3D spatial regions are under-reconstructed or over-reconstructed, dynamically managing splat distribution to intelligently resolve scene clarity without human intervention.

Functions as the brain of the 3DGS training engine, directly dictating the evolutionary speed and qualitative leap from sparse point clouds to perfect scenes.

#Training#Optimization

TechniqueExpert

Differentiable Rasterization

Also known as：Tile-based Rasterizer

The secret weapon behind 3DGS's blistering rendering speeds. It partitions the screen into 16×16 tiles, rapidly sorting and culling visible splats. Because this rasterization process is mathematically differentiable, pixel-level errors effortlessly propagate backward, instructing underlying splat properties on how to correct themselves.

The bridge connecting forward visual rendering with backward gradient optimization, leveraging traditional computer graphics ingenuity to shatter AI computational bottlenecks.

#Real-time#Training

Further reading

CUDA 核心实现库

ConceptExpertPending verification

Anisotropy

Also known as：Anisotropic

The property of exhibiting different values when measured in different directions. In 3DGS, primitives are not perfect spheres; they are anisotropic ellipsoids capable of independent scaling across X, Y, and Z axes. This allows them to flatten drastically to simulate complex structural surfaces efficiently.

Represents the massive leap in representational efficiency for 3DGS when compared against traditional voxels or isotropic point cloud models.

#Optimization#Paper

ConceptIntermediate

Anchor Point

Also known as：Neural Anchor

Feature mounting nodes introduced in advanced models like Scaffold-GS. Instead of isolating millions of splats, the system deploys sparse anchors via voxel grids. Each anchor controls attached local splats through a neural network, creating a hierarchical structure that dramatically enhances noise resistance.

An advanced network architecture tailored for complex indoor/outdoor scenes, effectively addressing the artifact flaws foundational 3DGS exhibits during abrupt viewing angle shifts.

#Paper#Indoor#Outdoor

Further reading

Scaffold-GS 论文

TechniqueIntermediate

4D Gaussian Splatting (4DGS)

Also known as：Dynamic GS / 4DGS

A technological branch integrating the temporal axis (T) into 3D Gaussians. It typically employs deformation field neural networks to capture the motion trajectories, deformations, and life cycles of splats over time. This enables ultra-high framerate reconstruction and rendering of dynamic real-world scenes.

The next-generation technological solution for spatial video, holographic performances, and motion capture, currently undergoing intense industrial research and development.

#Spatial Media#Paper

Further reading

4DGS 代表性论文

TechniqueExpert

Mip-Splatting

Also known as：Mip-Splat

An improved algorithm designed to eliminate high-frequency flickering and artifacts produced by 3DGS during camera zooming or movement. By introducing low-pass filters in both 2D and 3D dimensions, it constrains frequency sampling rates, ensuring reconstructions remain perfectly smooth and anti-aliased at any scale.

Massively elevates the visual experience during free-roaming on Web platforms. It is a critical optimization required for deploying commercial-grade applications.

#Viewing#Optimization

Further reading

Mip-Splatting GitHub

TechniqueExpert

Scaffold-GS

An improved Gaussian model employing a structured hierarchical design. It abandons completely free-growing splat clusters, instead utilizing SfM point clouds to build voxel grids that spawn 'neural anchors'. These anchors dynamically predict local Gaussian attributes, reducing view-overfitting and massively compressing file size.

Frequently leveraged by developers and geeks for large-scale outdoor scene reconstructions or highly storage-sensitive mobile exhibition projects.

#Paper#Outdoor

Further reading

Scaffold-GS 官方页

TechniqueIntermediate

Compressed GS

Also known as：压缩高斯

A lineage of compression algorithms (like C3DGS) engineered to mitigate the massive file size flaws (often hundreds of megabytes) of base 3DGS models. By synthesizing vector quantization, sensitivity-aware culling, and entropy coding, file sizes can be reduced 10× to 30× with negligible visual degradation.

The core pipeline defining whether content can be seamlessly distributed across web platforms, acting as a strict prerequisite for recommendations within communities.

#Publishing#Web Rendering

Further reading

Compact 3D Gaussian 论文

ToolBeginner

SuperSplat

An open-source, web-based 3DGS editing and processing tool developed by the PlayCanvas team. It provides an intuitive browser interface, empowering creators to import ply files and execute post-processing tasks such as selecting, trimming, merging, and recoloring, filling a crucial gap in the editing pipeline.

The premier tool of choice for 3D artists and no-code creators looking to polish raw splat models and eliminate stray floaters.

#Editing#Web Rendering

Further reading

SuperSplat GitHub

ToolBeginner

Splat Viewer

Also known as：Splat Web Viewer / Viewer

An application utilized on end-user devices (typically web browsers) to parse, load, and real-time rasterize Gaussian .ply or compressed formats. Premium viewers heavily leverage GPU acceleration while encapsulating camera roaming, anti-aliasing, and basic lighting interactions.

The direct portal through which end-users experience Gaussian content. The prosperity of this viewer ecosystem determines the dissemination breadth of 3DGS content.

#Viewing#Web Rendering

Further reading

Aras P 的知名 WebGL 实现

ConceptBeginnerPending verification

Real-time Rendering

Also known as：RTR

A computer graphics process that generates images rapidly enough (typically >30 FPS) to create a visually latency-free experience. 3DGS's explosive industrial impact stems entirely from its ability to maintain NeRF's offline-grade quality while unlocking blistering real-time rendering framerates at 1080P or 4K resolutions.

Distinguished from cinematic mesh raytracing requiring hours per frame, it serves as the foundational requirement for VR/AR interactions and game development.

#Real-time#Viewing

TechniqueIntermediate

WebGL / WebGPU API

Also known as：Web API

Graphics APIs enabling web browsers to directly leverage underlying GPU hardware acceleration. While WebGL offers historical stability and broad compatibility, the next-gen WebGPU provides low-level control and robust Compute Shader capabilities. This makes executing massive sorting calculations for millions of splats on pure web clients entirely feasible.

The indispensable underlying infrastructure powering lightweight Gaussian spatial distribution applications, such as plugin-free online galleries and interactive H5 exhibitions.

#Web Rendering#Viewing

Further reading

WebGPU 官方规范

WorkflowExpert

Simultaneous Localization and Mapping (SLAM)

Also known as：SLAM

A technique where a robot or device navigates an unknown environment, concurrently estimating its own location while constructing an environmental map. Recent studies like SplaTAM or MonoGS integrate 3DGS with SLAM, enabling phones or drones to scan and generate photorealistic Gaussian 3D maps dynamically while moving.

Massively streamlines the protracted 'Photography -> Offline SfM -> Training' pipeline, propelling Gaussian technology toward real-time mobile surveying.

#SLAM#Capture

Further reading

SplaTAM 论文

ConceptBeginnerPending verification

3D Reconstruction

Also known as：Reconstruction / 逆向工程

The reverse-engineering process of recreating the 3D morphology and surface properties of real-world physical objects using multi-view photos, videos, or depth sensor data. As one of the most potent contemporary paradigms, 3DGS is rapidly elevating the field's precision ceiling via its exceptional material reproduction and speed.

Broadly utilized across digital cultural heritage scanning, medical aesthetics facial mapping, digital human creation, and spatial cognition for embodied AI.

#Capture#Cultural Heritage

FormatIntermediatePending verification

Spatial Video

Also known as：Immersive Video / Volumetric Video

A media format supporting Six Degrees of Freedom (6DoF) viewing. Viewers can not only watch dynamic footage but physically shift their heads during playback to observe object flanks from varying angles. Spatial video powered by 4DGS compression is heavily tipped as the premier native content format for upcoming headsets like Apple Vision Pro.

The ultimate evolutionary leap from traditional 2D footage to stereoscopic narrative, currently facing immense challenges regarding massive data storage and streaming delivery.

#Spatial Media#Viewing

ConceptBeginnerPending verification

Digital Twin

Also known as：数字镜像

The one-to-one high-fidelity mapping of real-world physical entities (such as buildings, factories, or gallery spaces) into a digital ecosystem. Because it exceptionally replicates authentic lighting and materials, 3DGS is recognized as the most cost-effective visual foundational technology for constructing next-generation visceral digital twins.

Frequently operating in tandem with IoT data streams, it acts as the ultimate business container for smart cities, virtual exhibition halls, and industrial simulations.

#Indoor#Outdoor

TechniqueIntermediatePending verification

Voxelization

Also known as：Voxel Grid

The process of converting continuous 3D space or models into discrete microscopic 3D grids (voxels, or 3D pixels). While 3DGS relies on scattered continuous primitives, many spatial clipping tools and underlying collision detection mechanisms temporarily 'voxelize' the Gaussian field to leverage traditional bounding box algorithms efficiently.

A highly common intermediate transition state encountered when editing Gaussian models, physically slicing scenes, or attempting fusion with legacy physical collision engines.

#Editing#Optimization

MetricIntermediate

Evaluation Metrics (PSNR/SSIM/LPIPS)

Also known as：评测标准 / Metrics

Scientific benchmarks used to quantitatively compare image generation quality between 3DGS and rival algorithms. PSNR evaluates pixel-level color distortion; SSIM focuses on the structural integrity of edge features; and LPIPS utilizes neural networks to extract deep features, aligning much closer with actual human visual perception.

The absolute currency within academic papers validating model superiority. It constitutes the critical experimental data charts developers must monitor during hyperparameter tuning.

#Paper#Optimization

Further reading

LPIPS 原理论文

ConceptBeginner

Photogrammetry

In plain wordsPicture a shoebox full of holiday snapshots of one building—photogrammetry is the workflow where those flat prints are lined up so software can recover real-world distances and spit out a solid model you can spin; splatting is only one newer branch of the same "photos → 3D" family.

DetailsPhotogrammetry stitches overlapping photographs into consistent 3D geometry by estimating camera poses and 3D tie points through least-squares adjustment. Structure-from-motion and multi-view stereo pipelines are modern instantiations: they output sparse or dense reconstructions that downstream tools may convert into meshes, point clouds, or splatted Gaussians. The discipline predates neural fields; its focus is metrology-grade correspondence and calibration rather than a particular differentiable renderer.

↓ Prerequisites

↑ Go deeper

Related tools

polycam luma-ai kiri-engine

TechniqueIntermediate

Floaters (informal 3DGS artifact)

Also known as：stray Gaussians

In plain wordsAfter training, a splat scene can sprout tiny colored smudges that hang in mid-air like pollen—people in the field just call them "floaters." They often trace back to motion blur, bad coverage, or moving subjects, and editors such as SuperSplat exist mainly to lasso and delete them.

Details"Floaters" is informal jargon for stray Gaussian primitives that survive optimization away from true surfaces, producing visually floating specks. Papers may say "floating artifacts" instead. Mitigation spans interactive cleanup in tools, pruning heuristics, or training-time regularizers. Treat the word as practitioner slang, not a canonical keyword from a single paper title.

↓ Prerequisites

↑ Go deeper

Related tools

supersplat

ConceptExpert

Path Tracing

Also known as：path tracing

In plain wordsThink of path tracing as sending a pinball of light through the scene: each sample bounces randomly until it hits a light, you average thousands of those stories per pixel, and the photo slowly converges. That Monte Carlo story differs from rasterization, which streams triangles through a fast pipeline but usually needs extra tricks for soft shadows or color bleeding.

DetailsPath tracing estimates the rendering equation by Monte Carlo sampling complete light paths from the camera, averaging many independent samples per pixel to reduce variance. Rasterization projects primitives and shades fragments in a single forward pass—fast, but indirect lighting needs approximations or separate passes. Path tracing trades cost for physical consistency (global illumination, complex materials). When Gaussian splats are represented in Omniverse RTX or OctaneRender 2026-style pipelines, documentation and release notes describe them participating in lighting, shadows, reflections, and refractions alongside meshes.

Introduced

1986-08 · James T. Kajiya

"We present an integral equation which generalizes a variety of known rendering algorithms."[src]

↓ Prerequisites

↑ Go deeper

Related engines

nvidia-omniverse

Key papers

doi.org/10.1145/15922.15902

FormatIntermediate

SPZ format

Also known as：SPZ / Scaniverse PLY zipped

In plain wordsTreat SPZ like vacuum-sealing a closet full of coats: it is Niantic's packed format for Gaussian splats, often shrinking the same scene to a fraction of the PLY size (marketing copy cites ~90% savings) so mobile capture and web delivery hurt less. The bitstream is open source on GitHub.

DetailsThe reference library describes .spz as gzip-compressed streams with a fixed header followed by Gaussian attributes (positions, alphas, colors, scales, rotations, spherical harmonics). Quantization knobs trade size for fidelity. Interop demands explicit coordinate-frame metadata because SPZ stores data in an RUB system aligned with OpenGL/three.js, unlike many PLY files. Engines such as Babylon.js have added loaders, while Scaniverse remains the flagship consumer of the format.

Introduced

Niantic

".spz is a file format for compressed 3D gaussian splats."[src]

↓ Prerequisites

↑ Go deeper

Related tools

scaniverse

Related engines

niantic-scaniverse babylonjs

TechniqueExpert

SuGaR (Surface-Aligned Gaussian Splatting)

Also known as：SuGaR / Surface-Aligned Gaussian Splatting

In plain wordsSuGaR behaves like nudging a bowl of loose marbles—each Gaussian—until they hug the true surfaces; once they stick, Poisson reconstruction can spit out an editable triangle mesh in minutes, so artists can sculpt or relight through a mesh instead of hand-waving millions of splats.

DetailsSuGaR adds a regularizer that aligns Gaussians with the underlying surface, then samples the level set efficiently and runs Poisson surface reconstruction to obtain meshes without relying on marching cubes over a near-zero density field. An optional binding stage co-optimizes mesh and Gaussians for high-quality splat rendering while enabling mesh-based editing. The method targets fast mesh extraction from splats rather than real-time training.

Introduced

2023-11 · Antoine Guédon, Vincent Lepetit

"We propose a method to allow precise and extremely fast mesh extraction from 3D Gaussian Splatting."[src]

↓ Prerequisites

Key papers

arxiv.org/abs/2311.12775

ConceptBeginner

Novel View Synthesis

Also known as：view synthesis

In plain wordsImagine you only have vacation photos of a statue from the left and right—novel view synthesis is the trick of asking the computer to snap a picture from a viewpoint no camera ever held, while keeping lighting and geometry believable.

DetailsNovel view synthesis generates images from camera poses that were not in the training set. Image-based rendering, light-field methods, NeRF-style neural fields, and explicit Gaussian splats all target the same user-facing goal with different internal representations. Metrics such as PSNR, SSIM, and LPIPS quantify faithfulness to held-out captures.

↓ Prerequisites

↑ Go deeper

Related tools

polycam luma-ai

Related engines

threejs

ConceptIntermediate

Radiance Field

In plain wordsThink of a radiance field as a giant lookup table: at every 3D spot and every outward direction it tells you how bright light should be. Neural nets can approximate that table (NeRF); Gaussian splats approximate it with millions of tinted ellipsoids you can render fast.

DetailsRadiance measures directional light intensity at a point in space. A radiance field assigns such values across positions and directions. NeRF parameterizes the field with networks; 3DGS approximates it with explicit Gaussians and differentiable splatting. Practitioners often say "radiance field" when they mean either the continuous function or its discrete surrogate.

Introduced

2020-08 · Ben Mildenhall, Pratul P. Srinivasan, Matthew Tancik, Jonathan T. Barron, Ravi Ramamoorthi, Ren Ng

"Our algorithm represents a scene using a fully-connected (non-convolutional) deep network, whose input is a single continuous 5D coordinate (spatial location (x,y,z) and viewing direction (theta,phi)) and whose output is the volume density and view-dependent emitted radiance at that spatial location."[src]

↓ Prerequisites

↑ Go deeper

Related tools

inria-gaussian-splatting gsplat

Related engines

threejs

ConceptBeginner

Spatial Computing

In plain wordsSpatial computing is the shift from peering into flat screens to letting computers reason about real rooms, hands, and headsets: sensors map the world around you, then graphics and audio glue themselves to that map instead of living only inside a monitor bezel.

DetailsIndustry usage bundles AR/VR/MR, embodied UI, and device ecosystems that track pose in physical space. For splat workflows it matters because capture devices and headsets define how reconstructions are gathered and consumed. Interop often routes through OpenXR or WebXR plus game-engine runtimes.

↓ Prerequisites

↑ Go deeper

Related engines

apple-vision-pro meta-quest

ConceptIntermediate

Rasterization

Also known as：rasterisation

In plain wordsRasterization is the assembly-line step that stamps triangles—or splats—onto a pixel grid: geometry becomes screen-space coverage, then shaders pick colors. Real-time engines lean on it because it is predictable and fast compared with brute-force light simulation.

DetailsRasterization determines which pixels a primitive covers and shades those fragments. It powers most real-time graphics APIs. Differentiable splat rasterizers extend the same idea: sort Gaussians per tile, alpha-blend, and backpropagate screen-space error—still a rasterization mindset, not path tracing.

↓ Prerequisites

↑ Go deeper

Related engines

webgl-webgpu threejs

ConceptIntermediate

Gradient Descent

In plain wordsGradient descent is blind hiking: each step walks opposite the slope of the loss landscape. For splat training, that loss compares rendered pixels to photos; autograd tells every Gaussian which way to nudge position, color, or covariance.

DetailsIterative first-order optimization updates parameters by stepping along negative gradients of a scalar loss. In 3DGS pipelines the loss blends photometric terms (e.g., L1 plus D-SSIM) and interacts with densification schedules. Stochastic minibatches and GPU kernels make large-scale updates practical.

↓ Prerequisites

↑ Go deeper

Related tools

inria-gaussian-splatting gsplat

Related engines

playcanvas

WorkflowIntermediate

COLMAP

In plain wordsCOLMAP is the open-source detective for unordered photo sets: it runs structure-from-motion to recover cameras, then multi-view stereo to grow dense geometry—most splat pipelines start by exporting its poses and sparse points.

DetailsCOLMAP bundles established CVPR/ECCV algorithms behind a unified CLI and GUI. Typical splat workflows import images.bin/points3D.bin or textured meshes as initialization. Global SfM variants (e.g., GLOMAP) require extra citations. Licensing is BSD. Version 3.12 added native sensor-rig support, latitude/longitude to UTM conversion, and new pose files, improving panoramic, multi-camera, and drone reconstruction workflows.

↓ Prerequisites

↑ Go deeper

Related tools

inria-gaussian-splatting gsplat

WorkflowIntermediate

gsplat library (CUDA splat utilities)

Also known as：nerfstudio gsplat

In plain wordsThe gsplat project is a CUDA-accelerated Gaussian rasterizer with Python bindings: same splat idea as the Inria reference, but tuned for memory, speed, and extra research features when hooked into PyTorch training loops.

DetailsMaintainers publish evaluation scripts matching PSNR/SSIM/LPIPS and converged Gaussian counts against the official trainer while advertising lower VRAM and faster runtime. Developers integrate it inside Nerfstudio-style stacks or standalone COLMAP examples. Not to be confused with the gsplat.js viewer.

↓ Prerequisites

↑ Go deeper

Related tools

gsplat inria-gaussian-splatting

Key papers

arxiv.org/abs/2409.06765

WorkflowIntermediate

Training loss (3D Gaussian Splatting)

In plain wordsThe training loss scores how far splat renders drift from captured photos; optimizers push Gaussians to lower that score. The canonical 3DGS recipe blends an L1 photometric term with a D-SSIM structural term, weighted by a λ you pick once and reuse.

DetailsKerbl et al. combine L1 and D-SSIM (with λ=0.2) to encourage pixel accuracy and structural agreement. Other projects append regularizers for anti-aliasing, opacity control, or mesh-friendly alignment. Always relate scalar loss trends to perceptual metrics and qualitative renders.

↓ Prerequisites

↑ Go deeper

Related tools

inria-gaussian-splatting gsplat

Related engines

playcanvas

Key papers

arxiv.org/abs/2308.04079

WorkflowIntermediate

Mesh Extraction

Also known as：surface extraction

In plain wordsMesh extraction turns unstructured splats or points into a triangle soup you can boolean, rig, or ship to game engines. For Gaussian splats, researchers often sample implicit surfaces or aligned Gaussians, then run Poisson reconstruction instead of marching through noisy densities.

DetailsThe task spans Poisson reconstruction, marching cubes on SDFs, and splat-specific pipelines such as SuGaR's level-set sampling. Triangle meshes remain the interchange currency for DCC tools even when rendering stays on splats. Radiance Fields' overview also names 2DGS, RaDe-GS, Gaussian Frosting, and Texture-GS as active routes for pulling editable surfaces from Gaussian or radiance-field representations.

↓ Prerequisites

↑ Go deeper

Related tools

blender-3dgs-addon

Related engines

blender

Key papers

arxiv.org/abs/2311.12775

FormatBeginner

PLY file format

Also known as：PLY / Polygon File Format / Stanford Triangle Format

In plain wordsPLY is a text-or-binary recipe card: a header lists how many vertices exist and which properties (x, y, z, normals, SH bands…) follow, then the data rows stream underneath—exactly why splat trainers love it for rich Gaussian attributes.

DetailsThe Stanford PLY spec couples an element/property header with vertex or face records. Gaussian splat exports piggyback on that extensibility for SH coefficients and covariance data. Always verify endianness and property naming when swapping between trainers and viewers.

Introduced

1994 · Greg Turk; Stanford University

"PLY is a computer file format known as the Polygon File Format or the Stanford Triangle Format."[src]

↓ Prerequisites

↑ Go deeper

Related tools

supersplat inria-gaussian-splatting

Related engines

supersplat

FormatIntermediate

.splat binary format

Also known as：splat file

In plain wordsA .splat file packs Gaussian parameters into a tight binary layout tuned for fast loading in web viewers—think shipping foam peanuts in a labeled crate instead of a verbose text manifest. Vendors differ, so always verify headers when hopping between exporters.

DetailsBinary splat containers strip textual overhead from PLY while preserving the data needed for real-time rasterizers. SuperSplat and other editors import/export these blobs for publishing. Interoperability demands checking SH ordering, endianness, and optional compression flags per tool. OctaneRender 2026's PLY/SPZ support and its 2027 roadmap for exporting CG scenes into splats show Gaussian formats expanding from web viewers into offline rendering and DCC pipelines.

↓ Prerequisites

↑ Go deeper

Related tools

supersplat three-gaussian-splatting

Related engines

playcanvas

FormatIntermediate

KSPLAT (.ksplat) format

Also known as：KSPLAT / .ksplat

In plain wordsThe .ksplat container is Mark Kellogg's trimmed, compressed splat package for the GaussianSplats3D viewer—great for faster loads, but it is a project-specific binary rather than a formal Khronos-style spec, so treat interoperability as "try it and see."

DetailsKSplatLoader APIs export converted buffers for download. Because the format evolves, shipping cross-engine assets often stays on PLY/SPZ while using .ksplat inside a known web stack.

↓ Prerequisites

↑ Go deeper

Related tools

three-gaussian-splatting supersplat

Related engines

threejs

FormatExpert

USD ParticleField (incl. 3D Gaussian Splats)

Also known as：OpenUSD ParticleField

In plain wordsUSD's ParticleField schema is the paperwork that tells a renderer how to interpret volumetric particles: positions, kernels, radiance, optional scales and orientations. Derived types such as ParticleField3DGaussianSplat let pipelines store splats as first-class USD prims instead of ad-hoc blobs.

DetailsParticleField generalizes multiple field types, including 3D Gaussian splats. Tooling converts trained PLY assets into USD stages referencing these prims so RTX can path-trace them alongside meshes. Custom exporters must inherit the schema expectations for validation.

↓ Prerequisites

↑ Go deeper

Related engines

nvidia-omniverse

MetricBeginner

Peak Signal-to-Noise Ratio (PSNR)

Also known as：PSNR

In plain wordsPSNR compares the max possible signal power to the noise energy between two images, expressed in decibels—higher usually means closer pixels, but it can disagree with human perception when blur or structure breaks.

DetailsPSNR rewards pixel-wise fidelity. Teams report it alongside SSIM and LPIPS because high PSNR does not guarantee perceptual quality. Use the same train/test splits and bit depth when comparing numbers across papers.

↓ Prerequisites

↑ Go deeper

Related tools

gsplat

MetricBeginner

Structural Similarity Index Measure (SSIM)

Also known as：SSIM / structural similarity index

In plain wordsSSIM scores how well luminance, contrast, and structure match between patches—closer to human annoyance with blur than raw MSE, though it still misses every failure mode.

DetailsSSIM generalizes Wang–Bovik's framework; differentiable variants plug into splat optimization. Teams usually log SSIM alongside PSNR and LPIPS on held-out views.

↓ Prerequisites

↑ Go deeper

Related tools

gsplat

MetricIntermediate

Learned Perceptual Image Patch Similarity (LPIPS)

Also known as：LPIPS / perceptual metric LPIPS

In plain wordsLPIPS feeds paired images through a pretrained CNN, compares deep features (optionally calibrated), and returns a distance where higher means more different—useful when PSNR looks fine but pictures still feel wrong.

DetailsThe reference implementation exposes AlexNet/VGG/SqueezeNet variants. Gaussian splat papers cite LPIPS↓ on validation views. Unlike PSNR, it rewards semantic similarity but costs more compute.

↓ Prerequisites

↑ Go deeper

Related tools

gsplat

Key papers

arxiv.org/abs/1801.03924

MetricBeginner

Frames Per Second (FPS)

Also known as：FPS / frame rate

In plain wordsFPS counts how many full images a pipeline produces each second—either realtime splat rendering in a browser or the throughput numbers quoted beside PSNR tables. Higher feels smoother, but always ask which stage was timed.

DetailsBenchmarks should specify resolution, MSAA settings, and whether WebGPU/WebGL paths were used. Academic tables often list FPS next to memory footprint to compare explicit radiance methods.

↓ Prerequisites

↑ Go deeper

Related engines

playcanvas godot

Key papers

arxiv.org/abs/2308.04079

MetricIntermediate

Gaussian / splat count

Also known as：number of Gaussians

In plain wordsSplat count tracks how many Gaussian primitives survive optimization—think counting marbles in a jar. More splats can capture detail but inflate VRAM, storage, and sort cost; pruning and compression fight the same number.

DetailsDensification schedules grow or split splats when view-space gradients spike; opacity pruning removes low-alpha Gaussians. gsplat reproduces official Gaussian counts when benchmarking. SPZ/KSplat formats also hinge on how many primitives remain. The Infinite Realities report describes about 1,800 trained PLY files and 300K input images for a 30-second dynamic clip, showing that 4D capture must evaluate splat count together with time sequence and storage budget.

↓ Prerequisites

↑ Go deeper

Related tools

gsplat

Related engines

supersplat

Key papers

arxiv.org/abs/2308.04079