Understanding Gaussian Splatting
Beginner · 7 modules · ~2.5h
Concepts, pipeline, NeRF comparison, first hands-on demo.
Enter Path →Learn Hub
Four structured paths from first principles to spatial narrative experiments, followed by a tools index (capture → training → editing → viewing → publishing) and an in-site glossary.
Learning Paths
Beginner · 7 modules · ~2.5h
Concepts, pipeline, NeRF comparison, first hands-on demo.
Enter Path →Intermediate · 8 modules · ~4h
Shooting strategy, SfM, COLMAP, local & cloud training, QC and export.
Enter Path →Intermediate · 6 modules · ~3h
Three.js / PlayCanvas / Babylon.js, formats, LOD and WebXR.
Enter Path →General · 5 modules · ~2h
Milestone papers, 4D dynamics, avatars, text-driven gen and community works.
Enter Path →Tools Index
Organized along the five-stage pipeline (capture → training → editing → viewing → publishing) with ratings, ramp-up cost, and public sources.
Capture
26Turn real-world spaces into trainable image or video data.
26 tools →Training
20Convert captured data into Gaussian Splatting models.
20 tools →Editing
15Clean up, crop, recolor, and blend with traditional assets.
15 tools →Viewing
17View Gaussian models on the web, apps, or spatial devices.
17 tools →Publishing
14Distribute works to cloud, social, maps, or commercial platforms.
14 tools →Extended notes
Forked from legacy links, rewritten for on-site reading.
How Gaussian Splatting finds equilibrium among polygon meshes, point clouds, and NeRF implicit neural fields — a rethinking of how we record reality.
Read →Imagine filling space with countless translucent colored drops. We deconstruct the core parameters of a Gaussian and see how shape, color, and orientation recreate real-world light and shadow.
Read →How Gaussians self-organize like biological cells — splitting, cloning, and dying — to grow from a sparse initial point cloud into a detailed 3D scene.
Read →The technical internals of 3DGS real-time rendering — how tile-based rendering, CUDA optimization, and differentiable backpropagation render millions of Gaussians in milliseconds.
Read →A complete practical guide from capture to rendering — mastering COLMAP, Nerfstudio, Luma AI, and more to create your first Gaussian Splatting model.
Read →Exploring the boundaries of 3DGS — current limitations, ongoing breakthroughs, and how 4D Gaussian Splatting extends static scenes to dynamic worlds.
Read →Curated learning resources, key papers, open-source projects and active communities — your 3DGS deep-learning roadmap.
Read →Why photogrammetry-style pipelines need stable lighting, sharp frames, and overlap; a pre-shoot checklist aligned with public COLMAP guidance.
Read →Picture the space as an onion: shoot three concentric height rings and keep a looped route so SfM has enough overlapping data to converge.
Read →Orbit on the main wide camera across multiple heights; control total duration; optional background removal on supported platforms.
Read →Lock camera pose and focal length; shoot RAW, normalize WB/exposure; turntable stepping with high overlap plus bottom coverage.
Read →Use wide FOV and higher frame rates for redundant views; SDR + AWB defaults; prefer stills in dark venues; avoid nadir/zenith-only framing.
Read →Local training marks the crossover from hobbyist to practitioner. Anchored on three accessible desktop tools, this article spells out VRAM budgets, step counts and convergence heuristics — including a what-to-do-when-it-crashes routine.
Read →When local hardware buckles, the cloud is the other leg. This article benchmarks Luma's minimalist uploads, Zhitianxia's enterprise-leaning platform, and Pointcosm's package pricing, with a selection matrix indexed by scene scale.
Read →Training done is just round one of quality control. This article covers deleting floaters in SuperSplat, format conversion with splat-transform, and a simple PSNR/SSIM/LPIPS self-evaluation routine.
Read →SOG favors small payloads and fast first paint but trades geometric/texture fidelity; keep PLY/splat masters for archival handoff.
Read →A rewritten Volinga article explaining how NeRF can reduce time, budget, and camera-motion limits in virtual production environment creation, while still facing real-time rendering and Unreal Engine integration constraints.
Read →A rewritten Radiance Fields guide comparing Scaniverse, Polycam, Luma AI, KIRI Engine, and Postshot across onboarding, cloud dependence, export capability, and local training trade-offs.
Read →Judge whether frames carry enough texture, then cover volume with three heights and a looped path; avoid macro modes and too-close distances.
Read →Cull frames, fix upside-down shots, mild color balance, then auto-register in RC; if alignment collapses, reshoot for overlap instead of manual control-point hacks.
Read →Per-image/per-minute tiers and pro capacity; compare subscription SaaS on exports, queueing, and whether denoising is bundled.
Read →Separate the open-source SuperSplat editor from superspl.at composition; list official repos and editor URLs.
Read →Further Reading · Research Milestones
Want to read the source? Here are the key papers and open-source tools defining today's Gaussian Splatting engineering.
The foundational paper for 3DGS, breaking the barrier between high-quality view synthesis and real-time rendering.
View →Bridges the gap between Gaussian point clouds and traditional polygon mesh pipelines via surface-alignment regularization.
View →A high-performance 3DGS viewer and editor running entirely in the browser, drastically lowering the hardware barrier for spatial data cleanup.
View →Resolves intense high-frequency aliasing and dilation artifacts when changing viewing distances by introducing a 2D low-pass filter.
View →Flattens volumetric Gaussians into surface-aligned 2D disks with perspective-correct splatting and depth or normal cues for cleaner geometry.
View →Models dynamic scenes with 4D neural voxels and a deformation field over a single canonical Gaussian set for real-time training and storage efficiency.
View →Combines learned masking, grid-based view-dependent color, and codebook-quantized geometry to shrink Gaussian count and attribute storage while preserving quality.
View →Optimizes 3D Gaussians in canonical space with a deformation field for monocular dynamics and uses annealed smoothing to reduce temporal jitter from pose noise.
View →Combines 3D Gaussians with score distillation and mesh or UV refinement to cut per-asset 3D generation time sharply.
View →Models humans with animatable 3D Gaussians, learns pose-dependent appearance, and jointly refines motion and look for realistic monocular avatars.
View →Uses Gaussians as the sole map representation for monocular SLAM, jointly optimizing poses and primitives for incremental reconstruction and NVS.
View →Jointly learns static scene and animatable human Gaussians from a short monocular clip, optimizing skinning weights to reduce articulation artifacts at high FPS.
View →Stores distilled language features on Gaussians and splats them for open-vocabulary 3D grounding, avoiding costly NeRF volume rendering.
View →Organizes neural Gaussians on anchors and predicts attributes from the viewpoint to cut redundancy and improve generalization on complex scenes.
View →Glossary
52 cross-linked terms layered for beginners, practitioners, and researchers—same entry, your choice of depth.
54 entries
Also known as:Gaussian Splatting / 3DGS
A revolutionary 3D scene representation and real-time rendering technique. It explicitly models environments using millions of anisotropic Gaussian ellipsoids, each parameterized with covariance, opacity, and spherical harmonics. Coupled with a highly efficient differentiable rasterization pipeline, it achieves photorealistic novel view synthesis at exceptional frame rates.
Primarily utilized during high-fidelity 3D reconstruction and real-time rendering phases, serving as a fundamental pillar for modern spatial computing.
Related
Further reading
Also known as:Gaussian / Splatting Primitive
The fundamental 3D primitive comprising a 3DGS scene. It acts conceptually like a semi-transparent droplet of paint suspended in 3D space. Shaped as an anisotropic ellipsoid, its color and opacity dynamically shift depending on the viewing angle to simulate realistic lighting.
Creators and algorithms continuously add, remove, or modify these primitives during the optimization phase to closely approximate real-world environments.
Related
Further reading
Also known as:SfM
A computer vision technique that automatically estimates camera poses and extracts a sparse 3D point cloud from a sequence of 2D images. In the 3DGS workflow, SfM provides the crucial initial camera parameters and base structure required for subsequent initialization.
Deployed during the pre-processing stage immediately after data capture. Calculations are typically automated by open-source libraries like COLMAP.
Related
Further reading
Also known as:MVS
A 3D reconstruction algorithm operating downstream of SfM. It utilizes stereo matching techniques on image sets with known camera parameters to generate dense point clouds. While traditional pipelines rely on MVS for mesh extraction, 3DGS efficiently bypasses this expensive step.
The core step in traditional mesh modeling pipelines, though standard Gaussian workflows typically omit it or use it solely for benchmarking purposes.
Related
Further reading
Also known as:NeRF
A technique utilizing Multilayer Perceptrons (MLPs) to implicitly represent continuous 3D scenes. Given a viewing direction and spatial coordinates, the neural network outputs color and volume density. NeRF catalyzed the novel view synthesis revolution, acting as the direct predecessor to 3DGS. Radiance Fields' overview ties the original 2020 Berkeley work to later acceleration projects such as Instant-NGP, which shortened practical training times for many scenes.
Frequently utilized by researchers as a baseline metric. Due to sluggish rendering speeds, it is progressively being replaced by 3DGS in real-time interactive scenarios.
Related
Further reading
Also known as:Points
A collection of discrete data points in three-dimensional space, typically containing coordinate geometry alongside attributes like color or reflectance. Within 3DGS, the sparse point cloud generated by SfM serves as the initial seeding ground for Gaussian ellipsoids.
Present throughout the initial stages of the spatial computing pipeline. It is also the direct output format for various LiDAR sensors and scanning devices.
Related
Further reading
Also known as:Mesh / 网格
The dominant model representation in traditional 3D engines, composed of vertices, edges, and polygonal faces (typically triangles) forming a continuous surface shell. Unlike the volumetric and scattered nature of 3DGS, meshes demand strict surface topology, struggling with hair or translucency.
The native language of mainstream game engines and modeling software like Blender or Unreal. Seamlessly bridging meshes with Gaussian splats remains a critical industry objective.
Related
Further reading
Also known as:Covariance / Σ
The mathematical expression controlling the spatial morphology of an individual Gaussian primitive. In 3DGS context, it dictates the scaling and rotation of the ellipsoid across X, Y, and Z axes, allowing it to conform to shapes ranging from minute dust to flat walls.
Operates within the foundational gradient calculations of model training, meticulously ensuring that ellipsoids maintain mathematical validity (positive semi-definite) during transformations.
Related
Also known as:Alpha / 透明度
A scalar value defining the physical solidness of a Gaussian primitive, ranging from 0 (completely transparent) to 1 (fully opaque). During rasterization, colors from overlapping splats are blended based on their alpha values. Superfluous, highly transparent splats are pruned out.
Determines the visual clarity of the scene and serves as the primary metric for the algorithm to auto-regulate primitive count by pruning redundant volumes.
Related
Also known as:SH
A set of orthogonal basis functions defined on the surface of a sphere, utilized to faithfully approximate view-dependent color variations. In 3DGS, each splat holds SH coefficients rather than a single color, enabling realistic specular highlights based on the observer's angle.
Responsible for high-fidelity lighting and reflections. Typically divided into degrees; higher degrees yield richer chromatic detail but significantly increase memory consumption.
Related
Further reading
Also known as:克隆与分裂 / Cloning and Splitting
The 'growth' phase of the Adaptive Density Control strategy. When the model detects missing details or oversized splats with high variance during training, densification triggers. It clones existing splats to fill empty spaces or splits massive ones, thereby dynamically enhancing geometric resolution.
Occurs strictly within the core training loop. It acts as the critical automated replication mechanism granting 3DGS its extraordinary visual clarity.
Related
Also known as:3DGS as MCMC / Gaussian Splatting MCMC
In plain wordsReframes splat optimization as stochastic search over discrete Gaussian sets via MCMC-style proposals and accept/reject moves, with explicit noise learning rates plus scale and opacity regularizers atop the Inria reference trainer.
DetailsNeurIPS 2024 Spotlight work by UBC Vision reframes 3DGS optimization as Markov Chain Monte Carlo to stabilize adaptive density control. The public codebase extends the Inria reference trainer with CLI flags such as cap_max, scale_reg, opacity_reg, noise_lr, and init_type while inheriting the bundled non-commercial license and Ubuntu 20.04 test matrix.
Introduced
2024 · Shakiba Kheradmand et al. (UBC Vision)
"3D Gaussian Splatting as Markov Chain Monte Carlo"[src]
↓ Prerequisites
↑ Go deeper
Related
Related tools
Key papers
Also known as:裁剪 / Culling
The 'culling' phase of the Adaptive Density Control strategy. To prevent infinite primitive inflation from overwhelming VRAM, the algorithm periodically inspects the scene during training. It forcibly deletes invisible splats with ultra-low opacity or overly massive, ineffective ones, ensuring model efficiency.
Working symbiotically with densification throughout the training timeline, it serves as the safeguard mechanism for controlling final file sizes and rendering frame rates.
Related
Also known as:ADC
The macro-regulatory algorithm overseeing both densification and pruning behaviors. By calculating the average magnitude of positional gradients, it diagnoses which 3D spatial regions are under-reconstructed or over-reconstructed, dynamically managing splat distribution to intelligently resolve scene clarity without human intervention.
Functions as the brain of the 3DGS training engine, directly dictating the evolutionary speed and qualitative leap from sparse point clouds to perfect scenes.
Related
Also known as:Tile-based Rasterizer
The secret weapon behind 3DGS's blistering rendering speeds. It partitions the screen into 16×16 tiles, rapidly sorting and culling visible splats. Because this rasterization process is mathematically differentiable, pixel-level errors effortlessly propagate backward, instructing underlying splat properties on how to correct themselves.
The bridge connecting forward visual rendering with backward gradient optimization, leveraging traditional computer graphics ingenuity to shatter AI computational bottlenecks.
Related
Further reading
Also known as:Anisotropic
The property of exhibiting different values when measured in different directions. In 3DGS, primitives are not perfect spheres; they are anisotropic ellipsoids capable of independent scaling across X, Y, and Z axes. This allows them to flatten drastically to simulate complex structural surfaces efficiently.
Represents the massive leap in representational efficiency for 3DGS when compared against traditional voxels or isotropic point cloud models.
Related
Also known as:Neural Anchor
Feature mounting nodes introduced in advanced models like Scaffold-GS. Instead of isolating millions of splats, the system deploys sparse anchors via voxel grids. Each anchor controls attached local splats through a neural network, creating a hierarchical structure that dramatically enhances noise resistance.
An advanced network architecture tailored for complex indoor/outdoor scenes, effectively addressing the artifact flaws foundational 3DGS exhibits during abrupt viewing angle shifts.
Related
Further reading
Also known as:Dynamic GS / 4DGS
A technological branch integrating the temporal axis (T) into 3D Gaussians. It typically employs deformation field neural networks to capture the motion trajectories, deformations, and life cycles of splats over time. This enables ultra-high framerate reconstruction and rendering of dynamic real-world scenes.
The next-generation technological solution for spatial video, holographic performances, and motion capture, currently undergoing intense industrial research and development.
Related
Further reading
Also known as:Mip-Splat
An improved algorithm designed to eliminate high-frequency flickering and artifacts produced by 3DGS during camera zooming or movement. By introducing low-pass filters in both 2D and 3D dimensions, it constrains frequency sampling rates, ensuring reconstructions remain perfectly smooth and anti-aliased at any scale.
Massively elevates the visual experience during free-roaming on Web platforms. It is a critical optimization required for deploying commercial-grade applications.
Related
Further reading
An improved Gaussian model employing a structured hierarchical design. It abandons completely free-growing splat clusters, instead utilizing SfM point clouds to build voxel grids that spawn 'neural anchors'. These anchors dynamically predict local Gaussian attributes, reducing view-overfitting and massively compressing file size.
Frequently leveraged by developers and geeks for large-scale outdoor scene reconstructions or highly storage-sensitive mobile exhibition projects.
Related
Further reading
Also known as:压缩高斯
A lineage of compression algorithms (like C3DGS) engineered to mitigate the massive file size flaws (often hundreds of megabytes) of base 3DGS models. By synthesizing vector quantization, sensitivity-aware culling, and entropy coding, file sizes can be reduced 10× to 30× with negligible visual degradation.
The core pipeline defining whether content can be seamlessly distributed across web platforms, acting as a strict prerequisite for recommendations within communities.
Related
Further reading
An open-source, web-based 3DGS editing and processing tool developed by the PlayCanvas team. It provides an intuitive browser interface, empowering creators to import ply files and execute post-processing tasks such as selecting, trimming, merging, and recoloring, filling a crucial gap in the editing pipeline.
The premier tool of choice for 3D artists and no-code creators looking to polish raw splat models and eliminate stray floaters.
Related
Further reading
Also known as:Splat Web Viewer / Viewer
An application utilized on end-user devices (typically web browsers) to parse, load, and real-time rasterize Gaussian .ply or compressed formats. Premium viewers heavily leverage GPU acceleration while encapsulating camera roaming, anti-aliasing, and basic lighting interactions.
The direct portal through which end-users experience Gaussian content. The prosperity of this viewer ecosystem determines the dissemination breadth of 3DGS content.
Related
Further reading
Also known as:RTR
A computer graphics process that generates images rapidly enough (typically >30 FPS) to create a visually latency-free experience. 3DGS's explosive industrial impact stems entirely from its ability to maintain NeRF's offline-grade quality while unlocking blistering real-time rendering framerates at 1080P or 4K resolutions.
Distinguished from cinematic mesh raytracing requiring hours per frame, it serves as the foundational requirement for VR/AR interactions and game development.
Related
Also known as:Web API
Graphics APIs enabling web browsers to directly leverage underlying GPU hardware acceleration. While WebGL offers historical stability and broad compatibility, the next-gen WebGPU provides low-level control and robust Compute Shader capabilities. This makes executing massive sorting calculations for millions of splats on pure web clients entirely feasible.
The indispensable underlying infrastructure powering lightweight Gaussian spatial distribution applications, such as plugin-free online galleries and interactive H5 exhibitions.
Related
Further reading
Also known as:SLAM
A technique where a robot or device navigates an unknown environment, concurrently estimating its own location while constructing an environmental map. Recent studies like SplaTAM or MonoGS integrate 3DGS with SLAM, enabling phones or drones to scan and generate photorealistic Gaussian 3D maps dynamically while moving.
Massively streamlines the protracted 'Photography -> Offline SfM -> Training' pipeline, propelling Gaussian technology toward real-time mobile surveying.
Related
Further reading
Also known as:Reconstruction / 逆向工程
The reverse-engineering process of recreating the 3D morphology and surface properties of real-world physical objects using multi-view photos, videos, or depth sensor data. As one of the most potent contemporary paradigms, 3DGS is rapidly elevating the field's precision ceiling via its exceptional material reproduction and speed.
Broadly utilized across digital cultural heritage scanning, medical aesthetics facial mapping, digital human creation, and spatial cognition for embodied AI.
Related
Also known as:Immersive Video / Volumetric Video
A media format supporting Six Degrees of Freedom (6DoF) viewing. Viewers can not only watch dynamic footage but physically shift their heads during playback to observe object flanks from varying angles. Spatial video powered by 4DGS compression is heavily tipped as the premier native content format for upcoming headsets like Apple Vision Pro.
The ultimate evolutionary leap from traditional 2D footage to stereoscopic narrative, currently facing immense challenges regarding massive data storage and streaming delivery.
Related
Also known as:数字镜像
The one-to-one high-fidelity mapping of real-world physical entities (such as buildings, factories, or gallery spaces) into a digital ecosystem. Because it exceptionally replicates authentic lighting and materials, 3DGS is recognized as the most cost-effective visual foundational technology for constructing next-generation visceral digital twins.
Frequently operating in tandem with IoT data streams, it acts as the ultimate business container for smart cities, virtual exhibition halls, and industrial simulations.
Related
Also known as:Voxel Grid
The process of converting continuous 3D space or models into discrete microscopic 3D grids (voxels, or 3D pixels). While 3DGS relies on scattered continuous primitives, many spatial clipping tools and underlying collision detection mechanisms temporarily 'voxelize' the Gaussian field to leverage traditional bounding box algorithms efficiently.
A highly common intermediate transition state encountered when editing Gaussian models, physically slicing scenes, or attempting fusion with legacy physical collision engines.
Related
Also known as:评测标准 / Metrics
Scientific benchmarks used to quantitatively compare image generation quality between 3DGS and rival algorithms. PSNR evaluates pixel-level color distortion; SSIM focuses on the structural integrity of edge features; and LPIPS utilizes neural networks to extract deep features, aligning much closer with actual human visual perception.
The absolute currency within academic papers validating model superiority. It constitutes the critical experimental data charts developers must monitor during hyperparameter tuning.
Related
Further reading
In plain wordsPicture a shoebox full of holiday snapshots of one building—photogrammetry is the workflow where those flat prints are lined up so software can recover real-world distances and spit out a solid model you can spin; splatting is only one newer branch of the same "photos → 3D" family.
DetailsPhotogrammetry stitches overlapping photographs into consistent 3D geometry by estimating camera poses and 3D tie points through least-squares adjustment. Structure-from-motion and multi-view stereo pipelines are modern instantiations: they output sparse or dense reconstructions that downstream tools may convert into meshes, point clouds, or splatted Gaussians. The discipline predates neural fields; its focus is metrology-grade correspondence and calibration rather than a particular differentiable renderer.
↓ Prerequisites
↑ Go deeper
Related tools
Also known as:stray Gaussians
In plain wordsAfter training, a splat scene can sprout tiny colored smudges that hang in mid-air like pollen—people in the field just call them "floaters." They often trace back to motion blur, bad coverage, or moving subjects, and editors such as SuperSplat exist mainly to lasso and delete them.
Details"Floaters" is informal jargon for stray Gaussian primitives that survive optimization away from true surfaces, producing visually floating specks. Papers may say "floating artifacts" instead. Mitigation spans interactive cleanup in tools, pruning heuristics, or training-time regularizers. Treat the word as practitioner slang, not a canonical keyword from a single paper title.
↓ Prerequisites
↑ Go deeper
Related tools
Also known as:path tracing
In plain wordsThink of path tracing as sending a pinball of light through the scene: each sample bounces randomly until it hits a light, you average thousands of those stories per pixel, and the photo slowly converges. That Monte Carlo story differs from rasterization, which streams triangles through a fast pipeline but usually needs extra tricks for soft shadows or color bleeding.
DetailsPath tracing estimates the rendering equation by Monte Carlo sampling complete light paths from the camera, averaging many independent samples per pixel to reduce variance. Rasterization projects primitives and shades fragments in a single forward pass—fast, but indirect lighting needs approximations or separate passes. Path tracing trades cost for physical consistency (global illumination, complex materials). When Gaussian splats are represented in Omniverse RTX or OctaneRender 2026-style pipelines, documentation and release notes describe them participating in lighting, shadows, reflections, and refractions alongside meshes.
Introduced
1986-08 · James T. Kajiya
"We present an integral equation which generalizes a variety of known rendering algorithms."[src]
↓ Prerequisites
↑ Go deeper
Related engines
Key papers
Also known as:SPZ / Scaniverse PLY zipped
In plain wordsTreat SPZ like vacuum-sealing a closet full of coats: it is Niantic's packed format for Gaussian splats, often shrinking the same scene to a fraction of the PLY size (marketing copy cites ~90% savings) so mobile capture and web delivery hurt less. The bitstream is open source on GitHub.
DetailsThe reference library describes .spz as gzip-compressed streams with a fixed header followed by Gaussian attributes (positions, alphas, colors, scales, rotations, spherical harmonics). Quantization knobs trade size for fidelity. Interop demands explicit coordinate-frame metadata because SPZ stores data in an RUB system aligned with OpenGL/three.js, unlike many PLY files. Engines such as Babylon.js have added loaders, while Scaniverse remains the flagship consumer of the format.
↓ Prerequisites
↑ Go deeper
Related tools
Related engines
Also known as:SuGaR / Surface-Aligned Gaussian Splatting
In plain wordsSuGaR behaves like nudging a bowl of loose marbles—each Gaussian—until they hug the true surfaces; once they stick, Poisson reconstruction can spit out an editable triangle mesh in minutes, so artists can sculpt or relight through a mesh instead of hand-waving millions of splats.
DetailsSuGaR adds a regularizer that aligns Gaussians with the underlying surface, then samples the level set efficiently and runs Poisson surface reconstruction to obtain meshes without relying on marching cubes over a near-zero density field. An optional binding stage co-optimizes mesh and Gaussians for high-quality splat rendering while enabling mesh-based editing. The method targets fast mesh extraction from splats rather than real-time training.
Introduced
2023-11 · Antoine Guédon, Vincent Lepetit
"We propose a method to allow precise and extremely fast mesh extraction from 3D Gaussian Splatting."[src]
↓ Prerequisites
Key papers
Also known as:view synthesis
In plain wordsImagine you only have vacation photos of a statue from the left and right—novel view synthesis is the trick of asking the computer to snap a picture from a viewpoint no camera ever held, while keeping lighting and geometry believable.
DetailsNovel view synthesis generates images from camera poses that were not in the training set. Image-based rendering, light-field methods, NeRF-style neural fields, and explicit Gaussian splats all target the same user-facing goal with different internal representations. Metrics such as PSNR, SSIM, and LPIPS quantify faithfulness to held-out captures.
↓ Prerequisites
↑ Go deeper
Related engines
In plain wordsThink of a radiance field as a giant lookup table: at every 3D spot and every outward direction it tells you how bright light should be. Neural nets can approximate that table (NeRF); Gaussian splats approximate it with millions of tinted ellipsoids you can render fast.
DetailsRadiance measures directional light intensity at a point in space. A radiance field assigns such values across positions and directions. NeRF parameterizes the field with networks; 3DGS approximates it with explicit Gaussians and differentiable splatting. Practitioners often say "radiance field" when they mean either the continuous function or its discrete surrogate.
Introduced
2020-08 · Ben Mildenhall, Pratul P. Srinivasan, Matthew Tancik, Jonathan T. Barron, Ravi Ramamoorthi, Ren Ng
"Our algorithm represents a scene using a fully-connected (non-convolutional) deep network, whose input is a single continuous 5D coordinate (spatial location (x,y,z) and viewing direction (theta,phi)) and whose output is the volume density and view-dependent emitted radiance at that spatial location."[src]
↓ Prerequisites
↑ Go deeper
Related tools
Related engines
In plain wordsSpatial computing is the shift from peering into flat screens to letting computers reason about real rooms, hands, and headsets: sensors map the world around you, then graphics and audio glue themselves to that map instead of living only inside a monitor bezel.
DetailsIndustry usage bundles AR/VR/MR, embodied UI, and device ecosystems that track pose in physical space. For splat workflows it matters because capture devices and headsets define how reconstructions are gathered and consumed. Interop often routes through OpenXR or WebXR plus game-engine runtimes.
↓ Prerequisites
↑ Go deeper
Related engines
Also known as:rasterisation
In plain wordsRasterization is the assembly-line step that stamps triangles—or splats—onto a pixel grid: geometry becomes screen-space coverage, then shaders pick colors. Real-time engines lean on it because it is predictable and fast compared with brute-force light simulation.
DetailsRasterization determines which pixels a primitive covers and shades those fragments. It powers most real-time graphics APIs. Differentiable splat rasterizers extend the same idea: sort Gaussians per tile, alpha-blend, and backpropagate screen-space error—still a rasterization mindset, not path tracing.
↓ Prerequisites
↑ Go deeper
Related engines
In plain wordsGradient descent is blind hiking: each step walks opposite the slope of the loss landscape. For splat training, that loss compares rendered pixels to photos; autograd tells every Gaussian which way to nudge position, color, or covariance.
DetailsIterative first-order optimization updates parameters by stepping along negative gradients of a scalar loss. In 3DGS pipelines the loss blends photometric terms (e.g., L1 plus D-SSIM) and interacts with densification schedules. Stochastic minibatches and GPU kernels make large-scale updates practical.
↓ Prerequisites
↑ Go deeper
Related tools
Related engines
In plain wordsCOLMAP is the open-source detective for unordered photo sets: it runs structure-from-motion to recover cameras, then multi-view stereo to grow dense geometry—most splat pipelines start by exporting its poses and sparse points.
DetailsCOLMAP bundles established CVPR/ECCV algorithms behind a unified CLI and GUI. Typical splat workflows import images.bin/points3D.bin or textured meshes as initialization. Global SfM variants (e.g., GLOMAP) require extra citations. Licensing is BSD. Version 3.12 added native sensor-rig support, latitude/longitude to UTM conversion, and new pose files, improving panoramic, multi-camera, and drone reconstruction workflows.
↓ Prerequisites
↑ Go deeper
Related tools
Also known as:nerfstudio gsplat
In plain wordsThe gsplat project is a CUDA-accelerated Gaussian rasterizer with Python bindings: same splat idea as the Inria reference, but tuned for memory, speed, and extra research features when hooked into PyTorch training loops.
DetailsMaintainers publish evaluation scripts matching PSNR/SSIM/LPIPS and converged Gaussian counts against the official trainer while advertising lower VRAM and faster runtime. Developers integrate it inside Nerfstudio-style stacks or standalone COLMAP examples. Not to be confused with the gsplat.js viewer.
↓ Prerequisites
↑ Go deeper
Related tools
Key papers
In plain wordsThe training loss scores how far splat renders drift from captured photos; optimizers push Gaussians to lower that score. The canonical 3DGS recipe blends an L1 photometric term with a D-SSIM structural term, weighted by a λ you pick once and reuse.
DetailsKerbl et al. combine L1 and D-SSIM (with λ=0.2) to encourage pixel accuracy and structural agreement. Other projects append regularizers for anti-aliasing, opacity control, or mesh-friendly alignment. Always relate scalar loss trends to perceptual metrics and qualitative renders.
↓ Prerequisites
↑ Go deeper
Related tools
Related engines
Key papers
Also known as:surface extraction
In plain wordsMesh extraction turns unstructured splats or points into a triangle soup you can boolean, rig, or ship to game engines. For Gaussian splats, researchers often sample implicit surfaces or aligned Gaussians, then run Poisson reconstruction instead of marching through noisy densities.
DetailsThe task spans Poisson reconstruction, marching cubes on SDFs, and splat-specific pipelines such as SuGaR's level-set sampling. Triangle meshes remain the interchange currency for DCC tools even when rendering stays on splats. Radiance Fields' overview also names 2DGS, RaDe-GS, Gaussian Frosting, and Texture-GS as active routes for pulling editable surfaces from Gaussian or radiance-field representations.
↓ Prerequisites
↑ Go deeper
Related tools
Related engines
Key papers
Also known as:PLY / Polygon File Format / Stanford Triangle Format
In plain wordsPLY is a text-or-binary recipe card: a header lists how many vertices exist and which properties (x, y, z, normals, SH bands…) follow, then the data rows stream underneath—exactly why splat trainers love it for rich Gaussian attributes.
DetailsThe Stanford PLY spec couples an element/property header with vertex or face records. Gaussian splat exports piggyback on that extensibility for SH coefficients and covariance data. Always verify endianness and property naming when swapping between trainers and viewers.
Introduced
1994 · Greg Turk; Stanford University
"PLY is a computer file format known as the Polygon File Format or the Stanford Triangle Format."[src]
↓ Prerequisites
↑ Go deeper
Related tools
Related engines
Also known as:splat file
In plain wordsA .splat file packs Gaussian parameters into a tight binary layout tuned for fast loading in web viewers—think shipping foam peanuts in a labeled crate instead of a verbose text manifest. Vendors differ, so always verify headers when hopping between exporters.
DetailsBinary splat containers strip textual overhead from PLY while preserving the data needed for real-time rasterizers. SuperSplat and other editors import/export these blobs for publishing. Interoperability demands checking SH ordering, endianness, and optional compression flags per tool. OctaneRender 2026's PLY/SPZ support and its 2027 roadmap for exporting CG scenes into splats show Gaussian formats expanding from web viewers into offline rendering and DCC pipelines.
↓ Prerequisites
↑ Go deeper
Related tools
Related engines
Also known as:KSPLAT / .ksplat
In plain wordsThe .ksplat container is Mark Kellogg's trimmed, compressed splat package for the GaussianSplats3D viewer—great for faster loads, but it is a project-specific binary rather than a formal Khronos-style spec, so treat interoperability as "try it and see."
DetailsKSplatLoader APIs export converted buffers for download. Because the format evolves, shipping cross-engine assets often stays on PLY/SPZ while using .ksplat inside a known web stack.
↓ Prerequisites
↑ Go deeper
Related tools
Related engines
Also known as:OpenUSD ParticleField
In plain wordsUSD's ParticleField schema is the paperwork that tells a renderer how to interpret volumetric particles: positions, kernels, radiance, optional scales and orientations. Derived types such as ParticleField3DGaussianSplat let pipelines store splats as first-class USD prims instead of ad-hoc blobs.
DetailsParticleField generalizes multiple field types, including 3D Gaussian splats. Tooling converts trained PLY assets into USD stages referencing these prims so RTX can path-trace them alongside meshes. Custom exporters must inherit the schema expectations for validation.
↓ Prerequisites
↑ Go deeper
Related engines
Also known as:PSNR
In plain wordsPSNR compares the max possible signal power to the noise energy between two images, expressed in decibels—higher usually means closer pixels, but it can disagree with human perception when blur or structure breaks.
DetailsPSNR rewards pixel-wise fidelity. Teams report it alongside SSIM and LPIPS because high PSNR does not guarantee perceptual quality. Use the same train/test splits and bit depth when comparing numbers across papers.
↓ Prerequisites
↑ Go deeper
Related tools
Also known as:SSIM / structural similarity index
In plain wordsSSIM scores how well luminance, contrast, and structure match between patches—closer to human annoyance with blur than raw MSE, though it still misses every failure mode.
DetailsSSIM generalizes Wang–Bovik's framework; differentiable variants plug into splat optimization. Teams usually log SSIM alongside PSNR and LPIPS on held-out views.
↓ Prerequisites
↑ Go deeper
Related tools
Also known as:LPIPS / perceptual metric LPIPS
In plain wordsLPIPS feeds paired images through a pretrained CNN, compares deep features (optionally calibrated), and returns a distance where higher means more different—useful when PSNR looks fine but pictures still feel wrong.
DetailsThe reference implementation exposes AlexNet/VGG/SqueezeNet variants. Gaussian splat papers cite LPIPS↓ on validation views. Unlike PSNR, it rewards semantic similarity but costs more compute.
↓ Prerequisites
↑ Go deeper
Related tools
Key papers
Also known as:FPS / frame rate
In plain wordsFPS counts how many full images a pipeline produces each second—either realtime splat rendering in a browser or the throughput numbers quoted beside PSNR tables. Higher feels smoother, but always ask which stage was timed.
DetailsBenchmarks should specify resolution, MSAA settings, and whether WebGPU/WebGL paths were used. Academic tables often list FPS next to memory footprint to compare explicit radiance methods.
↓ Prerequisites
↑ Go deeper
Related engines
Key papers
Also known as:number of Gaussians
In plain wordsSplat count tracks how many Gaussian primitives survive optimization—think counting marbles in a jar. More splats can capture detail but inflate VRAM, storage, and sort cost; pruning and compression fight the same number.
DetailsDensification schedules grow or split splats when view-space gradients spike; opacity pruning removes low-alpha Gaussians. gsplat reproduces official Gaussian counts when benchmarking. SPZ/KSplat formats also hinge on how many primitives remain. The Infinite Realities report describes about 1,800 trained PLY files and 300K input images for a 30-second dynamic clip, showing that 4D capture must evaluate splat count together with time sequence and storage budget.
↓ Prerequisites
↑ Go deeper
Related tools
Related engines
Key papers