印刻万物 TOP3DGS印刻万物TOP3DGS

Learn Hub

Paths, tools, and glossary — one place to ramp up on 3DGS.

Four structured paths from first principles to spatial narrative experiments, followed by a tools index (capture → training → editing → viewing → publishing) and an in-site glossary.

Learning Paths

Four paths, one map

Tools Index

Tools mapped along the 3DGS production pipeline

Organized along the five-stage pipeline (capture → training → editing → viewing → publishing) with ratings, ramp-up cost, and public sources.

Extended notes

Short reads aligned to paths

Forked from legacy links, rewritten for on-site reading.

Further Reading · Research Milestones

Papers & tools that shaped the 3DGS paradigm

Want to read the source? Here are the key papers and open-source tools defining today's Gaussian Splatting engineering.

2023Bernhard Kerbl & Georgios Kopanas

3D Gaussian Splatting for Real-Time Radiance Field Rendering

The foundational paper for 3DGS, breaking the barrier between high-quality view synthesis and real-time rendering.

View
2023Antoine Guédon

SuGaR: Surface-Aligned Gaussian Splatting

Bridges the gap between Gaussian point clouds and traditional polygon mesh pipelines via surface-alignment regularization.

View
2023PlayCanvas Team

SuperSplat: Open-Source Web Gaussian Editor

A high-performance 3DGS viewer and editor running entirely in the browser, drastically lowering the hardware barrier for spatial data cleanup.

View
2023Zehao Yu

Mip-Splatting: Anti-Aliased 3D Gaussian Splatting

Resolves intense high-frequency aliasing and dilation artifacts when changing viewing distances by introducing a 2D low-pass filter.

View
2024Binbin Huang

2D Gaussian Splatting for Geometrically Accurate Radiance Fields

Flattens volumetric Gaussians into surface-aligned 2D disks with perspective-correct splatting and depth or normal cues for cleaner geometry.

View
2024Guanjun Wu

4D Gaussian Splatting for Real-Time Dynamic Scene Rendering

Models dynamic scenes with 4D neural voxels and a deformation field over a single canonical Gaussian set for real-time training and storage efficiency.

View
2024Joo Chan Lee

Compact 3D Gaussian Representation for Radiance Field

Combines learned masking, grid-based view-dependent color, and codebook-quantized geometry to shrink Gaussian count and attribute storage while preserving quality.

View
2024Ziyi Yang

Deformable 3D Gaussians for High-Fidelity Monocular Dynamic Scene Reconstruction

Optimizes 3D Gaussians in canonical space with a deformation field for monocular dynamics and uses annealed smoothing to reduce temporal jitter from pose noise.

View
2024Jiaxiang Tang

DreamGaussian: Generative Gaussian Splatting for Efficient 3D Content Creation

Combines 3D Gaussians with score distillation and mesh or UV refinement to cut per-asset 3D generation time sharply.

View
2024Liangxiao Hu

GaussianAvatar: Towards Realistic Human Avatar Modeling from a Single Video via Animatable 3D Gaussians

Models humans with animatable 3D Gaussians, learns pose-dependent appearance, and jointly refines motion and look for realistic monocular avatars.

View
2024Hidenobu Matsuki

Gaussian Splatting SLAM

Uses Gaussians as the sole map representation for monocular SLAM, jointly optimizing poses and primitives for incremental reconstruction and NVS.

View
2024Muhammed Kocabas

HUGS: Human Gaussian Splats

Jointly learns static scene and animatable human Gaussians from a short monocular clip, optimizing skinning weights to reduce articulation artifacts at high FPS.

View
2024Minghan Qin

LangSplat: 3D Language Gaussian Splatting

Stores distilled language features on Gaussians and splats them for open-vocabulary 3D grounding, avoiding costly NeRF volume rendering.

View
2024Tao Lu

Scaffold-GS: Structured 3D Gaussians for View-Adaptive Rendering

Organizes neural Gaussians on anchors and predicts attributes from the viewpoint to cut redundancy and improve generalization on complex scenes.

View

Glossary

A Map of Gaussian Splatting Concepts

52 cross-linked terms layered for beginners, practitioners, and researchers—same entry, your choice of depth.

View

54 entries

TechniqueBeginner

3D Gaussian Splatting (3DGS)

Also known asGaussian Splatting / 3DGS

A revolutionary 3D scene representation and real-time rendering technique. It explicitly models environments using millions of anisotropic Gaussian ellipsoids, each parameterized with covariance, opacity, and spherical harmonics. Coupled with a highly efficient differentiable rasterization pipeline, it achieves photorealistic novel view synthesis at exceptional frame rates.

Primarily utilized during high-fidelity 3D reconstruction and real-time rendering phases, serving as a fundamental pillar for modern spatial computing.

#Training#Real-time#Paper

Related

Further reading

ConceptBeginner

Splat

Also known asGaussian / Splatting Primitive

The fundamental 3D primitive comprising a 3DGS scene. It acts conceptually like a semi-transparent droplet of paint suspended in 3D space. Shaped as an anisotropic ellipsoid, its color and opacity dynamically shift depending on the viewing angle to simulate realistic lighting.

Creators and algorithms continuously add, remove, or modify these primitives during the optimization phase to closely approximate real-world environments.

#Optimization#Viewing

Related

TechniqueIntermediate

Structure-from-Motion (SfM)

Also known asSfM

A computer vision technique that automatically estimates camera poses and extracts a sparse 3D point cloud from a sequence of 2D images. In the 3DGS workflow, SfM provides the crucial initial camera parameters and base structure required for subsequent initialization.

Deployed during the pre-processing stage immediately after data capture. Calculations are typically automated by open-source libraries like COLMAP.

#Capture#Optimization

Related

Further reading

TechniqueIntermediate

Multi-View Stereo (MVS)

Also known asMVS

A 3D reconstruction algorithm operating downstream of SfM. It utilizes stereo matching techniques on image sets with known camera parameters to generate dense point clouds. While traditional pipelines rely on MVS for mesh extraction, 3DGS efficiently bypasses this expensive step.

The core step in traditional mesh modeling pipelines, though standard Gaussian workflows typically omit it or use it solely for benchmarking purposes.

#Training#Case Study

Related

Further reading

TechniqueBeginner

Neural Radiance Fields (NeRF)

Also known asNeRF

A technique utilizing Multilayer Perceptrons (MLPs) to implicitly represent continuous 3D scenes. Given a viewing direction and spatial coordinates, the neural network outputs color and volume density. NeRF catalyzed the novel view synthesis revolution, acting as the direct predecessor to 3DGS. Radiance Fields' overview ties the original 2020 Berkeley work to later acceleration projects such as Instant-NGP, which shortened practical training times for many scenes.

Frequently utilized by researchers as a baseline metric. Due to sluggish rendering speeds, it is progressively being replaced by 3DGS in real-time interactive scenarios.

#Paper#Real-time

Related

Further reading

FormatBeginner

Point Cloud

Also known asPoints

A collection of discrete data points in three-dimensional space, typically containing coordinate geometry alongside attributes like color or reflectance. Within 3DGS, the sparse point cloud generated by SfM serves as the initial seeding ground for Gaussian ellipsoids.

Present throughout the initial stages of the spatial computing pipeline. It is also the direct output format for various LiDAR sensors and scanning devices.

#Capture#Cultural Heritage

Related

Further reading

FormatBeginner

Polygon Mesh

Also known asMesh / 网格

The dominant model representation in traditional 3D engines, composed of vertices, edges, and polygonal faces (typically triangles) forming a continuous surface shell. Unlike the volumetric and scattered nature of 3DGS, meshes demand strict surface topology, struggling with hair or translucency.

The native language of mainstream game engines and modeling software like Blender or Unreal. Seamlessly bridging meshes with Gaussian splats remains a critical industry objective.

#Viewing#Editing

Related

Further reading

ConceptExpertPending verification

Covariance Matrix

Also known asCovariance / Σ

The mathematical expression controlling the spatial morphology of an individual Gaussian primitive. In 3DGS context, it dictates the scaling and rotation of the ellipsoid across X, Y, and Z axes, allowing it to conform to shapes ranging from minute dust to flat walls.

Operates within the foundational gradient calculations of model training, meticulously ensuring that ellipsoids maintain mathematical validity (positive semi-definite) during transformations.

#Optimization#Training

Related

MetricBeginnerPending verification

Opacity (Alpha)

Also known asAlpha / 透明度

A scalar value defining the physical solidness of a Gaussian primitive, ranging from 0 (completely transparent) to 1 (fully opaque). During rasterization, colors from overlapping splats are blended based on their alpha values. Superfluous, highly transparent splats are pruned out.

Determines the visual clarity of the scene and serves as the primary metric for the algorithm to auto-regulate primitive count by pruning redundant volumes.

#Optimization#Viewing

Related

TechniqueIntermediate

Spherical Harmonics (SH)

Also known asSH

A set of orthogonal basis functions defined on the surface of a sphere, utilized to faithfully approximate view-dependent color variations. In 3DGS, each splat holds SH coefficients rather than a single color, enabling realistic specular highlights based on the observer's angle.

Responsible for high-fidelity lighting and reflections. Typically divided into degrees; higher degrees yield richer chromatic detail but significantly increase memory consumption.

#Optimization#Real-time

Related

WorkflowIntermediatePending verification

Densification

Also known as克隆与分裂 / Cloning and Splitting

The 'growth' phase of the Adaptive Density Control strategy. When the model detects missing details or oversized splats with high variance during training, densification triggers. It clones existing splats to fill empty spaces or splits massive ones, thereby dynamically enhancing geometric resolution.

Occurs strictly within the core training loop. It acts as the critical automated replication mechanism granting 3DGS its extraordinary visual clarity.

#Training#Optimization

Related

TechniqueExpert

3D Gaussian Splatting as Markov Chain Monte Carlo (3DGS-MCMC)

Also known as3DGS as MCMC / Gaussian Splatting MCMC

In plain wordsReframes splat optimization as stochastic search over discrete Gaussian sets via MCMC-style proposals and accept/reject moves, with explicit noise learning rates plus scale and opacity regularizers atop the Inria reference trainer.

DetailsNeurIPS 2024 Spotlight work by UBC Vision reframes 3DGS optimization as Markov Chain Monte Carlo to stabilize adaptive density control. The public codebase extends the Inria reference trainer with CLI flags such as cap_max, scale_reg, opacity_reg, noise_lr, and init_type while inheriting the bundled non-commercial license and Ubuntu 20.04 test matrix.

Introduced

2024 · Shakiba Kheradmand et al. (UBC Vision)

"3D Gaussian Splatting as Markov Chain Monte Carlo"[src]
#Training#Optimization#Paper

Prerequisites

Go deeper

Related

WorkflowIntermediatePending verification

Pruning

Also known as裁剪 / Culling

The 'culling' phase of the Adaptive Density Control strategy. To prevent infinite primitive inflation from overwhelming VRAM, the algorithm periodically inspects the scene during training. It forcibly deletes invisible splats with ultra-low opacity or overly massive, ineffective ones, ensuring model efficiency.

Working symbiotically with densification throughout the training timeline, it serves as the safeguard mechanism for controlling final file sizes and rendering frame rates.

#Training#Optimization

Related

TechniqueExpertPending verification

Adaptive Density Control (ADC)

Also known asADC

The macro-regulatory algorithm overseeing both densification and pruning behaviors. By calculating the average magnitude of positional gradients, it diagnoses which 3D spatial regions are under-reconstructed or over-reconstructed, dynamically managing splat distribution to intelligently resolve scene clarity without human intervention.

Functions as the brain of the 3DGS training engine, directly dictating the evolutionary speed and qualitative leap from sparse point clouds to perfect scenes.

#Training#Optimization

Related

TechniqueExpert

Differentiable Rasterization

Also known asTile-based Rasterizer

The secret weapon behind 3DGS's blistering rendering speeds. It partitions the screen into 16×16 tiles, rapidly sorting and culling visible splats. Because this rasterization process is mathematically differentiable, pixel-level errors effortlessly propagate backward, instructing underlying splat properties on how to correct themselves.

The bridge connecting forward visual rendering with backward gradient optimization, leveraging traditional computer graphics ingenuity to shatter AI computational bottlenecks.

#Real-time#Training

Related

Further reading

ConceptExpertPending verification

Anisotropy

Also known asAnisotropic

The property of exhibiting different values when measured in different directions. In 3DGS, primitives are not perfect spheres; they are anisotropic ellipsoids capable of independent scaling across X, Y, and Z axes. This allows them to flatten drastically to simulate complex structural surfaces efficiently.

Represents the massive leap in representational efficiency for 3DGS when compared against traditional voxels or isotropic point cloud models.

#Optimization#Paper

Related

ConceptIntermediate

Anchor Point

Also known asNeural Anchor

Feature mounting nodes introduced in advanced models like Scaffold-GS. Instead of isolating millions of splats, the system deploys sparse anchors via voxel grids. Each anchor controls attached local splats through a neural network, creating a hierarchical structure that dramatically enhances noise resistance.

An advanced network architecture tailored for complex indoor/outdoor scenes, effectively addressing the artifact flaws foundational 3DGS exhibits during abrupt viewing angle shifts.

#Paper#Indoor#Outdoor

Related

Further reading

TechniqueIntermediate

4D Gaussian Splatting (4DGS)

Also known asDynamic GS / 4DGS

A technological branch integrating the temporal axis (T) into 3D Gaussians. It typically employs deformation field neural networks to capture the motion trajectories, deformations, and life cycles of splats over time. This enables ultra-high framerate reconstruction and rendering of dynamic real-world scenes.

The next-generation technological solution for spatial video, holographic performances, and motion capture, currently undergoing intense industrial research and development.

#Spatial Media#Paper

Related

Further reading

TechniqueExpert

Mip-Splatting

Also known asMip-Splat

An improved algorithm designed to eliminate high-frequency flickering and artifacts produced by 3DGS during camera zooming or movement. By introducing low-pass filters in both 2D and 3D dimensions, it constrains frequency sampling rates, ensuring reconstructions remain perfectly smooth and anti-aliased at any scale.

Massively elevates the visual experience during free-roaming on Web platforms. It is a critical optimization required for deploying commercial-grade applications.

#Viewing#Optimization

Related

Further reading

TechniqueExpert

Scaffold-GS

An improved Gaussian model employing a structured hierarchical design. It abandons completely free-growing splat clusters, instead utilizing SfM point clouds to build voxel grids that spawn 'neural anchors'. These anchors dynamically predict local Gaussian attributes, reducing view-overfitting and massively compressing file size.

Frequently leveraged by developers and geeks for large-scale outdoor scene reconstructions or highly storage-sensitive mobile exhibition projects.

#Paper#Outdoor

Related

Further reading

TechniqueIntermediate

Compressed GS

Also known as压缩高斯

A lineage of compression algorithms (like C3DGS) engineered to mitigate the massive file size flaws (often hundreds of megabytes) of base 3DGS models. By synthesizing vector quantization, sensitivity-aware culling, and entropy coding, file sizes can be reduced 10× to 30× with negligible visual degradation.

The core pipeline defining whether content can be seamlessly distributed across web platforms, acting as a strict prerequisite for recommendations within communities.

#Publishing#Web Rendering

Related

ToolBeginner

SuperSplat

An open-source, web-based 3DGS editing and processing tool developed by the PlayCanvas team. It provides an intuitive browser interface, empowering creators to import ply files and execute post-processing tasks such as selecting, trimming, merging, and recoloring, filling a crucial gap in the editing pipeline.

The premier tool of choice for 3D artists and no-code creators looking to polish raw splat models and eliminate stray floaters.

#Editing#Web Rendering

Related

Further reading

ToolBeginner

Splat Viewer

Also known asSplat Web Viewer / Viewer

An application utilized on end-user devices (typically web browsers) to parse, load, and real-time rasterize Gaussian .ply or compressed formats. Premium viewers heavily leverage GPU acceleration while encapsulating camera roaming, anti-aliasing, and basic lighting interactions.

The direct portal through which end-users experience Gaussian content. The prosperity of this viewer ecosystem determines the dissemination breadth of 3DGS content.

#Viewing#Web Rendering

Related

ConceptBeginnerPending verification

Real-time Rendering

Also known asRTR

A computer graphics process that generates images rapidly enough (typically >30 FPS) to create a visually latency-free experience. 3DGS's explosive industrial impact stems entirely from its ability to maintain NeRF's offline-grade quality while unlocking blistering real-time rendering framerates at 1080P or 4K resolutions.

Distinguished from cinematic mesh raytracing requiring hours per frame, it serves as the foundational requirement for VR/AR interactions and game development.

#Real-time#Viewing

Related

TechniqueIntermediate

WebGL / WebGPU API

Also known asWeb API

Graphics APIs enabling web browsers to directly leverage underlying GPU hardware acceleration. While WebGL offers historical stability and broad compatibility, the next-gen WebGPU provides low-level control and robust Compute Shader capabilities. This makes executing massive sorting calculations for millions of splats on pure web clients entirely feasible.

The indispensable underlying infrastructure powering lightweight Gaussian spatial distribution applications, such as plugin-free online galleries and interactive H5 exhibitions.

#Web Rendering#Viewing

Related

Further reading

WorkflowExpert

Simultaneous Localization and Mapping (SLAM)

Also known asSLAM

A technique where a robot or device navigates an unknown environment, concurrently estimating its own location while constructing an environmental map. Recent studies like SplaTAM or MonoGS integrate 3DGS with SLAM, enabling phones or drones to scan and generate photorealistic Gaussian 3D maps dynamically while moving.

Massively streamlines the protracted 'Photography -> Offline SfM -> Training' pipeline, propelling Gaussian technology toward real-time mobile surveying.

#SLAM#Capture

Related

Further reading

ConceptBeginnerPending verification

3D Reconstruction

Also known asReconstruction / 逆向工程

The reverse-engineering process of recreating the 3D morphology and surface properties of real-world physical objects using multi-view photos, videos, or depth sensor data. As one of the most potent contemporary paradigms, 3DGS is rapidly elevating the field's precision ceiling via its exceptional material reproduction and speed.

Broadly utilized across digital cultural heritage scanning, medical aesthetics facial mapping, digital human creation, and spatial cognition for embodied AI.

#Capture#Cultural Heritage

Related

FormatIntermediatePending verification

Spatial Video

Also known asImmersive Video / Volumetric Video

A media format supporting Six Degrees of Freedom (6DoF) viewing. Viewers can not only watch dynamic footage but physically shift their heads during playback to observe object flanks from varying angles. Spatial video powered by 4DGS compression is heavily tipped as the premier native content format for upcoming headsets like Apple Vision Pro.

The ultimate evolutionary leap from traditional 2D footage to stereoscopic narrative, currently facing immense challenges regarding massive data storage and streaming delivery.

#Spatial Media#Viewing

Related

ConceptBeginnerPending verification

Digital Twin

Also known as数字镜像

The one-to-one high-fidelity mapping of real-world physical entities (such as buildings, factories, or gallery spaces) into a digital ecosystem. Because it exceptionally replicates authentic lighting and materials, 3DGS is recognized as the most cost-effective visual foundational technology for constructing next-generation visceral digital twins.

Frequently operating in tandem with IoT data streams, it acts as the ultimate business container for smart cities, virtual exhibition halls, and industrial simulations.

#Indoor#Outdoor

Related

TechniqueIntermediatePending verification

Voxelization

Also known asVoxel Grid

The process of converting continuous 3D space or models into discrete microscopic 3D grids (voxels, or 3D pixels). While 3DGS relies on scattered continuous primitives, many spatial clipping tools and underlying collision detection mechanisms temporarily 'voxelize' the Gaussian field to leverage traditional bounding box algorithms efficiently.

A highly common intermediate transition state encountered when editing Gaussian models, physically slicing scenes, or attempting fusion with legacy physical collision engines.

#Editing#Optimization

Related

MetricIntermediate

Evaluation Metrics (PSNR/SSIM/LPIPS)

Also known as评测标准 / Metrics

Scientific benchmarks used to quantitatively compare image generation quality between 3DGS and rival algorithms. PSNR evaluates pixel-level color distortion; SSIM focuses on the structural integrity of edge features; and LPIPS utilizes neural networks to extract deep features, aligning much closer with actual human visual perception.

The absolute currency within academic papers validating model superiority. It constitutes the critical experimental data charts developers must monitor during hyperparameter tuning.

#Paper#Optimization

Related

Further reading

ConceptBeginner

Photogrammetry

In plain wordsPicture a shoebox full of holiday snapshots of one building—photogrammetry is the workflow where those flat prints are lined up so software can recover real-world distances and spit out a solid model you can spin; splatting is only one newer branch of the same "photos → 3D" family.

DetailsPhotogrammetry stitches overlapping photographs into consistent 3D geometry by estimating camera poses and 3D tie points through least-squares adjustment. Structure-from-motion and multi-view stereo pipelines are modern instantiations: they output sparse or dense reconstructions that downstream tools may convert into meshes, point clouds, or splatted Gaussians. The discipline predates neural fields; its focus is metrology-grade correspondence and calibration rather than a particular differentiable renderer.

Prerequisites

Go deeper

TechniqueIntermediate

Floaters (informal 3DGS artifact)

Also known asstray Gaussians

In plain wordsAfter training, a splat scene can sprout tiny colored smudges that hang in mid-air like pollen—people in the field just call them "floaters." They often trace back to motion blur, bad coverage, or moving subjects, and editors such as SuperSplat exist mainly to lasso and delete them.

Details"Floaters" is informal jargon for stray Gaussian primitives that survive optimization away from true surfaces, producing visually floating specks. Papers may say "floating artifacts" instead. Mitigation spans interactive cleanup in tools, pruning heuristics, or training-time regularizers. Treat the word as practitioner slang, not a canonical keyword from a single paper title.

Prerequisites

Go deeper

Related tools

ConceptExpert

Path Tracing

Also known aspath tracing

In plain wordsThink of path tracing as sending a pinball of light through the scene: each sample bounces randomly until it hits a light, you average thousands of those stories per pixel, and the photo slowly converges. That Monte Carlo story differs from rasterization, which streams triangles through a fast pipeline but usually needs extra tricks for soft shadows or color bleeding.

DetailsPath tracing estimates the rendering equation by Monte Carlo sampling complete light paths from the camera, averaging many independent samples per pixel to reduce variance. Rasterization projects primitives and shades fragments in a single forward pass—fast, but indirect lighting needs approximations or separate passes. Path tracing trades cost for physical consistency (global illumination, complex materials). When Gaussian splats are represented in Omniverse RTX or OctaneRender 2026-style pipelines, documentation and release notes describe them participating in lighting, shadows, reflections, and refractions alongside meshes.

Introduced

1986-08 · James T. Kajiya

"We present an integral equation which generalizes a variety of known rendering algorithms."[src]

Prerequisites

Go deeper

Related engines

FormatIntermediate

SPZ format

Also known asSPZ / Scaniverse PLY zipped

In plain wordsTreat SPZ like vacuum-sealing a closet full of coats: it is Niantic's packed format for Gaussian splats, often shrinking the same scene to a fraction of the PLY size (marketing copy cites ~90% savings) so mobile capture and web delivery hurt less. The bitstream is open source on GitHub.

DetailsThe reference library describes .spz as gzip-compressed streams with a fixed header followed by Gaussian attributes (positions, alphas, colors, scales, rotations, spherical harmonics). Quantization knobs trade size for fidelity. Interop demands explicit coordinate-frame metadata because SPZ stores data in an RUB system aligned with OpenGL/three.js, unlike many PLY files. Engines such as Babylon.js have added loaders, while Scaniverse remains the flagship consumer of the format.

Introduced

Niantic

".spz is a file format for compressed 3D gaussian splats."[src]

Prerequisites

Go deeper

Related tools

TechniqueExpert

SuGaR (Surface-Aligned Gaussian Splatting)

Also known asSuGaR / Surface-Aligned Gaussian Splatting

In plain wordsSuGaR behaves like nudging a bowl of loose marbles—each Gaussian—until they hug the true surfaces; once they stick, Poisson reconstruction can spit out an editable triangle mesh in minutes, so artists can sculpt or relight through a mesh instead of hand-waving millions of splats.

DetailsSuGaR adds a regularizer that aligns Gaussians with the underlying surface, then samples the level set efficiently and runs Poisson surface reconstruction to obtain meshes without relying on marching cubes over a near-zero density field. An optional binding stage co-optimizes mesh and Gaussians for high-quality splat rendering while enabling mesh-based editing. The method targets fast mesh extraction from splats rather than real-time training.

Introduced

2023-11 · Antoine Guédon, Vincent Lepetit

"We propose a method to allow precise and extremely fast mesh extraction from 3D Gaussian Splatting."[src]

Prerequisites

ConceptBeginner

Novel View Synthesis

Also known asview synthesis

In plain wordsImagine you only have vacation photos of a statue from the left and right—novel view synthesis is the trick of asking the computer to snap a picture from a viewpoint no camera ever held, while keeping lighting and geometry believable.

DetailsNovel view synthesis generates images from camera poses that were not in the training set. Image-based rendering, light-field methods, NeRF-style neural fields, and explicit Gaussian splats all target the same user-facing goal with different internal representations. Metrics such as PSNR, SSIM, and LPIPS quantify faithfulness to held-out captures.

Prerequisites

Go deeper

Related tools

Related engines

ConceptIntermediate

Radiance Field

In plain wordsThink of a radiance field as a giant lookup table: at every 3D spot and every outward direction it tells you how bright light should be. Neural nets can approximate that table (NeRF); Gaussian splats approximate it with millions of tinted ellipsoids you can render fast.

DetailsRadiance measures directional light intensity at a point in space. A radiance field assigns such values across positions and directions. NeRF parameterizes the field with networks; 3DGS approximates it with explicit Gaussians and differentiable splatting. Practitioners often say "radiance field" when they mean either the continuous function or its discrete surrogate.

Introduced

2020-08 · Ben Mildenhall, Pratul P. Srinivasan, Matthew Tancik, Jonathan T. Barron, Ravi Ramamoorthi, Ren Ng

"Our algorithm represents a scene using a fully-connected (non-convolutional) deep network, whose input is a single continuous 5D coordinate (spatial location (x,y,z) and viewing direction (theta,phi)) and whose output is the volume density and view-dependent emitted radiance at that spatial location."[src]

Prerequisites

Go deeper

Related engines

ConceptBeginner

Spatial Computing

In plain wordsSpatial computing is the shift from peering into flat screens to letting computers reason about real rooms, hands, and headsets: sensors map the world around you, then graphics and audio glue themselves to that map instead of living only inside a monitor bezel.

DetailsIndustry usage bundles AR/VR/MR, embodied UI, and device ecosystems that track pose in physical space. For splat workflows it matters because capture devices and headsets define how reconstructions are gathered and consumed. Interop often routes through OpenXR or WebXR plus game-engine runtimes.

Prerequisites

Go deeper

ConceptIntermediate

Rasterization

Also known asrasterisation

In plain wordsRasterization is the assembly-line step that stamps triangles—or splats—onto a pixel grid: geometry becomes screen-space coverage, then shaders pick colors. Real-time engines lean on it because it is predictable and fast compared with brute-force light simulation.

DetailsRasterization determines which pixels a primitive covers and shades those fragments. It powers most real-time graphics APIs. Differentiable splat rasterizers extend the same idea: sort Gaussians per tile, alpha-blend, and backpropagate screen-space error—still a rasterization mindset, not path tracing.

Prerequisites

Go deeper

Related engines

ConceptIntermediate

Gradient Descent

In plain wordsGradient descent is blind hiking: each step walks opposite the slope of the loss landscape. For splat training, that loss compares rendered pixels to photos; autograd tells every Gaussian which way to nudge position, color, or covariance.

DetailsIterative first-order optimization updates parameters by stepping along negative gradients of a scalar loss. In 3DGS pipelines the loss blends photometric terms (e.g., L1 plus D-SSIM) and interacts with densification schedules. Stochastic minibatches and GPU kernels make large-scale updates practical.

Prerequisites

Go deeper

Related engines

WorkflowIntermediate

COLMAP

In plain wordsCOLMAP is the open-source detective for unordered photo sets: it runs structure-from-motion to recover cameras, then multi-view stereo to grow dense geometry—most splat pipelines start by exporting its poses and sparse points.

DetailsCOLMAP bundles established CVPR/ECCV algorithms behind a unified CLI and GUI. Typical splat workflows import images.bin/points3D.bin or textured meshes as initialization. Global SfM variants (e.g., GLOMAP) require extra citations. Licensing is BSD. Version 3.12 added native sensor-rig support, latitude/longitude to UTM conversion, and new pose files, improving panoramic, multi-camera, and drone reconstruction workflows.

Prerequisites

Go deeper

WorkflowIntermediate

gsplat library (CUDA splat utilities)

Also known asnerfstudio gsplat

In plain wordsThe gsplat project is a CUDA-accelerated Gaussian rasterizer with Python bindings: same splat idea as the Inria reference, but tuned for memory, speed, and extra research features when hooked into PyTorch training loops.

DetailsMaintainers publish evaluation scripts matching PSNR/SSIM/LPIPS and converged Gaussian counts against the official trainer while advertising lower VRAM and faster runtime. Developers integrate it inside Nerfstudio-style stacks or standalone COLMAP examples. Not to be confused with the gsplat.js viewer.

Prerequisites

Go deeper

WorkflowIntermediate

Training loss (3D Gaussian Splatting)

In plain wordsThe training loss scores how far splat renders drift from captured photos; optimizers push Gaussians to lower that score. The canonical 3DGS recipe blends an L1 photometric term with a D-SSIM structural term, weighted by a λ you pick once and reuse.

DetailsKerbl et al. combine L1 and D-SSIM (with λ=0.2) to encourage pixel accuracy and structural agreement. Other projects append regularizers for anti-aliasing, opacity control, or mesh-friendly alignment. Always relate scalar loss trends to perceptual metrics and qualitative renders.

Prerequisites

Go deeper

Related engines

WorkflowIntermediate

Mesh Extraction

Also known assurface extraction

In plain wordsMesh extraction turns unstructured splats or points into a triangle soup you can boolean, rig, or ship to game engines. For Gaussian splats, researchers often sample implicit surfaces or aligned Gaussians, then run Poisson reconstruction instead of marching through noisy densities.

DetailsThe task spans Poisson reconstruction, marching cubes on SDFs, and splat-specific pipelines such as SuGaR's level-set sampling. Triangle meshes remain the interchange currency for DCC tools even when rendering stays on splats. Radiance Fields' overview also names 2DGS, RaDe-GS, Gaussian Frosting, and Texture-GS as active routes for pulling editable surfaces from Gaussian or radiance-field representations.

Prerequisites

Go deeper

Related tools

Related engines

FormatBeginner

PLY file format

Also known asPLY / Polygon File Format / Stanford Triangle Format

In plain wordsPLY is a text-or-binary recipe card: a header lists how many vertices exist and which properties (x, y, z, normals, SH bands…) follow, then the data rows stream underneath—exactly why splat trainers love it for rich Gaussian attributes.

DetailsThe Stanford PLY spec couples an element/property header with vertex or face records. Gaussian splat exports piggyback on that extensibility for SH coefficients and covariance data. Always verify endianness and property naming when swapping between trainers and viewers.

Introduced

1994 · Greg Turk; Stanford University

"PLY is a computer file format known as the Polygon File Format or the Stanford Triangle Format."[src]

Prerequisites

Go deeper

Related engines

FormatIntermediate

.splat binary format

Also known assplat file

In plain wordsA .splat file packs Gaussian parameters into a tight binary layout tuned for fast loading in web viewers—think shipping foam peanuts in a labeled crate instead of a verbose text manifest. Vendors differ, so always verify headers when hopping between exporters.

DetailsBinary splat containers strip textual overhead from PLY while preserving the data needed for real-time rasterizers. SuperSplat and other editors import/export these blobs for publishing. Interoperability demands checking SH ordering, endianness, and optional compression flags per tool. OctaneRender 2026's PLY/SPZ support and its 2027 roadmap for exporting CG scenes into splats show Gaussian formats expanding from web viewers into offline rendering and DCC pipelines.

Prerequisites

Go deeper

Related engines

FormatIntermediate

KSPLAT (.ksplat) format

Also known asKSPLAT / .ksplat

In plain wordsThe .ksplat container is Mark Kellogg's trimmed, compressed splat package for the GaussianSplats3D viewer—great for faster loads, but it is a project-specific binary rather than a formal Khronos-style spec, so treat interoperability as "try it and see."

DetailsKSplatLoader APIs export converted buffers for download. Because the format evolves, shipping cross-engine assets often stays on PLY/SPZ while using .ksplat inside a known web stack.

Prerequisites

Go deeper

Related engines

FormatExpert

USD ParticleField (incl. 3D Gaussian Splats)

Also known asOpenUSD ParticleField

In plain wordsUSD's ParticleField schema is the paperwork that tells a renderer how to interpret volumetric particles: positions, kernels, radiance, optional scales and orientations. Derived types such as ParticleField3DGaussianSplat let pipelines store splats as first-class USD prims instead of ad-hoc blobs.

DetailsParticleField generalizes multiple field types, including 3D Gaussian splats. Tooling converts trained PLY assets into USD stages referencing these prims so RTX can path-trace them alongside meshes. Custom exporters must inherit the schema expectations for validation.

Prerequisites

Go deeper

Related engines

MetricBeginner

Peak Signal-to-Noise Ratio (PSNR)

Also known asPSNR

In plain wordsPSNR compares the max possible signal power to the noise energy between two images, expressed in decibels—higher usually means closer pixels, but it can disagree with human perception when blur or structure breaks.

DetailsPSNR rewards pixel-wise fidelity. Teams report it alongside SSIM and LPIPS because high PSNR does not guarantee perceptual quality. Use the same train/test splits and bit depth when comparing numbers across papers.

Prerequisites

Go deeper

Related tools

MetricBeginner

Structural Similarity Index Measure (SSIM)

Also known asSSIM / structural similarity index

In plain wordsSSIM scores how well luminance, contrast, and structure match between patches—closer to human annoyance with blur than raw MSE, though it still misses every failure mode.

DetailsSSIM generalizes Wang–Bovik's framework; differentiable variants plug into splat optimization. Teams usually log SSIM alongside PSNR and LPIPS on held-out views.

Prerequisites

Go deeper

Related tools

MetricIntermediate

Learned Perceptual Image Patch Similarity (LPIPS)

Also known asLPIPS / perceptual metric LPIPS

In plain wordsLPIPS feeds paired images through a pretrained CNN, compares deep features (optionally calibrated), and returns a distance where higher means more different—useful when PSNR looks fine but pictures still feel wrong.

DetailsThe reference implementation exposes AlexNet/VGG/SqueezeNet variants. Gaussian splat papers cite LPIPS↓ on validation views. Unlike PSNR, it rewards semantic similarity but costs more compute.

Prerequisites

Go deeper

Related tools

MetricBeginner

Frames Per Second (FPS)

Also known asFPS / frame rate

In plain wordsFPS counts how many full images a pipeline produces each second—either realtime splat rendering in a browser or the throughput numbers quoted beside PSNR tables. Higher feels smoother, but always ask which stage was timed.

DetailsBenchmarks should specify resolution, MSAA settings, and whether WebGPU/WebGL paths were used. Academic tables often list FPS next to memory footprint to compare explicit radiance methods.

Prerequisites

Go deeper

Related engines

MetricIntermediate

Gaussian / splat count

Also known asnumber of Gaussians

In plain wordsSplat count tracks how many Gaussian primitives survive optimization—think counting marbles in a jar. More splats can capture detail but inflate VRAM, storage, and sort cost; pruning and compression fight the same number.

DetailsDensification schedules grow or split splats when view-space gradients spike; opacity pruning removes low-alpha Gaussians. gsplat reproduces official Gaussian counts when benchmarking. SPZ/KSplat formats also hinge on how many primitives remain. The Infinite Realities report describes about 1,800 trained PLY files and 300K input images for a 30-second dynamic clip, showing that 4D capture must evaluate splat count together with time sequence and storage budget.

Prerequisites

Go deeper

Related tools

Related engines