←

Read-only Review: 9.6

FS_3DGS_MED (Study on 3D Gaussian splats)

Meeting: TSGS4_135_India
Generated: 2026-04-07 09:47:54

Show columns:

TDoc Number	Title	Source	Summary	Proposals	Comments
S4-260088 (pdf)	[FS_3DGS_MED] pCR on 3D tiles, LOD and 3DGS delivery format requirements	Samsung Electronics Iberia SA	Summary of 3GPP Change Request S4-260088 Document Information Source: Samsung Electronics Co., Ltd. Title: pCR on 3D tiles, LOD and 3DGS delivery format requirements Specification: 3GPP TR 26.958 v0.1.1 (FS_3DGS_MED) Purpose: Agreement on text additions and modifications Overview This change request proposes comprehensive updates to TR 26.958 to address 3D Gaussian Splatting (3DGS) encapsulation and delivery format requirements, with particular focus on spatial random access and level of detail (LOD) mechanisms. The document introduces three main changes to improve clarity and technical accuracy. Main Technical Contributions 1. Terminology Updates (1st Change) New and Modified Definitions 3DGS tile (new definition): A spatial volume of the scene represented by a specific bounding volume, containing a set of 3D Gaussians for a given level of detail (LOD) Levels of detail (new definition): Multiple representations of a scene, each with a different set of data which represents different qualities of the scene for a compromise between visual detail and data size 3D tile (modified/removed): The previous generic definition ("discrete spatial partition of a massive geospatial dataset") is replaced with the more specific "3DGS tile" definition to better align with 3DGS-specific requirements Rationale: The original TR lacked an LOD definition and used a non-3DGS-specific definition for 3D tiles. The new terminology better reflects the technical requirements for 3DGS delivery. 2. Use Case Description Refinements (2nd Change) Updates to Clause 5.3 - Exploration of Large 3DGS Environment Terminology Harmonization: - Replaces generic "3D tiles" references with "3D Gaussians" in use case descriptions - Updates "3D tiles" to "3DGS tiles" in working assumptions with reference to new technical clause - Maintains LOD terminology as it is widely understood in 3D graphics Key Technical Aspects: - Adaptive delivery of 3D Gaussians at various LODs based on user pose and device capabilities - Selection process maintains constant number of displayed splats for quality consistency - Interactive delivery mechanism for 3D Gaussian sets at various detail levels - Constrained navigation within captured regions Working Assumptions Updates Compression and Packaging: - 3DGS tiles with different LODs serialized into delivery format - Signaling for spatial and LOD indices and dependencies - Editor's notes identify need for: - Workflow documentation for different uplink/downlink traffic profiles - Characterization of Gaussian parameters requiring signaling - Evaluation of existing 3GPP media delivery frameworks Transport and Delivery: - Interactive delivery with predictive prefetch - Edge-assisted content hosting for latency control - Buffering strategies to minimize latencies and visual artifacts Decoding and Rendering: - UE parses 3DGS tile indices and manages GPU residency - Real-time splat-based rendering with tile/LOD switching - Navigation constraints based on capture information and collision detection - Editor's note on expressing "allowed navigation volume" 3. New Clause on 3DGS Encapsulation and Delivery Formats (3rd Change) This represents the major technical contribution, introducing a comprehensive new clause (Clause X) covering: X.1 Introduction Core Concepts: - 3DGS scalability requires support for: 1. Position-based random access of 3D Gaussians 2. Delivery/rendering of different LODs Technical Relationship: - Spatial random access and LOD are non-orthogonal (same spatial volume can have different Gaussian sets for different LODs) - Viewing frustum (derived from user pose) determines required spatial volumes and LODs - Frustum includes: position, orientation, horizontal/vertical FoV, viewing distance - Mechanisms relevant to both rendering efficiency and delivery optimization X.2 Requirements Two Primary Requirements Identified: 3D Gaussian Set Identification: Method to identify 3DGS data into sets enabling association of 3D Gaussians at different LODs with spatial volumes Frustum-Based Access: Method to identify and access required 3D Gaussians at different LODs using user's frustum for efficient delivery, access, and rendering X.3 3DGS Tiles Technical Definition: - Method to associate spatial volumes with 3D Gaussians through 3DGS tiles - 3DGS tile = spatial volume with specific bounding volume + set of 3D Gaussians for given LOD Editor's Note: Indicates need for additional technical details X.4 Related Compression Aspects Compression Technology Requirements: LOD and Partial Delivery Support: Ability to support LOD and partial spatial data delivery without compression optimization dependency Optimized Compression: Real-time compression of 3DGS data optimized for LOD support and partial spatial delivery Technical Impact The changes establish a structured framework for: - Standardized terminology for 3DGS spatial organization - Clear requirements for encapsulation and delivery formats - Foundation for future work on compression, signaling, and delivery protocols - Alignment with interactive 6DoF streaming requirements from TR 26.928 The proposal moves from use-case-embedded technical concepts to a dedicated technical clause, providing clearer separation of concerns and enabling more detailed specification development.	Extracted Proposals Proposal 1: It is proposed to agree the following changes to 3GPP TR 26.958 v0.1.0.	manager: [Technical] Replacing the generic “3D tile” definition with “3DGS tile” risks breaking consistency with existing 3GPP usage (and external ecosystems like 3D Tiles) and may remove a useful generic concept; the CR should either retain “3D tile” as a generic term and add “3DGS tile” as a specialization, or clearly scope the term change to TR 26.958 only and update all dependent text accordingly. [Technical] The new “3DGS tile” definition (“spatial volume…containing a set of 3D Gaussians for a given LOD”) implicitly makes LOD part of the tile identity, but later requirements talk about associating “different LODs with spatial volumes”; this is internally inconsistent unless the spec defines whether a tile is (volume, LOD) or volume-only with multiple LOD representations. [Technical] Clause X.2 requirements are too high-level to be actionable (“method to identify…”, “method to identify and access…”) and do not translate into measurable encapsulation/delivery format requirements (e.g., required metadata fields, indexing granularity, dependency signaling, random access points, or constraints on bounding volumes). [Technical] The “frustum-based access” requirement introduces frustum parameters (FoV, viewing distance) but does not define how these map to requested spatial volumes/LOD selection (server-driven vs client-driven, deterministic mapping, hysteresis), risking non-interoperable implementations. [Technical] The use case claim that the selection process “maintains constant number of displayed splats for quality consistency” is not generally valid across devices and scenes and reads like a specific renderer policy; it should be framed as an example strategy or removed from normative-style assumptions. [Technical] The proposal mentions “signaling for spatial and LOD indices and dependencies” but does not specify what dependencies mean for 3DGS (e.g., inter-LOD prediction, progressive refinement, shared codebooks) and therefore cannot guide encapsulation design or compression choices. [Technical] Clause X.4 “LOD and partial delivery support…without compression optimization dependency” is unclear and potentially contradictory: if compression is not designed for partial delivery, random access may be infeasible; the CR should clarify whether this is a requirement on the delivery format (independent access units) versus on the compression tool. [Technical] The CR introduces “bounding volume” but does not constrain its type (AABB/OBB/sphere), coordinate system, or precision, which are essential for interoperable spatial indexing and frustum intersection across UE/server. [Technical] The navigation constraints discussion (“allowed navigation volume”, collision detection) is introduced but not tied to any encapsulation/delivery requirement; if it impacts delivery (e.g., prefetch region, validity of tiles), it needs explicit linkage or should remain purely informative. [Editorial] The change description says “replaces generic ‘3D tiles’ references with ‘3D Gaussians’” in Clause 5.3, but “tiles” and “gaussians” are not interchangeable concepts; the edits should consistently distinguish between the primitive (Gaussian) and the packaging/indexing unit (tile/set). [Editorial] The new clause numbering (“Clause X”) and subclauses (X.1–X.4) need proper integration into TR 26.958 structure (actual clause number, references from Clause 5.3 working assumptions), otherwise the document becomes hard to navigate and cross-reference. [Editorial] Multiple editor’s notes (“need for additional technical details”, workflow documentation, parameter characterization) indicate the text is not yet stable; the CR should either add the missing minimum content or clearly mark these as future work items in a dedicated “open issues” section rather than inline notes. 2026-02-09 04:35
S4-260089 (pdf)	[FS_3DGS_MED] pCR on editorial changes	Samsung Electronics Iberia SA	3GPP Document S4-260089 Summary Document Information Meeting: TSG-SA4 Meeting #135 (February 9-13, 2026, Goa, India) Source: Samsung Electronics Co., Ltd. Document Type: pCR (pseudo Change Request) Target Specification: 3GPP TR 26.958 v0.1.1 Study Item: FS_3DGS_MED (3D Gaussian Splatting for Media) Purpose: Agreement Overview This is an editorial change request for the 3D Gaussian Splatting for Media study item Technical Report. The document proposes non-technical corrections and clean-up modifications to improve the clarity and consistency of TR 26.958. Main Technical Contributions 1. Reason for Change The document identifies the need for editorial corrections and clean-up in the current version (v0.1.1) of TR 26.958. These changes are purely editorial in nature and do not affect the technical content or agreements previously made in the study. 2. Proposed Changes The contribution proposes to incorporate editorial modifications to TR 26.958 v0.1.1. The specific editorial changes are contained in an attached document (not visible in the provided content). Nature of Contribution This is a maintenance-type contribution focused on: - Editorial corrections - Document clean-up - Improving readability and consistency Note: The actual detailed editorial changes are not visible in the provided HTML document as they would be in the attachment referenced by the contribution.	Proposal: It is proposed to agree the changes in the attached document to 3GPP TR 26.958 v0.1.1.	manager: [Technical] The contribution provides no visibility of the actual tracked changes/CR text (attachment not included), so SA4 cannot verify the claim that the changes are “purely editorial” or ensure no normative/technical meaning is altered in TR 26.958 v0.1.1. [Technical] Because the specific edits are missing, it is impossible to assess whether any terminology changes (e.g., “shall/should/may”, “encoder/decoder”, “bitstream/syntax”) inadvertently change requirements or assumptions in the 3DGS study conclusions. [Technical] The pCR does not identify the impacted clauses/subclauses, figures, or tables in TR 26.958, preventing consistency checks (e.g., definitions vs. usage, abbreviations, and cross-references) across the document. [Technical] No change log or summary of edit categories is provided (e.g., reference updates, figure renumbering, equation fixes), which makes it hard to detect high-risk “editorial” edits such as corrected formulas, parameter names, or units that can materially affect interpretation. [Technical] The contribution does not state whether any references (external specs, codecs, file formats, rendering pipelines) are updated; reference changes can have technical impact if versions, titles, or scopes shift. [Editorial] The “Purpose: Agreement” is not supported by a concrete list of proposed corrections; for an agreement request, the document should at least enumerate the main edits or provide a diff excerpt in the main body. [Editorial] The document labels itself “editorial” but does not include the standard CR-style fields that help review (affected version, affected clauses, detailed change description), reducing reviewability even for purely editorial maintenance. [Editorial] The summary is generic (“clean-up modifications to improve clarity and consistency”) and does not justify urgency or priority; a brief rationale tied to specific recurring issues (typos, inconsistent naming, broken cross-references) would be expected. [Editorial] The contribution should explicitly confirm that no figures/tables are added/removed and no numbering changes affect cross-references; renumbering is a common source of residual inconsistencies if not carefully managed. [Editorial] As a pCR against TR 26.958 v0.1.1, it should clarify whether the edits are intended for the next draft (v0.1.2) and whether they align with SA4 drafting rules (e.g., consistent capitalization of defined terms, abbreviation introduction on first use). 2026-02-09 04:36
S4-260119 (pdf)	[FS_3DGS_MED] glTF-based Representation Formats for 3D Gaussian Splats	Qualcomm Atheros, Inc.	Summary of S4-260119: glTF-based Representation Formats for 3D Gaussian Splats Introduction and Scope This contribution addresses Objective 2c of the FS_3DGS_MED Study Item ("Determine relevant formats") by providing a comprehensive analysis of glTF-based representation formats for 3D Gaussian Splatting. The document identifies a gap in TR 26.958 V0.1.1, which currently only mentions PLY as a storage format without comparative analysis of the emerging glTF-based format ecosystem from Khronos and MPEG. The contribution proposes a two-layer architecture combining: - KHR_gaussian_splatting (Khronos) for canonical splat semantics - MPEG_gaussian_splatting_transport (MPEG-I Scene Description) for distribution and streaming capabilities KHR_gaussian_splatting (Khronos Layer) Core Attribute Semantics The Khronos extension (review draft published August 2025) defines Gaussian splats as POINTS primitives within standard glTF 2.0 with the following attributes: POSITION (VEC3, required): Splat center position using standard glTF base attribute ROTATION (VEC4, required): Quaternion (x,y,z,w) for local axes orientation SCALE (VEC3, required): Per-axis scale in log-space OPACITY (SCALAR, required): Opacity in range [0,1] SH_DEGREE_l_COEF_n (VEC3, conditional): Spherical harmonics coefficients organized by degree (0-3) and coefficient index for view-dependent lighting COLOR_0 (VEC3/VEC4, recommended): Baseline color for fallback point-cloud rendering Extensibility and Backward Compatibility Key design features: - Nested extensions mechanism inside the KHR_gaussian_splatting object allows other extensions to add compression, alternative encodings, or processing without duplicating semantics - Graceful degradation: Clients not recognizing the extension can still render as standard point cloud using POSITION and COLOR_0 - Provides strong anchor for MPEG and 3GPP work targeting interoperable distribution and streaming MPEG_gaussian_splatting_transport (MPEG Layer) Architecture Approach The MPEG extension is carried as a nested extension inside `KHR_gaussian_splatting.extensions`, avoiding semantic duplication and adding only transport-level features. Transport-Level Features Alternative SH Layouts Two MPEG-specific SH coefficient storage modes alongside Khronos default: mpegProgressive layout: Groups coefficients by SH degree (degree 1, 2, 3 as separate SCALAR accessors) Efficient for progressive refinement Receiver can render with only SH degree 0 data and incrementally fetch higher degrees DC (degree 0) term reconstructed from COLOR_0.rgb or carried via KHR SH_DEGREE_0_COEF_0 mpegPerChannel layout: Separates coefficients by color channel (R, G, B) More efficient for certain compression schemes Progressive Download Optional progressive ordering signaled by listing accessor indices in `progressive.stages` Ordered from lower to higher fidelity Receiver may initially fetch only first stage and progressively refine without re-decoding previous data Timed Delivery for 4D Splats Dynamic 4D Gaussian splat sequences supported using existing MPEG timed media mechanisms Accessor treated as time-varying if and only if it carries `MPEG_accessor_timed` extension Timed accessors backed by circular buffers as defined by MPEG-I Scene Description Two-Layer Architecture Benefits for 3GPP Architectural Summary Layer 1 (Khronos): Canonical splat semantics (geometry, appearance, SH lighting) and fallback point-cloud path Layer 2 (MPEG): Progressive download, timed delivery, and alternative SH layouts as nested extension 3GPP Service Integration Advantages Alignment with existing 3GPP specifications: glTF already adopted by TS 26.118 (Immersive teleconferencing) and TS 26.119 (MeCAR) 5GMS adaptive delivery mapping: Progressive download and timed delivery map naturally to 5G Media Streaming Bandwidth-adaptive quality: Progressive SH degree layout enables network/receiver control of SH levels to fetch, analogous to spatial/temporal layer selection in scalable video codecs Future-proof extensibility: Clear path for future compression extensions (e.g., from ongoing MPEG Gaussian Splat Coding exploration) and tiled spatial delivery without breaking backward compatibility Format Comparison PLY De facto training output format Raw float32 attributes without compression Very large files (typically 200+ MB for single scene at SH degree 3) Limitations: No extensibility mechanism, no progressive delivery support, no scene graph, no standard metadata support (camera parameters, animation) SPZ (Splat Zip) Developed by Niantic as compact binary container Applies quantization and packing (~90% size reduction vs PLY) Extension under development in Khronos Superior compression schemes (e.g., Qualcomm's L-GSC) also being considered glTF + KHR_gaussian_splatting + MPEG transport Full scene graph support (nodes, transforms, animations) Standard extensibility Backward-compatible fallback MPEG transport layer for progressive and timed delivery Signaling and usage of different compression schemes through proper extensions Recommended as primary format path for 3GPP Proposals for TR 26.958 The contribution proposes to include the following in TR 26.958 Section 4 and new subsection under Section 11: Document KHR_gaussian_splatting as emerging industry baseline for 3DGS representation in glTF, including: Attribute semantics SH coefficient organization Backward-compatible fallback via POINTS Extensibility mechanism Document MPEG_gaussian_splatting_transport being developed within MPEG-I Scene Description, including: Progressive download Timed delivery for dynamic 4D Gaussian splat sequences Alternative SH coefficient layouts (mpegProgressive and mpegPerChannel) Document two-layer architecture (Khronos semantics + MPEG transport) and its suitability for 3GPP service integration, noting alignment with glTF-based approach in TS 26.118 and TS 26.119	Extracted Proposals Proposal We propose to include the following information in TR 26.958 under Section 4 (and a new subsection for external format analysis under Section 11), addressing SID Objective 2c: Document the KHR_gaussian_splatting extension as the emerging industry baseline for 3DGS representation in glTF, including its attribute semantics, SH coefficient organization, backward-compatible fallback via POINTS, and extensibility mechanism. Document the MPEG_gaussian_splatting_transport extension being developed within MPEG-I Scene Description, including progressive download, timed delivery for dynamic 4D Gaussian splat sequences, and alternative SH coefficient layouts (mpegProgressive and mpegPerChannel). Document the two-layer architecture (Khronos semantics + MPEG transport) and its suitability for 3GPP service integration, noting the alignment with the glTF-based approach already used in TS 26.118 and TS 26.119.	manager: [Technical] The contribution treats MPEG_gaussian_splatting_transport as a “nested extension inside `KHR_gaussian_splatting.extensions`”, but glTF extension governance/namespacing and validation rules typically require explicit registration and clear JSON schema; without normative references to the actual MPEG/Khronos drafts and their JSON structures, the proposed two-layer nesting risks being non-interoperable or even invalid in strict glTF validators. [Technical] The stated graceful degradation (“render as standard point cloud using POSITION and COLOR_0”) is overstated: if splats rely on OPACITY/scale/rotation/SH for appearance, a POINTS fallback will not approximate splat rendering and may be misleading; TR text should qualify this as a debug/preview fallback and specify minimum attributes for meaningful fallback. [Technical] The attribute list claims SCALE is in log-space and ROTATION is required, but no rationale or interoperability impact is discussed (e.g., quantization, decoding, coordinate conventions, handedness); TR 26.958 would need to capture these conventions precisely or risk implementers producing incompatible decoders. [Technical] The SH signaling is described as `SH_DEGREE_l_COEF_n` with degree 0–3, but the mapping to actual coefficient counts, ordering, and whether coefficients are RGB triplets vs scalar per channel is unclear; the proposed “alternative layouts” (mpegProgressive/mpegPerChannel) need an unambiguous normative definition of coefficient indexing and reconstruction to avoid mismatched lighting. [Technical] The proposal says DC term reconstructed from `COLOR_0.rgb` or carried via KHR SH_DEGREE_0_COEF_0, which creates two possible sources of truth; this needs a strict precedence rule and constraints (e.g., must match within tolerance) or decoders will diverge. [Technical] “Progressive download” via `progressive.stages` listing accessor indices is underspecified: it doesn’t define whether stages are additive vs replacement, how partial accessors are fetched (byte ranges? separate buffers?), and how this maps to 5GMS segmenting; without a concrete packaging model, this is not actionable for 3GPP. [Technical] The “timed delivery for 4D splats” relies on `MPEG_accessor_timed` and “circular buffers as defined by MPEG-I Scene Description,” but no details are provided on timestamping, random access, buffering constraints, or synchronization with audio/video; TR 26.958 would need at least a clear reference model for timing and synchronization to be useful. [Technical] The document implies glTF is already “adopted” by TS 26.118/26.119, but does not specify which profiles/constraints (e.g., glTF 2.0 core vs specific extensions, binary GLB usage, buffer constraints); the integration argument is weak without aligning the proposed extensions to those existing 3GPP profiles. [Technical] The comparison against PLY focuses on file size and missing features, but omits that PLY is often used with external metadata and that many pipelines use custom binary formats; the TR should avoid implying PLY is inherently non-extensible without clarifying it’s a container lacking standardized extension mechanisms. [Technical] The contribution references SPZ and “Qualcomm’s L-GSC” as “being considered,” but does not clarify whether these are compatible with the proposed KHR/MPEG layering (e.g., as buffer compression extensions) or require different semantics; this weakens the recommendation of a single “primary format path.” [Editorial] The document cites a “review draft published August 2025” for KHR_gaussian_splatting, which is future-dated relative to typical 3GPP timelines and raises credibility/versioning issues; the TR proposal should reference stable, publicly accessible draft identifiers/URLs and revision dates. [Editorial] Proposed TR changes are described only at a high level (“include in Section 4 and new subsection under Section 11”) without concrete text, clause numbers, or proposed wording; for a 3GPP contribution, this makes it hard to assess exact impact and consistency with TR 26.958 structure and terminology. [Editorial] Several terms are used without definition in 3GPP context (e.g., “canonical splat semantics,” “nested extensions mechanism,” “circular buffers”), and the contribution mixes Khronos/MPEG terminology with 3GPP service language; the TR additions should introduce a short glossary or align terms to existing TR 26.958 definitions. 2026-02-10 11:09
S4-260140 (pdf)	[FS_3DGS_MED] Pseudo-CR on Sport Example for Dynamic 3DGS Content Use Case	Pengcheng Laboratory, China Mobile Com. Corporation	Summary of S4-260140: Sport Example for Dynamic 3DGS Content Use Case Document Overview This change request proposes adding a sports scenario example to TR 26.958 to illustrate the Dynamic 3DGS (3D Gaussian Splatting) content use case. The contribution is from Pengcheng Laboratory and China Mobile, targeting the FS_3DGS_MED study item. Main Technical Contributions Use Case Enhancement - Dynamic 3DGS Content (Section 5.4) Core Use Case Description (Section 5.4.1) The document enhances the existing Dynamic 3DGS content use case description with the following key characteristics: Content Type: Time-varying 3DGS content depicting dynamic subjects/scenes (performers, dancers, singers, exhibitions, bands, sport actions) Rendering Approach: Real-time rendering of 3DGS content sequences on the UE Network Support: Delivery and rendering may be assisted through: Partial delivery mechanisms Network-assisted rendering User Interaction: Viewpoint adjustment within a constrained navigation volume while the scene changes dynamically Rendering Primitive: 3D Gaussian splats (as opposed to textured meshes or voxels used in volumetric video) Scope Definition Primary Focus: Delivery, decoding, and real-time rendering of pre-recorded dynamic 3DGS sequences On-demand streaming File download scenarios Future Consideration: Live dynamic 3DGS capturing and delivery (feasibility-dependent, later stage) Alignment: Corresponds to TR 26.928 Use Case 3: Streaming of Immersive 6DoF (non-live/on-demand variant) Sports Action Example Scenario Description The CR introduces a basketball game segment as an illustrative example (Figure 5.1): Content Representation: Dynamic scene encoding both: Evolving motion of players Surrounding environment Represented as time-indexed sequence of 3D Gaussian splats Playback Characteristics Temporal Handling: UE receives successive temporal segments Continuous rendering to preserve temporal progression Spatial Navigation: User-controlled viewpoint adjustments including: Limited rotation Translation Zoom Enables observation from different perspectives without altering temporal sequence Navigation Constraints Temporal Navigation: Driven by playback timeline Spatial Navigation: User-controlled within permitted range Constrained to allowed-view volume derived from original capture configuration Ensures visual coherence and avoids out-of-distribution views Combined Interaction: Time-continuous playback with interactive viewpoint exploration Technical Significance This contribution provides a concrete, large-scale example for Dynamic 3DGS content use cases, specifically addressing: Wide-area environments with complex background dynamics Fast-moving subjects (athletes) Traffic analysis requirements for extensive 3DGS environments Requirement derivation for the FS_3DGS_MED study The sports scenario serves as a representative example for understanding delivery, rendering, and interaction requirements for dynamic 3DGS content in challenging real-world conditions.	Proposal: It is proposed to agree the following changes to the draft 3GPP TR 26.958.	manager: [Technical] The CR appears to add a “basketball game segment” example but does not state what new requirements/implications this example drives for Dynamic 3DGS (e.g., bitrate/latency bounds, segment duration, viewport-dependent delivery), so it risks being non-actionable narrative rather than a use-case clarification in TR 26.958 §5.4. [Technical] The text mixes “real-time rendering on the UE” with “network-assisted rendering” and “partial delivery mechanisms” without clarifying the assumed functional split (decode vs render vs compose) and whether the example targets client-side rendering, edge rendering, or hybrid—this can conflict with the intended scope of §5.4.1 if not explicitly bounded. [Technical] “Time-indexed sequence of 3D Gaussian splats” is underspecified: it is unclear whether this implies per-frame independent 3DGS, inter-frame prediction, or parameter updates/deltas; without that, the example cannot meaningfully inform delivery/decoding aspects that §5.4 is supposed to cover. [Technical] The “allowed-view volume derived from original capture configuration” introduces a key constraint but does not define how it is represented/signalled to the UE (metadata? scene description? per-segment constraints), which is essential if the example is meant to support requirement derivation. [Technical] The example claims “wide-area environments with complex background dynamics” and “fast-moving subjects” but does not discuss occlusions, motion blur, or temporal consistency artifacts specific to 3DGS; these are central technical challenges for dynamic sports scenes and should be acknowledged if the example is to be credible. [Technical] The contribution states alignment with TR 26.928 “Use Case 3: Streaming of Immersive 6DoF (non-live/on-demand variant)” but does not explain the mapping (e.g., 6DoF navigation limits, viewport-adaptive streaming, segmenting model), risking inconsistency with the referenced use case framing. [Technical] “UE receives successive temporal segments” is vague: it should clarify whether segments are GOP-like timed chunks, tiles/partitions, or layered representations, and whether “partial delivery” is temporal-only, spatial-only, or both. [Technical] The scope says “pre-recorded dynamic 3DGS sequences” yet the sports example implies potentially long-form content; without stating assumptions on duration, storage, and buffering, it’s hard to reconcile with “real-time rendering” and any implied latency constraints. [Technical] If the example is intended to support “traffic analysis requirements,” it should specify what traffic characteristics are being highlighted (e.g., peak vs average bitrate, burstiness due to viewpoint changes, uplink feedback frequency), otherwise the claim is unsubstantiated. [Editorial] The CR references “Figure 5.1” but the summary does not indicate whether the figure is newly added/updated and properly captioned/numbered consistent with TR 26.958; figure insertion often breaks numbering and cross-references if not carefully integrated. [Editorial] Terminology is inconsistent/ambiguous: “Dynamic 3DGS content,” “3DGS content sequences,” and “time-indexed sequence of 3D Gaussian splats” should be harmonized with existing definitions in TR 26.958 to avoid introducing parallel phrasing for the same concept. [Editorial] The list of dynamic subjects in §5.4.1 (“performers… exhibitions… sport actions”) mixes nouns and activities; consider normalizing wording (e.g., “sports events” rather than “sport actions”) to match TR style and improve clarity. 2026-02-09 04:36
S4-260145 (pdf)	Pseudo-CR on Dancer Example for Dynamic 3DGS Content Use Case	Pengcheng Laboratory, China Mobile Com. Corporation	Summary of S4-260145: Pseudo-CR on Dancer Example for Dynamic 3DGS Content Use Case Document Overview This contribution proposes adding a detailed dancer scenario example to TR 26.958 as an illustrative use case for Dynamic 3D Gaussian Splatting (3DGS) content. The document is submitted by Pengcheng Laboratory and China Mobile Com. Corporation for SA4 Meeting #135. Main Technical Contributions Dynamic 3DGS Content Use Case Enhancement (Section 5.4) General Description The contribution expands the existing Dynamic 3DGS content use case description with the following key characteristics: Content Type: Time-varying 3DGS content depicting dynamic subjects or scenes (performers, dancers, singers, exhibition moments, bands, sport actions) Delivery Model: Pre-recorded dynamic 3DGS sequences via on-demand streaming or file download Rendering: Real-time rendering on UE with potential network assistance (partial delivery or network-assisted rendering) User Interaction: Local viewpoint adjustment within constrained navigation volume while scene changes dynamically over time Rendering Primitive: 3D Gaussian splats (analogous to volumetric video but using splats instead of textured meshes or voxels) Alignment: Corresponds to 3GPP TR 26.928 Use Case 3: Streaming of Immersive 6DoF (non-live/on-demand variant) Dancer Scenario Example The contribution introduces a comprehensive dancer performance example with the following technical specifications: Scene Representation: - Dynamic 3DGS sequence representing dance performance captured over short temporal interval - Time-indexed sequence of 3D Gaussian splats encoding: - Continuous body motion - Pose transitions - Expressive gestures of one or multiple dancers - Relevant stage elements Playback Characteristics: - UE receives successive temporal segments - Real-time rendering preserving temporal continuity and rhythm - Motion evolution according to encoded timeline - Spatial structure coherence maintained across frames User Interaction Model: - Viewpoint Adjustment: Interactive control within constrained navigation volume derived from original capture setup - Permitted Operations: Limited rotation, translation, or zoom - Benefits: Enhanced perception of choreography, spatial relationships between performers, and fine-grained motion details - Constraints: Visual consistency ensured while avoiding out-of-distribution views Navigation Paradigm: - Temporal Navigation: Driven by playback timeline (time-continuous) - Spatial Navigation: User-controlled within permitted range - Combined Experience: Time-continuous playback with interactive viewpoint exploration Scope Limitations The contribution explicitly defines the following scope boundaries: In Scope: - Delivery of pre-recorded sequences - Decoding of dynamic 3DGS content - Real-time rendering on mobile devices - On-demand streaming or file download Out of Scope: - Live dynamic 3DGS capturing and delivery (may be considered later depending on feasibility) - Capture processes - Real-time communication Technical Focus The use case specifically targets: - Human-centric 3D Gaussian scene reconstruction - Capturing intricate details of human motion and dynamic appearance changes within confined volume - Reference implementation for evaluating 3DGS rendering performance on mobile devices - High-fidelity character rendering Visual Material The contribution includes Figure 5.x illustrating the dancer scenario, showing time-indexed dynamic 3DGS sequence playback with temporal progression preservation and user viewpoint adjustment capabilities within the allowed navigation volume.	Proposal: It is proposed to agree the following changes to the draft 3GPP TR 26.958.	manager: [Technical] The proposal does not define what constitutes a “dynamic 3DGS sequence” in spec terms (e.g., per-frame independent splat sets vs. temporally predicted updates/deltas), which is essential to avoid ambiguity in TR 26.958 Section 5.4 when later mapping to codec, packaging, and streaming implications. [Technical] “UE receives successive temporal segments” is underspecified: the contribution should clarify whether segments are CMAF chunks, file segments, or generic time slices, and how segment boundaries relate to random access, buffering, and timeline continuity for dynamic 3DGS. [Technical] The text implies “spatial structure coherence maintained across frames” but provides no mechanism/assumption (e.g., stable splat IDs, correspondence, motion fields); without this, the example risks implying requirements on representation/decoder behavior that may not be intended in a use-case TR. [Technical] The “constrained navigation volume derived from original capture setup” needs a concrete definition (e.g., 6DoF bounding volume, camera manifold, near/far limits) and how it is signaled/communicated to the client; otherwise it is not actionable for system design discussions in TR 26.958. [Technical] The contribution mentions “potential network assistance (partial delivery or network-assisted rendering)” but does not state whether this is in-scope for the example; this can conflict with the stated “real-time rendering on mobile devices” and should be clearly framed as optional/non-normative to avoid scope creep. [Technical] Alignment to TR 26.928 “Use Case 3: Streaming of Immersive 6DoF (non-live/on-demand variant)” is asserted but not demonstrated; the example should explicitly map the dancer scenario’s navigation, timing, and delivery assumptions to the corresponding TR 26.928 attributes to ensure consistency. [Technical] The example mixes “on-demand streaming” and “file download” without clarifying whether the same timing/navigation behavior is expected in both modes (e.g., progressive download vs. true streaming), which affects buffering and interactivity assumptions. [Technical] The statement “visual consistency ensured while avoiding out-of-distribution views” is more of a reconstruction/training limitation than a delivery use-case attribute; it should be reframed to avoid implying normative constraints on user navigation beyond what the system can signal/enforce. [Technical] The example references “real-time rendering preserving temporal continuity and rhythm” but does not identify the key performance/QoE parameters (target frame rate, motion-to-photon latency tolerance, acceptable stutter) that make the use case meaningful for SA4 evaluation. [Editorial] The contribution uses “UE” without expansion (Unreal Engine) and introduces it as if it were a normative component; TR text should either generalize to “client renderer” or define UE as an example implementation. [Editorial] Terminology alternates between “dynamic 3DGS,” “time-varying 3DGS,” and “time-indexed sequence of 3D Gaussian splats” without a consistent term; Section 5.4 should pick one primary term and define it once. [Editorial] Figure “5.x” is referenced but not anchored to an actual figure number/caption and the surrounding text does not state what the figure concretely illustrates (timeline, navigation volume, segmenting), reducing its value as an illustrative example. [Editorial] The “In scope / Out of scope” bullets partially repeat general TR boundaries (e.g., “capture processes”) and could be tightened to only what is specific to this dancer example, otherwise it reads like a generic disclaimer rather than a targeted use-case addition. 2026-02-09 04:37
S4-260147 (pdf)	[FS_3DGS_MED] Pseudo-CR on Enhanced Scenario for Avatar Communication Use Case	Pengcheng Laboratory, China Mobile Com. Corporation	Summary of 3GPP Change Request S4-260147 Document Information Source: Pengcheng Laboratory, China Mobile Com. Corporation Title: [FS_3DGS_MED] Pseudo-CR on Enhanced Scenario for Avatar Communication Use Case Specification: 3GPP Draft TR 26.958 v0.1.1 Meeting: TSG-SA4 Meeting #135, 9-13 February 2026, Goa, India Main Objective This contribution proposes an enhanced scenario for avatar-based communication that combines parametric human models with 3D Gaussian Splatting (3DGS) technology. The proposal aims to enable efficient real-time interactive communication by transmitting compact motion parameters to drive a deformable mesh while using 3DGS for high-fidelity appearance rendering. Technical Contributions Enhanced Avatar Communication Architecture The proposal introduces a hybrid representation approach consisting of: Deformable mesh representation driven by parametric human model parameters (e.g., SMPL-X for body/hands, FLAME for face) 3D Gaussian Splat representation for appearance enhancement and fine detail capture Separation of geometry and appearance to optimize transmission efficiency Technical Processing Pipeline Sender Side Processing Capture: User captured using one or more cameras Parameter Extraction: Geometric and animation parameters extracted using parametric models SMPL-X for body and hand motion FLAME for facial geometry and expression Representation Generation: Deformable human mesh reconstruction based on extracted parameters 3D Gaussian Splat generation for appearance details (fine surface detail, hair, clothing) Spatial Alignment: 3DGS representation aligned with deformable mesh Transmission Strategy Base Avatar: Transmitted once at session setup or updated occasionally Rigged mesh with skeletal structure and blendshapes Static 3DGS representation Animation Stream: Time-varying model parameters transmitted during session Compact parametric representation Low-latency transmission for interactive communication Update Frequency: 3DGS updated at lower frequency than animation parameters Receiver Side Processing Animation Application: Received parameters drive avatar motion Deformation Propagation: 3DGS follows mesh deformation Rendering: Composite approach combining: Mesh-based shading 3DGS-based appearance contributions Viewpoint Adaptation: Supported within application-defined constraints Working Assumptions The proposal defines several key working assumptions: Capture and Animation Extraction: - Real-time capture using one or more cameras - Real-time derivation of animation parameters from captured signals Representation: - Deformable mesh with associated rig - Associated 3DGS for appearance rendering - Static or low-frequency updated 3DGS representation Transmission: - One-time or occasional base avatar transmission - Continuous time-varying animation parameter transmission - Low-latency requirement for interactive communication Decoding and Rendering: - Animation parameter application at receiver - Combined mesh and 3DGS rendering - Constrained viewpoint adaptation support Key Innovation The main technical innovation is the separation of geometric animation (transmitted as compact parametric data) from appearance representation (using 3DGS), enabling photorealistic real-time avatar communication with efficient bandwidth utilization suitable for bidirectional interactive applications.	Proposal: It is proposed to agree the following changes to the draft 3GPP TR 26.958.	manager: [Technical] The proposal introduces a “static 3DGS representation” that “follows mesh deformation” at the receiver, but it does not specify a deformation model for Gaussians (e.g., per-Gaussian skinning weights, attachment to mesh surface, or a learned deformation field), making interoperability and feasibility unclear. [Technical] “Spatial alignment” between the deformable mesh and 3DGS is asserted without defining the coordinate frames, calibration requirements, and how alignment is maintained under pose/expression changes; this is a core missing element for a normative scenario description. [Technical] The transmission strategy lacks a concrete definition of what constitutes the “base avatar” payload versus “animation parameters” (parameter sets, units, ranges, timing model), so the claimed bandwidth/latency benefits cannot be evaluated or compared to other TR 26.958 scenarios. [Technical] The document assumes SMPL‑X/FLAME parameter extraction in real time but does not address model licensing/IP, standardization suitability, or whether the scenario is intended to be model-agnostic; referencing specific proprietary/de facto models may conflict with 3GPP’s technology-neutral TR positioning. [Technical] “3DGS updated at lower frequency than animation parameters” is underspecified: no triggers (appearance change, lighting change, topology change), update granularity (full set vs patches), or drift/consistency handling are described, which is critical for interactive bidirectional use. [Technical] The receiver rendering is described as “composite” (mesh shading + 3DGS appearance) but no compositing rules are given (occlusion, depth ordering, alpha blending, shadowing), risking ambiguous visual results and undermining the scenario’s reproducibility. [Technical] “Viewpoint adaptation supported within application-defined constraints” is too vague for a TR scenario; it should at least state whether free-viewpoint is expected, what baseline view range is assumed, and how artifacts are handled when extrapolating beyond capture coverage. [Technical] The capture assumptions (“one or more cameras”) omit key constraints that drive feasibility (mono vs multi-view, depth availability, required resolution/frame rate, lighting), which are necessary to justify real-time parameter extraction and 3DGS generation. [Technical] The proposal does not discuss error resilience and synchronization between the low-latency animation stream and the lower-rate 3DGS updates (e.g., timestamping, buffering, late/early update handling), which is essential for interactive communication scenarios. [Technical] There is no discussion of how identity personalization is handled (e.g., per-user mesh/3DGS creation, enrollment time, update cadence), yet “base avatar transmitted once” implies a prior creation pipeline that should be captured in the scenario. [Editorial] As a “Pseudo-CR,” the contribution summary does not indicate the exact TR 26.958 clause(s) to be updated, nor does it provide proposed text; without clause-level changes, SA4 cannot efficiently assess consistency with existing scenarios and terminology. [Editorial] Several terms are introduced without definition or alignment to TR terminology (e.g., “deformation propagation,” “appearance contributions,” “application-defined constraints”), which should be tightened to avoid multiple interpretations across implementers. [Editorial] The summary claims “efficient bandwidth utilization” but provides no qualitative comparison point (e.g., versus full 3DGS streaming or mesh+texture video), making the motivation read as aspirational rather than supported by scenario requirements. 2026-02-09 04:37
S4-260164 (pdf)	[FS_3DGS_MED] Pseudo-CR on objective metrics for 3DGS	Tencent Cloud	Summary of 3GPP Change Request S4-260164 Document Information Source: Tencent Title: Pseudo-CR on objective metrics for 3DGS Specification: 3GPP TR 26.958 v0.1.1 Meeting: TSG-SA4 Meeting #135, February 2026 Main Technical Contributions 1. Introduction of Objective Metrics Framework for 3DGS This change request proposes the adoption of a standardized objective quality evaluation methodology for 3D Gaussian Splatting (3DGS) content. The contribution addresses the current gap in TR 26.958, which contains only placeholders for metrics and reference implementations. The proposal leverages the mpeg-gsc-metrics software tool recently developed by MPEG for computing objective quality metrics. 2. Rationale for Standardization The CR identifies three key requirements for the study: Image-based evaluation: Enables calculation of objective image-based metrics for comparing source and decoded 3DGS content Industry-standard metrics: Supports commonly used image quality metrics (PSNR, SSIM, IVSSIM, etc.) Viewpoint management: Provides flexible handling of test views with exact camera parameter reuse or custom testing scenarios The proposed software is a fork of MPEG metrics software intended for storage in the 3GPP git repository to facilitate updates and future experiments. 3. Technical Changes to TR 26.958 3.1 New Section 6.4.1: Objective Metrics The CR introduces a comprehensive objective metrics section defining: Supported Metrics: - PSNR and MSE: Computed in both RGB and YUV color spaces with weighted averages - Object Masked (OM) Metrics: PSNR and SSIM variants computed only on valid pixels defined by union of object masks - Perceptual Metrics: SSIM and IVSSIM - Geometric Statistics: Occupancy rate measuring valid pixel coverage percentage Dual-Mode Rasterizer: - CPU rasterizer: Software-based implementation ensuring bit-exact rendering regardless of hardware/OS (recommended for normative results) - GPU rasterizer: OpenGL-based accelerated rendering for visual inspection and rapid experiments Evaluation Process: 1. Viewpoint generation from original PLY file or explicit definition 2. Rendering using standardized rasterizer (CPU or GPU) 3. Metric computation on rendered pairs 3.2 New Section 12.4: Objective Metrics Reference Implementation The CR adds detailed usage documentation for the 3DGS-Metrics command-line tool: 12.4.1 Basic Metric Computation: - Simple command-line interface for comparing source and decoded PLY files 12.4.2 Evaluation Using Embedded Camera Parameters: - `--useCameraPosition` option enables rendering using camera parameters stored in PLY header comments - Parameters typically inserted by content preparation tools using COLMAP photogrammetry data - Ensures exact camera intrinsics and extrinsics without external configuration 12.4.3 Evaluation with Loaded Viewpoints: - Support for external viewpoint files specifying camera poses - CPU rendering option for bit-exact results 12.4.4 Video Generation: - `-s` flag enables generation of rendered video sequences (Source, Decode, Butterfly comparison) - Facilitates visual inspection alongside metric computation 12.4.5 Output Results Format: - Detailed per-frame and global average statistics - Comprehensive reporting of MSE, PSNR, and SSIM in RGB and YUV color spaces - Example output provided for Bartender sequence at 1920x1080 resolution showing: - RGB PSNR (avg): 49.79 dB - YUV PSNR (avg): 53.96 dB - SSIM (avg): 0.998386 - 100% occupancy Conclusion The CR proposes adopting the 3DGS metrics software as the reference tool for objective quality evaluation to ensure all contributions are measured against the same baseline. This standardization will facilitate technical work by providing consistent, comparable, and reproducible results across different proponents within the study.	Proposal: It is proposed to agree the following changes to 3GPP TR 26.958 v0.1.1.	manager: [Technical] TR 26.958 is a Study Item TR; introducing “recommended for normative results” and “bit-exact rendering” language (CPU rasterizer) risks implying normative conformance where none exists—wording should be strictly informative and aligned with TR scope. [Technical] The proposal relies on a fork of MPEG’s mpeg-gsc-metrics but provides no evidence of licensing/IPR compatibility, long-term maintenance plan, or reproducibility guarantees within 3GPP Git (e.g., pinned commit, dependency versions), which is critical if it becomes the de facto reference. [Technical] “Bit-exact rendering regardless of hardware/OS” for a CPU rasterizer is a strong claim that is typically false without strict control of floating-point behavior, compiler flags, SIMD paths, and math libraries; the CR should specify determinism constraints and validation methodology. [Technical] The metric definitions are underspecified: PSNR/MSE “in RGB and YUV with weighted averages” needs explicit color conversion (matrix, range, primaries, transfer), bit depth, rounding, and weighting (e.g., 4:2:0 vs 4:4:4, luma/chroma weights) to ensure cross-implementation comparability. [Technical] “Object Masked (OM) metrics” based on “union of object masks” is ambiguous (union of source+decoded masks vs per-view mask generation); this choice materially affects scores and should be precisely defined, including how masks are generated from splats and how occlusions/background are handled. [Technical] Viewpoint generation “from original PLY or explicit definition” is not sufficiently constrained; without a standardized view sampling strategy (count, distribution, near/far planes, FOV, resolution), results across companies will remain non-comparable despite a common tool. [Technical] The `--useCameraPosition` approach depends on non-standard PLY header comments “typically inserted by tools”; without a defined schema/grammar and required fields (intrinsics/extrinsics conventions, coordinate system, units), interoperability will be poor and results non-repeatable. [Technical] The CR mixes evaluation of “source vs decoded PLY” but does not clarify whether “source” is the original capture point cloud, the encoder input 3DGS, or a rendered reference; for codec evaluation, the reference should be the encoder input representation, not necessarily the original reconstruction. [Technical] GPU rasterizer based on OpenGL is inherently non-deterministic across drivers and platforms; the CR should clearly restrict GPU mode to non-comparative visualization only and prevent accidental use in reported results (e.g., tool default, output labeling). [Technical] Inclusion of IVSSIM is mentioned but not defined (version, parameters, viewing conditions); perceptual metrics often have multiple variants—without parameter locking, reported numbers will not be comparable. [Technical] “Occupancy rate = valid pixel coverage percentage” needs a precise definition of “valid pixel” (alpha threshold? depth test? mask generation?) and whether it is computed on source, decoded, or combined; otherwise it can be gamed and misinterpreted. [Editorial] Section numbering (new 6.4.1 and 12.4) may conflict with existing TR 26.958 structure; the CR should show exact insertion points and ensure consistent cross-references, rather than describing “new sections” abstractly. [Editorial] The contribution reads like a “pseudo-CR” and tool user guide; it should clearly separate (a) metric definitions, (b) reference implementation description, and (c) example commands/outputs, and avoid promotional phrasing (“comprehensive”, “ensures exact”) unless substantiated. [Editorial] Example results (Bartender, 1920×1080, PSNR/SSIM, 100% occupancy) are not traceable without specifying dataset version, rendering settings, and tool version/commit; examples should be labeled as illustrative and reproducible inputs should be referenced. 2026-02-09 04:37
S4-260168 (pdf)	[FS_3DGS_MED] Pseudo-CR on 3DGS renderer and performance benchmarking	Tencent Cloud	Summary of 3GPP Change Request S4-260168 Document Information Source: Tencent Title: Pseudo-CR on 3DGS renderer and performance benchmarking Specification: 3GPP TR 26.958 v0.1.1 Study: FS_3DGS_MED (3D Gaussian Splats for mobile) Main Objective This change request proposes adding technical content to TR 26.958 regarding a reference implementation of a 3DGS player for mobile platforms, including mobile renderer features and preliminary experimental benchmark results obtained on commercial mobile devices. Technical Contributions 1. Mobile Renderer Architecture (Section 12.4.1) The document proposes a hybrid architecture for the 3DGS mobile player: Native Layer (C++): Implements core rendering using OpenGL ES 3.2 Tile-based rasterizer inspired by original 3DGS method CPU sorting or Compute Shaders for parallel sorting (e.g., Radix sort) Vertex and Fragment shaders for rendering Application Layer (Java/Kotlin): UI management AR runtime lifecycle for camera tracking Resource management Capabilities: Supports standard .ply file loading Real-time interaction (rotation, translation, scaling) Benchmarking mode with dynamic parameter variation 2. Rendering Process Details (Section 12.4.1 - second subsection) Key technical aspects of the mobile rendering pipeline: Depth Sorting: Critical back-to-front sorting performed by CPU each frame for proper alpha blending (unlike Z-buffer-based mesh rendering) Sorting Implementation: CPU-based Radix Sort preferred over GPU Compute Shaders on mobile for thermal balance and driver compatibility Data Management: Gaussian attributes loaded into VRAM at startup FP32 textures/buffers for precision in covariance and color calculations Only sorted indices transferred CPU→GPU per frame Vertex shader uses texelFetch for direct reads from persistent buffers Minimizes CPU-GPU bandwidth while maintaining visual fidelity 3. Benchmark Methodology (Section 12.4.2) Proposed benchmarking approach: Dynamic parameter modification during runtime Thermal management API usage for consistent clock speeds AR runtime disabled during benchmarking for fair comparison Variable parameters: Number of Gaussians: 5,000 to 485,436 points Spherical Harmonics degree: 0 (diffuse only) to 3 (full view-dependence) 4. Experimental Results (Section 12.4.3) Test Configuration Device: Google Pixel 9a (Tensor G4, mid-range, March 2025) Application: Tencent 3DGS mobile player Build: Release mode with optimizations Test duration: 30 seconds per configuration for thermal stability Model: bicycle.ply (485,436 points) Power measurement: Android Battery Manager API Impact of Number of Points (SH degree=3) Key findings from Table 1 and Figure 2: 5,000 points: 355 FPS, 24% CPU, 6% GPU, 1.45W 150,000 points: 56 FPS, 47% CPU, 88% GPU, 1.47W (approaching GPU saturation) 200,000 points: 45 FPS, 48% CPU, 99% GPU, 1.33W (GPU saturated) 485,436 points: 19 FPS, 55% CPU, 100% GPU, 1.22W Conclusion: GPU saturation occurs at ~150k points (87% load) and full saturation at 200k points. Beyond saturation, frame rate decreases linearly with point count. Impact of Spherical Harmonics Degree (485k points) Key findings from Table 2 and Figure 3: SH Degree 0: 20.41 FPS, 55% CPU, 100% GPU, 1.45W SH Degree 3: 18.05 FPS, 55% CPU, 100% GPU, 0.99W Performance impact: ~10.8% FPS reduction from degree 0 to 3 Conclusion: Moderate frame rate impact when increasing SH degree from 0 to 3. 5. Overall Analysis (Section 12.4.2.3) Key conclusions: Real-time rendering of complex 3DGS scenes is feasible on current-generation mobile hardware Scene complexity management required (< 200k visible points recommended) Performance variations observed between identical experiments due to: Background processes Dynamic power management Results should be considered as trends rather than fixed values Editor's note: Additional benchmarks planned to evaluate impact of other improvements (memory optimization, quantization, sorting algorithms, etc.) Rationale for Change Provides concrete data to validate real-time 3DGS feasibility on mobile hardware Identifies performance bottlenecks (CPU sorting, memory transfer, GPU rasterization, power consumption) Supports study objectives for reference implementations and performance characteristics Guides future specification work with empirical evidence	Proposal: It is proposed to include the following text in Section 12 (Reference Implementation) of TR 26.958.	manager: [Technical] The proposal mandates/assumes per-frame full back-to-front sorting “for proper alpha blending” without discussing established alternatives (e.g., approximate OIT, depth peeling limits, per-tile sorting, or k-buffer) or the correctness impact; TR text should not imply a single required method if multiple are viable for 3DGS. [Technical] Claiming “CPU-based Radix Sort preferred over GPU Compute Shaders on mobile for thermal balance and driver compatibility” is not substantiated with comparative data or conditions (SoC/GPU family, driver versions, dataset sizes), and risks being misleading guidance in a TR. [Technical] The renderer description mixes “tile-based rasterizer inspired by original 3DGS” with OpenGL ES vertex/fragment pipeline but does not specify where tile binning occurs (CPU vs GPU), how tiles map to screen space, or how overdraw/blending is handled; as written it is incomplete and hard to reproduce. [Technical] Stating “Gaussian attributes loaded into VRAM at startup” and “only sorted indices transferred CPU→GPU per frame” omits the memory footprint and bandwidth feasibility on mobile (e.g., FP32 covariance/color/SH for 485k points), and does not address what happens under memory pressure or when models exceed GPU memory. [Technical] Use of “FP32 textures/buffers for precision” is presented as a design choice but the TR should discuss precision/performance trade-offs (FP16/packed formats/quantization) since FP32 is often a major bottleneck on mobile and may contradict the study’s “mobile feasibility” narrative. [Technical] Benchmark methodology is under-specified: resolution, FOV, render target format, MSAA, vsync, fixed camera path, and whether AR pose updates are disabled are not clearly defined, making the FPS/power numbers non-comparable across implementations. [Technical] “Thermal management API usage for consistent clock speeds” is vague and potentially incorrect for Android devices (many controls are advisory and OEM-specific); the TR should specify exact APIs, permissions, and how effectiveness was verified, otherwise results may not be reproducible. [Technical] Power measurement via “Android Battery Manager API” is not sufficiently accurate/consistent across devices and often reports averaged or estimated values; the TR should either qualify the limitations strongly or recommend external power measurement for normative comparisons. [Technical] The conclusion “GPU saturation occurs at ~150k points (87% load)” relies on “GPU %” metrics that are not defined (source tool, sampling interval, what 100% means); without a standardized measurement method, the saturation point is not defensible. [Technical] The reported behavior “Beyond saturation, frame rate decreases linearly with point count” is an overreach given only a few data points and no confidence intervals; the TR should present it as an observed trend for this setup, not a general property. [Technical] SH degree impact results show lower power at higher SH degree (e.g., 1.45W at degree 0 vs 0.99W at degree 3) while GPU remains “100%”; this is internally inconsistent and suggests measurement artifacts or uncontrolled variables that must be explained before drawing conclusions. [Technical] Disabling “AR runtime” for benchmarking may invalidate the stated mobile player architecture use case (AR camera tracking + rendering); the TR should either benchmark both modes or clearly separate “renderer-only” performance from “end-to-end AR” performance. [Editorial] The contribution is described as a “Pseudo-CR” against TR 26.958 v0.1.1 but does not provide actual CR-style change markup, exact clause text, or proposed insertions/deletions; reviewers cannot verify consistency with existing Section 12.4.x wording. [Editorial] Several statements read like requirements (“critical,” “preferred,” “recommended <200k visible points”) but TRs should keep such guidance clearly non-normative and scoped (device class, resolution, quality targets), otherwise it may be misinterpreted as specification direction. 2026-02-09 04:38
S4-260169 (pdf)	[FS_3DGS_MED] Pseudo-CR on 3DGS delivery workflows based on capability negotiation	Tencent Cloud	Summary of S4-260169: Pseudo-CR on 3DGS Delivery Workflows Based on Capability Negotiation Document Overview This contribution from Tencent proposes updates to TR 26.958 v0.1.1 to define adaptive delivery workflows for 3D Gaussian Splats (3DGS) content in mobile environments. The document addresses the heterogeneity in both 3DGS scene complexity and UE capabilities through capability negotiation mechanisms. Motivation and Problem Statement The contribution identifies a critical gap in the current study: static delivery workflows for 3DGS content pose significant risks including: - Poor Quality of Experience (QoE) when content complexity exceeds UE rendering capabilities - Device overheating and thermal throttling - Inefficient resource utilization across diverse mobile devices The heterogeneity exists on two dimensions: - Content complexity: Ranging from simple objects (thousands of primitives) to massive scenes (millions of primitives) - Device capabilities: Significant variation in GPU power, thermal limits, memory, and battery constraints Main Technical Contributions Adaptive Delivery Framework (Clause 9.2) The contribution proposes updating clause 9.2 with a comprehensive adaptive workflow that introduces: Capability Reporting Mechanism: UEs report both static and dynamic capabilities to the server Static capabilities: Maximum visible Gaussians at target frame rate (e.g., 30fps), highest supported Spherical Harmonics (SH) degree (0-3), maximum memory, supported quantization/compression formats, GPU rendering capacity, CPU performance class, native screen resolution/frame rate, memory bandwidth Dynamic state: Current thermal status (throttling level), battery level, available GPU/CPU compute headroom, real-time battery charge Rendering Budget Concept: A negotiated constraint that ensures target frame rates and maximizes session duration based on device capabilities Two Negotiation Modes The contribution defines two distinct approaches aligned with TR 26.928 principles: Server-Centric Decision Mode (Clause 9.2.2.2) In this approach, the UE acts as a data provider while the server makes adaptation decisions: Workflow Steps: 1. Hardware assessment: UE evaluates capabilities via system checks (potentially using OpenXR APIs) 2. Capability reporting: UE transmits comprehensive capability report (CPU, GPU, Memory constraints) 3. Server decision: Server analyzes report and determines optimal delivery strategy 4. Content adaptation: Server processes 3DGS model through: - Pruning low-opacity or spatially insignificant splats - LOD selection from pre-generated levels - SH degree reduction (stripping high-order coefficients, transmitting only Direct Color components) - Quantization adjustments 5. Data delivery: Server streams optimized 3DGS payload 6. Local adaptation: UE performs final on-device optimizations (further pruning/merging) to fit runtime constraints 7. Rendering: UE executes rendering pipeline Key characteristics: - Server employs internal logic or lookup tables to map raw metrics to rendering budget - Server determines primitive count limits (e.g., N primitives for specific GPU under thermal stress) - Reduces both network bandwidth and client rendering load Client-Centric Decision Mode (Clause 9.2.2.3) In this approach, the UE determines its own requirements and explicitly requests specific content characteristics: Workflow Steps: 1. Hardware analysis: UE performs internal audit of hardware resources and API support 2. Format determination: UE calculates optimal 3DGS representation format (point budget, SH degree) based on continuous self-assessment of frame time, thermal headroom, and hardware capabilities 3. Content request: UE explicitly specifies required format parameters (quantization levels, SH orders, point budget) 4. Server-side adaptation: Server processes source content to match UE-specified constraints 5. Data delivery: Server streams optimized payload 6. Local refinement: UE applies final local adaptations 7. Rendering: UE executes rendering pipeline Key characteristics: - Decision-making responsibility delegated to UE - UE continuously monitors performance metrics - Server acts as content filter/selector fulfilling explicit UE requests Use Case Alignment The proposed workflows address requirements from: - Clause 5.2: Static 3DGS scene delivery - Clause 5.4: Dynamic 3DGS content delivery Technical Benefits The contribution ensures: - Frame rate stability through capability-aware content delivery - Thermal management by preventing device overheating - Prevention of application crashes and frame drops - Optimized battery consumption - Maximized session duration - Content complexity aligned with hardware processing limits Proposed Changes The document proposes modifications to Clause 9.2 of TR 26.958, specifically: - Adding new clause 9.2.1 (Overview) - Adding new clause 9.2.2 (Workflow with capability negotiation) - Adding new clause 9.2.2.1 (Objectives) - Adding new clause 9.2.2.2 (Server-centric 3DGS adaptation) with Figure 2 - Adding new clause 9.2.2.3 (Client-centric 3DGS adaptation) with Figure 3	Proposal: It is proposed to agree the following changes to 3GPP TR 26.958.	manager: [Technical] The proposal introduces “capability reporting” and “negotiation modes” in TR 26.958 Clause 9.2 but does not define where this negotiation occurs (e.g., 5GMS AF/AS, application layer, DASH/HTTP, MIV/scene protocol), nor the message/parameter set, so the workflow is not implementable or assessable for interoperability. [Technical] Several reported metrics are not well-defined or measurable in a consistent way across UEs (e.g., “maximum visible Gaussians at 30 fps”, “available GPU/CPU compute headroom”, “memory bandwidth”, “GPU rendering capacity”), risking non-comparable capability reports and unstable adaptation decisions. [Technical] The “dynamic state” items (thermal status, battery level, throttling level) raise privacy/policy and platform-access issues; the text should clarify optionality, granularity, and whether these are exposed via standardized APIs, otherwise the workflow assumes capabilities many OSes do not reliably provide. [Technical] The “rendering budget” concept is introduced but not normatively bounded (units, parameters, mapping to point budget/SH degree/LOD/bitrate), and it is unclear how it relates to existing 3GPP media adaptation constructs (e.g., representation selection, bitrate ladders), creating ambiguity and potential duplication. [Technical] Server-centric mode claims the server can map “raw metrics to rendering budget” via lookup tables, but no guidance is provided on required inputs/outputs or stability (hysteresis, oscillation control), which is critical when dynamic metrics fluctuate (thermal/battery) and could cause rapid quality switching. [Technical] Client-centric mode allows the UE to request “quantization levels, SH orders, point budget” but does not specify constraints/validation (e.g., server limits, content availability, security against abusive requests), nor how the server signals supported operating points back to the UE. [Technical] The adaptation operations listed (pruning, LOD selection, SH degree reduction, “Direct Color components”) can change visual appearance and potentially break authoring intent; the text should address objective quality targets/metrics and whether these transformations are reversible or require precomputed assets. [Technical] For dynamic 3DGS content (claimed alignment with Clause 5.4), the workflow omits latency and update-frequency considerations (e.g., incremental updates, delta coding, synchronization with pose/time), which are typically the dominant constraints for “dynamic” scenes. [Technical] The proposal mentions “supported quantization/compression formats” but does not align them with any referenced 3GPP/ISO codec or payload format for 3DGS; without identifying candidate formats and signaling, the negotiation cannot be tied to actual delivery mechanisms. [Technical] The UE “local adaptation” step (further pruning/merging) is underspecified and may invalidate server-side assumptions (e.g., bounding volumes, LOD selection, rate control), so the workflow should clarify whether UE-side changes are purely rendering-time culling or alter the delivered asset. [Editorial] References to “TR 26.928 principles” are vague; the contribution should cite the exact clauses/principles being reused and ensure terminology matches (e.g., “capability exchange”, “adaptation”, “representation”) to avoid inconsistent wording across TRs. [Editorial] The proposed new subclause structure (9.2.1/9.2.2/9.2.2.x) is described, but the summary does not indicate how it integrates with existing Clause 9.2 text (what is replaced vs. appended), risking duplication or contradictions with current workflows already in TR 26.958. [Editorial] Example values like “SH degree (0–3)” and “30 fps” are presented without stating whether they are informative examples or requirements; the text should clearly mark them as non-normative to avoid accidental specification of fixed operating points. [Editorial] Figures 2 and 3 are referenced but not described in enough detail to verify consistency (entities, interfaces, message directions); the contribution should ensure the figures use consistent naming with TR 26.958 architecture terms and clearly show negotiation signaling paths. 2026-02-09 04:38
S4-260186 (pdf)	[FS_3DGS_MED] On Software and Services	Nokia	Summary of S4-260186: 3DGS Software and Services Document Overview This contribution from Nokia provides an overview of consumer-facing 3D Gaussian Splatting (3DGS) software and services for inclusion in the draft TR for the study on 3DGS for Media (FS_3DGS-MED). The document proposes two main changes: addition of normative references and a new clause describing available 3DGS software products. Main Technical Contributions Addition of Normative References The contribution proposes adding multiple new references to support the technical content: Foundational 3DGS References: - Kerbl et al. foundational 3DGS paper (ACM TOG 2023) - Existing 3GPP references (TR 21.905, TR 26.928) Image Processing and Rendering References: - SSIM (Structural Similarity Index) - Wang et al. 2004 - GPU sorting algorithms - Satish et al. - Alpha compositing - Porter et al. (SIGGRAPH '84) Recent 3DGS Research (2024-2025): - VGGT: Visual Geometry Grounded Transformer - DepthSplat: Connecting Gaussian Splatting and Depth - AnySplat: Feed-forward 3DGS from unconstrained views - GS-LRM: Large Reconstruction Model for 3D Gaussian Splatting - iLRM: Iterative Large 3D Reconstruction Model - MetaSapiens: Real-time neural rendering with efficiency-aware pruning - Hybrid Transparency Gaussian Splatting (HTGS) - Sort-free Gaussian Splatting via Weighted Sum Rendering Software and Service References: - KIRI Engine, Niantic Scaniverse, Polycam, Luma AI, Jawset Potshot, LichtFeld Studio, SuperSplat, Gauzilla New Clause: Software and Products (11.3) The contribution proposes adding a comprehensive overview of consumer 3DGS software categorized by platform and capabilities: Mobile Applications KIRI Engine: - Platform: iOS and Android - Processing: Cloud-based - Capabilities: Photogrammetry or LiDAR capture with 3DGS generation - Export: .ply and other formats - Limitations: Limited control over splat parameters, quality depends on capture device Niantic Scaniverse: - Platform: Smartphone (iOS/Android) - Processing: Local on-device - Pipeline: SfM for camera pose estimation + Gaussian optimization - Export: .ply, .spz formats - Limitations: Mobile GPU/thermal constraints limit scene size and density, no manual SH order adjustment or splat pruning Polycam: - Platform: Web, iOS, Android - Processing: Cloud-based - Capabilities: Photos/videos to Gaussian splats, also supports mesh/point cloud - Export: .ply for splats, standard formats for meshes - Limitations: No control over splat parameters, non-deterministic cloud processing results Desktop Applications Jawset Potshot: - Platform: Windows desktop - Processing: Local GPU-based - Workflow: Alignment, optimization, and visualization - Export: .ply format - Limitations: Limited parameter tuning compared to research tools, no low-level SH coefficient control LichtFeld Studio: - Platform: Linux and Windows desktop - Type: Open source - Processing: Local GPU-based - Input Requirements: Pre-computed SfM data (images, point clouds, camera locations) - Features: - 3D Unscented (3DGUT) transform for rendering - Background Modulation for black segments - Timelapse for intermittent quality checks - Masking support - Export: .ply format Web-Based Viewers/Editors SuperSplat and Gauzilla: - Platform: Browser-based - Rendering: Client-side via WebGL or WebGPU - Capabilities: Rendering, sharing, transformations, cropping, basic filtering - Limitations: No training or reconstruction support, lower rendering fidelity vs desktop GPU pipelines - Use Case: Post-processing and quick visualization Hybrid Platform Luma AI: - Platform: iOS and Web - Processing: Cloud-based - Input: Short handheld videos or image sets - Technology: Neural scene representations rendered as Gaussian splats or hybrid neural radiance fields - Pipeline: Pose estimation and scene normalization before splat optimization - Limitations: No raw Gaussian parameters or SH coefficients exposed, no export capability (as of February 2026), oriented toward visualization rather than pipeline integration Summary Table The contribution includes a comparative table summarizing: - Product name - Application type (Mobile/Desktop/Web) - Processing location (Cloud/Local) - Export format options This table provides a quick reference for understanding the landscape of available 3DGS tools and their capabilities. Key Observations The contribution demonstrates the rapid proliferation of 3DGS tools across different platforms and use cases, from mobile capture applications to desktop processing tools and web-based viewers. The tools vary significantly in: - Processing location (cloud vs. local) - User control over parameters - Export capabilities - Target use cases (capture, processing, viewing, sharing) This overview provides important context for standardization work by documenting the current state of consumer 3DGS software ecosystem.	Proposal: Include the following changes in the draft TR.	manager: [Technical] The proposal to add many normative references to academic papers and commercial products is not appropriate for a TR study item; these should be informative references (and even then, only those actually cited/needed), otherwise the TR becomes dependent on unstable, non-standard, and potentially inaccessible sources. [Technical] Several listed “2024–2025” references (e.g., VGGT, AnySplat, GS‑LRM, iLRM, MetaSapiens, HTGS, sort-free variants) are not clearly identified with stable bibliographic details (authors/venue/version/URL), making them unsuitable as normative references and hard to verify even as informative references. [Technical] Adding TR 21.905 as a normative reference is questionable unless the new clause introduces terms that explicitly rely on 21.905 definitions; otherwise it is unnecessary and inconsistent with typical TR referencing practice. [Technical] The new Clause 11.3 content reads like a market survey of specific vendors (KIRI, Luma, Polycam, etc.) rather than technical study material; 3GPP TRs generally avoid endorsing or cataloging commercial offerings unless there is a clear methodological purpose and neutral selection criteria. [Technical] Claims about product pipelines and limitations (e.g., “SfM + Gaussian optimization”, “non-deterministic cloud processing”, “no manual SH order adjustment”, “no export capability as of Feb 2026”) are not backed by citations and may quickly become outdated, risking incorrect statements in the TR. [Technical] The document introduces file formats (.ply, .spz) and parameter concepts (SH order, pruning, “raw Gaussian parameters”) without defining them or linking them to the TR’s terminology/model; this creates ambiguity and weak traceability to the study objectives. [Technical] The inclusion of SSIM, alpha compositing, and GPU sorting as normative references is not justified by the described Clause 11.3 (which is product overview); if the TR needs these topics, they should appear in technical clauses with clear normative dependency and consistent scope. [Technical] If the intent is to inform standardization, the clause should extract common functional capabilities and gaps (capture metadata, pose formats, splat parameter sets, compression/streaming needs, rendering profiles) rather than listing per-product features; as written it does not translate into requirements or candidate work items. [Technical] The “processing location (Cloud/Local)” categorization is oversimplified and may be misleading for hybrid pipelines (on-device pose + cloud optimize, progressive refinement, etc.); the TR should use more precise pipeline stage breakdown if included. [Editorial] The contribution summary suggests “addition of normative references and a new clause,” but does not indicate exact target TR clause numbers/titles beyond “11.3” nor provide the exact proposed text; reviewers cannot assess consistency with surrounding clauses or numbering. [Editorial] Product descriptions use inconsistent technical depth and terminology (e.g., “3D Unscented (3DGUT) transform,” “Background Modulation for black segments”) without explanation; this reads like vendor marketing terms and is not aligned with 3GPP neutral style. [Editorial] The table fields (“export format options”) should be harmonized with defined terms (e.g., “Gaussian splat interchange format”) and should avoid listing proprietary/unclear formats (e.g., .spz) without a reference and short description. [Editorial] Time-sensitive statements (“as of February 2026”) are inappropriate for a TR unless clearly framed as an observation at the time of study with a citation; otherwise it will age poorly and require frequent maintenance. 2026-02-09 04:39
S4-260187 (pdf)	[FS_3DGS_MED] On Mapping to 3GPP services	Nokia	Summary of S4-260187: On Mapping to 3DGS Services 1. Introduction and Background This contribution from Nokia addresses objectives 3 and 5 of the FS_3DGS_MED study (approved in SP-251190 at SA#109). The objectives include: Objective 5: Mapping relevant workflows to 3GPP services Objective 3: Studying content generation aspects including network-based processing and Edge/Cloud operations for 3DGS representations The document notes that static 3DGS content generation workflow (documented in draft TR 26.958v0.1.0) consists of: - Capture - Structure from Motion (SfM) estimation for sparse point cloud reconstruction - Gaussian initialization - Training and optimization The contribution emphasizes that SfM and training are compute-heavy operations requiring architectural consideration. The document reviews two SA4 media service architectures as potential frameworks: the 5G Media Delivery architecture (TS 26.501, TS 26.510) and IMS (TS 23.228, TS 26.114), noting precedents from R18/R19 services like split rendering, avatar communications, media messaging, and spatial computing. 2. Media Service Enabler (MSE) Framework Architecture Overview The contribution proposes leveraging the MSE framework (TR 26.857) which provides Application Providers with well-defined client and network-side functions. The reference architecture includes: Defined Functions: - Application: UE-resident function leveraging MSE - MSE Client: Logical internal UE function for specific MSE - MSE Application Function (AF): Dedicated application function for an MSE - MSE Application Server (AS): Dedicated application server for an MSE Defined Interfaces/APIs: - MSE-1: Provisioning API for Application Providers - MSE-2: Optional ingest/egest API for content processing - MSE-3: Inter MSE AF-MSE AS communication - MSE-4: User plane interface between MSE Client and Server - MSE-5: Control API for configuration and management - MSE-6: Client APIs for internal application communication - MSE-7: External device APIs for accessing device functions - MSE-8: Application APIs for information exchange between Application and Provider 3DGS Workflow Mapping to MSE The proposed mapping for 3DGS content generation and sharing: - AP provisions service through MSE-1 - Session handling control plane information exchanged between MSE AF and MSE client over MSE-5 - Media communication over MSE-4 - Application uses device functions (cameras) via MSE-7 to capture images/video - Captured media transmitted over MSE-4 to MSE AS for SfM and training - Generated 3DGS or rendered views shared back to UE over MSE-4 3. IMS Data Channel (IMS DC) Architecture Architecture Overview The contribution proposes IMS DC (TS 23.228 Annex AC) as an alternative architecture, noting IMS as the backbone for conversational media in 3GPP networks. New Functional Entities: - Data Channel Signalling Function (DCSF): Manages data channel control logic, determines service availability, manages bootstrap and application data channel resources at MF via IMS AS, handles interworking between application data channel media and audio/video media - Media Function (MF): Provides media resource management and forwarding of data channel media traffic, manages bootstrap and application data channel resources, anchors application data channels in P2P scenarios, relays traffic between UEs and DC-AS, handles transcoding. SA2 specifies MF supports rendering (S4-251420) but not AIML functionality (S4-260022) - DC Application Repository (DC-AR): Stores verified data channel applications for retrieval by DCSF and download to UE - DC Application Server (DC-AS): Interacts with DCSF for resource control and traffic forwarding, serves as endpoint for application data channels, communicates with UE through MF. DC-AS functionalities are not 3GPP-specified DC-Relevant Reference Points: - DC1: Between DCSF and IMS AS - DC2: Between IMS AS and MF - DC3: Between DCSF and NEF - DC4: Between DCSF and DC Application Server - DC5: Between DCSF and DCAR - N70/Cx/Dx: Between CSCF and HSS (updated for DC signalling) - N71/Sh: Between IMS AS and HSS (updated for DC signalling) Data Channel Media Handling Reference Points: - MDC1: Between MF and DCSF - MDC2: Between MF and DC-AS, between BAR and DC-AS, between MF and BAR - MDC3: Between DCSF and DC-AS 3DGS Workflow Mapping to IMS DC The proposed mapping for 3DGS generation over IMS DC: - Service provider provides IMS DC application to DCAR - Provisions and configures resources via NEF and DC4 - UE downloads IMS DC app - IMS DC app sets up application data channel with DC-AS for service configuration - Uses device camera(s) to capture images/video - Transmits captured media to DC-AS for SfM and 3DGS training - Generated 3DGS shared back to UE or sent to MF for view-based rendering 4. Conclusion and Proposal The contribution proposes to develop mappings for 3DGS content generation and sharing workflows to both an MSE framework and to IMS DC architecture, considering both frameworks appropriate for 3DGS service deployment.	Proposal: develop mappings for 3DGS content generation and sharing work flows to an MSE framework and to IMS DC architecture.	manager: [Technical] The mapping assumes raw captured images/video are sent over MSE-4 / IMS DC data channels for SfM and training, but it does not address feasibility (uplink bitrate, latency tolerance, session duration) nor identify any 3GPP mechanisms for large-volume bulk upload vs conversational transport. [Technical] The proposal treats “SfM and 3DGS training in MSE AS / DC-AS” as straightforward, but it does not specify where compute is anchored (edge vs central), how UE selects/steers to the compute instance, or how continuity is handled if the UE moves—key for Objective 3 (edge/cloud operations). [Technical] For IMS DC, the contribution relies on DC-AS “not 3GPP-specified” while also placing core 3DGS processing there; this weakens the mapping because no normative service capabilities, QoS, security, or interop behavior can be referenced for the essential function. [Technical] The IMS DC description mixes roles: MF is said to support rendering (per S4-251420) and also “handles transcoding,” but it is unclear how 3DGS-specific rendering (view synthesis) maps onto MF capabilities without defining media formats, processing primitives, or whether rendering is in MF vs DC-AS. [Technical] The MSE mapping uses MSE-7 for camera access, but it does not clarify whether MSE-7 is intended to expose such device sensor/camera control for high-rate capture workflows, nor how capture synchronization/metadata (intrinsics/extrinsics, timestamps) needed for SfM is conveyed. [Technical] Neither mapping identifies the 3DGS representation formats and transport encapsulation (e.g., how a trained 3DGS model or progressive updates are packaged, versioned, and delivered), making it hard to assess consistency with TS 26.501/26.510 media delivery assumptions. [Technical] Security/privacy aspects are missing despite uploading potentially sensitive raw imagery to network compute; the contribution does not map authentication/authorization, consent, data retention, or encryption to MSE/IMS DC procedures and interfaces. [Technical] The workflows omit any control-plane procedures for job orchestration (start/stop, progress, retries, partial results, failure handling) and do not state whether these are carried on MSE-5 / IMS DC application channel, which is essential for long-running training tasks. [Technical] The IMS DC provisioning text (“Provisions and configures resources via NEF and DC4”) is unclear/possibly incorrect: NEF exposure is typically for northbound API exposure, but the specific API set and how it relates to DC4/DCSF control is not described, risking architectural inconsistency. [Editorial] Several interface names appear with typos or inconsistent terminology (e.g., “ingest/egest” likely “ingest/egress”), which reduces clarity when referencing TR 26.857 interface definitions. [Editorial] The contribution cites “draft TR 26.958v0.1.0” and multiple S4 references, but it does not pinpoint the exact clauses being mapped; adding clause-level references would make the mapping verifiable and align with study objective tracking. [Editorial] The conclusion proposes to “develop mappings” but does not provide concrete proposed TR text, work item impact, or specific deliverable updates (e.g., which sections of TR 26.958/26.857/IMS DC annexes would be amended), limiting usefulness as a contribution beyond high-level discussion. 2026-02-09 04:40
S4-260191 (pdf)	Dynamic 3DGS complexity	InterDigital New York	Summary of S4-260191: Dynamic 3DGS Complexity Document Overview This contribution from InterDigital addresses an open editor's note in TR 26.958 regarding scene complexity impacts on Dynamic 3D Gaussian Splatting (3DGS) feasibility for mobile platforms. The document proposes text for Clause 6.3 (Complexity) which is currently empty. Main Technical Contributions Scene Complexity Impact on Mobile Platforms The contribution identifies that dynamic scene complexity significantly affects the feasibility of dynamic 3DGS content on mobile devices. Key complexity drivers include: Number of Gaussians: Direct impact on memory and processing requirements Magnitude of motion: Affects rendering load and temporal prediction efficiency Topology changes: Increases complexity when scene structure varies Variability of Gaussian attributes: Impacts both storage and processing These parameters directly constrain: - Achievable frame rate - Session duration - Visual quality FFS: Determining maximum scene complexity that representative UE categories can sustain. Compression Complexity Considerations The document highlights that highly dynamic content (multi-person scenes, self-occlusions, cloth/hair motion) presents specific compression challenges: Reduces benefits of temporal prediction Requires more frequent keyframes Weakens temporal coherence assumptions in coding algorithms Increases both encoding and decoding complexity proportional to scene intrinsic complexity and temporal variability Dynamic 3DGS Format Categories The contribution proposes categorizing dynamic 3DGS representations based on temporal association of Gaussian primitives: Tracked: Gaussians maintain temporal associations across frames Partially tracked: Some temporal associations maintained Untracked: No temporal associations These categories differ in: - Efficiency for temporal prediction - Robustness to motion/topology changes FFS: Comparison of these formats regarding bitrate efficiency, latency, UE processing, and visual quality. Limitations of Original 3DGS Format The document notes that the original INRIA 3DGS representation has inherent limitations for dynamic content: Designed for per-scene optimization Static topology assumption Frame-independent Gaussian attributes Does not exploit temporal redundancy Not optimized for dynamic content Recent academic developments (references [1]-[4]) explore alternatives addressing these limitations. FFS: Whether multiple dynamic-oriented 3DGS formats may coexist. Conclusion The contribution proposes adding the provided text to Section 6.3.X or another suitable section of TR 26.958 to address the open editor's note on complexity considerations for Dynamic 3DGS content.	Proposal: It is proposed to add the text above in section 6.3.X or other suitable section of TR 26.958.	manager: [Technical] The proposed Clause 6.3 text is largely qualitative and does not resolve the editor’s note intent (“scene complexity impacts on feasibility”) because it provides no measurable complexity metrics (e.g., Gaussian count ranges, per-Gaussian attribute sizes, update rates, target FPS/resolution) or any method to map “complexity drivers” to UE capability classes. [Technical] “Number of Gaussians” is cited as a primary driver, but the contribution does not distinguish between active/visible Gaussians per frame vs total stored Gaussians, nor does it account for view-dependent culling/LOD—key to actual mobile rendering and decoding complexity. [Technical] The “tracked / partially tracked / untracked” categorization is underspecified and risks being non-actionable: it does not define what constitutes an association (ID persistence? correspondence confidence? motion model?) or how partial tracking is signaled/quantified, making it hard to use consistently elsewhere in TR 26.958. [Technical] Compression statements (e.g., “requires more frequent keyframes”, “reduces benefits of temporal prediction”) are plausible but incomplete without identifying which coding tools/architectures are assumed (e.g., inter-frame prediction of attributes, topology delta coding, entropy coding), so the conclusions may not generalize across candidate dynamic 3DGS codecs. [Technical] The text implies decoding complexity scales with “intrinsic complexity and temporal variability” but does not separate encoder-side complexity (tracking, correspondence, optimization) from decoder-side complexity (parsing, reconstruction, rendering), which is critical for UE feasibility discussions in S4. [Technical] “Magnitude of motion” and “topology changes” are listed as drivers, but there is no proposal for how to measure them (e.g., per-Gaussian displacement statistics, birth/death rates, attribute change rates), leaving the clause unable to support later normative requirements or comparative evaluations. [Technical] The contribution discusses “session duration” constraints without clarifying whether this is due to thermal throttling, battery drain, memory pressure, or network bitrate; without tying to specific resource models, the statement is too vague to guide TR conclusions. [Technical] The claim that original INRIA 3DGS has “frame-independent Gaussian attributes” and “static topology assumption” is not carefully framed for dynamic extensions—readers may interpret it as a spec limitation rather than a property of a particular academic method; this needs clearer scoping to avoid misleading conclusions about what 3GPP could standardize. [Technical] The proposal introduces multiple FFS items (max scene complexity per UE category; comparison of tracked formats; coexistence of formats) but does not propose any evaluation plan, test conditions, or reporting parameters, which weakens its usefulness as TR text intended to close an editor’s note. [Editorial] The contribution references “Clause 6.3 (Complexity) currently empty” but does not provide the exact draft text with placement, numbering (e.g., 6.3.1/6.3.X), or integration points with existing clauses/terminology in TR 26.958, increasing editor burden and risk of inconsistency. [Editorial] Several terms are introduced without alignment to 3GPP style/definitions (e.g., “topology changes” for Gaussian sets, “temporal association”, “intrinsic complexity”), and should either be defined in a definitions clause or rephrased to match existing TR vocabulary. [Editorial] The summary cites references [1]-[4] but does not indicate how they map to the proposed categories or complexity claims; the clause should explicitly tie each key assertion to a reference or mark it as an observation to avoid appearing unsubstantiated. 2026-02-09 04:40
S4-260239 (pdf)	[FS_3DGS_MED] Pseudo-CR on 3DGS delivery workflows for large 3DGS scenes	Tencent Cloud	Summary of 3GPP Technical Document S4-260239 Document Overview This is a pseudo-CR to TR 26.958 v0.1.1 addressing viewport-adaptive delivery workflows for large-scale 3D Gaussian Splatting (3DGS) scenes in the context of FS_3DGS_MED study. The contribution focuses on enabling delivery of massive 3DGS environments (e.g., city-scale digital twins) to mobile devices with constrained resources. Problem Statement Large-scale 3DGS scenes (as defined in clause 5.4) cannot be fully loaded into mobile device memory due to: - Bandwidth limitations - Memory constraints - Rendering capacity restrictions Static delivery workflows would result in: - Excessive latency - Immediate resource saturation - Inability to deliver complete scenes Simple capability negotiation alone is insufficient for these use cases. Main Technical Contributions Viewport-Adaptive Workflow (Clause 9.2.3) The document proposes a new clause 9.2.3 introducing a viewport-adaptive workflow that extends existing capability negotiation mechanisms by incorporating continuous spatial feedback. Core Mechanism Dynamic Spatial Context: UE continuously transmits 6DoF pose and Field of View (FoV) to server Metadata Format: Adheres to formats defined in TR 26.928 (XR services) Rendering Budget Management: Server optimizes 3DGS stream relative to user's perspective while staying within negotiated rendering budget Spatial Optimization Strategies (Clause 9.2.3.2) Two approaches are defined: Tiled Environments with LOD Environment partitioned into spatial tiles Multiple levels of detail (LOD) per tile Server selects appropriate LOD based on: Proximity to user Visibility within frustum LOD Distribution: High-density tiles (e.g., LOD 4) for viewport center Lower-density tiles (e.g., LOD 1-3) for peripheral/distant areas Concentrates point budget where user is looking Unstructured Scenes Real-time frustum culling, pruning, and merging High point density in center of FoV Aggressive simplification in peripheral zones Dynamic primitive removal/merging for non-visible areas Server-Centric Decision Workflow (Clause 9.2.3.3) Two-Phase Approach: Static Initialization Phase Hardware Capabilities Assessment: UE evaluates resources via system APIs or OpenXR Capability Reporting: UE transmits comprehensive capability report to server Server-Side Capability Decision: Server defines global rendering budget (max point count, SH degree) for session Dynamic Delivery Phase Viewpoint and FoV Determination: UE calculates current 6DoF pose and camera frustum Viewpoint and FoV Information: UE sends spatial metadata to server Content Adaptation Based on FoV: Server selects visible spatial tiles and adapts content (pruning, merging, LOD selection, quantization) to fit budget and user's view Optimized 3DGS Data: Server streams adapted content payload (N points) to UE Local Adaptation: UE performs final on-device adjustments if necessary 3DGS Rendering: UE renders the scene Key Characteristic: Server maintains control over rendering budget throughout session based on initial capability assessment. Client-Centric Decision Workflow (Clause 9.2.3.4) UE-Driven Approach: Initialization Phase Hardware Assessment Analysis: UE performs internal audit of hardware capabilities Decision of Best Representation Format: UE selects optimal configuration (max point count, SH degree) 3DGS Format Request: UE requests content from server, specifying desired format parameters (point budget, SH degrees, quantization) Delivery Phase Viewpoint and FoV Determination: UE calculates current spatial position and FoV Viewpoint and FoV Information: UE sends spatial metadata to server Content Adaptation Based on FoV: Server filters scene spatially (frustum culling/tile selection) and adapts data to match format requested in step 3 Optimized 3DGS Data: Server delivers visible content conforming to requested parameters Local Adaptation: UE applies final local refinements for runtime stability 3DGS Rendering: UE renders received content Key Characteristic: UE explicitly requests specific representation format during initialization; server's role restricted to spatial operations while adhering to UE-imposed format constraints. Alignment with Existing Specifications Builds upon capability negotiation described in clause 9.2.2 Aligns with viewport-dependent streaming principles from TR 26.928 (XR services) Addresses use case defined in clause 5.4 (Large 3DGS scenes) Proposal The document proposes to agree the changes introducing clause 9.2.3 and its subclauses (9.2.3.1-9.2.3.4) to TR 26.958, including two workflow diagrams (Figures 5 and 6) and one illustration of tile/LOD selection (Figure 4).	Proposal 1: Agree the changes to 3GPP TR 26.958 as described in clause 4 of this contribution, introducing a new clause 9.2.3 on viewport-adaptive workflow with capability negotiation for large scenes.	manager: [Technical] The proposed clause 9.2.3 introduces continuous UE pose/FoV reporting but does not specify the transport/protocol binding (e.g., which 3GPP interface, message type, timing model), making the workflow non-actionable and potentially inconsistent with TR 26.958’s assumed delivery architecture. [Technical] Referencing “metadata format adheres to TR 26.928” is too vague: TR 26.928 contains multiple XR metadata constructs, and the contribution does not identify the exact fields (pose, projection, frustum, timestamps) nor how they map to 3DGS delivery, risking incompatible implementations. [Technical] The server-centric workflow claims the server “maintains control over rendering budget throughout session,” but it doesn’t define how budget enforcement is verified/updated when UE conditions change (thermal throttling, background load, network degradation), which is critical for mobile feasibility. [Technical] Both workflows omit latency/jitter handling for 6DoF feedback (prediction, time synchronization, stale pose handling); without this, viewport-adaptive selection can cause visible popping and incorrect tile selection, especially over cellular RTT. [Technical] The “tiled environments with LOD” approach does not define tile addressing, coordinate reference system, tile boundary handling, or LOD switching rules (hysteresis), which are necessary to avoid oscillation and seams when the user moves/rotates. [Technical] The “unstructured scenes” approach lists pruning/merging but does not define objective metrics (error bounds, screen-space density targets) or constraints to preserve visual fidelity; this reads as an algorithm sketch rather than spec-quality workflow guidance. [Technical] The contribution introduces parameters like “max point count” and “SH degree” but does not align them with any existing 3DGS representation constraints in TR 26.958 clause 5.4/9.2.2 (e.g., whether SH degree is mandatory/optional, per-point vs per-tile), risking internal inconsistency. [Technical] The client-centric workflow lets the UE request “quantization” and other format parameters, but there is no negotiation/error handling if the server cannot satisfy the request (fallback modes, rejection, partial compliance), which is essential for interoperability. [Technical] Step 8 “Local Adaptation” is underspecified and potentially contradicts the earlier “budget control” premise: if UE can further adapt, the spec should clarify what is allowed (dropping points, reducing SH) and how that interacts with negotiated constraints. [Technical] The workflows do not address caching/prefetching strategies (e.g., near-future tiles along motion vector) or continuity requirements; for large-scale scenes, purely reactive frustum streaming will likely fail bandwidth/latency targets. [Technical] Security/privacy implications of continuous pose streaming (user location/behavior leakage) are not mentioned; at minimum, the clause should note privacy considerations or reference relevant 3GPP security guidance. [Editorial] The document is described as a “pseudo-CR” but the proposed changes are not presented with explicit CR-style change markup, making it hard to review exact normative text deltas and to ensure clause numbering (9.2.3.1–9.2.3.4, Figures 4–6) is consistent with the current TR 26.958 v0.1.1 structure. [Editorial] Terminology is inconsistent: “UE,” “client,” and “mobile device” are used interchangeably, and “server” is not clearly distinguished from “application server/content server,” which can confuse the architecture assumed in clause 9.2.x. [Editorial] Several claims are absolute without qualification (e.g., “simple capability negotiation alone is insufficient”); the text should either provide supporting rationale/conditions (e.g., scene scale thresholds) or soften to “may be insufficient” to match TR study-item style. 2026-02-09 04:41
S4-260245 (pdf)	[FS_3DGS_MED] High level media data workflows for All-in-client configuration	Samsung Research America	Change Request Summary: High Level Media Data Workflows for All-in-Client Configuration CR Metadata Document: 3GPP TR 26.958 v0.1.1 CR Category: B (addition of feature) Release: Rel-20 Work Item: FS_3DGS_MED (3D Gaussian Splatting Media Study) Source: Samsung Electronics Co. Ltd. Purpose This CR adds high level media data workflows for the All-in-client configuration to the 3D Gaussian Splatting (3DGS) media study. The workflows describe how different 3DGS service use cases can be realized when processing steps primarily run on the UE. Technical Contributions New Clause 9.1: All-in-Client Configuration 9.1.1 Description Defines the All-in-client configuration as workflows where functionality primarily runs on the UE for the use cases described in clause 5 of the technical report. 9.1.2 Media Workflow Steps Workflow Description The CR identifies the following key workflow steps that can be executed on the UE: Scene Capture Utilizes UE cameras (rear or front) on mobile devices Supports multiple viewpoint capture for coverage and parallax Includes application-guided user interaction Collects auxiliary signals (references clause 5.2) 3DGS Model Generation Generation of static 3DGS models on UE (subject to device capability) Referenced to clause 5.2 Animation Stream Generation Creation of time-aligned animation streams for 3D Avatar animation Referenced to clause 5.5 Packaging and Distribution 3DGS assets (static objects, scenes, or dynamic sequences) and animation streams are packaged Distribution via multiple channels: MMS, OTT messaging, or download Supports UE-to-UE or UE-to-network device transmission Referenced to clauses 5.2 and 5.5 Asset Reception and Storage UE receives one or more 3DGS assets Storage in local memory or GPU memory Supports dynamic 3DGS scene content via file delivery (clause 5.4) Rendering Gaussian selection based on Level of Detail (LOD) LOD selection dependent on: User preferences UE device capabilities and characteristics Camera pose Display resolution Support for time-aligned animation streams Referenced to clauses 5.2 and 5.5 Configuration Characteristics The CR defines three key characteristics for the All-in-client configuration: Latency/Performance Dependent on UE device capabilities for capture, generation, and rendering operations Scalability Limited by UE device capabilities: Local memory GPU memory Decoding capabilities Network Usage Network only used for distribution/asset transfer No network interaction during playback All viewpoint updates and navigation handled locally after 3DGS data reception Impact Assessment The CR states that if not approved, the study would be incomplete, indicating this is a foundational contribution to the 3DGS media workflows study.	Extracted Proposals This document does not contain any proposals. The document describes technical specifications for high-level media data workflows for All-in-client configuration in 3DGS media delivery, but no sections are explicitly marked as "Proposal" in any of the standard formats.	manager: [Technical] The proposed “All-in-client” definition (“functionality primarily runs on the UE”) is too vague to be normative within a TR clause; it should explicitly scope which functions are assumed local (capture, 3DGS training/reconstruction, packaging, rendering) and which are out of scope (e.g., any server-side optimization, CDN delivery), otherwise it overlaps ambiguously with other configurations likely covered elsewhere in the report. [Technical] “3DGS Model Generation on UE” is asserted without stating feasibility constraints or alternatives (e.g., on-device incremental training vs. conversion from prebuilt assets); given 3DGS training is compute/memory intensive, the workflow should at least acknowledge optionality, minimum capability assumptions, or fallback paths to avoid an unrealistic baseline. [Technical] The “Packaging and Distribution” step introduces MMS as a distribution channel, which is atypical for large 3D assets and not aligned with 3GPP media delivery assumptions; if kept, it needs constraints (asset size limits, fragmentation, reliability) or should be generalized to “messaging/file transfer” without naming MMS. [Technical] The workflow omits any explicit handling of codec/format signaling and compatibility (e.g., how a UE knows the 3DGS asset format/version, animation stream format, and required rendering features), which is critical for interoperability even in a study report. [Technical] “Asset Reception and Storage” mentions “storage in GPU memory” as if it were persistent storage; GPU memory is transient and device/OS-managed, so the workflow should distinguish persistent storage vs. runtime upload/caching to GPU. [Technical] The “Network only used for distribution/asset transfer” and “No network interaction during playback” statements are too absolute and conflict with common needs like progressive download, adaptive LOD streaming, updates, rights/license checks, or telemetry; this should be phrased as a typical/optional characteristic rather than a defining property. [Technical] Rendering step lists LOD selection inputs (preferences, capabilities, pose, resolution) but does not mention performance targets (frame rate, thermal constraints) or how LOD relates to 3DGS-specific parameters (e.g., Gaussian count, SH degree, splat size), making the workflow incomplete for 3DGS-specific study value. [Technical] The “Animation Stream Generation” and “time-aligned animation streams” are referenced, but the workflow does not define the synchronization mechanism (timestamps, clock source, alignment to render frames) or whether animation is applied to Gaussians, camera, or avatar rig—key to understanding feasibility. [Technical] The contribution references clause 5.4 for “dynamic 3DGS scene content via file delivery,” but does not clarify whether dynamic sequences are precomputed 3DGS frames, parameter deltas, or hybrid representations; without this, “dynamic” is underspecified and may contradict other parts of the TR. [Editorial] Clause numbering “New Clause 9.1” is presented without confirming existing Clause 9 structure in TR 26.958 v0.1.1; the CR should ensure numbering, titles, and cross-references are consistent with the current document skeleton. [Editorial] Multiple bullets say “Referenced to clause 5.2/5.5” but do not specify which subclauses or exact concepts are being reused; tighter references (e.g., 5.2.x) would improve traceability and reduce ambiguity. [Editorial] The “Configuration Characteristics” section reads like conclusions rather than a workflow description and repeats “dependent on UE capabilities” without adding measurable or comparative insight; it would be stronger to relate characteristics to the steps (capture/training/rendering) and to other configurations in the TR. 2026-02-09 04:41
S4-260247 (pdf)	[FS_3DGS_MED] High level media data workflows for Client-Server configuration	Samsung Research America	Summary of 3GPP TR 26.501 Change Request Document Information Meeting: 3GPP TSG-S4 Meeting #135, Goa, India (9-13 February 2026) Document Number: S4-260247 CR Type: Category B (addition of feature) Release: Rel-20 Work Item: FS_3DGS_MED (3D Gaussian Splatting Media Study) Main Purpose This CR introduces high level media data workflows for Client-Server configuration in the context of 3D Gaussian Splatting (3DGS) service delivery. This complements the existing all-in-client configuration by defining workflows where functionality is split between the UE client and the network server. Technical Contributions New Clause 9.2: Client-Server Configuration 9.2.1 Description Defines media data workflows where functionality is split between client and server (network) Supports interactive navigation in large or dynamic 3DGS scenes Enables network-assisted processing for resource-intensive operations 9.2.2 Media Workflow Steps 9.2.2.1 Workflow Description The CR identifies the following workflow steps that execute in the Client-Server configuration: Server-Side Operations: - 3DGS Content Generation: Generation of 3DGS content from 2D captures of scenes (references clause 5.2) - Dynamic Content Generation: Creation of dynamic 3DGS content and region-based parts of 3DGS scenes based on the 3DGS model for adaptive delivery - Adaptive Selection: Selection of 3D tiles and their Level of Detail (LOD) based on: - User movement - UE device capabilities - Packaging and Distribution: 3DGS assets packaged in the network and delivered via: - MMS - OTT messaging - Download services Client-Side Operations: - Content Reception: UE receives one or more of: - 3DGS assets (clause 5.2) - 3D tiled LODs (clause 5.3) - Dynamic 3DGS scene content via file delivery, partial-delivery, or on-demand streaming (clause 5.4) - Storage: Content stored in local memory or GPU memory - Rendering: UE renders: - 3DGS assets by selecting Gaussians based on LODs dependent on: - User preferences - UE device capabilities and characteristics (camera pose, display resolution, etc.) - 3D tiled LODs fetched using adaptive delivery (clause 5.3) - 3D Avatars using time-aligned animation streams (clause 5.5) 9.2.2.2 Characteristics The CR defines key characteristics of the client-server configuration: Latency/Performance: - Dependent on network and application latency - Influenced by: - Capabilities of the network server generating 3DGS content - UE device rendering capabilities Scalability: - Enhanced scalability compared to all-in-client configuration - Leverages theoretically infinite network resources - Enables more use cases Network Usage: - Content generation - Rendering (full or partial) - Distribution Network Interaction During Playback: - Selection of 3D tiles and LODs - Sending user pose information - Temporal updates - Optional partial or full rendering support Affected Clauses Clause 9.2: New clause added for Client-Server configuration Relationship to Existing Work This CR builds upon: - Clause 5 use cases (referenced throughout) - Clause 9.1 All-in-client configuration (complementary approach) - Various delivery mechanisms already defined in the study (MMS, OTT messaging, file delivery, streaming)	Proposal 1: Capture the description and characteristics of the client-server configuration for 3DGS media data workflows as described in clause 9.2 in TR 26.958.	manager: [Technical] The proposed “Client-Server configuration” is underspecified as a workflow because it does not define any normative interfaces or information exchange (e.g., pose/viewport signaling, tile/LOD request/response, timing model), so it is unclear how interoperability would be achieved beyond a conceptual split. [Technical] “Packaging and Distribution … via MMS, OTT messaging, Download services” is not technically credible for interactive navigation and on-demand tile/LOD delivery due to latency, payload size, and session control constraints; the clause should either constrain these to non-interactive/offline use or align with appropriate 3GPP delivery frameworks (e.g., DASH/CMAF, 5GMS, MBS) already used for streaming. [Technical] The text mixes responsibilities inconsistently: server “Adaptive Selection” chooses tiles/LODs based on “UE device capabilities,” while the client also “select[s] Gaussians based on LODs”; the split of decision-making (server vs client) and the resulting data sent (selected tiles vs multi-LOD representations) needs to be made consistent. [Technical] “Optional partial or full rendering support” implies server-side rendering, but no rendering output format, transport, latency budget, or synchronization with local rendering is described; without defining whether this is video streaming, point/gaussian streaming, or hybrid composition, the workflow is incomplete. [Technical] “Dynamic Content Generation … region-based parts … for adaptive delivery” introduces new concepts (dynamic 3DGS, region-based parts) without defining how regions/tiles are partitioned, addressed, versioned, or updated over time, which is essential for any client-server adaptive scheme. [Technical] The clause claims “Enhanced scalability … leverages theoretically infinite network resources,” which is misleading and ignores bottlenecks (uplink pose signaling, per-user state, edge compute limits); scalability should be qualified with realistic assumptions and constraints. [Technical] “Network Usage: Content generation, Rendering (full or partial), Distribution” is too broad and conflates offline authoring with real-time session operations; the workflow should separate pre-processing (content generation) from runtime adaptation/streaming to avoid architectural ambiguity. [Technical] The client “Storage … in local memory or GPU memory” is implementation-specific and not appropriate even for a TR workflow description unless tied to a requirement/assumption (e.g., caching model, persistence, eviction), otherwise it adds noise without technical value. [Technical] Referencing “UE device capabilities and characteristics (camera pose, display resolution, etc.)” conflates static capabilities with dynamic state; if pose is used for adaptation, the clause should explicitly define update rate, coordinate system, and privacy/security considerations for transmitting pose to the network. [Technical] The workflow lists “3D Avatars using time-aligned animation streams (clause 5.5)” but does not explain how avatar streams synchronize with 3DGS scene updates/tiles (timestamps, clock model, buffering), risking inconsistency with any existing timing assumptions in clause 5. [Editorial] The CR summary repeatedly cites “clause 5.2/5.3/5.4/5.5” but does not indicate whether those clauses already define the needed primitives for client-server operation; add explicit cross-references to the exact subclauses and confirm no contradictions with clause 9.1 terminology. [Editorial] Terminology is inconsistent and sometimes vague (“network server,” “server (network),” “OTT messaging,” “download services,” “partial-delivery”); the clause should align with 3GPP-defined terms (e.g., AF/AS, 5GMS Application Server, file delivery over HTTP) and define any new terms. [Editorial] The “Characteristics” section reads like marketing statements rather than a technical study outcome; it should be rewritten to state measurable factors (latency contributors, bandwidth drivers, compute split options) and remove unsubstantiated claims. 2026-02-09 04:41
S4-260249 (pdf)	[FS_3DGS_MED] Mapping 3DGS to 3GPP services with All in UE configuration	Samsung Research America	Summary of 3GPP TR 26.501 Change Request Document Information CR Number: S4-260249 Specification: 3GPP TR 26.958 v0.1.1 Work Item: FS_3DGS_MED (3D Gaussian Splatting for Media) Category: B (addition of feature) Release: Rel-20 Purpose This CR addresses the mapping of 3D Gaussian Splatting (3DGS) services to 3GPP services and specifications, specifically for the "All in UE" configuration. Main Technical Contributions Clause 10: Mapping to 3GPP Services/Specifications This CR introduces a new clause (Clause 10) that maps high-level media data workflows for 3DGS to different 3GPP services. The mapping covers two configurations, with this CR specifically detailing the "All in UE" configuration. 10.1: All in UE Configuration Mapping In this configuration, 3DGS content is treated as downloadable or message-based assets. The CR provides a comprehensive mapping table that covers the following functional areas: Content Generation Functions: Scene capture, static 3DGS model generation, time-aligned animation stream generation for animating 3D Avatars 3GPP Mapping: 3DGS/XR Application on the UE Reference: Media-Aware Application of Media Delivery architecture (TS 26.501) 3DGS File Delivery 3GPP Mapping: MMS (TS 26.140, TS 26.143) RCS messaging HTTP file transfer Reference: Media Access Function of Media Delivery architecture (TS 26.501) Functionality: Provides upload/download function for sending and receiving 3DGS content Storage 3GPP Mapping: UE Local storage (no 3GPP-specific mapping) Rendering and Playback 3GPP Mapping: 3DGS/XR Application on the UE Reference: Media-Aware Application (TS 26.501) Key Technical Notes NOTE 1 - File-based Delivery Requirements: - 5G latency or jitter requirements do not apply (strict 5G QoS is not necessary) - Low Packet Error Rate and reliable delivery required - Standard 5G bearers specified in TS 23.501 are adequate to carry 3DGS content NOTE 2 - Storage: No 3GPP-specific mapping required Technical Implications The CR establishes that for the All in UE configuration: 1. 3DGS content follows a file-based delivery model rather than streaming 2. Existing 3GPP services (MMS, RCS, HTTP) are sufficient for content delivery 3. Standard 5G bearers are adequate without requiring enhanced QoS 4. The architecture aligns with the existing Media Delivery framework in TS 26.501	Extracted Proposals This document does not contain any proposals. The document is a Change Request (CR) for 3GPP TR 26.958 that describes the mapping of 3DGS (3D Gaussian Splatting) services to different 3GPP services/specifications for the All-in-UE configuration, but it does not include any sections explicitly marked as "Proposal" with the various formats mentioned.	manager: [Technical] The statement in NOTE 1 that “5G latency or jitter requirements do not apply” is too absolute; even file-based delivery can have user-experience constraints (e.g., time-to-first-render, progressive download), so the CR should qualify the conditions (offline vs interactive) and avoid implying no QoS considerations at all. [Technical] Claiming “Standard 5G bearers specified in TS 23.501 are adequate” is underspecified and potentially misleading because TS 23.501 defines QoS framework and 5QI characteristics; the CR should indicate which QoS treatment is assumed (e.g., default/non-GBR, TCP-based delivery) or explicitly state that no specific 5QI is mandated. [Technical] Mapping “3DGS File Delivery” to MMS (TS 26.140/26.143) and RCS messaging is questionable for typical 3DGS asset sizes; MMS/RCS have practical payload limits and store-and-forward behaviors that may not support large 3DGS datasets, so the CR should either constrain the use case (small assets/thumbnails) or prioritize HTTP-based delivery. [Technical] The CR asserts a “file-based delivery model rather than streaming,” but 3DGS can be delivered progressively or as time-aligned animation streams (as mentioned under content generation); the mapping should address whether timed delivery uses DASH/MPEG-based streaming, MBS, or other 3GPP media streaming services rather than only “download/message-based assets.” [Technical] The “time-aligned animation stream generation” function is mapped only to an on-UE application, but the CR does not explain how timing, synchronization, and clock/reference (e.g., with audio/video or pose streams) are handled within the 3GPP media framework; this is a gap if the clause is meant to be a workflow mapping. [Technical] The mapping to “Media Access Function of Media Delivery architecture (TS 26.501)” for MMS/RCS/HTTP is not clearly justified; TS 26.501’s Media Delivery architecture typically assumes HTTP-based media access, and messaging services are not obviously “Media Access Function” instances—this needs architectural consistency or a rationale. [Technical] “Low Packet Error Rate and reliable delivery required” is vague and somewhat contradictory with the earlier dismissal of QoS; if reliability is a requirement, the CR should clarify whether it relies on transport-layer reliability (TCP/QUIC) versus radio-layer QoS, and what failure/retry behavior is expected. [Technical] Storage is mapped to “UE local storage (no 3GPP-specific mapping),” but the clause should at least mention whether any 3GPP enablers are relevant for content management (e.g., caching, content hosting, or application-layer DRM/security) if the intent is a comprehensive mapping. [Editorial] The contribution references “3GPP TR 26.501” and “TS 26.501” interchangeably; TS 26.501 is a Technical Specification, so the document should use consistent and correct document type references. [Editorial] The CR summary says it introduces “a new clause (Clause 10)” but does not show the actual inserted text, table structure, or exact normative wording; for a CR review, the proposed clause text and precise edits are necessary to assess completeness and consistency. [Editorial] The term “All in UE configuration” is used without a clear definition or cross-reference to earlier clauses in TR 26.958; the clause should explicitly define the configuration assumptions (where capture, encoding, delivery, rendering occur) to avoid ambiguity. [Editorial] The mapping table items mix functions (“scene capture”) with deployment statements (“3DGS/XR Application on the UE”) and with 3GPP service names; the table would be clearer if it separated functional blocks, 3GPP enablers, and interfaces, and used consistent terminology aligned with TS 26.501 entities. 2026-02-09 04:42
S4-260250 (pdf)	[FS_3DGS_MED] Mapping 3DGS to 3GPP services with Client-Server configuration	Samsung Research America	3GPP TR 26.501 Change Request Summary Document Information CR Number: Pseudo CR for TR 26.958 v0.1.1 Category: B (addition of feature) Release: Rel-20 Work Item: FS_3DGS_MED Source: Samsung Electronics Co. Ltd. Purpose This CR addresses the mapping of 3D Gaussian Splatting (3DGS) services to 3GPP services and specifications for the Client-Server configuration. This complements the existing All-in-UE configuration mapping. Main Technical Contributions Clause 10.2: Client-Server Configuration Mapping The CR introduces a comprehensive mapping table that defines how different 3DGS workflow functions map to existing 3GPP services when operating in a Client-Server configuration. In this configuration, 3DGS is delivered as either an interactive XR service or 6DoF media streaming. Content Generation UE-side 2D Capture: Maps to Media-Aware Application (TS 26.501) and Split Rendering Client (TS 26.565) Network-side 3DGS Generation: Maps to (Edge) Media AS of Media Delivery architecture (TS 26.501), including: 3DGS scene generation from 2D capture Dynamic 3DGS content Region-based parts of 3DGS scenes Caching 3DGS Model and Tile Caching: Utilizes 5G Edge CDN infrastructure Delivery Streaming/Real-time Communication: Supports tiled LOD streaming using: Adaptive media delivery protocols (DASH, HLS, RTP, QUIC) Partial delivery or on-demand streaming 3GPP Mapping: (Edge) Media AS and Media Access Function (TS 26.501), Split Rendering Server (TS 26.565) Network Rendering Edge Rendering: Partial or full network edge rendering 3GPP Mapping: (Edge) Media AS (TS 26.501) and Split Rendering Server (TS 26.565) UE Rendering/Playback Client-side Rendering: Maps to Media-Aware Application (TS 26.501) and Split Rendering Client (TS 26.565) Pose/LOD Reporting Uplink Reporting: Maps to: Media-Aware Application (TS 26.501) and Split Rendering Client (TS 26.565) for pose/LOD capture Real-time/conversational service interfaces (TS 26.506) for transfer of pose and LOD information to generate view/LOD dependent 3DGS content 5G QoS Considerations (NOTE 1) The CR identifies key 5G QoS requirements for 3DGS service delivery (marked as FFS for final determination): Performance Requirements High bit rates/bandwidth/throughput: Leveraging 3GPP eMBB Low to Ultra Low Latencies: Utilizing 5G URLLC Low jitter delivery XR-Specific QoS Features New 5QI for XR services PDU Set based QoS with parameters: PDU Set Error Rate PDU Set Delay Budget PDU Set Size Alternative QoS Profiles Network Architecture Features Service Stability: Using 5G Network Slicing (e.g., dedicated network slice for 3DGS delivery) per TS 23.501 Edge Computing: Caching with edge CDNs and network processing (generation, rendering) per TS 26.501 and TS 23.558 Multicast/Broadcast: Distribution to multiple users per TS 26.502 Additional Considerations (NOTE 2) For Avatar-related 3DGS services, TS 26.264 may be applicable. Impact This CR completes the study on mapping 3DGS to 3GPP services by addressing the Client-Server configuration, which is essential for network-assisted 3DGS delivery scenarios including edge rendering and adaptive streaming use cases.	Proposal: The functions of different workflows for this configuration are described in clause 9.2 of the present document. Below is a mapping of those functions to 3GPP services. In this configuration, 3DGS is delivered as an interactive XR service or 6DoF media streaming.	manager: [Technical] The CR is internally inconsistent on the target spec: it is titled “TR 26.501 Change Request Summary” but states “Pseudo CR for TR 26.958 v0.1.1”; the exact target document, version, and clause numbering (e.g., “Clause 10.2”) must be aligned or the change cannot be applied. [Technical] The mapping heavily relies on TS 26.565 “Split Rendering Client/Server” without justifying applicability to 3DGS streaming/generation; split rendering is defined for specific XR rendering splits, so the CR should state which split(s) are assumed and what interfaces carry 3DGS-specific data. [Technical] “Network-side 3DGS generation from 2D capture” is mapped to “(Edge) Media AS” in TS 26.501, but TS 26.501 Media AS is primarily for media processing/delivery functions; the CR should clarify whether 3DGS reconstruction is in scope for Media AS or requires an application server outside the media architecture. [Technical] The delivery section lists “DASH, HLS, RTP, QUIC” as protocols, but TS 26.501/26.501-based architectures typically reference specific 3GPP media profiles (e.g., DASH in TS 26.247/26.244, RTP/RTCP in TS 26.114/26.237); “HLS” and generic “QUIC” are not clearly anchored to 3GPP normative specs here. [Technical] “Tiled LOD streaming” and “region-based parts of 3DGS scenes” are introduced without mapping to an existing 3GPP tiling/ROI mechanism (e.g., MCTS/viewport-dependent streaming concepts); the CR should specify how tiles/LOD are addressed, signaled, and synchronized with pose. [Technical] Pose/LOD reporting is mapped to “real-time/conversational service interfaces (TS 26.506)”, but TS 26.506 is not the typical anchor for XR pose transport; the CR needs to identify the exact service/API (and directionality, timing constraints) and how it interworks with media delivery (e.g., RTP header extensions, SEI, timed metadata). [Technical] QoS claims are problematic: “utilizing 5G URLLC” for 3DGS delivery is likely unrealistic for high-throughput media and conflicts with typical XR QoS handling (e.g., XR 5QI work); the CR should avoid implying URLLC unless it specifies which flows (pose vs media) and how they map to standardized QoS flows. [Technical] “New 5QI for XR services” and “PDU Set based QoS” are presented as if available, but in a TR mapping clause they should be referenced to the exact 3GPP stage-2/stage-3 specs and current Rel-20 status; otherwise this reads as speculative and may contradict agreed QoS frameworks. [Technical] “3DGS Model and Tile Caching: Utilizes 5G Edge CDN infrastructure” is vague and not mapped to a 3GPP-defined function (e.g., M4E/Edge enablers, 5GMS AF/AS roles); the CR should state whether caching is in 5GMSd, 5GMSu, or generic edge CDN outside 3GPP scope. [Technical] The CR mixes “interactive XR service” and “6DoF media streaming” but does not state which 3GPP service category (e.g., 5GMS, MTSI, XR conversational) is assumed per workflow; this ambiguity undermines the mapping table’s usefulness and may lead to contradictory function assignments. [Technical] Multicast/broadcast mapping to TS 26.502 is questionable: TS 26.502 is 5GMS architecture, but multicast/broadcast for media is typically tied to 5MBS/MBMS-related specs; the CR should reference the correct multicast/broadcast specifications and indicate whether 5GMS supports the intended distribution mode. [Editorial] The contribution claims to “introduce a comprehensive mapping table” in Clause 10.2, but no actual table content (rows/columns, exact mappings) is included in the CR summary; reviewers cannot verify completeness, consistency, or whether mappings duplicate/contradict existing clauses. [Editorial] Several references are imprecise or potentially wrong (e.g., “TS 26.501 Media-Aware Application” vs the actual 5GMS entities, “Edge Media AS” terminology); the CR should use exact entity names as defined in the target TR/TS to avoid misinterpretation. [Editorial] “NOTE 1/NOTE 2” content is largely FFS and reads like requirements rather than mapping; if kept, it should be clearly scoped as informative text and separated from normative mapping statements to avoid overstating conclusions in a TR. 2026-02-09 04:42
S4-260253 (pdf)	[FS_3DGS_MED] Mapping 3DGS to 5QI	Samsung Research America	Summary of 3GPP TR 26.501 Change Request Document Information CR Number: S4-260253 Specification: TR 26.958 v0.1.1 Work Item: FS_3DGS_MED (Study on 3D Gaussian Splatting for Media) Category: B (addition of feature) Release: Rel-20 Main Objective This CR proposes to add a new clause (6.X) to TR 26.958 addressing the mapping of 3D Gaussian Splatting (3DGS) services to 3GPP 5G QoS Identifier (5QI) parameters as specified in TS 23.501. Technical Contributions Background on 5QI (Clause 6.X.1) The CR provides a comprehensive table of relevant pre-defined 5QI values from TS 23.501 that have similar QoS characteristics to 3DGS services, including: GBR Resources: 5QI 1-4: Conversational voice/video, real-time gaming, buffered streaming (PDB: 50-300ms, PER: 10⁻² to 10⁻⁶) 5QI 71-76: Live uplink streaming (PDB: 300-500ms, PER: 10⁻⁴ to 10⁻⁸) 5QI 80: Low latency eMBB/AR applications (PDB: 10ms, PER: 10⁻⁶) Delay-Critical GBR: 5QI 88: Motion tracking data, split AI/ML inference (PDB: 10ms, PER: 10⁻³) 5QI 89-90: Visual content for cloud/edge/split rendering (PDB: 15-20ms, PER: 10⁻⁴) Non-GBR Resources: 5QI 5-10: IMS signaling, buffered streaming, interactive gaming (PDB: 100-1100ms) 3DGS Application Flows for 5QI Mapping (Clause 6.X.2) The CR identifies distinct application flows requiring QoS treatment: Delivery of 3DGS media content application flows Delivery from/to UE of static 3DGS scene content Delivery from network to UE of dynamic or view-based 3DGS content Delivery of user pose, gaze information, LOD information from UE to network for network-assisted rendering or delivery Recommendations (Clause 6.X.3) The CR establishes the following recommendations for the study outcome: Mapping Identification: A mapping of 3DGS application flows to appropriate 5QI values should be identified Reference 5QI Values: Existing 5QI values (from clause 6.X.1) with similar QoS/QoE expectations and RAN resource priority should be used as reference for determining appropriate 5QI values and QoS characteristics limits for 3DGS services New 5QI Definition: If existing 5QI values are insufficient, liaison with appropriate 3GPP groups (likely SA2) should be initiated to define new 5QI values and corresponding QoS characteristics specifically for 3DGS services Rationale The CR addresses a gap in the FS_3DGS_MED study by providing guidance on how 3DGS services should be treated within the 5G System QoS framework, ensuring proper traffic handling behavior through appropriate QoS Flow configuration.	Proposal 1: A mapping of application flows corresponding to 3DGS services to 5QI is identified. Proposal 2: The 5QI values of those services, listed in clause 6.X.1 of the present document, that have similar QoS/QoE expectations and resource priority at RAN level, as that of 3DGS services, may be used as a reference to identify 5QI values and the corresponding QoS characteristics limits for 3DGS services. Proposal 3: If required, liaise with necessary 3GPP groups to define the 5QI and corresponding QoS characteristics to 3DGS services	manager: [Technical] The CR proposes mapping “3DGS services” to pre-defined 5QIs but does not define concrete QoS requirements (PDB, PER, priority level, packet size/rate assumptions) per identified 3DGS flow in clause 6.X.2, so the mapping in clause 6.X.3 is not actionable or verifiable. [Technical] Several cited 5QI examples appear incorrect or misleading versus TS 23.501 tables: e.g., “5QI 1-4 … buffered streaming” is not aligned with typical standardized service examples (buffered streaming is commonly associated with non-GBR 5QI 6/8/9), risking wrong conclusions for 3DGS traffic treatment. [Technical] The CR mixes GBR, delay-critical GBR, and non-GBR 5QIs without stating which 3DGS flows are expected to require guaranteed bit rate versus non-GBR (e.g., static scene download vs pose/gaze uplink), which is fundamental to QoS Flow selection and admission control behavior. [Technical] The identified flows in clause 6.X.2 overlap and are not mutually exclusive (e.g., “delivery of 3DGS media content” vs “dynamic or view-based 3DGS content”), making it unclear what traffic classification rules the UE/AF/SMF would apply. [Technical] The CR does not address that 5QI selection in 5GS is typically driven by AF/PCF policy (TS 23.503) and QoS profiles, not by application-layer “recommendations” alone; the proposal should clarify whether it targets AF signaling (e.g., via NEF) or is purely informative. [Technical] The recommendation “use existing 5QI values … as reference for determining appropriate 5QI values and QoS characteristics limits” is problematic because pre-defined 5QIs already have fixed standardized characteristics; if different limits are needed, the correct mechanism is a standardized new 5QI or a non-standardized QoS profile, which should be explicitly discussed. [Technical] The CR suggests liaison “likely SA2” to define new 5QIs, but 5QI standardization and QoS characteristics are specified in SA2 (TS 23.501) with strong SA5/CT involvement for management/signaling impacts; the liaison scope and target groups are underspecified. [Technical] No consideration is given to uplink pose/gaze/LOD traffic being small-packet, high-rate, jitter-sensitive control traffic; mapping it to generic “motion tracking” 5QI 88 without discussing jitter, periodicity, and reliability trade-offs may be technically unsound. [Technical] The CR does not discuss whether 3DGS delivery is downlink-heavy eMBB-like (throughput-driven) versus interactive XR-like (latency/jitter-driven), yet it cites 5QI 80/89/90; without a traffic model, selecting among these is speculative. [Editorial] The contribution claims a “comprehensive table” of relevant 5QIs but only lists selected ranges and examples; if clause 6.X.1 is intended to be normative study text, it should either reproduce the exact TS 23.501 entries with correct service examples or clearly state it is a non-exhaustive subset. [Editorial] The document references “TS 23.501 clause 6.X.1” style content but does not provide exact TS 23.501 table numbers/versions; precise references are needed to avoid mismatches across releases and revisions. [Editorial] Terminology is inconsistent: “3DGS services,” “3DGS media content,” “scene content,” and “view-based content” are used without definitions; TR 26.958 should define these terms or reference a definitions clause to prevent ambiguous interpretation. [Editorial] The proposed clause numbering “6.X” is placeholder-like; for a CR it should include final clause numbers and indicate insertion points relative to existing TR 26.958 structure to ensure consistency with the document’s table of contents and cross-references. 2026-02-09 04:43
S4-260321 (pdf)	[FS_3DGS_MED] LS on mpeg-gsc-metrics for 3DGS objective quality evaluation	Tencent Cloud	No summary available	No proposals available	No comments
S4-260322 (pdf)	[FS_3DGS_MED] Draft TR 26.958 v0.2.0	Tencent Cloud	No summary available	No proposals available	No comments
S4-260363	Dynamic 3DGS complexity	InterDigital New York	No summary available	No proposals available	No comments
S4-260377 (pdf)	[FS_3DGS_MED] Mapping 3DGS to 5QI	Samsung Research America	No summary available	No proposals available	No comments
S4-260379 (pdf)	[FS_3DGS_MED] glTF-based Representation Formats for 3D Gaussian Splats	Qualcomm Atheros, Inc.	No summary available	No proposals available	No comments
S4-260380 (pdf)	[FS_3DGS_MED] pCR on 3D tiles, LOD and 3DGS delivery format requirements	Samsung Electronics Iberia SA	No summary available	No proposals available	No comments
S4-260385 (pdf)	[FS_3DGS_MED] Pseudo-CR on 3DGS renderer and performance benchmarking	Tencent Cloud	No summary available	No proposals available	No comments
S4-260387 (pdf)	[FS_3DGS_MED] Pseudo-CR on 3DGS delivery workflows based on capability negotiation	Tencent Cloud	No summary available	No proposals available	No comments
S4-260388 (pdf)	[FS_3DGS_MED] High level media data workflows for All-in-client configuration	Samsung Research America	No summary available	No proposals available	No comments
S4-260389 (pdf)	[FS_3DGS_MED] High level media data workflows for Client-Server configuration	Samsung Research America	No summary available	No proposals available	No comments
S4-260390 (pdf)	[FS_3DGS_MED] Pseudo-CR on 3DGS delivery workflows for large 3DGS scenes	Tencent Cloud	No summary available	No proposals available	No comments
S4-260392 (pdf)	[FS_3DGS_MED] Mapping 3DGS to 3GPP services with All in UE configuration	Samsung Research America	No summary available	No proposals available	No comments
S4-260393 (pdf)	[FS_3DGS_MED] Mapping 3DGS to 3GPP services with Client-Server configuration	Samsung Research America	No summary available	No proposals available	No comments
S4-260396 (pdf)	[FS_3DGS_MED] Work Plan v0.2	Tencent Cloud	No summary available	No proposals available	No comments
S4-260468 (pdf)	[FS_3DGS_MED] Work Plan v0.2	Tencent Cloud	No summary available	No proposals available	No comments
S4-260469 (pdf)	[FS_3DGS_MED] LS on mpeg-gsc-metrics for 3DGS objective quality evaluation	Tencent Cloud	No summary available	No proposals available	No comments

Total TDocs: 34 | PDFs: 33 | Comments: 18

Read-only Review: 9.6

Summary of 3GPP Change Request S4-260088

Document Information

Overview

Main Technical Contributions

1. Terminology Updates (1st Change)

New and Modified Definitions

2. Use Case Description Refinements (2nd Change)

Updates to Clause 5.3 - Exploration of Large 3DGS Environment

Working Assumptions Updates

3. New Clause on 3DGS Encapsulation and Delivery Formats (3rd Change)

X.1 Introduction

X.2 Requirements

X.3 3DGS Tiles

X.4 Related Compression Aspects

Technical Impact

Extracted Proposals

3GPP Document S4-260089 Summary

Document Information

Overview

Main Technical Contributions

1. Reason for Change

2. Proposed Changes

Nature of Contribution

Summary of S4-260119: glTF-based Representation Formats for 3D Gaussian Splats

Introduction and Scope

KHR_gaussian_splatting (Khronos Layer)

Core Attribute Semantics

Extensibility and Backward Compatibility

MPEG_gaussian_splatting_transport (MPEG Layer)

Architecture Approach

Transport-Level Features

Alternative SH Layouts

Progressive Download

Timed Delivery for 4D Splats

Two-Layer Architecture Benefits for 3GPP

Architectural Summary

3GPP Service Integration Advantages

Format Comparison

PLY

SPZ (Splat Zip)

glTF + KHR_gaussian_splatting + MPEG transport

Proposals for TR 26.958

Extracted Proposals

Summary of S4-260140: Sport Example for Dynamic 3DGS Content Use Case

Document Overview

Main Technical Contributions

Use Case Enhancement - Dynamic 3DGS Content (Section 5.4)

Core Use Case Description (Section 5.4.1)

Scope Definition

Sports Action Example

Scenario Description

Playback Characteristics

Navigation Constraints

Technical Significance

Summary of S4-260145: Pseudo-CR on Dancer Example for Dynamic 3DGS Content Use Case

Document Overview

Main Technical Contributions

Dynamic 3DGS Content Use Case Enhancement (Section 5.4)

General Description

Dancer Scenario Example

Scope Limitations

Technical Focus

Visual Material

Summary of 3GPP Change Request S4-260147

Document Information

Main Objective

Technical Contributions

Enhanced Avatar Communication Architecture

Technical Processing Pipeline

Sender Side Processing

Transmission Strategy

Receiver Side Processing

Working Assumptions

Key Innovation

Summary of 3GPP Change Request S4-260164

Document Information

Main Technical Contributions

1. Introduction of Objective Metrics Framework for 3DGS

2. Rationale for Standardization