TDoc: S4-260080

Meeting: TSGS4_135_India | Agenda Item: 11.1

Back to Agenda
Document Information
Title

pCR [FS_6G_MED] Considerations on Work Topic 1: Media Delivery requirements for intelligent immersive calling

Source

HuaWei Technologies Co., Ltd

Type

pCR

For

Agreement

Release

Rel-20

Specification

26.87

3GPP Document
View on 3GPP
TDoc S4-260080
Title pCR [FS_6G_MED] Considerations on Work Topic 1: Media Delivery requirements for intelligent immersive calling
Source HuaWei Technologies Co., Ltd
Agenda item 11.1
Agenda item description FS_6G_MED (Study on Media aspects for 6G System)
Doc type pCR
For action Agreement
Release Rel-20
Specification 26.87
Version 0.0.1
Related WIs FS_6G_MED
download_url https://www.3gpp.org/ftp/tsg_sa/WG4_CODEC/TSGS4_135_India/Docs/S4-260080.zip
For Agreement
Spec 26.87
Type pCR
Contact Shuxin Ouyang
Uploaded 2026-02-03T09:05:25.327000
Contact ID 103366
Revised to S4-260133
TDoc Status revised
Reservation date 02/02/2026 09:05:08
Agenda item sort order 60
Comments
Previous Comments:
manager
2026-02-09 04:24:05


  1. [Technical] The contribution proposes adding “IMS extension support” and “review protocol for extensions” as requirements, but TR 26.870 clause 4.2 should state concrete media/system requirements (e.g., session setup, media negotiation, synchronization) rather than vague process statements; specify what IMS/SIP/SDP capabilities are actually needed (new SDP attributes, new media types, MSRP/data channel, etc.).




  2. [Technical] Several requirements assume “network performs transcoding from video to immersive media codecs” and “network-based rendering/face rendering,” which is a strong architectural choice; TR 26.870 requirements should be phrased technology/placement-neutral (edge/device/network) unless SA4 has agreed on normative functional split assumptions.




  3. [Technical] “Support for 4K + HDR uplink video” is underspecified and potentially unrealistic for uplink in many deployments; it needs frame rate, chroma, bit depth, HDR format (HDR10/HLG/Dolby Vision), latency/jitter targets, and whether it is mandatory or optional/tiered.




  4. [Technical] Eye tracking is introduced as a requirement but no performance constraints are given (sampling rate, end-to-end latency budget, accuracy, coordinate system, time-stamping); without these, it is not actionable for codec/transport/synchronization work in SA4.




  5. [Technical] “User intention understanding capability (via voice/gesture)” is not a media delivery requirement unless mapped to explicit interfaces and data exchange (event streams, metadata formats, privacy controls, timing alignment with media); otherwise it reads as an AI feature requirement outside SA4 scope.




  6. [Technical] The use case mixes classic conversational media with sensor/biometric data (blood pressure, heart rate) but does not specify how such data is transported (RTP/RTCP extensions, data channel, HTTP-based side channel) nor how it is synchronized with audio/video/immersive rendering.




  7. [Technical] “Tiered QoE support taking device capabilities into account” needs definition of tiers and the adaptation mechanisms (scalable coding, simulcast, layered rendering, viewport-dependent delivery, bitrate/latency trade-offs); otherwise it duplicates generic QoE statements already present in many TRs.




  8. [Technical] The multi-device scenario (smart TV + cameras + watches + sensors) implies multi-stream capture and tight synchronization, but no requirement is stated for common time base, clock sync, or inter-stream lip-sync/scene-sync tolerances—key for immersive calling.




  9. [Technical] The “eye-contact enablement” story implies sending “facial picture” plus eye tracking to render a face; this raises identity/spoofing and consent requirements (authenticity of rendered representation, user control, watermarking/indication) that are not addressed but are critical for a calling service.




  10. [Technical] The text repeatedly references “6G system” and “AI technologies (e.g., LLM)” in a way that may conflict with TR 26.870’s 5G/3GPP system framing; requirements should be expressed in 3GPP-system-agnostic terms or aligned to the TR’s agreed scope and terminology.




  11. [Editorial] The contribution claims “updates to clause 4.2” but provides no proposed change text, no exact requirement wording, and no indication of how the new items integrate with existing 4.2 structure; reviewers cannot assess consistency or redundancy without the actual delta.




  12. [Editorial] Terminology is inconsistent/unclear (“immersive media codecs,” “split-rendering,” “spatial computing rendering,” “multi-media transporting”); these should be aligned with existing SA4 terms (e.g., MIV, V3C, RTP-based immersive media, split rendering definitions) or explicitly defined.




  13. [Editorial] Several statements are ambiguous or non-testable (e.g., “protocol extensions should be possible,” “review protocol for extensions,” “system fulfills user intention”); requirements in TR 26.870 should be phrased as verifiable capabilities with clear conditions and outcomes.



You must log in to post comment