TDoc: S4-260195

Meeting: TSGS4_135_India | Agenda Item: 10.5

Back to Agenda
Document Information
Title

CR on AIML processing in IMS calls

Source

Qualcomm Inc.

Type

CR

Release

Rel-20

Specification

26.114

3GPP Document
View on 3GPP
TDoc S4-260195
Title CR on AIML processing in IMS calls
Source Qualcomm Inc.
Agenda item 10.5
Agenda item description AI_IMS-MED (Media aspects for AI/ML in IMS services)
Doc type CR
Secretary remarks Title modified on 2/3/2026. Original title : [AIML_IMS-MED] CR on AIML processing in IMS calls<br/><br/>Source modified on 2/3/2026. Original source : Qualcomm Atheros, Inc.
Release Rel-20
Specification 26.114
Version 19.2.0
Related WIs AIML_IMS-MED
CR number 608.0
CR category B
Clauses affected Annex AD (new).
download_url https://www.3gpp.org/ftp/tsg_sa/WG4_CODEC/TSGS4_135_India/Docs/S4-260195.zip
CN True
CR 608.0
ME True
Spec 26.114
Type CR
Contact Imed Bouazizi
Uploaded 2026-02-03T21:49:01.107000
Contact ID 84417
TDoc Status noted
Clauses Affected Annex AD (new).
Reservation date 03/02/2026 17:07:55
Secretary Remarks Title modified on 2/3/2026. Original title : [AIML_IMS-MED] CR on AIML processing in IMS calls<br/><br/>Source modified on 2/3/2026. Original source : Qualcomm Atheros, Inc.
Agenda item sort order 52
Comments
Previous Comments:
manager
2026-02-09 04:13:30


  1. [Technical] The CR introduces extensive new normative behavior (task manifests, model cards, “3gpp-ai” subprotocol, time binding) but does not identify exact TS 26.114 clause numbers to be changed nor provide normative text deltas, making it impossible to verify consistency with existing DCMTSI procedures and requirements in clauses 6.2.10/6.2.13.




  2. [Technical] The call flow AD.4.1 is internally inconsistent (it claims a “14-step procedure” but lists 15 steps) and mixes BDC/ADC roles in a way that conflicts with the stated architecture (e.g., step 11 “BDC: HTTP GET with task/model URLs” implies UE informs MF via GET, which is semantically wrong for selection signaling).




  3. [Technical] The proposal assumes MF can fetch and relay large AI model artifacts (steps 12–13) while also stating “IMS Media Function does not perform inference or process RTP media”; it still requires MF to understand application/model selection and act as a distribution proxy, but no impact analysis is provided for MF behavior, caching, authorization, charging, or load.




  4. [Technical] Introducing a new ADC subprotocol “3gpp-ai” and JSON message types without defining a stable, interoperable schema (it says “detailed schema specified by AI/ML application”) undermines cross-vendor interoperability and contradicts the claim of “normative procedures, formats, and signaling.”




  5. [Technical] The time-binding mechanism in AD.8.2 is underspecified/incorrect for synchronization: “NTP-based timestamp associated with RTP stream” is not defined (RTP itself doesn’t carry NTP unless using RTCP SR mapping), and allowing either RTP timestamp or “NTP + durSamples” without a mandated mapping to RTP clock rate/RTCP SR risks ambiguous alignment at the receiver.




  6. [Technical] The use of SDP “mid” alone for binding is insufficient in multi-SSRC, simulcast, or stream-restart scenarios; the CR should specify whether SSRC, RID, or other identifiers are needed to uniquely bind metadata to a specific RTP source/encoding within a given mid.




  7. [Technical] The CR mandates ONNX 1.16.0 and opset ≥18 as “Mandatory Model Format” without justification or fallback; this is likely unrealistic for UE implementations and conflicts with the stated “WebNN-aligned runtime” (WebNN support is not equivalent to ONNX opset support), risking non-deployable requirements.




  8. [Technical] Security/integrity is incomplete: SHA-256 verification is mentioned, signatures are “optional,” but there is no trust model (certificate chain, key provisioning, revocation), no binding between model card and artifact, and no protection against downgrade/mix-and-match attacks across model_id/model_version_id/artifact variants.




  9. [Technical] The CR introduces “capability discovery” and “resource limits” exchange but does not define privacy constraints, user consent, or minimization; exposing detailed device accelerator/operator support to a DC AS can materially increase fingerprinting risk and may conflict with 3GPP privacy expectations.




  10. [Technical] The “split inference” concept is described but not normatively constrained: it is unclear how tasks are partitioned, how intermediate representations are transported (if any), and how this interacts with the statement “RTP media unchanged,” especially for tasks like noise suppression that typically require modifying the media stream.




  11. [Technical] The CR claims “DCMTSI client shall not transmit user media over BDC,” but then allows ADC metadata delivery without defining whether metadata may contain user content (e.g., STT text is user content); policy and confidentiality requirements (encryption end-to-end vs hop-by-hop) are not addressed.




  12. [Editorial] Several references appear incorrect or non-actionable: “TS 23.228, clause AC.7.1” and new clauses “AD.1…AD.9” are not existing TS 26.114 clause identifiers, so the contribution cannot be reviewed against the actual spec structure.




  13. [Editorial] Terminology is inconsistent and sometimes non-3GPP: “DCMTSI clients must support … web runtime,” “WebNN-aligned runtime,” “model card,” and “task manifest” are introduced as if standardized, but no alignment to existing 3GPP definitions or external normative references (W3C WebNN, ONNX spec) is provided.




  14. [Editorial] The example metadata uses fields like "ntpTs": 381245120 without defining epoch, units, wraparound, or relation to RTP/RTCP, and uses "mid": "audio" even though mid values are tokens negotiated in SDP (not semantic labels), which may mislead implementers.



You must log in to post comment