S4-260181 Metadata - 3GPP Contribution Reviewer

Document Information

Title

[AIML_IMS-MED] Negotiation messages

Source

InterDigital Finland Oy

Type

discussion

For

Agreement

Release

Rel-20

3GPP Document

View on 3GPP

TDoc	S4-260181
Title	[AIML_IMS-MED] Negotiation messages
Source	InterDigital Finland Oy
Agenda item	10.5
Agenda item description	AI_IMS-MED (Media aspects for AI/ML in IMS services)
Doc type	discussion
For action	Agreement
Release	Rel-20
download_url	https://www.3gpp.org/ftp/tsg_sa/WG4_CODEC/TSGS4_135_India/Docs/S4-260181.zip
For	Agreement
Type	discussion
Contact	Stephane Onno
Uploaded	2026-02-03T19:11:22.603000
Contact ID	84864
Revised to	S4-260450
TDoc Status	revised
Is revision of	S4aR260012
Reservation date	03/02/2026 16:29:12
Agenda item sort order	52

Comments

Previous Comments:

manager

2026-02-09 04:08:07

[Technical] The contribution introduces a “transport-protocol-independent” container while simultaneously mapping each negotiation message to HTTP GET/POST/RESPONSE in Table A4.2-1; the spec needs a clear normative statement on whether HTTP is mandatory, optional, or merely an example, otherwise implementers will diverge.

[Technical] Several metadata fields are not operationally well-defined for interoperability (e.g., minimumTaskInferenceAccuracy, taskAccuracy, outputAccuracy, energyEstimation): there is no mandated metric definition, dataset/benchmark reference, confidence interval, or measurement conditions, so endpoints cannot compare values consistently.

[Technical] The static/dynamic capability split is sensible, but the document does not define update triggers, validity timers, or how dynamic values (e.g., currentComputeLoad, acceleratorAvailability, batteryLevel) are refreshed and correlated to a specific decision point, risking stale negotiation decisions.

[Technical] Units and ranges are inconsistent or missing across key parameters (e.g., flopsProcessingCapabilities, macOpProcessingCapabilities, availableMemorySize, currentComputeLoad); without explicit units (GFLOPS vs FLOPS, bytes vs MB, load as % vs normalized), negotiation logic is ambiguous.

[Technical] The proposal returns “application binary data” and “model binary data” in responses but does not specify integrity/authenticity mechanisms (hash, signature), versioning, or licensing/authorization checks; this is a major gap given executable/model distribution security requirements.

[Technical] URN-based identifiers (applicationIdentifier, modelIdentifier, task identifiers) are introduced without defining the URN namespace, registration/ownership model, and collision handling; interoperability across vendors depends on a normative identifier scheme.

[Technical] The message type list in A.4.4 includes CANDIDATE_MODELS_REQUEST/RESPONSE, while the summary table uses CANDIDATE_MODELS_LIST_REQUEST/RESPONSE; this naming mismatch will cause implementers to treat them as different procedures unless aligned.

[Technical] The call-flow alignment claim (“agreed call flow from S4aR260014”) is not backed by explicit sequencing rules (e.g., whether discovery is mandatory before application request, whether model selection can be repeated, error handling); the procedure needs normative state machine/ordering constraints.

[Technical] sessionId is described as “multimedia session identifier” but the document does not specify which 3GPP identifier it maps to (IMS dialog identifiers, SDP session, MSRP/RTC data channel association, etc.), making correlation across signaling planes unclear.

[Technical] sendingAtTime uses “wall clock timestamp” without defining format (e.g., RFC 3339), time base, and clock synchronization assumptions; if used for ordering/latency, it will be unreliable across endpoints.

[Technical] The endpoint capability fields overlap and may be contradictory (accelerationSupported boolean vs supportedEngines including NPU/GPU, plus acceleratorAvailability dynamic); rules are needed to resolve inconsistencies and define what “acceleration” precisely means.

[Technical] Model I/O descriptors (inputType, inputShape, outputType, outputShape) lack a normative schema (tensor layout, channel order, sample rate for audio, language tags for ASR/TTS/translation), so two endpoints may “match” a model but still be incompatible.

[Editorial] The document mixes “AIML_IMS-MED”, “AI/ML-based media services”, and “local inferencing call flows” terminology without a stable definition section; consistent naming and abbreviations are needed to avoid scope confusion.

[Editorial] Several parameter names are verbose or inconsistent in style (maximumTaskInferenceLatency vs targetInferenceLatency, modelDataType values like Uint8 vs typical uint8), and the JSON examples should be aligned to a single naming convention and enumerated value casing.

<ol>
<li>
[Technical] The contribution introduces a “transport-protocol-independent” container while simultaneously mapping each negotiation message to HTTP GET/POST/RESPONSE in Table A4.2-1; the spec needs a clear normative statement on whether HTTP is mandatory, optional, or merely an example, otherwise implementers will diverge.
</li>
<li>
[Technical] Several metadata fields are not operationally well-defined for interoperability (e.g., <code>minimumTaskInferenceAccuracy</code>, <code>taskAccuracy</code>, <code>outputAccuracy</code>, <code>energyEstimation</code>): there is no mandated metric definition, dataset/benchmark reference, confidence interval, or measurement conditions, so endpoints cannot compare values consistently.
</li>
<li>
[Technical] The static/dynamic capability split is sensible, but the document does not define update triggers, validity timers, or how dynamic values (e.g., <code>currentComputeLoad</code>, <code>acceleratorAvailability</code>, <code>batteryLevel</code>) are refreshed and correlated to a specific decision point, risking stale negotiation decisions.
</li>
<li>
[Technical] Units and ranges are inconsistent or missing across key parameters (e.g., <code>flopsProcessingCapabilities</code>, <code>macOpProcessingCapabilities</code>, <code>availableMemorySize</code>, <code>currentComputeLoad</code>); without explicit units (GFLOPS vs FLOPS, bytes vs MB, load as % vs normalized), negotiation logic is ambiguous.
</li>
<li>
[Technical] The proposal returns “application binary data” and “model binary data” in responses but does not specify integrity/authenticity mechanisms (hash, signature), versioning, or licensing/authorization checks; this is a major gap given executable/model distribution security requirements.
</li>
<li>
[Technical] URN-based identifiers (<code>applicationIdentifier</code>, <code>modelIdentifier</code>, task identifiers) are introduced without defining the URN namespace, registration/ownership model, and collision handling; interoperability across vendors depends on a normative identifier scheme.
</li>
<li>
[Technical] The message type list in A.4.4 includes <code>CANDIDATE_MODELS_REQUEST/RESPONSE</code>, while the summary table uses <code>CANDIDATE_MODELS_LIST_REQUEST/RESPONSE</code>; this naming mismatch will cause implementers to treat them as different procedures unless aligned.
</li>
<li>
[Technical] The call-flow alignment claim (“agreed call flow from S4aR260014”) is not backed by explicit sequencing rules (e.g., whether discovery is mandatory before application request, whether model selection can be repeated, error handling); the procedure needs normative state machine/ordering constraints.
</li>
<li>
[Technical] <code>sessionId</code> is described as “multimedia session identifier” but the document does not specify which 3GPP identifier it maps to (IMS dialog identifiers, SDP session, MSRP/RTC data channel association, etc.), making correlation across signaling planes unclear.
</li>
<li>
[Technical] <code>sendingAtTime</code> uses “wall clock timestamp” without defining format (e.g., RFC 3339), time base, and clock synchronization assumptions; if used for ordering/latency, it will be unreliable across endpoints.
</li>
<li>
[Technical] The endpoint capability fields overlap and may be contradictory (<code>accelerationSupported</code> boolean vs <code>supportedEngines</code> including NPU/GPU, plus <code>acceleratorAvailability</code> dynamic); rules are needed to resolve inconsistencies and define what “acceleration” precisely means.
</li>
<li>
[Technical] Model I/O descriptors (<code>inputType</code>, <code>inputShape</code>, <code>outputType</code>, <code>outputShape</code>) lack a normative schema (tensor layout, channel order, sample rate for audio, language tags for ASR/TTS/translation), so two endpoints may “match” a model but still be incompatible.
</li>
<li>
[Editorial] The document mixes “AIML_IMS-MED”, “AI/ML-based media services”, and “local inferencing call flows” terminology without a stable definition section; consistent naming and abbreviations are needed to avoid scope confusion.
</li>
<li>
[Editorial] Several parameter names are verbose or inconsistent in style (<code>maximumTaskInferenceLatency</code> vs <code>targetInferenceLatency</code>, <code>modelDataType</code> values like <code>Uint8</code> vs typical <code>uint8</code>), and the JSON examples should be aligned to a single naming convention and enumerated value casing.
</li>
</ol>

You must log in to post comment

Log In

TDoc: S4-260181