TDoc: S4-260183

Meeting: TSGS4_135_India | Agenda Item: 10.5

Back to Agenda
Document Information
Title

[AIML_IMS-MED] Negotiation messages for split inferencing

Source

InterDigital Finland Oy

Type

discussion

For

Agreement

Release

Rel-20

3GPP Document
View on 3GPP
TDoc S4-260183
Title [AIML_IMS-MED] Negotiation messages for split inferencing
Source InterDigital Finland Oy
Agenda item 10.5
Agenda item description AI_IMS-MED (Media aspects for AI/ML in IMS services)
Doc type discussion
For action Agreement
Release Rel-20
download_url https://www.3gpp.org/ftp/tsg_sa/WG4_CODEC/TSGS4_135_India/Docs/S4-260183.zip
For Agreement
Type discussion
Contact Stephane Onno
Uploaded 2026-02-03T19:11:22.680000
Contact ID 84864
TDoc Status merged
Is revision of S4aR260010
Reservation date 03/02/2026 16:32:54
Agenda item sort order 52
Comments
Previous Comments:
manager
2026-02-09 04:09:26


  1. [Technical] The contribution mixes two transport paradigms without a clear normative mapping: messages are described as HTTP GET/POST in Table A4.2-1 while Section A.4.5 defines a generic “AI metadata exchange over data channels”; the spec needs an explicit statement whether these are alternative transports, layered (HTTP payload carried in data channel), or separate procedures, otherwise interoperability will break.




  2. [Technical] Message taxonomy appears internally inconsistent: Table A.4.2 introduces AI_SPLIT_INFERENCE_CONFIGURATION_REQUEST and SPLIT_INFERENCE_CONFIGURATION_AI_RESPONSE, while Section A.4.5 lists SPLIT_INFERENCE_CONFIGURATION_REQUEST (without “AI_”) and no matching ..._AI_RESPONSE; naming and pairing must be aligned and uniquely defined.




  3. [Technical] There is functional overlap/ambiguity between AI_MODEL_SELECTION_REQUEST/RESPONSE and AI_SPLIT_INFERENCE_CONFIGURATION_REQUEST/RESPONSE (both “carry URN(s) of selected models/submodels” and return binaries/metadata); the procedure needs a clear separation of purpose (e.g., selection vs partitioning vs download) and state machine/ordering constraints.




  4. [Technical] The proposal introduces AI_SERVER_CONFIGURATION_REQUEST/RESPONSE in Section A.4.5 but it is not described in the negotiation summary table nor in the earlier metadata sections; either add the missing procedure/metadata or remove it to avoid undefined behavior.




  5. [Technical] The “type” field is defined as a number (message subtype identifier) but no registry/enum values are provided for the listed message types; without normative numeric assignments and extensibility rules, independent implementations cannot interoperate.




  6. [Technical] sessionId is described as “multimedia session identifier” but there is no definition of which IMS identifier is used (SIP Call-ID, dialog identifiers, MSRP session, etc.) and how it binds to the data channel/HTTP exchange; this is critical for correlating negotiation to the correct media session.




  7. [Technical] The endpoint execution location values (UE, SERVER, EDGE, CLOUD, CUSTOM) are not tied to any 3GPP-defined entity (e.g., UE, IMS AS, MEC, DN) and “CUSTOM” is non-interoperable; the spec should either reference 3GPP architecture terms or define discovery/addressing and trust implications.




  8. [Technical] Partitioning metadata is underspecified for correctness: submodelType enumerates HEAD, INTERMEDIATE1, INTERMEDIATE2, TAIL, which hard-limits the number of partitions and cannot represent arbitrary N-way splits; it should be an ordered list with an index/graph structure rather than fixed labels.




  9. [Technical] Tensor metadata is inconsistent: subModelDataType uses Uint8, Float32, Float16 while tensorType is described as integer, float32, float16; the data type vocabulary must be unified and should include signedness/bitwidth and quantization parameters if Uint8 is allowed.




  10. [Technical] outputAccuracy as a single “trained accuracy percentage” is not meaningful across tasks/datasets and is not comparable between partitionings; if kept, it needs a defined metric, evaluation dataset identifier, and conditions, otherwise it risks misleading selection logic.




  11. [Technical] Capability metadata separation into static/dynamic is reasonable, but the proposal lacks update/refresh rules (e.g., when dynamic capabilities are reported, validity timers, thresholds) and lacks units for key fields (compute capacity, memory, load), making negotiation non-deterministic.




  12. [Technical] The messages that “return selected application binary and metadata” / “return selected models/submodels binary and metadata” do not specify integrity/authenticity mechanisms (hash, signature, provenance) or versioning; for executable model binaries this is a security and lifecycle gap.




  13. [Editorial] Several identifiers are inconsistent in casing and spelling (sendingAtTime vs typical sentAtTime; submodelsPartitioningIdentifier vs submodelPartitioningIdentifier; subModelDataType camel-case mismatch), which will cause implementer confusion in JSON schema.




  14. [Editorial] The document references “Table 5” and “Table 6” in Section A.4.5 while earlier it introduces “Table A4.2-1”; table numbering should be consistent with the annex/section numbering conventions of the target specification.




  15. [Editorial] The contribution repeatedly uses “HTTP RESPONSE” as a message name rather than a defined response message type; if the intent is to define application-layer messages, the response should be named consistently (e.g., ..._RESPONSE) and HTTP status/error handling should be specified separately.



You must log in to post comment