S4-260287 - AI Summary

overview of inputs to RAN2#133 on AI traffic characteristics

Back to Agenda Download Summary
AI-Generated Summary AI

Overview of RAN2#133 Inputs on AI Traffic Characteristics

Document Purpose and Context

This document provides a summary of contributions submitted to RAN2#133 regarding AI traffic characteristics. Following RAN#110 plenary's assignment of RAN-2 to lead AI traffic characteristics work in RAN and coordinate with SA WG4, this overview aims to align SA4 and RAN-2 work at an early stage. The document explicitly recommends prioritizing discussion around key dependencies identified by RAN-2.

Key Traffic Characteristics Identified Across Contributions

Common Traffic Patterns

Multiple contributions converge on the following AI traffic characteristics:

  • Bursty and aperiodic nature: Nearly universal observation across contributions
  • Uplink-heavy traffic: Particularly emphasized for mobile AI applications
  • Unpredictable bandwidth requirements: Dynamic and variable data rates
  • Small packet sizes: Frequent transmission of small data units
  • Multi-modal traffic: Synchronization requirements across different modalities
  • Asymmetrical traffic patterns: Different characteristics for UL vs DL
  • Error tolerance: Variable across different AI applications and data types
  • Token-based communication: Specific characteristics for tokenized AI traffic

Latency Characteristics

  • Delay-sensitive traffic: Strict end-to-end latency requirements
  • Low latency for initial packets: Critical for interactive applications
  • Variable packet delay budgets: Dependent on application type
  • Interactive with elastic latency: Some flexibility in certain scenarios

AI Traffic Categorization Approaches

By Real-Time Requirements

Several contributions propose categorization based on timing:
- Real-time vs Non-real-time: Most common distinction
- Interactive vs Non-interactive: Request/response patterns

By AI Codec Usage

Multiple contributions distinguish:
- AI codec traffic: Native AI representation formats
- Non-AI codec traffic: Traditional encoding methods
- Type 1: Real-time AI application with non-AI codec
- Type 2: Real-time AI application with AI codec
- Type 3: Non-real-time AI application

By Service Class

Peng Cheng Lab (R2-2600153) proposes detailed service classes:
- Service Class A: Generative AI and AI Agent Traffic (Token-Streaming Inference)
- Service Class B: Perception/Analytics AI (Uplink-Intensive Inference), including Split Inference
- Service Class C: Federated/Distributed Learning and Training Traffic (Bulk, Synchronized Uploads)
- Composite Class D: AI-Enhanced Immersive Communication (XR + Digital Twin + AI Components)

By Use Case

  • Agentic (continuous) vs Non-agentic (bursty): Meta/Qualcomm et al. (R2-2600480)
  • Chatbot, Live AI, AI assistant: Ericsson (R2-2600885)
  • Intermediate data type: From TR 26.927 (NEC R2-2600552)

By Data Type

  • Training data, Model data, Inference data: CATT (R2-2600242)
  • Token vs non-token: Multiple contributions
  • Modality-based importance vs sequence-based importance: Offino (R2-2600853)

Release Strategy: 5G Rel-20 vs 6G

5G Rel-20 Focus

Strong consensus on prioritizing:
- Uplink enhancements: Primary focus for Rel-20
- Non-real-time applications: Particularly chatbot/GenAI use cases
- Burstiness and unpredictability handling: Leveraging XR Phase 4 work
- AI traffic awareness in RAN: Enable service-aware handling

6G Scope

Broader scope proposed for 6G:
- Real-time uplink and downlink: Full bidirectional support
- Unified framework: Comprehensive AI traffic handling
- Native AI communication: AI-native RAN traffic support
- Flexible QoS: Dynamic adaptation to AI traffic patterns
- Downlink non-real-time: Extended coverage beyond Rel-20

QoS and RAN Enhancement Proposals

QoS Mechanisms

  • Dynamic QoS support: Constrained latency handling (ZTE R2-2600164)
  • Flexible QoS framework: 6G requirement for AI traffic adaptation
  • Context-aware traffic flow: Enable RAN awareness (Nvidia R2-2600925)
  • Enhanced reliability: Beyond current 5G capabilities (Samsung R2-2600389)
  • PDU Set concept reuse: Leverage XR mechanisms (vivo R2-2600074)

Uplink Enhancements

  • Irregular burst support: Handle unpredictable UL patterns
  • Delay-bound data bursts: Resource-efficient handling without over-provisioning
  • Small packet transmission in RRC inactive: Efficiency improvement
  • UE-assisted uplink reporting prediction: Proactive resource allocation
  • Multi-modal synchronization for uplink: Coordinate different data streams

Scheduling and Resource Management

  • Service awareness at L2: Enable intelligent scheduling decisions
  • UE-based coordination: Context and dependency awareness
  • Error-tolerant token transmission: Exploit AI traffic characteristics
  • Token importance differentiation: Priority-based handling
  • Downlink scheduling enhancement: Network-side optimizations

Multi-modality Support

  • Multi-modal synchronization: Beyond MMSID for QoS control
  • PDU Set binding for AI traffic: Token set/burst handling
  • Dependency structure handling: Inter-stream coordination

Power Efficiency

  • Energy savings for continuous agentic AI: Long-duration applications
  • Tethering and multi-device support: Multi-access scenarios
  • RRC state optimization: Balance latency and power consumption

Explicit SA4 Dependencies and Coordination Requests

Traffic Characteristics Clarification

Multiple contributions request SA4 input on:

  1. Token communication characteristics:
  2. Token importance levels and granularity
  3. Error tolerance properties
  4. Token-to-PDU mapping
  5. Dependency between tokens
  6. Whether tokenization increases/decreases data size
  7. Visibility of tokens to RAN

  8. Packet-level characteristics:

  9. Packet delay budget
  10. Packet size distributions
  11. Packet arrival rates and patterns (streaming vs bursts)
  12. Packet error rate tolerance
  13. Packet importance variability

  14. Data compression characteristics: Impact on traffic patterns

  15. Multi-modality aspects: Synchronization requirements and characteristics

AI Codec Study Coordination

Several contributions explicitly reference or request coordination on:

  • TR 26.847 alignment: Token communication definitions
  • AI representation format clarification: Scope and characteristics
  • AI codec vs non-AI codec traffic: Differentiation and handling
  • Timeline and scope of SA4 AI codec study: Critical for RAN-2 planning
  • Trace data provision: For derivation of packet size, arrival rate, delay budget, success rate

Service Type and Application Clarification

Requests for SA4 input on:

  • Service types definition: Categories and characteristics
  • Use case traffic patterns: Specific application behaviors
  • Intermediate data characteristics: From TR 26.927
  • End-to-end latency requirements: Impact on RAN design
  • Traffic encryption: Whether packets are encrypted at application layer

PDU Set and Annotation

  • PDU Set annotation: Importance and token information
  • PDU Set binding for AI traffic: Token set/burst definitions
  • PDU Set handling: AI-specific requirements

Specific Questions to SA4

  1. China Telecom (R2-2600685): More details on tokens and service types
  2. Spreadtrum/UNISOC (R2-2600673): Token-to-PDU mapping, importance granularity definition, UE processing requirements
  3. Panasonic (R2-2600757): Packet delay budget, packet size, error tolerance of token traffic
  4. Lenovo (R2-2600745): Confirm AI traffic characteristics including data compression, error tolerance, token importance, multimodality, burstiness, unpredictability
  5. CMCC et al. (R2-2600965): Whether token traffic characteristics align with TR 26.847
  6. Apple (R2-2600446): Input on token communication, delay budget, relative priority
  7. Nokia (R2-2600315): PDU Set annotation, importance and token information for AI traffic
  8. Fujitsu (R2-2600347): Tokenized AI feedback
  9. Samsung (R2-2600389): PDU Set handling of AI-related traffic
  10. HONOR (R2-2600515): Mobile AI arrival patterns (streaming or bursts) and corresponding characteristics
  11. NEC (R2-2600552): Intermediate data traffic characteristics from TR 26.927
  12. Peng Cheng Lab (R2-2600153): PDU Set binding for AI traffic, dependency structure, traffic model input
  13. OPPO et al. (R2-2600206): Timeline and scope of SA4 study, whether token/packet characteristics are in scope, AI representation format clarification
  14. Sharp (R2-2600183): Token traffic characteristics support
  15. CATT (R2-2600242): Burst traffic confirmation, end-to-end latency impact, encryption status, importance and error tolerance modeling, RAN visibility of tokens
  16. vivo (R2-2600074): Burst characteristics, end-to-end latency, traffic encryption, error tolerance, AI token characteristics, token visibility to RAN
  17. Huawei/HiSilicon (R2-2600148): Trace data for packet characteristics, whether AI codec apps have error tolerance and variable packet importance

Liaison and Coordination Proposals

Several contributions propose formal coordination:

  • OPPO et al. (R2-2600206): Inform SA4 about RAN-2 decisions and progress, get timeline/scope information
  • vivo (R2-2600074): Inform SA4 that RAN-2 leads AI traffic work in RAN
  • Peng Cheng Lab (R2-2600153): Send LS to SA2/SA4 to clarify service awareness points
  • Ericsson (R2-2600885): Coordinate with SA (not only SA4 but SA in general)

Divergent Views and Open Issues

Traffic Model Approach

  • Qualcomm et al. (R2-2600138): Adopt XR traffic models for real-time, MBB models for non-real-time
  • AT&T (R2-2600890): Proposes text-based conversational GenAI traffic model (suggests RAN1 scope)
  • Ericsson (R2-2600885): Cautions against optimizing for specific AI applications

Scope and Prioritization

  • MediaTek (R2-2600901): Stop referring to tokenizer, enhance UL, wait for AI codec study in SA4
  • CATT (R2-2600242): Prioritize network inference in RAN-2
  • Lenovo (R2-2600745): Rel-20 XR Phase 4 focus on uplink, unified framework in 6G
  • Hanbat Univ (R2-2600409): Include AI native RAN traffic and RedCap

Error Tolerance Determination

  • NTT/Docomo (R2-2600978): RAN-2 to proactively determine error tolerance based on AI task and source data type, with concrete test results provided
  • Multiple others: Request SA4 to define error tolerance characteristics

XR Relationship

  • Ericsson (R2-2600885): Need to understand difference between XR and AI traffic
  • Fujitsu (R2-2600347): Need gap analysis from XR
  • Multiple others: Propose reusing XR mechanisms (PDU Set, traffic models)

Technical Contributions Summary by Topic

Token Communication (17 contributions)

Nvidia, Offino, China Telecom, Spreadtrum/UNISOC, Panasonic, Lenovo, CMCC et al., Apple, Nokia, Fujitsu, Samsung, HONOR, Peng Cheng Lab, OPPO et al., Sharp, CATT, vivo

Key aspects: Importance differentiation, error tolerance, dependency, compression, RAN visibility, PDU mapping

Burst Traffic Handling (20+ contributions)

Nearly universal recognition of bursty, aperiodic traffic requiring specific RAN enhancements

Uplink Enhancement (15+ contributions)

Strong consensus on Rel-20 focus for uplink mobile AI traffic with burstiness, unpredictability, and interactive characteristics

Multi-modality (8 contributions)

Fraunhofer, Meta/Qualcomm et al., ZTE, Peng Cheng Lab, Samsung, Lenovo, HONOR, Nokia

Key aspects: Synchronization, MMSID usage, multi-device scenarios

Error Tolerance (12 contributions)

Offino, China Telecom, Spreadtrum/UNISOC, Panasonic, Lenovo, CMCC et al., NTT/Docomo, Fujitsu, Samsung, CATT, vivo, Huawei/HiSilicon

Key aspects: Variable tolerance, task-dependent, token-specific, importance-based

Service Awareness (6 contributions)

Nvidia, Nokia, ZTE, Peng Cheng Lab, Xiaomi, Huawei/HiSilicon

Key aspects: Context-aware flow, L2 scheduling, UE-assisted coordination

Dynamic QoS (7 contributions)

ZTE, Meta/Qualcomm et al., Samsung, Nokia, Apple, HONOR, Huawei/HiSilicon

Key aspects: Flexible adaptation, constrained latency, relative priorities

Recommendations

The document recommends taking into account the explicit dependencies from RAN-2 and prioritizing discussion around these key dependencies, particularly:

  1. Token communication characteristics and RAN visibility
  2. Packet-level traffic characteristics (size, arrival patterns, delay budgets)
  3. Error tolerance properties and importance differentiation
  4. AI codec study timeline and scope alignment
  5. PDU Set binding and annotation for AI traffic
  6. Multi-modality synchronization requirements
  7. Traffic model inputs and trace data provision
Document Information
Source:
Huawei Tech.(UK) Co.. Ltd
Type:
discussion
For:
Discussion
Original Document:
View on 3GPP
Title: overview of inputs to RAN2#133 on AI traffic characteristics
Agenda item: 11.1
Agenda item description: FS_6G_MED (Study on Media aspects for 6G System)
Doc type: discussion
For action: Discussion
Release: Rel-20
Related WIs: FS_6G_MED
Contact: Rufail Mekuria
Uploaded: 2026-02-05T15:10:17.387000
Contact ID: 104180
Revised to: S4aP260010
TDoc Status: noted
Reservation date: 05/02/2026 14:35:58
Agenda item sort order: 60