S4-260155 - AI Proposals

[FS_ULBC] Analysis of AI Codec Real-Time Performance (RTF) and Complexity Scaling

Back to Agenda Download Proposals
AI-Generated Proposals AI

Proposal: It is proposed to include the findings of this RTF analysis in TR 26.940 to inform the selection of complexity constraint for the ULBC candidate.

Document Information
Source:
vivo Mobile Communication Co., Xiaomi Technology, Spreadtrum, Bytedance
Type:
pCR
For:
Agreement
Original Document:
View on 3GPP
Title: [FS_ULBC] Analysis of AI Codec Real-Time Performance (RTF) and Complexity Scaling
Agenda item: 7.8
Agenda item description: FS_ULBC (Study on Ultra Low Bitrate Speech Codec)
Doc type: pCR
For action: Agreement
Abstract: As part of the study on the new Ultra Low Bitrate Speech Codec (ULBC) [1], it is necessary to establish complexity constraints that reflect real-world device capabilities. Previous contributions have analyzed theoretical complexity using static metrics such as FLOPs and WMOPS [2] [5]. However, static metrics often fail to capture system-level bottlenecks, such as memory bandwidth pressure and thermal constraints on mobile System-on-Chips (SoCs). This contribution presents a comprehensive performance analysis of a neural audio codec (based on the Descript Audio Codec architecture) running on a representative mid-range mobile platform. By sweeping across model sizes (1M to 74M parameters) and sample rates (8, 16, 32 kHz), we evaluate the correlation between theoretical complexity and the Real-Time Factor (RTF).
Release: Rel-20
Specification: 26.94
Version: 0.4.0
Related WIs: FS_ULBC
Spec: 26.94
Contact: Wang Dong
Uploaded: 2026-02-03T13:43:09.937000
Contact ID: 107237
Revised to: S4-260445
TDoc Status: revised
Reservation date: 03/02/2026 12:35:47
Agenda item sort order: 20