S4-260220 - AI Summary

Proposed design constraints for noise suppression, DTX, and non-speech inputs

Back to Agenda Download Summary
AI-Generated Summary AI

Summary of S4-260220: Design Constraints for Noise Suppression, DTX, and Non-Speech Inputs

1. Background and Context

This contribution addresses design constraints for the ULBC (Ultra-Low Bit-rate Communication) over GEO channel solution, building upon previous discussions from S4-251881 and S4-251786. The document focuses on three key areas:
- Noise suppression handling
- Discontinuous transmission (DTX) framework
- Robustness to non-speech inputs

Emergency Call Use Case

The contribution emphasizes that emergency calls represent a critical use case for ULBC over GEO, particularly when terrestrial network (TN) service coverage is unavailable. Key considerations include:
- Background signals may contain critical contextual information (e.g., voices, environmental sounds indicating danger)
- Post-call analysis requirements (ASR transcripts, emergency response evaluation, criminal investigations)
- Need for full situational awareness rather than aggressive noise suppression

2. Technical Analysis

2.1 Noise Suppression Trade-offs

The document identifies several technical challenges:

  • Performance requirements alone may be insufficient: Testing with background signals (even using ITU-T P.800 DCR methodology) may not prevent systems from employing aggressive noise suppression that removes critical background information
  • Ultra-low bit rate optimization: At very low bit rates, there exists an unknown trade-off between:
  • Applying noise suppression
  • Accepting more coding artifacts
  • Potentially reduced intelligibility in presence of background signals
  • Device-specific processing: Acknowledges that device-specific noise suppression is standard practice and will likely be applied before ULBC encoding

2.2 Updated Approach

The contribution updates the original proposal from S4-251881 by:
- Maintaining the requirement for disableable noise suppression within the codec
- Adding specific SNR ranges for stationary (5-15 dB) and non-stationary (10-25 dB) noise
- Deferring specific noise type definitions for future discussion
- Linking noise suppression behavior primarily to performance requirements

3. Proposed Design Constraints

The document proposes updates to Table 6.2-1 in draft TR 26.940 with three new/modified constraint parameters:

3.1 Noise Suppression Constraint

Requirement: If noise suppression is supported as part of the candidate codec, it must be possible to disable it to preserve background signals.

Editor's Notes:
- EN1: Requirement to disable may be considered in connection with specific operating bit rate(s)
- EN2: Solution behavior w.r.t. potential noise suppression is primarily enforced via performance requirements; default operation for tests is with noise suppression disabled

3.2 DTX Framework Constraint

Requirement: The candidate codec shall provide a framework for:
- Voice Activity Detection (VAD)
- Discontinuous Transmission (DTX)
- Comfort Noise Generation (CNG)
- Operation with DTX on or DTX off

Editor's Note: Operation relating to DTX on and disabling/enabling potential noise suppressor may need clarification

3.3 Robustness to Non-Speech Input

Requirement: The candidate codec shall be robust to:
- Noisy speech with stationary noise (5-15 dB SNR)
- Noisy speech with non-stationary noise (10-25 dB SNR)
- Background signals during and between speech segments
- Other non-speech input signals

Editor's Notes:
- EN1: May need to be in performance requirements
- EN2: Relevant background signals to be further defined as part of performance requirements, including both stationary and non-stationary types

4. Key Technical Contributions

  1. Balanced approach to noise suppression: Recognizes both the need for flexibility in noise suppression (for speech quality) and the critical requirement to preserve background signals (for emergency use cases)

  2. Mandatory DTX framework: Establishes VAD/DTX/CNG as a required feature rather than optional, with explicit on/off control

  3. Quantified robustness requirements: Provides specific SNR ranges for different noise conditions that the codec must handle

  4. Testing methodology guidance: Proposes default testing with noise suppression disabled, while allowing performance requirements to govern overall behavior

5. Open Issues

Several editor's notes indicate areas requiring further work:
- Specific operating bit rates where noise suppression disable requirement applies
- Clarification of DTX and noise suppression interaction
- Final placement of robustness requirements (design constraints vs. performance requirements)
- Definition of specific background signal types for testing
- Speech quality requirements (to be addressed separately in performance requirements)

Document Information
Source:
Nokia
Type:
discussion
For:
Agreement
Original Document:
View on 3GPP
Title: Proposed design constraints for noise suppression, DTX, and non-speech inputs
Agenda item: 7.8
Agenda item description: FS_ULBC (Study on Ultra Low Bitrate Speech Codec)
Doc type: discussion
For action: Agreement
Contact: Lasse Laaksonen
Uploaded: 2026-02-03T20:42:58.227000
Contact ID: 45550
TDoc Status: noted
Reservation date: 03/02/2026 19:19:04
Agenda item sort order: 20