All Tags
Browse through all available tags to find articles on topics that interest you.
Browse through all available tags to find articles on topics that interest you.
Showing 1 results for this tag.
UAF: A Unified Audio Front-end LLM for Full-Duplex Speech Interaction
This paper introduces UAF (Unified Audio Front-end LLM), a novel large language model that unifies critical audio front-end tasks like voice activity detection, speaker recognition, and automatic speech recognition into a single end-to-end generative framework. UAF aims to overcome the limitations of traditional cascaded pipelines and enhance full-duplex speech interaction by jointly modeling semantic content and interaction-level control signals.