Python PyQt Convert Speech Recognition CodeSource

Multimodal Speech Recognition Assisted by Slide Information in Classroom Scenes

Abstract: Multimodal automatic speech recognition (ASR) technology has attracted much attention because it improves the accuracy of speech recognition by adding other modal information. However, most ...

IEEE

Enhancing Speech Emotion Recognition with Explainable AI and Feature Importance Analysis

Abstract: This study proposes a lightweight 1D-CNN architecture that integrates Explainable AI (XAI) techniques to address the interpretability gap in Speech Emotion Recognition (SER). The model ...

GitHub

DePasqualeOrg/mlx-audio-plus

The best audio processing library built on Apple's MLX framework, providing fast and efficient text-to-speech (TTS), speech-to-text (STT), and speech-to-speech (STS) on Apple Silicon. Kokoro Fast, ...

Some results have been hidden because they may be inaccessible to you

Show inaccessible results

Multimodal Speech Recognition Assisted by Slide Information in Classroom Scenes

Enhancing Speech Emotion Recognition with Explainable AI and Feature Importance Analysis

DePasqualeOrg/mlx-audio-plus

Trending now