Blog posts

Supervised Speech Enhancement with Self-Attention

28 minute read

Published: September 07, 2024

This article introduces a Deep Generative Speech Enhancement model that utilizes a hybrid architecture combining U-Net and Transformer models. The model is trained in a supervized manner to remove various types of noise from audio signals, enhancing the clarity and quality of speech. We have tested the model on several noise conditions, demonstrating its effectiveness across different environments.

Mouad Taleb

Blog posts

2024

Supervised Speech Enhancement with Self-Attention