Abstract: In this work, we propose CleanMel, a single-channel Mel-spectrogram denoising and dereverberation network for improving both speech quality and automatic speech recognition (ASR) performance ...
Abstract: Respiratory sounds serve as early indicators of lung diseases. The development of computer-aided classification systems has become a key enabler for timely diagnosis and treatment. The ...
This repo is the implementation of a research project aimed at enhancing Acoustic Side-Channel Attacks (ASCAs) using a novel combination of Vision Transformers (VTs) and Large Language Models (LLMs).
An open-source framework for analyzing birdsong recordings through acoustic feature extraction, dimensionality reduction, and neural audio synthesis. Transform audio signals into interactive 3D ...