DeepXi
Visit ToolDeepXi is an Audio & Music tool that uses deep learning for a priori SNR estimation. It is implemented in TensorFlow 2/Keras for speech enhancement and robust ASR.
At a glance
Trending
Also listed in
DeepXi is an Audio & Music tool that uses deep learning for a priori SNR estimation. It is implemented in TensorFlow 2/Keras for speech enhancement and robust ASR.
Trending
Also listed in
About
DeepXi is a deep learning framework implemented in TensorFlow 2/Keras, designed for a priori Signal-to-Noise Ratio (SNR) estimation. This tool is primarily used for speech enhancement, noise estimation, and mask estimation, and can also serve as a front-end for robust Automatic Speech Recognition (ASR). It supports various deep neural network architectures, including MHANet, RDLNet, ResNet, ResLSTM, and ResBiLSTM, to efficiently model noisy speech. DeepXi offers both causal and non-causal versions of its models, providing flexibility for different application requirements. It operates on mono/single-channel audio at a standard sampling frequency of 16000 Hz, with configurable window duration and shift. The tool supports common audio codecs like .wav, .mp3, and .flac, and provides pre-trained models and datasets for research and development.
Capabilities
Pricing & Plans
Open Source
Free
FAQs
Trending