Tao Sun’s Post

View profile for Tao Sun

Neuromorphic Algorithm Researcher focused on uncertainty estimation and speech processing

We are thrilled to announce our latest paper, now available on [arXiv](https://lnkd.in/grgvEftD)! 📚 🔬 Title: DPSNN: Spiking Neural Network for Low-Latency Streaming Speech Enhancement In our recent work, we tackle a crucial issue in speech enhancement SNN solutions: latency. While neuromorphic hardware is designed for low latency, some algorithms and implementations can still introduce significant delays. In speech enhancement, the Short-Time Fourier Transform (STFT)—a common preprocessing step in frequency-domain approaches—can be a significant source of latency. Inspired by the success of high-performance, low-latency deep learning models, we have developed a novel time-domain SNN framework that achieves the very low latency required for applications like hearing aids. Key Contributions of Our Paper: 1. Innovative Solution: We introduce a novel two-phase time-domain streaming SNN framework that effectively addresses latency while ensuring high accuracy and power efficiency. 2. Latency Optimization: Traditional methods often suffer from latency due to long sampling windows, such as 32ms. Our time-domain approach significantly reduces this latency, meeting the stringent requirements of real-time applications like hearing aids, which demand latencies under 5ms. 3. Competitive Performance: Our framework not only reduces latency but also achieves competitive performance compared to current SNN models, pushing the boundaries of what’s possible in speech enhancement. Explore the full details of our work on [arXiv](https://lnkd.in/grgvEftD) and discover how our innovations are advancing the practical applications of neuromorphic computing in this vital field. We look forward to your feedback and discussions! #SpeechEnhancement #SNN #LatencyReduction #DeepLearning #NeuralNetworks #RealTimeProcessing #AI #TechInnovation #arXiv #HearingAids

DPSNN: Spiking Neural Network for Low-Latency Streaming Speech Enhancement

DPSNN: Spiking Neural Network for Low-Latency Streaming Speech Enhancement

arxiv.org

Tao Wang

Director of Machine Learning, PhD

6mo

Congratulations Tao Sun, Ph.D.!

To view or add a comment, sign in

Explore topics