Publications
See Google Scholar or publications.
"_" denotes that the student was under the supervision of Prof. Wang while conducting the work.
Preprints
- P. Shen, K. Chen, S. He, P. Chen, S. Yuan, H. Kong, X. Zhang, and Z.-Q. Wang, "Listen to Extract: Onset-Prompted Target Speaker Extraction", in arxiv preprint arXiv:2505.05114, 2025. [Sound Demo]
- Z.-Q. Wang, "ctPuLSE: Close-Talk, and Pseudo-Label Based Far-Field, Speech Enhancement", in arxiv preprint arXiv:2407.19485, 2024. [Sound Demo]
2026
- Y. Masuyama, X. Chang, W. Zhang, S. Cornell, Z.-Q. Wang, N. Ono,Y. Qian, and S. Watanabe, "An End-to-End Integration of Speech Separation and Recognition with Self-Supervised Learning Representation", in Computer Speech & Language (CSL), vol. 95, issue 101813, pp. 1-18, 2026.
2025
- Z.-Q. Wang, "SuperM2M: Supervised and Mixture-to-Mixture Co-Learning for Speech Enhancement and Noise-Robust ASR", in Neural Networks (NN), vol. 188, issue 107408, pp. 1-16, 2025. [Sound Demo]
- Z. Xu, X. Fu, Z.-Q. Wang, X. Jiang, and R. Roy Choudhury, "Unsupervised Blind Speech Separation with A Diffusion Prior", in International Conference on Machine Learning (ICML), accepted, 2025. [Sound Demo] [Code]
- S. Araki, N. Ito, R. Haeb-Umbach, G. Wichern, Z.-Q. Wang, and Y. Mitsufuji, "30+ Years of Source Separation Research: Achievements and Future Challenges", in IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), 2025.
- P. Shen, X. Zhang, and Z.-Q. Wang, "ARiSE: Auto-Regressive Multi-Channel Speech Enhancement", in Interspeech, 2025.
- F. Zhao, X. Zhang, and Z.-Q. Wang, "Multi-Channel Acoustic Echo Cancellation Based on Direction-of-Arrival Estimation", in Interspeech, 2025.
- L. Fu, Y. Liu, Z. Liu, Z. Yang, Z.-Q. Wang, Y. Li, and H. Kong, "AuralNet: Hierarchical 3D Binaural Localization of Overlapping Speakers", in Interspeech, 2025.
2024
- Z.-Q. Wang, "USDnet: Unsupervised Speech Dereverberation via Neural Forward Filtering", in IEEE/ACM Transactions on Audio, Speech, and Language Processing (IEEE/ACM TASLP), vol. 32, pp. 3882-3895, 2024. [Sound Demo]
- Z.-Q. Wang, "Mixture to Mixture: Leveraging Close-talk Mixtures as Weak-supervision for Speech Separation", in IEEE Signal Processing Letters (IEEE SPL), vol. 31, pp. 1715-1719, 2024. [Sound Demo]
- Z.-Q. Wang, A. Kumar, and S. Watanabe, "Cross-Talk Reduction", in International Joint Conference on Artificial Intelligence (IJCAI), pp. 5171-5180, 2024. [Sound Demo] [Poster] [Slide]