Wang Kai, Huang Hao, Hu Ying, Huang Zhihua, Li Sheng. End-to-End Speech Separation Using Orthogonal Representation in Complex and Real Time-Frequency Domain[C]//Interspeech. 2021: 3046-3050.
Wang Kai, Peng Yizhou, Huang Hao, Hu Ying, Li Sheng. Mining hard samples locally and globally for improved speech separation[C]//International Conference on Acoustics, Speech and Signal Processing (ICASSP). IEEE, 2022: 6037-6041.
Wang Kai, Yang Yuhang, Huang Hao, Hu Ying, Li Sheng. Speakeraugment: Data Augmentation for Generalizable Source Separation via Speaker Parameter Manipulation[C]//International Conference on Acoustics, Speech and Signal Processing (ICASSP). IEEE, 2023: 1-5.
Huang Hao, Wang Kai, Hu Ying, Li Sheng. Encoder-decoder based pitch tracking and joint model training for Mandarin tone classification[C] //International Conference on Acoustics, Speech and Signal Processing (ICASSP). IEEE, 2021: 6943-6947.
王凯, 李鸣鹤, 黄志华, 黄浩. 基于时域的基频感知语音分离方法[J]. 新疆大学学报:自然科学版(中英文), 2022(039-002).
Wang K , Liu J , Huang P H .Neural RAPT: deep learning-based pitch tracking with prior algorithmic knowledge instillation[J].International journal of speech technology, 2023, 26(4):999-1015.
Lai S, He M, Zhao Z, Wang K, Huang H, Yang J, Synthesizing Long-Form Speech merely from Sentence-Level Corpus with Content Extrapolation and LLMContextual Enrichment[C]//Interspeech. 2024
Wu D, Jiang L, Yin L, Wang K, Huang H, Dual Level Intent-Slot Interaction for Improved Multi-Intent Spoken Language Understanding[C]//ICASSP 2024-2024 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP). IEEE, 2024: 12301-12305.