资讯

Abstract: There exist three approaches for multilingual and crosslingual automatic speech recognition (MCL-ASR) - supervised pretraining with phonetic or graphemic transcription, and self-supervised ...
Abstract: Speech emotion recognition (SER) technology analyzes speech signals to automatically identify the speaker’s emotional state. However, existing methods overlook feature extraction based on ...