Spoken Language Identification

SLID, Speech Signal Processing, Deep Learning, Jadavpur University, 2020

This aims at identifying language from speech data. Many Indian languages have similar phonemes, which makes it challenging to separate them. This project uses MFCC features to develop diffent algorithms to tackle this problem.

This contains the description using SVM classifier. This was considerably improved by using the fresh algorithm for feature extraction and selection from the MFCC timeseries. These hand-engineered features have semantic meaning associated with them, making the model explainable. These features were then used by a shallow MLP to achieve state-of-the-art performance. This is the associated document, and the corresponding code can be found here.

Share on

Twitter Facebook LinkedIn

Mainak Biswas

Share on