ZHANG Qing-qing, LIU Yong, PAN Jie-lin, YAN Yong-hong. Continuous speech recognition by convolutional neural networks[J]. Chinese Journal of Engineering, 2015, 37(9): 1212-1217. DOI: 10.13374/j.issn2095-9389.2015.09.015
Citation: ZHANG Qing-qing, LIU Yong, PAN Jie-lin, YAN Yong-hong. Continuous speech recognition by convolutional neural networks[J]. Chinese Journal of Engineering, 2015, 37(9): 1212-1217. DOI: 10.13374/j.issn2095-9389.2015.09.015

Continuous speech recognition by convolutional neural networks

  • Convolutional neural networks (CNNs), which show success in achieving translation invariance for many image processing tasks, were investigated for continuous speech recognition. Compared to deep neural networks (DNNs), which are proven to be successful in many speech recognition tasks nowadays, CNNs can reduce the neural network model sizes significantly, and at the same time achieve even a better recognition accuracy. Experiments on standard speech corpus TIMIT and conversational speech corpus show that CNNs outperform DNNs in terms of the accuracy and the generalization ability.
  • loading

Catalog

    /

    DownLoad:  Full-Size Img  PowerPoint
    Return
    Return