Regularized Urdu Speech Recognition with Semi-Supervised Deep Learning

Regularized Urdu Speech Recognition with Semi-Supervised Deep Learning

Mohammad Ali Humayun;Ibrahim A. Hameed;Syed Muslim Shah;Sohaib Hassan Khan;Irfan Zafar;Saad Bin Ahmed;Junaid Shuja;Ali Humayun, Mohammad;Hameed, Ibrahim A.;Muslim Shah, Syed;Hassan Khan, Sohaib;Zafar, Irfan;Bin Ahmed, Saad;Shuja, Junaid;
applied sciences 2019 Vol. 9 pp. 1956-
114
humayun2019appliedregularized

Abstract

Automatic Speech Recognition, (ASR) has achieved the best results for English, with end-to-end neural network based supervised models. These supervised models need huge amounts of labeled speech data for good generalization, which can be quite a challenge to obtain for low-resource languages like Urdu. Most models proposed for Urdu ASR are based on Hidden Markov Models (HMMs). This paper proposes an end-to-end neural network model, for Urdu ASR, regularized with dropout, ensemble averaging and Maxout units. Dropout and ensembles are averaging techniques over multiple neural network models while Maxout are units in a neural network which adapt their activation functions. Due to limited labeled data, Semi Supervised Learning (SSL) techniques are also incorporated to improve model generalization. Speech features are transformed into a lower dimensional manifold using an unsupervised dimensionality-reduction technique called Locally Linear Embedding (LLE). Transformed data along with higher dimensional features is used to train neural networks. The proposed model also utilizes label propagation-based self-training of initially trained models and achieves a Word Error Rate (WER) of 4% less than that reported as the benchmark on the same Urdu corpus using HMM. The decrease in WER after incorporating SSL is more significant with an increased validation data size.

Citation

ID: 268289
Ref Key: humayun2019appliedregularized
Use this key to autocite in SciMatic or Thesis Manager

References

Blockchain Verification

Account:
NFT Contract Address:
0x95644003c57E6F55A65596E3D9Eac6813e3566dA
Article ID:
268289
Unique Identifier:
10.3390/app9091956
Network:
Scimatic Chain (ID: 481)
Loading...
Blockchain Readiness Checklist
Authors
Abstract
Journal Name
Year
Title
5/5
Creates 1,000,000 NFT tokens for this article
Token Features:
  • ERC-1155 Standard NFT
  • 1 Million Supply per Article
  • Transferable via MetaMask
  • Permanent Blockchain Record
Blockchain QR Code
Scan with Saymatik Web3.0 Wallet

Saymatik Web3.0 Wallet