Optimized Neural Network Architecture for The Classification of Voice Signals
DOI:
https://doi.org/10.26438/ijcse/v6i9.502506Keywords:
Digital signal processing, Optimize neural network, Pattern classificationAbstract
In this paper, the performance to optimize feed-forward neural network has been evaluated for the classification of voice signals of English alphabets. There are various feed forward neural network models have been used earlier but the selection of optimize architecture is a challenge. In this paper we are implementing a optimize architecture which is best suitable for the classification of voice signals. Digital signal processing operations are applied on analog speech signals to convert them into digital form and then to make them suitable for further processing by neural network models.
References
[1] P. Rani, S. Kakkar and S. Rani, “Speech Recognition Using Neural Network”, In Proceedings of International Conference on Advancements in Engineering and Technology, International Journal of Computer Applications, pp. 11-14, 2015.
[2] M. A. Anusuya and S. K. Katti, “Speech Recognition by Machine: A Review”, International Journal of Computer Science and Information Security, pg. 181 – 205, Vol. 6, No. 3, 2009.
[3] X. Cui et.al., “A Study of Variable-Parameter Gaussian Mixture Hidden Markov Modeling for Noisy Speech Recognition”, IEEE Transactions On Audio, Speech, And Language Processing, Vol. 15, No. 4, 2007.
[4] G.E. Dahl, M. Ranzato, A. Mohamed and G.E. Hinton, “Phone Recognition with the Mean-covariance Restricted Boltzmann Machine”, Adv. Neural Inf. Process. Syst., No. 23, 2010.
[5] D. Yu, L. Deng and G. Dahl, “Roles of Pre-training and Fine-tuning in Context-dependent DBN-HMMs for Real-world Speech Recognition”, In Proceedings of NIPS Workshop Deep Learn, Unsupervised Feature Learn, 2010.
[6] H. Bourland and C.J. Wellekens, “Multilayer Perceptrons and Automatic Speech Recognition”, IEEE First International Conference on Neural Networks, San Diego, California IV-407-IV-416, June 21-24, 1987.
[7] H. Yashwanth, H. Mahendrakar and S. David, “Automatic Speech Recognition Using Audio Visual Cues”, IEEE India Annual Conference, pp. 166-169, 2004.
[8] Robinson and F. Fallside, “A Recurrent Error Propagation Network Speech Recognizer System”, Computer, Speech and Language, Vol. 5, No. 3, 1991.
[9] L. Yang and Z. Yang, “Study on Audio Signal’s Classification Based on BP Neural Network”, IEEE Conference Publications on Artificial Intelligence, Management Science and Electronic Commerce, pp. 5153-5155, 2011.
[10] S. Balochian, E. A. Seidabadand S. Z. Rad, “ Neural Network Optimization Genetic Algorithms for the Audio Classification to Speech and Music”, International Journal of Signal Processing, Image Processing and Pattern Recognition, pg. 47-54, Vol. 6, No. 3, 2013.
[11] T. Lefteri H. and A. U. Robert, “Fuzzy and Neural Approaches in Engineering”, John Wiley and Sons Publications, 1997.
[12] R. Hecht-Nielsen, “Theory of Backpropagation Neural Network”, International Joint Conference on Neural Networks, pp. 593-605, Vol. 1, 1989.
[13] M.J.D. Powell, “Radial Basis Functions for Multivariate Interpolation: A Review”, In Algorithms for the Approximation of Functions and Data, J.C. Mason and M.G. Cox, eds., Clarendon Press, pp. 143-167, 1987.
[14] S. N. Parappa and M. P. Singh, “Conjugate Descent of Gradient Descent Radial Basis Function for Generalization of Feed-forward Neural Network”, International Journal of Advancements in Research & Technology, pg. 112-125, Vol. 2, Issue 12, 2013.
Downloads
Published
How to Cite
Issue
Section
License

This work is licensed under a Creative Commons Attribution 4.0 International License.
Authors contributing to this journal agree to publish their articles under the Creative Commons Attribution 4.0 International License, allowing third parties to share their work (copy, distribute, transmit) and to adapt it, under the condition that the authors are given credit and that in the event of reuse or distribution, the terms of this license are made clear.
