Comparison of Structure Based Models for Handwritten English Character Recognition
DOI:
https://doi.org/10.26438/ijcse/v5i8.126130Keywords:
Character recognition, stroke detector, codewords, spatially embedded dictionary, part based modelAbstract
Characters are the symbols made by man that are composed of different structure and strokes for easy communication. The intrinsic characteristics of the characters can be utilized to design the stroke and structure based models for handwritten character recognition. This paper focus to learn the part based and the stroke detector based models to recognize the characters by detecting the elastic strokes. The Tree Structured Model (TSM) and the Mixture of parts Tree Structured Model (MTSM) are the part based models that uses the trained part models on the images to recognize the characters. These models require manually labelled key points. In order to learn the discriminative stroke detectors automatically, the discriminative spatiality embedded dictionary learning-based representation (DSEDR) is used for character recognition. A comparative study is made on all the three models on the chars74k dataset to determine the model that shows the best performance.
References
Cun-Zhao Shi, Song Gao, Meng-Tao Liu, Cheng-Zuo Qi, Chun-Heng Wang, "Stroke detector and structure based models for Character Recognition: A Comparative Study" in IEEE transactions on image processing, volume 24, no. 12, Dec 2015.
C. Yao, X. Bai, B. Shi, and W. Liu, “Strokelets: A learned multi-scale representation for scene text recognition,” in the Proceedings CVPR, Jun. 2014, pp. 4042–4049.
S. Gao, C. Wang, B. Xiao, C. Shi, and Z. Zhang, “Stroke bank: A high level representation for scene character recognition,” in the Proceedings on 22nd International Conerence Pattern Recognition (ICPR), Aug. 2014, pp. 2909–2913.
C. Shi, C. Wang, B. Xiao, Y. Zhang, S. Gao, and Z. Zhang, “Scene text recognition using part-based tree-structured character detection,” In the Proceedings of IEEE Conference Computer Vision and Pattern Recognition (CVPR), Jun. 2013, pp. 2961–2968.
T. E. de Campos, B. R. Babu, and M. Varma, “Character recognition in natural images,” In the Proceedings of VISAPP, 2013, pp. 273–280.
S. Tian, S. Lu, B. Su, and C. L. Tan, “Scene text recognition using co-occurrence of histogram of oriented gradients,” in the Proceedings on 12th International Conference Document Analytical Recognition. (ICDAR), Aug. 2013, pp. 912–916.
L. Wang, M. Zeiler, S. Zhang, Y. L. Cun, and R. Fergus, “Regularization of Neural Networks using drop connect,” in Proceedings 30th International Conference Machine Learning (ICML), 2013, pp. 1058–1066.
X. Zhu and D. Ramanan, “Face detection, pose estimation, and landmark
localization in the wild,” in Proceedings CVPR, Jun. 2011, pp. 2879–2886.
D. L. Smith, J. Field, and E. Learned-Miller, “Enforcing similarity constraints with integer programming for better scene text recognition,” in Proceedings CVPR, Jun. 2011, pp. 73–80.
L. Neumann and J. Matas, “A method for text localization and recognition in real-world images,” in Proceedings Asian Conference Computer Vision, 2011, pp. 770–783.
A. Bissacco, M. Cummins, Y. Netzer, and H. Neven, “PhotoOCR: Reading text in uncontrolled conditions,” in Proceedings IEEE International Conference Computer Vision, Dec. 2011, pp. 785–792.
[ 12] Y. Yang and D. Ramanan, “Articulated pose estimation with flexible mixtures-of- parts,” in Proceedings CVPR, Jun. 2011, pp. 1385–1392.
T. E. de Campos, B. R. Babu, and M. Varma, “Character recognition in natural images,” in Proceedings VISAPP, 2009, pp. 273–280.
C. Yi, X. Yang, and Y. Tian, “Feature representations for scene text character recognition: A comparative study,” in Proceedings IEEE 12th International Conference Document Analytics Recognition Aug. 2011, pp. 907–911.
T. Wang, D. J. Wu, A. Coates, and A. Y. Ng, “End-to-end text recognition with convolutional neural networks,” in Proceedings 21st International Conference Pattern Recognition (ICPR), Nov. 2012, pp. 3304–3308.
P. F. Felzenszwalb and D. P. Huttenlocher, “Pictorial structures for object recognition” in Proceedings Computer Vision., vol. 61, no. 1, pp. 55–79, Jan. 2005.
N. Dalal and B. Triggs, “Histograms of oriented gradients for human detection,” in Proceedings CVPR, vol. 1. Jun. 2005, pp. 886–893.
S. M. Lucas, A. Panaretos, L. Sosa, A. Tang, S. Wong, and R. Young, “ICDAR 2003 robust reading competitions,” in Proceedings ICDAR, vol. 2. Aug. 2003, pp. 682–687.
C.-L. Liu, K. Nakashima, H. Sako, and H. Fujisawa, “Handwritten digit recognition: Benchmarking of state-of-the-art techniques,” Pattern Recognition., volume 36, no. 10, pp. 2271–2285, Oct. 2003.
Downloads
Published
How to Cite
Issue
Section
License

This work is licensed under a Creative Commons Attribution 4.0 International License.
Authors contributing to this journal agree to publish their articles under the Creative Commons Attribution 4.0 International License, allowing third parties to share their work (copy, distribute, transmit) and to adapt it, under the condition that the authors are given credit and that in the event of reuse or distribution, the terms of this license are made clear.
