Comparison of Structure Based Models for Handwritten English Character Recognition

Authors

Bhavana Shastry M Dept. of CSE, Bapuji Institute of Engineering and Technology, Davanagere, India
Pradeep N Dept. of CSE, Bapuji Institute of Engineering and Technology, Davanagere, India

DOI:

https://doi.org/10.26438/ijcse/v5i8.126130

Keywords:

Character recognition, stroke detector, codewords, spatially embedded dictionary, part based model

Abstract

Characters are the symbols made by man that are composed of different structure and strokes for easy communication. The intrinsic characteristics of the characters can be utilized to design the stroke and structure based models for handwritten character recognition. This paper focus to learn the part based and the stroke detector based models to recognize the characters by detecting the elastic strokes. The Tree Structured Model (TSM) and the Mixture of parts Tree Structured Model (MTSM) are the part based models that uses the trained part models on the images to recognize the characters. These models require manually labelled key points. In order to learn the discriminative stroke detectors automatically, the discriminative spatiality embedded dictionary learning-based representation (DSEDR) is used for character recognition. A comparative study is made on all the three models on the chars74k dataset to determine the model that shows the best performance.

References

Cun-Zhao Shi, Song Gao, Meng-Tao Liu, Cheng-Zuo Qi, Chun-Heng Wang, "Stroke detector and structure based models for Character Recognition: A Comparative Study" in IEEE transactions on image processing, volume 24, no. 12, Dec 2015.

C. Yao, X. Bai, B. Shi, and W. Liu, “Strokelets: A learned multi-scale representation for scene text recognition,” in the Proceedings CVPR, Jun. 2014, pp. 4042–4049.

S. Gao, C. Wang, B. Xiao, C. Shi, and Z. Zhang, “Stroke bank: A high level representation for scene character recognition,” in the Proceedings on 22nd International Conerence Pattern Recognition (ICPR), Aug. 2014, pp. 2909–2913.

C. Shi, C. Wang, B. Xiao, Y. Zhang, S. Gao, and Z. Zhang, “Scene text recognition using part-based tree-structured character detection,” In the Proceedings of IEEE Conference Computer Vision and Pattern Recognition (CVPR), Jun. 2013, pp. 2961–2968.

T. E. de Campos, B. R. Babu, and M. Varma, “Character recognition in natural images,” In the Proceedings of VISAPP, 2013, pp. 273–280.

S. Tian, S. Lu, B. Su, and C. L. Tan, “Scene text recognition using co-occurrence of histogram of oriented gradients,” in the Proceedings on 12th International Conference Document Analytical Recognition. (ICDAR), Aug. 2013, pp. 912–916.

L. Wang, M. Zeiler, S. Zhang, Y. L. Cun, and R. Fergus, “Regularization of Neural Networks using drop connect,” in Proceedings 30th International Conference Machine Learning (ICML), 2013, pp. 1058–1066.

X. Zhu and D. Ramanan, “Face detection, pose estimation, and landmark

localization in the wild,” in Proceedings CVPR, Jun. 2011, pp. 2879–2886.

D. L. Smith, J. Field, and E. Learned-Miller, “Enforcing similarity constraints with integer programming for better scene text recognition,” in Proceedings CVPR, Jun. 2011, pp. 73–80.

L. Neumann and J. Matas, “A method for text localization and recognition in real-world images,” in Proceedings Asian Conference Computer Vision, 2011, pp. 770–783.

A. Bissacco, M. Cummins, Y. Netzer, and H. Neven, “PhotoOCR: Reading text in uncontrolled conditions,” in Proceedings IEEE International Conference Computer Vision, Dec. 2011, pp. 785–792.

[ 12] Y. Yang and D. Ramanan, “Articulated pose estimation with flexible mixtures-of- parts,” in Proceedings CVPR, Jun. 2011, pp. 1385–1392.

T. E. de Campos, B. R. Babu, and M. Varma, “Character recognition in natural images,” in Proceedings VISAPP, 2009, pp. 273–280.

C. Yi, X. Yang, and Y. Tian, “Feature representations for scene text character recognition: A comparative study,” in Proceedings IEEE 12th International Conference Document Analytics Recognition Aug. 2011, pp. 907–911.

T. Wang, D. J. Wu, A. Coates, and A. Y. Ng, “End-to-end text recognition with convolutional neural networks,” in Proceedings 21st International Conference Pattern Recognition (ICPR), Nov. 2012, pp. 3304–3308.

P. F. Felzenszwalb and D. P. Huttenlocher, “Pictorial structures for object recognition” in Proceedings Computer Vision., vol. 61, no. 1, pp. 55–79, Jan. 2005.

N. Dalal and B. Triggs, “Histograms of oriented gradients for human detection,” in Proceedings CVPR, vol. 1. Jun. 2005, pp. 886–893.

S. M. Lucas, A. Panaretos, L. Sosa, A. Tang, S. Wong, and R. Young, “ICDAR 2003 robust reading competitions,” in Proceedings ICDAR, vol. 2. Aug. 2003, pp. 682–687.

C.-L. Liu, K. Nakashima, H. Sako, and H. Fujisawa, “Handwritten digit recognition: Benchmarking of state-of-the-art techniques,” Pattern Recognition., volume 36, no. 10, pp. 2271–2285, Oct. 2003.

Downloads

PDF ⁰

Published

2025-11-11

CITATION

DOI: 10.26438/ijcse/v5i8.126130

Published: 2025-11-11

How to Cite

[1]

M. Bhavana Shastry and N. Pradeep, “Comparison of Structure Based Models for Handwritten English Character Recognition”, Int. J. Comp. Sci. Eng., vol. 5, no. 8, pp. 126–130, Nov. 2025.

Download Citation

Issue

Vol. 5 No. 8 (2017): IJCSE August Edition

Section

Research Article

License

This work is licensed under a Creative Commons Attribution 4.0 International License.

Authors contributing to this journal agree to publish their articles under the Creative Commons Attribution 4.0 International License, allowing third parties to share their work (copy, distribute, transmit) and to adapt it, under the condition that the authors are given credit and that in the event of reuse or distribution, the terms of this license are made clear.

Comparison of Structure Based Models for Handwritten English Character Recognition

Authors

DOI:

Keywords:

Abstract

References

Downloads

Published

How to Cite

Issue

Section

License

Make a Submission

Journal Information

UGC Gazette Regulation

Join Editorial Board

Information

Current Issue

Keywords