Optical Music Recognition using Image Processing and Machine Learning

Authors

  • Mathew P School of Mathematics and Computer Science, Indian Institute of Technology, Goa, India
  • Vijayakumar R School of Mathematics and Computer Science, Heriot-Watt University, Dubai, UAE
  • Kuriakose AT Department of Computer Science and Engineering, Rajiv Gandhi Institute of Technology, Kerala, India
  • Sunny J Department of Computer Science and Engineering, Vidya Academy of Science and Technology, Kerala, India
  • Bai RV Department of Computer Science and Engineering, Vidya Academy of Science and Technology, Kerala, India

DOI:

https://doi.org/10.26438/ijcse/v6si10.1823

Keywords:

Optical Music Recognition, Image Processing, RLE, Classification, Machine Learning

Abstract

The ability to understand music score is a basic requirement for learning music. This paper proposes a mathematical method to find the pitch of a musical note from digital image of sheet music and a classification-based method for detecting the duration of a music note. In a sheet music, the horizontal direction can be associated with the notes starting time, whilst the vertical direction can be associated with pitch. The symbols used for a note represents its duration. Music scores sometimes need to be transposed or slightly modified, having the score in a digital format greatly reduces the time and effort required to do these. In this paper, we make use of techniques such as Run Length Encoding (RLE), Horizontal projection and Vertical Projection (X & Y projections) for Segmentation and attribute extraction. For note recognition, a classifier based system is used which returns the duration of the given input symbol. The pitch, duration and position of notes are finally given as input to a midi generation module, which generates a MIDI file corresponding to the given input music notation. There are several other applications to Optical Music Recognition (OMR) systems. Converting music scores in Braille code for the blind is yet another application of an OMR system

References

[1] A. F. Desaedeleer, “Reading sheet music – openomr”, Imperial College London,(University of London), http://sourceforge.net/projects/openomr/.

[2] R. J. Baugh Earl Gose, “Pattern Recognition and Image Analysis”, In Pattern Recognition and Image Analysis, volume 1, 2011.

[3] M. Hall, E. Frank , G. Holmes , B. Pfahringer , P. Reutemann , H. Ian : “The WEKA Data Mining Software: An Update”. SIGKDD Explorations 11, 2009.

[4] C.E. Shannon: “A Mathematical Theory of Communication”. The Bell System Technical Journal 27, 379–423, 623–656, July, October 1948.

[5] Online. “Note names, MIDI numbers and frequencies”. http://www.phys.unsw.edu.au/jw/notes.html, June 2005.

[6] Online. “Midiutil - A Python interface for writing multi-track MIDI Files”. https://code.google.com/p/midiutil/, December 2013.

[7] Online. “PythonInMusic”. https://wiki.python.org/moin/PythonInMusic,December 2013.

[8] Online. “Note value”. http://en.wikipedia.org/wiki/Note_value, March 2014.

[9] Online. “Attribute-Relation File Format (ARFF)” https://www.cs.waikato.ac.nz/ml/weka/arff.html, November 2008.

Downloads

Published

2025-11-17
CITATION
DOI: 10.26438/ijcse/v6si10.1823
Published: 2025-11-17

How to Cite

[1]
P. Mathew, R. Vijayakumar, A. T. Kuriakose, J. Sunny, and R. V. Bai, “Optical Music Recognition using Image Processing and Machine Learning”, Int. J. Comp. Sci. Eng., vol. 6, no. 10, pp. 18–23, Nov. 2025.