EFFICIENT IMAGE COMPRESSION AND DECOMPRESSION ALGORITHMS FOR OCR SYSTEMS

Boban Arizanović; Vladan Vučković

EFFICIENT IMAGE COMPRESSION AND DECOMPRESSION ALGORITHMS FOR OCR SYSTEMS

Boban Arizanović, Vladan Vučković

DOI Number

10.2298/FUEE1803461A

First page

461

Last page

485

Abstract

This paper presents an efficient new image compression and decompression methods for document images, intended for usage in the pre-processing stage of an OCR system designed for needs of the “Nikola Tesla Museum” in Belgrade. Proposed image compression methods exploit the Run-Length Encoding (RLE) algorithm and an algorithm based on document character contour extraction, while an iterative scanline fill algorithm is used for image decompression. Image compression and decompression methods are compared with JBIG2 and JPEG2000 image compression standards. Segmentation accuracy results for ground-truth documents are obtained in order to evaluate the proposed methods. Results show that the proposed methods outperform JBIG2 compression regarding the time complexity, providing up to 25 times lower processing time at the expense of worse compression ratio results, as well as JPEG2000 image compression standard, providing up to 4-fold improvement in compression ratio. Finally, time complexity results show that the presented methods are sufficiently fast for a real time character segmentation system.

Keywords

Image processing, image compression, image decompression, OCR, machine-typed documents, machine- printed documents

Full Text:

PDF

References

A. Andreopoulos and J. K. Tsotsos, “50 Years of object recognition: Directions forward”, Computer Vision and Image Understanding, vol. 117, no. 8, pp. 827-891, 2013.

N. Bourbakis, N. Pereira and S. Mertoguno, “Hardware design of a letter-driven OCR and document processing system”, Journal of Network and Computer Applications, vol. 19, no. 3, pp. 275-294, 1996.

S. Khoubyari and J. J. Hull, “Font and Function Word Identification in Document Recognition”, Computer Vision and Image Understanding, vol. 63, no. 1, pp. 66-74, 1996.

J. Mao and K. M. Mohiuddin, “Improving OCR performance using character degradation models and boosting algorithm”, Pattern Recognition Letters, vol. 18, no. 11-13, pp. 1415-1419, 1997.

A. Namane, A. Guessoum, E. H. Soubari and P. Meyrueis, “CSM neural network for degraded printed character optical recognition”, Journal of Visual Communication and Image Representation, vol. 25, no. 5, pp. 1171-1186, 2014.

J. I. Olszewska, “Active contour based optical character recognition for automated scene understanding”, Neurocomputing, vol. 161, no. 5, pp. 65-71, 2015.

M. I. Razzak, F. Anwar, S. A. Husain, A. Belaid and M. Sher, “HMM and fuzzy logic: A hybrid approach for online Urdu script-based languages’ character recognition”, Knowledge-Based Systems, vol. 23, no. 8, pp. 914-923, 2010.

G. Vamvakas, B. Gatos, N. Stamatopoulos and S. Perantonis, “A Complete Optical Character Recognition Methodology for Historical Documents”, IAPR International Workshop on Document Analysis Systems, vol. 1, 2008, pp. 525-532.

H. Fujisawa, “Forty years of research in character and document recognition-and industrial perspective”, Pattern Recognition, vol. 41, no. 8, pp. 2435-2446, 2008.

Y. Lu, “Machine Printed Character Segmentation - An Overview”, Pattern Recognition, vol. 28, no. 1, pp. 67-80, 1995.

Y. Lu and M. Shridhar, “Character segmentation in handwritten words - An overview”, Pattern Recognition, vol. 29, no. 1, pp. 77-96, 1996.

Á. González and L. M. Bergasa, “A text reading algorithm for natural images”, Image and Vision Computing, vol. 31, no. 3, pp. 255-274, 2013.

D. Karatzas and A. Antonacopoulos, “Colour text segmentation in web images based on human perception”, Image and Vision Computing, vol. 25, no. 5, pp. 564-577, 2007.

J. Lim, J. Park and G. G. Medioni, “Text segmentation in color images using tensor voting”, Image and Vision Computing, vol. 25, no. 5, pp. 671-685, 2007.

J. Min-Chul, S. Yong-Chul and S. N. Srihari, “Machine Printed Character Segmentation Method Using Side Profiles”, In Proceedings of the IEEE SMC ’99 Conference on Systems, Man and Cybernetics, 1999.

N. Nikolaou, M. Makridis, B. Gatos, N. Stamatopoulos and N. Papamarkos, “Segmentation of historical machine-printed documents using Adaptive Run Length Smoothing and skeleton segmentation paths”, Image and Vision Computing, vol. 28, no. 4, pp. 590-604, 2010.

H. C. Park, S. Y. Ok, Y. J. Yu and H. G. Cho, “A word extraction algorithm for machine-printed documents using a 3D neighborhood graph model”, International Journal on Document Analysis and Recognition, vol. 4, no. 2, pp. 115-130, 2001.

L. Zheng, A. H. Hassin and X. Tang, “A new algorithm for machine printed Arabic character segmentation”, Pattern Recognition Letters, vol. 25, no. 15, pp. 1723-1729, 2004.

A. Choudhary, R. Rishi and S. Ahlawat, “A New Character Segmentation Approach for Off-Line Cursive Handwritten Words”, Procedia Computer Science, First International Conference on Information Technology and Quantitative Management, vol. 17, pp. 88-95, 2013.

K. Fukushima and T. Imagawa, “Recognition and segmentation of connected characters with selective attention”, Neural Networks, vol. 6, no. 1, pp. 33-41, 1993.

E. B. Lacerda and C. A. B. Mello, “Segmentation of connected handwritten digits using Self-Organizing Maps”, Expert Systems with Applications, vol. 40, no. 15, pp. 5867-5877, 2013.

H. Lee and B. Verma, “Binary segmentation algorithm for English cursive handwriting recognition”, Pattern Recognition, vol. 45, no. 4, pp. 1306-1317, 2012.

J. Oh, S. Joon, S. Sangkuk, L. Ji-Won, K. Nojun and K. Em, “Online recognition of handwritten music symbols”, International Journal on Document Analysis and Recognition (IJDAR), pp. 1-11, 2017.

T. Plötz and G. A. Fink, “Markov models for offline handwriting recognition: a survey”, International Journal on Document Analysis and Recognition (IJDAR), vol. 12, pp. 269-298, 2009.

A. Rehman and T. Saba, “Performance analysis of character segmentation approach for cursive script recognition on benchmark database”, Digital Signal Processing, vol. 21, no. 3, pp. 486-490, 2011.

N. Stamatopoulos, B. Gatos, G. Louloudis, U. Pal and A. Alaei, “ICDAR 2013 Handwriting Segmentation Contest”, In Proceedings of the 12th International Conference on Document Analysis and Recognition (ICDAR), 2013.

O. Surinta, M. F. Karaaba, L. R. B. Schomaker and M. A. Wiering, “Recognition of handwritten characters using local gradient feature descriptors”, Engineering Applications of Artificial Intelligence, vol. 45, pp. 405-414, 2015.

J. Tan, J. Lai, C. Wang, W. Wang and X. Zuo, “A new handwritten character segmentation method based on nonlinear clustering”, Neurocomputing, vol. 89, pp. 213-219, 2012.

M. Younes and Y. Abdellah, “Segmentation of Arabic Handwritten Text to Lines”, In Proceedings of the Procedia Computer Science, International Conference on Advanced Wireless Information and Communication Technologies (AWICT 2015), vol. 73, pp. 115-121, 2015.

A. Antonacopoulos and D. Karatzas, “Semantics-based content extraction in typewritten historical documents”, In Proceedings of the 8th International Conference on Document Analysis and Recognition (ICDAR ‘05), 2005, pp. 48-53.

I. Bar-Yosef, A. Mokeichev, K. Kedem, I. Dinstein and U. Ehrlich, “Adaptive shape prior for recognition and variational segmentation of degraded historical characters”, Pattern Recognition, vol. 42, no. 12, pp. 3348-3354, 2009.

M. R. Gupta, N. P. Jacobson and E. K. Garcia, “OCR binarization and image pre-processing for searching historical documents”, Pattern Recognition, vol. 40, no. 2, pp. 389-397, 2007.

L. A. F. Fernandes and M. M. Oliveira, “Real-time line detection through an improved Hough transform voting scheme”, Pattern Recognition, vol. 41, no. 1, pp. 299-314, 2008.

V. Shapiro, “Accuracy of the straight line Hough Transform: The non-voting approach”, Computer Vision and Image Understanding, vol. 103, no. 1, pp. 1-21, 2006.

C. Singh, N. Bhatia and A. Kaur, “Hough transform based fast skew detection and accurate skew correction methods”, Pattern Recognition, vol. 41, no. 12, pp. 3528-3546, 2008.

G. Bessho, K. Ejiri and J. F. Cullen, “Fast and accurate skew detection algorithm for a text document or a document with straight lines”, In Proc. of the SPIE, vol. 2181, pp. 133-140, 1994.

Y. Cao, S. Wang and H. Li, “Skew detection and correction in document images based on straight-line fitting”, Pattern Recognition Letters, vol. 24, no. 12, pp. 1871-1879, 2003.

A. Fernández-Caballero, M. T. López and J. C. Castillo, “Display text segmentation after learning best-fitted OCR binarization parameters”, Expert Systems with Applications, vol. 39, no. 4, pp. 4032-4043, 2012.

H. Z. Eldin, M. A. Elhosseini and H. A. Ali, “Image compression algorithms in wireless multimedia sensor networks: A survey”, Ain Shams Engineering Journal, vol. 6, no. 2, pp. 481-490, 2015.

J. Mtimet and H. Amiri, “Arabic Textual Image Compression Approach”, Procedia Computer Science, vol. 35, pp. 118-126, 2014.

M. Wu, “Genetic algorithm based on discrete wavelet transformation for fractal image compression”, Journal of Visual Communication and Image Representation, vol. 25, no. 8, pp. 1835-1841, 2014.

L. Wang, J. Bai, J. Wu and G. Jeon, “Hyperspectral image compression based on lapped transform and Tucker decomposition”, Signal Processing: Image Communication, vol. 36, pp. 63-69, 2015.

A. M. Rufai, G. Anbarjafari and H. Demirel, “Lossy image compression using singular value decomposition and wavelet difference reduction”, Digital Signal Processing, vol. 24, pp. 117-123, 2014.

Z. Zheng, J. Zhao, H. Guo, L. Yang, X. Yu and W. Fang, “Character Segmentation System Based on C# Design and Implementation”, Procedia Engineering, International Workshop on Information and Electronics Engineering, vol. 29, pp. 4073-4078, 2012.

A. Sedighi and M. Vafadust, “A new and robust method for character segmentation and recognition in license plate images”, Expert Systems with Applications, vol. 38, no. 11, pp. 13497-13504, 2011.

M. Grafmüller and J. Beyerer, “Performance improvement of character recognition in industrial applications using prior knowledge for more reliable segmentation”, Expert Systems with Applications, vol. 40, no. 17, pp. 6955-6963, 2013.

C. Papaodysseus, P. Rousopoulos, F. Giannopoulos, S. Zannos, D. Arabadjis, M. Panagopoulos, E. Kalfa, C. Blackwell and S. Tracy, “Identifying the writer of ancient inscriptions and Byzantine codices. A novel approach”, Computer Vision and Image Understanding, vol. 121, pp. 57-73, 2014.

N. B. Venkateswarlu and R. D. Boyle, “New segmentation techniques for document image analysis”, Image and Vision Computing, vol. 13, no. 7, pp. 573-583, 1995.

J. Li, M. Li, J. Pan, S. Chu and J. F. Roddick, “Gabor-based kernel self-optimization Fisher discriminant for optical character segmentation from text-image-mixed document”, Optik - International Journal for Light and Electron Optics, vol. 126, no. 21, pp. 3119-3124, 2015.

J. H. Bae, K. C. Jung, J. W. Kim and H. J. Kim, “Segmentation of touching characters using an MLP”, Pattern Recognition Letters, vol. 19, no. 8, pp. 701-709, 1998.

P. P. Roy, U. Pal, J. Lladós and M. Delalandre, “Multi-oriented touching text character segmentation in graphical documents using dynamic programming”, Pattern Recognition, vol. 45, no. 5, pp. 1972-1983, 2012.

O. Starostenko, C. Cruz-Perez, F. Uceda-Ponga and V. Alarcon-Aquino, “Breaking text-based CAPTCHAs with variable word and character orientation”, Pattern Recognition, vol. 48, no. 4, pp. 1101-1112, 2015.

V. Vučković and B. Arizanović, “Efficient character segmentation approach for machine-typed documents”, Expert Systems with Applications, vol. 80, pp. 210-231, 2017.

V. Vučković and B. Arizanović, “Automatic document skew pre-processor for character segmentation algorithm“, Facta Universitatis: Electronics and Energetics, vol. 30, no. 4, pp. 611-625, 2017.

V. Vučković, B. Arizanović and S. Le Blond, “Ultra-fast basic geometrical transformations on linear image data structure“, Expert Systems with Applications, vol. 91, pp. 322-346, 2018.

V. Vučković, B. Arizanović and S. Le Blond, “Generalized N-way iterative scanline fill algorithm for real-time applications“, Journal of Real-Time Image Processing, vol. 13, no. 4, pp. 1-19, 2018.

B. Žalik, D. Mongus and N. Lukač, “A universal chain code compression method”, Journal of Visual Communication and Image Representation, vol. 29, pp. 8-15, 2015.

J. Zhu, Z. Wang, R. Zhong and S. Qu, “Dictionary based surveillance image compression”, Journal of Visual Communication and Image Representation, vol. 31, pp. 225-230, 2015.

A. J. Hussain, D. Al-Jumeily, N. Radi and P. Lisboa, “Hybrid Neural Network Predictive Wavelet Image Compression System”, Neurocomputing, vol. 151, no. 3, pp. 975-984, 2015.

R. T. Haweel, W. S. El-Kilani and H. H. Ramadan, “Fast approximate DCT with GPU implementation for image compression”, Journal of Visual Communication and Image Representation, vol. 40 (Part A), pp. 357-365, 2016.

J. Li, “Image Compression - the Mathematics of JPEG 2000”, Microsoft Research, Communication Collaboration and Signal Processing.

Refbacks

There are currently no refbacks.

ISSN: 0353-3670 (Print)

ISSN: 2217-5997 (Online)

COBISS.SR-ID 12826626

Username
Password
Remember me