A Robust Method for Arabic Car Plates Recognition and Matching Using Chain Code

R.  F. Mansour

American Journal of Computational and Applied Mathematics

p-ISSN: 2165-8935 e-ISSN: 2165-8943

2012; 2(3): 105-111

doi: 10.5923/j.ajcam.20120203.07

A Robust Method for Arabic Car Plates Recognition and Matching Using Chain Code

R. F. Mansour

Department of Science and Mathematics, Faculty of Education, New Valley, Assiut University, EL kharaga, Egypt

Correspondence to: R. F. Mansour, Department of Science and Mathematics, Faculty of Education, New Valley, Assiut University, EL kharaga, Egypt.

Email:

Abstract

This paper provides a new and fast method for matching and recognition of characters in Arabic license plate images. For this purpose, various methods have been proposed in literature. However, most of them suffer from: sensitivity to non-uniform illumination distribution, existence of shade in license plate, license plate color and the need for receiving an exact image of the license plate. The main contributions of our work include (I) chain code use to bounded the shape and distinguishing similar characters by local structural features. The moving window matching algorithm has been implemented. The distance measure (squared Euclidean distance) technique has been used for measuring the similarities between the moving window and the plate image. (2) Developing a system architecture combining statistical and structural recognition methods. We tested the method with 300 of plate images captured in different environments from real applications. The result yield 93.93% recognition accuracy.

Keywords: License Plate Recognition, Template Matching, Moving Window, Segmentation and Chain Code

Article Outline

1. Introduction

2. Preprocessing

2.1. Skew Correction

2.2. Size Normalization

3. Feature Extraction Using Chain Code

3.1. Segmentation

3.2. Thinning

4. Recognizing Characters by Statistical Method

5. Moving Window with Template Matching

6. Experiments

7. Conclusions

1. Introduction

Automatic license plate recognition (ALPR) is one of the most important aspects of applying computer techniques towards intelligent transportation systems. These systems attempt to facilitate the problem of identification of cars, via various techniques which mainly rely on automated (rather than manual) algorithms.

Accurate license plate locating is very important for post process, because sub-image drop off disturb from non-plate image region and the post process use sub-image as input to recognition. There are many papers[1-4] discussed the locating methods, but it need to be improved for better application.

Image processing is one of these techniques which deals with images and/ or video sequences taken from vehicles. One unique property that can be taken into account for identifying all vehicles is their license plate number. Numerous applications, such as automatic toll collection[5], criminal pursuit[6] and traffic law enforcement[7], have been benefited from it[8-14]. Although some novel techniques, for example RFID (radio frequency identification), WSN (wireless sensor network), etc., have been proposed for car ID identification, LPR on image data is still an indispensable technique in current intelligent transportation systems for its convenience and low cost. Recognition algorithms reported in previous researches are generally composed of three major parts; license plate detection (LPD), character segmentation and character recognition. Segmentation and recognition of characters in Iranian license plates, which correspond to the second and third above mentioned stages, is the purpose of this research. Camera angle and different distances to the plate, non-uniform illumination in the image (such as light and shade), the plate different colors and availability of an inexact band, including the plate, are the major problems encountered. Some different methods are proposed for the segmentation of characters from license plates such as: global optimization procedure[8], image projection[15-17] and the Hough transform[18]. As most of the algorithms need a binary image of license plate, at first we explain the traditional methods of thresholding and binarization of an image and then review the segmentation and recognition techniques of the characters. The brightness distribution of various positions in a license plate image may vary because of the condition of the plate and the effect of lighting environment. Since the binarization with one global threshold cannot always produce useful results in such cases, adaptive local binarization methods are used[19,20]. In many local binarization methods, an image is divided into m x n blocks, and a threshold is chosen for each block.

In Arabic license plates has main features country name and numbers these features indicated in three rows or columns. Figure 1 show samples of plates from Egypt and kingdom of Saudi Arabia. Arabic license plates recognition based on the segmentation of plate and analyzing the segments. There are many factors that make the character segmentation task difficult such as image noise, inappropriate plate frame, rivets, space mark, shapes, plate rotation, mixed digitals and characters and various degrees of illumination.

The paper is organized as follows. In Section 2, the proposed method is described step by step. Sections 3 and 4 using chain code for feature extraction and using chain code and recognizing characters by statistical method respectively. Using the moving window for matching car plates is discuss in section 5. The experimental results and evaluation of the algorithms are given in section 6. Finally, the paper is concluded in Section 7.

Figure 1. show samples of plates from Egypt and kingdom of Saudi Arabia

In this research, we are proposing an efficient technique that recognizes Arabic car plates which contain mixed of characters and numbers will be used as testing images.

2. Preprocessing

In order to recognize characters accurately, preprocessing to the images, such as skew correction and normalization, has to be performed. In this section, we briefly introduce skew correction and normalization operations

2.1. Skew Correction

Character recognitions are generally very sensitive to skew. Therefore, skew detection and correction are critical. We propose here a least-square based skew detection method. Suppose the binary image

is defined as follows:

Step 1: Find out all the connected regions. Let the connected region sets be

, and

has a height

and width

Step 2: For each connected region, check if it is a "valid" region. A connected region

, is said to be "valid" if .

Where

are predefined values. As for a standard, the rate between width and height of each character ranges from 0.3 - 0.8 in a given Arabic car plate. Therefore, in our implementation,

are set to 0.3 and 1.0 respectively.

Step 3: For each "valid" connected region, calculate its centered

Step 4: Perform the skew correction by least-squares based on the centered

. Approximate

sets

by least-square, and compute the skew angle

. Given that

is the skew image and

the corrected image, the skew correction equation is defined as following:

Figure 2 show image before and after skew correction. After skew correction of the character images, characters are segmented from the corrected images.

Figure 2. show image before skew correction and after skew correction

2.2. Size Normalization

Characters segmented from different car plates have different sizes. A linear normalization algorithm is applied to the input image to adjust to a uniform size. Assume the horizontal and vertical projections of the original image F be h and v , respectively. The normalization position

is obtained by

where M, N is the height and width of normalized

Figure 3 shows some normalization results.

Figure 3. Character images before and after size normalization

3. Feature Extraction Using Chain Code

The first step of the construction of the chain code is to extract the boundary of the image. Chains can represent the boundaries or contours of any discrete shape composed of regular cells. In the content of this work, the length l of each side of cells is considered equal to one. These chains represent closed boundaries. Thus, all chains are closed. Extracting the contour depends on the connectivity. In the content of this paper we use pixels with four-connectivity. The simplest contour following algorithms were presented by Duda and Hart[21]. Thus using these algorithms it is possible to represent shape contours by only two states: left turn (represented by “1") and right turn (represented by “0"). The abovementioned process produces a chain composed of only binary elements. Figure 4 illustrates the contour following on an image composed of pixels. This contour was obtained according to the following algorithm:

Figure 4. Example of a contour following on a digital figure

Figure 5. Directions of the neighbors:(a) 4-connected; (b) 8-connected

Scan the image until a figure cell is encountered. Then: If you are in a figure cell turn left and take a step. If you are in a ground cell turn right and take a step. Terminate when you are within one cell of the starting point. In this paper we proposed a new algorithm to find the contour of a binary image and use this contour to obtain the chain code. Since we use pixels with 4-connectivity, the four neighbors of any point can be represented by directions as illustrated in figure (5a) .To find the contour of a binary image we apply the following algorithm:

Step 1. For all pixels with value 0 (black) in the image, set the pixel that has the direction 2 in 4-connected to 0.

Step 2. In the new image (i.e., image obtained from Step 1, also, for all pixels with value 0, set the pixel that has the direction 1 in 4-connected to 0.

Step 3. Remove the old pixels (in the original image) that have 8-connected as shown in figure (2b) and do not satisfy the conditions shown in figure 6.

Figure 6. Four conditions to remove the old pixels.

we can apply this algorithm to real images to obtain their contours. Figure 7 shows a original plate and its contour.

Figure 7. show the applied chain code method to real plate

3.1. Segmentation

The technique applied in this phase will be the partial segmentation technique. The threshold images of car plates will be segmented to several regions depending on how many characters consist in the car plates. For example, car plate with 7 characters will be segmented into 7 regions.

he first step in segmentation process is to cutoff the background from each character and number from the license plate. We use vertical scanning to detect first and last columns for each character and number before horizontal scanning as explained in algorithm .Vertical scanning is done before horizontal scanning because if skewness is present in the input image then its effect will be minimized. This improvement will reduce the error of first and last columns and rows. Vertical scanning (column by column) will be done to detect the first and last columns or each component and cut the area in between to separate the license information from background. The vertical segmentation and horizontal segmentation outputs are shown in Figure 8.

Figure 8. shown the output segmentation (a) vertical segmentation and (b) horizontal segmentation

3.2. Thinning

In this phase, each region will undergo a thin line formation in order to find the most successful-thin line of characters. The output from this process will be a standard size of thinning image regardless of its various sizes and font types and known as template. A traditional thinning technique, the Hilditch technique is proposed to be applied in this phase because of its capability in reducing processing time. Hilditch is a skeletonization method that used non-recursive, recursive and partially recursive neighborhood[22]. A process of resizing an image will be applied to the thinning image and here, the nearest neighbor method to make each character’s size constant will be used because it is the fastest technique compared to other.

4. Recognizing Characters by Statistical Method

After preprocessing, the input character image is first recognized by statistical methods. In our approach, four sub-classifiers recognize the character independently, and their recognition results are combined using the Bayes method[23,24]. To recognize similar characters in our car plate character recognition system, it is important to extract stable and representative structure features. Fortunately, different similarity sets have different structural features. Taken for example, we will discuss in this section how to distinguish most frequently occurring similarities: "8" and "B" by using left edge contour feature. First, we give the definition of edge point. A point

with

is called an edge point. The left edge sequence of the input image

is defined, which is a left edge point set

. For point

, the value k_i can be obtained by the following process: In the i -th row, the column index j moves from left to right until

is a left edge point, and k_i = j (the last value). Then the curve direction of edge point

is defined as follows:

Let the curve direction sequence be :

, if

and

; satisfy the following conditions:

Then the sequence is said to contain a curve point, The left edge contour feature is calculated as follows:

Step 1: Obtain the left edge sequence

of the input

image

Step 2: Compute the curve direction of the left edge sequence.

Step 3: Compute the total of the curve point set (denoted by total curve ) from the direction of the left edge sequence.

Step 4: Approximate the left edge sequence by using a least-square method. Compute the approximate error (denoted by error )

The two types of structural features are feed into a binary decision tree to distinguish "8" and "B". The decision tree doesn’t always give the precise result. If the decision rejects the character, the final recognize result is set back to preprocessing stage. In our system, several parameters(such as W) and decision parameters used in binary decision tree need to be predefined, and they can be obtained by some optimization algorithm, and we use genetic algorithm for optimization parameters.

5. Moving Window with Template Matching

Moving window using the template matching method (sum of squared differences) is a common and practical technique utilized in many pattern recognition applications[25,26]. The template matching method gives high recognition accuracy and reduces the processing time compared to other methods such as cross-correlation. The applied method computes the sum of squared differences in each position while the word image we want to recognize moves over the background template. The point where the sum of squared difference is less than a preset threshold will be considered as the point of matching. The proposed moving window template matching scheme is illustrated in figure 5. First a window containing an object with size smaller than that of the main image is defined. Only a portion of the image is visible through this window. The template matching function is performed between the object in the window and the corresponding area of the image. Then the window is shifted and the template matching function is carried out between the object in the window and the new part of the image visible through the window. Thus, the window is moved left to right and top to bottom in single pixel displacement steps until the entire image is covered and template matching is carried out for all different window positions. Mathematically, distance measure is a measure of the similarities or shared properties between two signals. The distance metric commonly used is the Murkowski metric d(x,y)as follow ;

where x, y are two N dimensional feature vectors, and r is a Minkowski factor. And when r is 2, it is actually Euclidean distance.

In our case there are two discrete signals f, t represent two images denoting the object to be searched and the template respectively. The object is of dimension I x J pixels and the template is of dimension M x N see Figure 9.

where the sum is over i, j under the window containing the feature positioned at x, y. To reduce the computing time, the above equation can be simplified to Manhattan distance:

Figure 9. Moving window template matching scheme

6. Experiments

We apply the above algorithm to our database of car license plates, all of which are real scene images acquired by CCD cameras. They contain cars in different conditions, such as different illumination and different visual angle. Figure.1 shows some test images in our experiment. Table. 1 shows the result of our experiment. From it we can see that, in most cases the car license plates can be detected effectively. Our algorithm Tailed in 16% cases. The failures region there is a dark shade. the algorithm can be applied in a certain range of the size of license plate which is according to the concrete situations. In different situations, we can adjust the size of window to coincide with it. The time spent to run the algorithm depends on the size of windows and the size of the image processed. Our experiment proposed algorithm has been implemented in Matlab software, A variety of country names, characters and numbers are used through this primary test. Figure 1 shows some of the characters and numbers images included in the database with two different fonts to match the fonts used in Egypt and Saudi Arabia license plates. The software program has been improved several times to reduce the processing time to the minimum value.

Table 1. The table of characters recognition results

CharacterDistrict	LettersDistrict	Letter NumeralMixing Zone
License PlateNumber	300	300	1500
The numberof correctlyidentified	290		287	1409
RecognitionRatio	96.6%		95.6%	93.93%

A large number of Egyptian license plates acquired in different environments have been used in the test phase to determine the most suitable threshold for similarity as shown in Figure 10(a). Another number of Saudi license plates have been acquired and processed in the test phase Figure 10(b). It can be easily noted that the distance measures for the Saudi plates have higher peaks than that for Egyptian ones. It can be referred to the fact that the Egyptian plates have colored background while the Saudi plates have white background. The size of the moving window is an important criterion in determining the system performance.

The threshold corresponding to minimal error has been determined. The relation between the minimum error (minimum distance between the object in the moving window and corresponding area in plat's image) and the standard deviation of the distance measure is control factor for determining the threshold. It has been found that R = 0.4 is safe threshold.

Where

and

are the minimal error and the average error respectively.

Figure 10. Illustration of the threshold determination

Figure 11. The comparison of the average ROC curves for our method and others

We evaluate the proposed method and compare with different methods[27,28] with respect to the criteria of the matching accuracy and efficiency at plat car image. Therefore, we report the ROC in Figure 11. Clearly, our method that based on features extraction using chain code and matching is observed to perform better than the other techniques.

7. Conclusions

In this paper, we propose a robust method for car plate character recognition. The main contributions of our work include (I) chain code used to bounded the shapes and distinguishing similar characters by local structural features. The moving window matching algorithm has been implemented. The distance measure (squared Euclidean distance) technique has been used for measuring the similarities between the moving window and the plate image. (2) Developing a system architecture combining statistical and structural recognition methods. We tested the method with huge number of plate images captured in different environments from real applications, and proven to be successfully in commercial car plate recognition. Compared with other methods , our method is more effective and robust. The method is applied on a test database of 300 samples of extracted license plate images captured in outdoor environment. The result yield 93.93% recognition accuracy. We believe that our method can be extended to other OCR application fields. The mixed characters of Arabic car plates is difficult , which sometimes may be very illegible even by human beings. How to recognize it is a challenging research topic.

References

[1]	Wei-jian Zhu, Liang-zheng Xia, “Practical and Fast Algorithm for Character Segmentation of License Plate,” JOURNAL OF NANJING UNIVERSITY OF SCIENCE AND TECHNOLOGY(NATURAL SCIENCE). vol. 29(z1), pp. 26–28, 2005.
[2]	Zhi-li Chen, Wei Zhao, “A fast and efficient location algorithm of dynamic vehicle license plate,” MACHINERY DESIGN & MANUFACTURE. vol.11, pp. 11–13,2008.
[3]	Ni-na Zhou, Min Wang and Xin-han Hang,and so on. “Pretreatment Algorithm of Auto Plate Characters' Recognition,” COMPUTER ENGINEERING AND APPLICATIONS. vol.39(15), pp. 220–221,2003.
[4]	Yong-li Ma, Qiu-hua Xia, “Algorithmic Research of License Plate Location Based on Mathematics Morphology,” CONTROL & AUTOMATION. vol.24(1), pp. 227–228,2008
[5]	Anagnostopoulos, C. N. E., Anagnostopoulos, I. E., Loumos, V., & Kayafas, E. (2006). A license plate-recognition algorithm for intelligent transportation system applications. IEEE Transactions on Intelligent Transportation Systems, 377–391.
[6]	Zhang, H., Jia, W., He, X., & Wu, Q. (2006). Learning-based license plate detection using global and local features. In International conference on pattern recognition (pp. 1102–1105).
[7]	Youssef, M., & AbdelRahman, B. (2008). A smart access control using an efficient license plate location and recognition approach. Expert Systems with Applications, 256–265.
[8]	Franc, V., & Hlava, V. (2005). License plate character segmentation using hidden Markov chains. In: Proceedings of the 27th DAGM symposium (Vol. 1, pp. 385–392).
[9]	Duan, T. D., Hong Du, T. L., Phuoc, T. V., & Hoang, N. V. (2005). Building an automatic vehicle license plate recognition system. In Proceedings of international conference on computer science RIVF (pp. 59–63).
[10]	Kamat, V., & Ganesan, S. (1995). Efficient implementation of the Hough transform for detecting vehicle license plates using DSP’S. In First IEEE real-time technology and applications symposium (pp. 58–61).
[11]	Wang, S. Z., & Lee, H. M. (2003). Detection and recognition of license plate characters with different appearances. In Proceedings of conference on intelligent transportation systems (Vol. 2, pp. 979–984).
[12]	Wang, T. H., Ni, F. C., Li, K. T., & Chen, Y. P. (2004). Robust license plate recognition based on dynamic projection warping. In Proceedings of IEEE international conference on network, sensing and control (pp. 784–788).
[13]	Zhang, H., Jia, W., He, X., & Wu, Q. (2006). Learning-based license plate detection using global and local features. In International conference on pattern recognition (pp. 1102–1105).
[14]	Zheng, D., Zhao, Y., & Wang, J. (2005). An efficient method of license plate location. Pattern Recognition Letters, 2431–2438.
[15]	Barroso, J., Rafel, A., Dagless, E. L., & BulasCruz, J. (1997). Number plate reading using computer vision. In IEEE international symposium on ind. electron. ISIE’97 (pp. 761–766).
[16]	Hegt, H. A., De la Haye, R. J., & Khan, N. A. (1998). A high performance license plate recognition system. In Proceedings of IEEE international conference on system, man and cybernetics (Vol. 5, pp. 4357–4362).
[17]	Xin N., and Lansun, S. (1999). Research on license plate recognition technology. Measurement & Control Technology, 14–17.
[18]	Brad, R. (2005). License plate recognition system. Computer Science Department, Lucian Blaga. Univ; Sibiu, Romania.
[19]	Lee, B. R., Park, K., Kang, H., Kim, H., & Kim, C. (2004). Adaptive local binarization method for recognition of vehicle license plates. In Lecture notes on computer science (Vol. 2004, pp. 646–655).
[20]	Naito, T., Tsukada, T., Yamada, K., Kozuka, K., & Yamamoto, S. (2000). Robust license plate recognition method for passing vehicles under outside environment. IEEE Transactions on Vehicular Technology, 49, 2309–2319.
[21]	Duda R.O. and P.E. Hart. Pattern classification and scene analysis. Wiley, New York, 1973
[22]	Lucas J. Van and Ben J.H. Verwer, "A Contour Processing Method for Fast Binary Neighborhood Operations ", Pattern Recognition Letters, Vol. 7, No. 1, pp. 27-36,1998
[23]	Zhang P., L. H. Chen, 2002. A Novel Feature Extraction Method and Hybrid Tree Classification for Handwritten Numeral Recognition, Pattern Recognition Letters. 23: 45-56
[24]	P., Y. Xiuzi and Z. Sanyuan, "A Hybrid Method for Robust Car Plate Character Recognition",2004 IEEE International Conference on Systems, Man and Cybernetics
[25]	Fairhurst M.C and M.S Hoque, Moving Window Classifier: a new approach to off-line image recognition. Electronics letters, 36(7): 628-630 : 2000.
[26]	Hoque M.S and M.C Fairhurst, Face recognition using the moving window classifier. In proc. Of 11th British Machine Vision Conference (BMVC2000), Bristol, UK,
[27]	Mohamed S. Mansoor Roomi, M. Anitha, R. Bhargavi, "Accurate License Plate Localization" International Conference on Computer, Communication & Electrical Technology ICCCET2011, March 2011
[28]	Li Li and Feng Gua ngli,"The License Plate Recognition System Based on Fuzzy Theory and BP Neural Network", Fourth International Conference on Intelligent Computation Technology and Automation, 2011.

Paper Information

Journal Information

A Robust Method for Arabic Car Plates Recognition and Matching Using Chain Code

Article Outline

1. Introduction

2. Preprocessing

2.1. Skew Correction

2.2. Size Normalization

3. Feature Extraction Using Chain Code

3.1. Segmentation

3.2. Thinning

4. Recognizing Characters by Statistical Method

5. Moving Window with Template Matching

6. Experiments

7. Conclusions

References