|
Published Articles >> Table of Contents >> Abstract
September 2003 (Vol. 25, No. 9)
pp. 1188-1192
Document Image Recognition Based on Template Matching of Component Block Projections
Hanchuan Peng, IEEE
Fuhui Long
Zheru Chi, IEEE
Full Article Text:
  
DOI Bookmark: http://doi.ieeecomputersociety.org/10.1109/TPAMI.2003.1227996
Send link to a friend
| Abstract |
|
Document Image Recognition (DIR), a very useful technique in office automation and digital library applications, is to find the most similar template for any input document image in a prestored template document image data set. Existing methods use both local features and global layout information. In this paper, we propose a novel algorithm based on the global matching of Component Block Projections (CBP), which are the concatenated directional projection vectors of the component blocks of a document image. Compared to those existing methods, CBP-based template-matching methods possess two major advantages: 1) The spatial relationship among the component blocks of a document image is better represented, hence a very high matching accuracy can be obtained even for a large template set and seriously distorted input images; and 2) the effective matching distance of each template and the triangle inequality are proposed to significantly reduce the computational cost. Our experimental results confirm these advantages and show that the CBP-based template-matching methods are very suitable for DIR applications.
|
References
|
[1] D.A. Adjeroh and M.C. Lee, On Ratio-Based Color Indexing IEEE Trans. Image Processing, vol. 10, no. 1, pp. 36-48, 2001.
[2] B. Braunmuller, M. Ester, H.-P. Kriegel, and J. Sander, Multiple Similarity Queries: A Basic DBMS Operation for Mining in Metric Databases IEEE Trans. Knowledge and Data Eng., vol. 13, no. 1, pp. 79-95, Jan./Feb. 2001.
[3] F. Cesarini, M. Gori, S. Marinai, and G. Soda, “INFORMys: A Flexible Invoice-Like Form-Reader System,” IEEE Trans. Pattern Analysis and Machine Intelligence, vol. 20, no. 7, pp. 730-745, July 1998.
[4] J.-Y. Chen, C.A. Bouman, and J.C. Dalton, “Hierarchical Browsing and Search of Large Image Databases,” IEEE Trans. Image Processing, vol. 9, no. 3, pp. 442-455, Mar. 2000.
[5] D. Doermann, H. Li, and O. Kia, The Detection of Duplicates in Document Image Databases Proc. Fourth Int'l Conf. Document Analysis and Recognition, pp. 314-318, 1997.
[6] K. Fan and M. Chang, Form Document Identification Using Line Structure Based Features Proc. Fourth Int'l Conf. Pattern Recognition, vol. 2, pp. 1098-1100, 1998.
[7] K. Fukunaga and P.M. Narendra, A Branch and Bound Algorithm for Computing k-Nearest Neighbors IEEE Trans. Computers, vol. 24, no. 7, pp. 750-753, July 1975.
[8] J. Hafner, H.S. Sawhney, W. Equitz, M. Flickner, and W. Niblack, “Efficient Color Histogram Indexing for Quadratic Form Distance Functions,” IEEE Trans. Pattern Analysis and Machine Intelligence, vol. 17, no. 7, pp. 729-736, July 1995.
[9] J. Hu, R. Kashi, and G. Wilfong, Document Image Layout Comparison and Classification Proc. Sixth Int'l Conf. Document Analysis and Recognition, pp. 285-288, 1999.
[10] J.J. Hull, Document Image Similarity and Equivalence Detection Int'l J. Document Analysis and Recognition, vol. 1, no. 1, pp. 37-42, 1998.
[11] D.P. Lopresti, String Techniques for Detecting Duplicates in Document Databases Int'l J. Document Analysis and Recognition, vol. 2, no. 4, pp. 186-199, 2000.
[12] H. Peng and Q. Gan, SOCR 1.03: A Handwritten Data Form Producing and Reading System Proc. 2000 Int'l Workshop Multimedia Data Storage, Retrieval, Integration, and Applications, pp. 197-202, 2000.
[13] H. Peng, Z. Chi, W. Siu, and D. Feng, PageX: An Integrated Document Processing Software for Digital Libraries Proc. 2000 Int'l Workshop Multimedia Data Storage, Retrieval, Integration, and Applications, pp. 203-207, 2000.
[14] H. Peng, F. Long, Z. Chi, D. Feng, and W. Siu, Document Image Matching Based on Component Blocks Proc. Int'l Conf. Image Processing, pp. 601-604, Sept. 2000.
[15] H. Peng, F. Long, Z. Chi, and W. Siu, Document Template Matching Based on Component Block List Pattern Recognition Letters, vol. 22, no. 9, pp. 1033-1042, 2001.
[16] J. Puzicha, Y. Rubner, C. Tomasi, and J. Buhmann, Empirical Evaluation of Dissimilarity Measures for Color and Texture Proc. Int'l Conf. Computer Vision, 1999.
[17] R. Safari, N. Narasimhamurthi, M. Shridhar, and M. Ahmadi, Document Registration Using Projective Geometry IEEE Trans. Image Processing, vol. 6, no. 9, pp. 1337-1341, 1997.
[18] S. Shimotsuji and M. Asano, Form Identification Based on Cell Structure Proc. 13th Int'l Conf. Pattern Recognition, vol. 3, pp. 793-797, 1996.
[19] L. Tseng and R. Chen, The Recognition of Form Documents Based on Three Types of Line Segments Proc. Fourth Int'l Conf. Document Analysis and Recognition, vol. 1, pp. 71-75, 1997.
[20] T. Watanabe, Q. Luo, and N. Sugie, “Layout Recognition of Multi-Kinds of Table-Form Documents,” IEEE Trans. Pattern Analysis and Machine Intelligence, vol. 17, no. 4, pp. 432-445, Apr. 1995.
|
Additional Information
|
Index Terms- Document image recognition, template matching, component block projection.
Citation:
Hanchuan Peng, Fuhui Long, Zheru Chi,
"Document Image Recognition Based on Template Matching of Component Block Projections,"
IEEE Transactions on Pattern Analysis and Machine Intelligence,
vol. 25,
no. 9,
pp. 1188-1192,
Sept.,
2003
|
|