Advanced Search
CS Search Google Search
Subscribers, please login

Published Articles >> Table of Contents >> Abstract

Seventh International Conference on Document Analysis and Recognition (ICDAR'03) - Volume 2   p. 829
Using tree-grammars for training set expansion in page classification

Full Article Text: Download PDF of full textBuy this articleGet full text from IEEE Xplore

DOI Bookmark: http://doi.ieeecomputersociety.org/10.1109/ICDAR.2003.1227778
Send link to a friend

Abstract
In this paper we describe a method for the expansion of training sets made by XY trees representing page layout. This approach is appropriate when dealing with page classification based on MXY tree page representations. The basic idea is the use of tree grammars to model the variations in the tree which are caused by segmentation algorithms. A set of general grammatical rules are defined and used to expand the training set. Pages are classified with a k - nn approach where the distance between pages is computed by means of tree-edit distance.
Additional Information

Citation:  Stefano Baldi, Simone Marinai, Giovanni Soda, "Using tree-grammars for training set expansion in page classification," icdar, p. 829,  Seventh International Conference on Document Analysis and Recognition (ICDAR'03) - Volume 2,  2003

Similar Articles

Abstract Contents
Abstract
Citation




Free access to

  • Abstracts
  • Selected PDFs

Electronic subscribers login to:

  • Access HTML/PDFs of full text articles

Subscription information

Get a Web account

PDFs require Adobe Acrobat Reader.

Peer Review Notice

Give us Feedback