Advanced Search
CS Search Google Search
Subscribers, please login

Published Articles >> Table of Contents >> Abstract

Services Computing, 2004 IEEE International Conference on (SCC'04)   pp. 449-452
Segmenting the Web Document with Document Object Model

Full Article Text: Download PDF of full textBuy this articleGet full text from IEEE Xplore

DOI Bookmark: http://doi.ieeecomputersociety.org/10.1109/SCC.2004.1358040
Send link to a friend

Abstract
We present a model about DOM-based web document segmentation using the semi-structure information of web pages. This model builds DOM tree of the web page by parsing HTML tags which organize structure of the web page. By improving traditional plain text segmentation algorithms, we expand these algorithms to suit web text segmentation. Then, with the boundaries between the nodes in the DOM tree, precision of segmentation results can be increased further.
Additional Information

Citation:  Jianli Luo, Jie Shen, Cuihua Xie, "Segmenting the Web Document with Document Object Model," scc, pp. 449-452,  Services Computing, 2004 IEEE International Conference on (SCC'04),  2004

Similar Articles

Abstract Contents
Abstract
Citation




Free access to

  • Abstracts
  • Selected PDFs

Electronic subscribers login to:

  • Access HTML/PDFs of full text articles

Subscription information

Get a Web account

PDFs require Adobe Acrobat Reader.

Peer Review Notice

Give us Feedback