|
Published Articles >> Table of Contents >> Abstract
International Conference on Information Technology: Coding and Computing (ITCC'04) Volume 2
p. 337
Ontology-based Web Crawler
S. Ganesh, Pondicherry Engineering College, India
M. Jayaraj, Pondicherry Engineering College, India
V. Kalyan, Pondicherry Engineering College, India
Srinivasa Murthy, Pondicherry Engineering College, India
G. Aghila, Pondicherry Engineering College, India
Full Article Text:
 
DOI Bookmark: http://doi.ieeecomputersociety.org/10.1109/ITCC.2004.1286658
Send link to a friend
| Abstract |
|
The requirement of a web Crawler that downloads
most relevant pages is still a major challenge in the field
of Information Retrieval Systems. The use of link analysis
algorithms like page rank and other Importance-metrics
have shed a new approach in prioritizing the URL queue
for downloading higher relevant pages. In this paper, the
combination of these metrics along with a new metric
called association-metric has been proposed. The
association-metric estimates the semantic content of the
URL based on the domain dependent ontology, which in
turn strengthens the metric that is used for prioritizing the
URL queue. In addition, after downloading the page, the
association metric plays important role in estimating the
relevancy of the links in that page. The proposed new
metric will solve the major problem of finding the
relevancy of the pages before the process of crawling, to
an optimal level.
|
Additional Information
|
Index Terms- Web Crawler, Ordering-metric, Importance-metrics, Association-metric, Ontology
Citation:
S. Ganesh, M. Jayaraj, V. Kalyan, Srinivasa Murthy, G. Aghila,
"Ontology-based Web Crawler,"
itcc,
p. 337,
International Conference on Information Technology: Coding and Computing (ITCC'04) Volume 2,
2004
|
|