|
IGI Global
Main Office
701 E. Chocolate Avenue
Hershey, PA 17033, USA
Tel: 717-533-8845 x100
Toll Free: 1-866-342-6657
Fax: 717-533-8661
or 717-533-7115
|
|
|
Service Class Driven Dynamic Data Source Discovery with DynaBot:
| Our Price: |
$30.00 US |
| Article #: |
ITJ3831 |
| Number of pages: |
26-48 pages |
| Source: |
International Journal of Web Services Research, Vol. 4, Issue 3 |
| Author(s): |
Rocco, Daniel; Caverlee, James; Liu, Ling; Critchlow, Terence |
| Affiliation(s): |
University of West Georgia, USA; Georgia Institute of Technology, USA; Georgia Institute of Technology, USA; Lawrence Livermore National Laboratory, USA |
Order Now!
This document will be delivered electronically. Terms of Delivery |
|
Description
Dynamic Web data sources on the Deep Web provide intuitive access to real-time information and large data repositories anywhere that Web access is available. Although recent studies suggest that the dynamic Web is larger and growing faster than static Web, dynamic content is often ignored by existing search engine indexers owing to technical challenges inherent in searching dynamic sources. To address these challenges, we present DynaBot, a service-centric crawler for discovering and clustering Deep Web sources. Dyna- Bot has three unique characteristics. First, DynaBot utilizes a service class model implemented through the construction of service class descriptions (SCDs). Second, DynaBot employs a modular architecture for focused crawling of the Deep Web. Third, DynaBot incorporates algorithms for efficiently probing, discovering, and clustering Deep Web sources through SCD-based service analysis. Experimental results demonstrate DynaBot's effectiveness and suggest techniques for efficiently managing service discovery given the immense scale of the Deep Web. |