BOOKS
BOOK SERIES
JOURNALS
PROCEEDINGS
TEACHING CASES
PAY-PER-VIEW
REFERENCE
E-RESOURCES
ABOUT IGI
BECOME AN AUTHOR/EDITOR  |   MAILING LIST  |   HOW TO ORDER  |   LIBRARY SUGGESTION | EXAMINATION REQUESTS/COURSE ADOPTION | DISTRIBUTORS
IGI Online Bookstore
Click here to PLAY Demo Click here to Start Search Search 30,000+ chapters, articles, and cases - available for download today!

IGI Global Online Symposium!



  Browse Our Bookstore
IGI Catalogs & Newsletters
Forthcoming Titles
Featured Book
By Category
Advanced Search

  Shop
My Profile
View My Cart

  Contact Us
IGI Global
Main Office
701 E. Chocolate Avenue
Hershey, PA 17033, USA
Tel: 717-533-8845 x100
Toll Free: 1-866-342-6657
Fax: 717-533-8661
    or 717-533-7115
 

Discovering Frequent Embedded Subtree Patterns from Large Databases of Unordered Labeled Trees:
Our Price:    $30.00 US
Article #:    ITJ2784
Pages:    70 - 92
Source:    International Journal of Data Warehousing and Mining, Vol. 1, Issue 2
Author(s):    Xiao, Yongqiao; Yao, J. F.; Yang, G.
Affiliation(s):    SAS Inc. & Georgia College and State University, USA; University at Buffalo, State University of New York, USA

Order Now! This document will be delivered electronically. Terms of Delivery
 

Description
Recent years have witnessed a surge of research interest in knowledge discovery from data domains with complex structures, such as trees and graphs. In this paper, we address the problem of mining maximal frequent embedded subtrees which is motivated by such important applications as mining “hot” spots of Web sites from Web usage logs and discovering significant “deep” structures from tree-like bioinformatic data. One major challenge arises due to the fact that embedded subtrees are no longer ordinary subtrees, but preserve only part of the ancestor-descendant relationships in the original trees. To solve the embedded subtree mining problem, in this article we propose a novel algorithm, called TreeGrow, which is optimized in two important respects. First, it obtains frequency counts of root-to-leaf paths through efficient compression of trees, thereby being able to quickly grow an embedded subtree pattern path by path instead of node by node. Second, candidate subtree generation is highly localized so as to avoid unnecessary computational overhead. Experimental results on benchmark synthetic data sets have shown that our algorithm can outperform unoptimized methods by up to 20 times.

 
Books  |  Book Series  |  Journals  |  Proceedings  |  Teaching Cases  |  Pay-Per-View  |  Reference  |  E-Resources  |  About IGI
Become An Author/Editor  |  Mailing List  |  How To Order  |  Library Suggestion  |  Examination Requests

IGI Global - All Rights Reserved ©2001-2010