CORC  > 北京大学  > 信息科学技术学院
Constructing cube blocks effectively for stream data analysis
Jiang, Lizheng ; Yang, Dongqing ; Tang, Shiwei ; Ma, Xiuli ; Zhang, Dehui
2006
英文摘要With the rapid growth of WWW, many applications based on web are generating tremendous amount of data. Analyzing and mining such data will be important for administrators and other users. Because these data are increasing continuously and rapidly, retrieving these data efficiently and effectively are challenging tasks. In this paper, we propose a new Cube Block model to organize multi-dimensional data efficiently for online analysis and mining. The main idea of Cube Block model is to split the whole data set into blocks by dividing time into exponential time frames. In each block, we omit individual time identifiers of data items and aggregate duplicate data items to one data cell. Cube Block model uses the approximating method to get historical data in any time interval. The approximation accuracy is guaranteed by an upper bound and a lower bound. For the implementation of Cube Block model, we develop the CB-Builder system to scan data and construct cube blocks. CB-Builder's algorithm complexity (runtime and space cost) is analyzed in this paper. Experiments on synthetic data sets demonstrate that Cube Block model with its implementation CB-Builder is an effective and efficient tool for web application data analysis and mining. ? 2006 IEEE.; EI; 0
语种英语
DOI标识10.1109/WAIMW.2006.10
内容类型其他
源URL[http://ir.pku.edu.cn/handle/20.500.11897/294990]  
专题信息科学技术学院
推荐引用方式
GB/T 7714
Jiang, Lizheng,Yang, Dongqing,Tang, Shiwei,et al. Constructing cube blocks effectively for stream data analysis. 2006-01-01.
个性服务
查看访问统计
相关权益政策
暂无数据
收藏/分享
所有评论 (0)
暂无评论
 

除非特别说明,本系统中所有内容都受版权保护,并保留所有权利。


©版权所有 ©2017 CSpace - Powered by CSpace