在大型数据集上提高序列挖掘效率的统计方法

CORC > 清华大学

	在大型数据集上提高序列挖掘效率的统计方法
	陆甡 ; 李春平 ; LU Shen ; LI Chunping
	2010-06-09 ; 2010-06-09
关键词	数据挖掘序列模式挖掘最大频繁项目集统计模型 Data mining Sequential pattern mining Largest frequent item set Statistical model TP311.13
其他题名	A Statistical Method of Improving Efficiency on Sequential Patterns
中文摘要	分析了在大型事务集合发现频繁模式的特点,给出了一个改进算法(称为AprioriAdjust算法),同时,发展了两项技术:(1)提出了事务集合的压缩过程。(2)展示了一种基于统计模式的方法来评价序列的支持度,在该方法中,考虑了整个过程中每一趟的支持度均值的收敛性,以进行有效的候选频繁集的剪枝。此外,还讨论了实验结果。比较AprioriTID算法,研究展示,AprioriAdjust算法在进行大型事务集合的挖掘时更有效,可扩展性更强。; This paper analyzes some features of discovering frequent patterns on large set of transactions. On this basis, it gives an improvedalgorithm (called AprioriAdjust) with the development of two techniques: (1) applying the compressive processing of the transaction set for miningsequential patterns; (2) presenting a method based on the statistical mechanism to evaluate the sequence’s support, in which the convergence ofaverage value of the support in the whole procedure is considered, so as to effectively prune the candidate set of the frequent patterns. Furthermore, itdiscusses the results of the experiments. Compared to the AprioriTID algorithm, it shows that the AprioriAdjust algorithm is more efficient andscalable over the large sets of transactions.; 国家“863”计划基金资助项目
语种	中文 ; 中文
内容类型	期刊论文
源URL	[http://hdl.handle.net/123456789/56572]
专题	清华大学
推荐引用方式 GB/T 7714	陆甡,李春平,LU Shen,等. 在大型数据集上提高序列挖掘效率的统计方法[J],2010, 2010.
APA	陆甡,李春平,LU Shen,&LI Chunping.(2010).在大型数据集上提高序列挖掘效率的统计方法..
MLA	陆甡,et al."在大型数据集上提高序列挖掘效率的统计方法".(2010).

个性服务

查看访问统计

相关权益政策

暂无数据

收藏/分享

所有评论 (0)

[发表评论/异议/意见]

暂无评论

评论
权益异议
反馈意见

评注功能仅针对注册用户开放，请您登录

您对该条目有什么异议，请向管理员反馈。
内容：
Email：	*
单位:
验证码：	刷新

您在知识库使用过程中有什么好的想法或者建议可以反馈给我们。
标题：	*
内容：
Email：	*
验证码：	刷新

相关链接

CORC

联系我们