一种基于线性规划的孤立点检测方法
王天然; 刘伟军
刊名控制工程
2013
卷号20期号:6页码:1123-1126, 1130
关键词线性规划 孤立点检测 马尔科夫模型
ISSN号1671-7848
其他题名A Linear Programming Framework for Outlier Detection
产权排序1
中文摘要孤立点检测是数据挖掘中的重要问题,可以发现不具备一般特性的数据,进而发现潜在的有用信息。现有的孤立点检测算法对于孤立点组成小集群的情形,一般不能正确检出。针对这一问题,提出一种新的基于线性规划的孤立点检测方法,该方法基于一个简单的事实:紧邻的两个数据点,必然同时为孤立点或正常点。首先建立待检测数据点的图模型,通过构造顶点能量模型和边模型,建立孤立点检测问题的马尔科夫模型,之后通过求解线性规划问题,得到该模型的最优解,进而得到孤立点检测结果。最后,使用一个合成数据集和三个真实数据集进行实验,验证本文所提出的算法,实验结果表明,提出的算法对于普通数据集和含有孤立点组成小集群的数据集,都能够正确地检出,且具有较高的检测正确率。
英文摘要Outlier detection is an important step in many data - mining applications. It can find patterns in data that do not conform to expected behavior,these nonconforming patterns can imply potentially useful information. The disadvantages of current methods are that if the data has outliers that form a small cluster,the technique fails to label them correctly. In this paper,we propose a new method for outlier detection. The essential idea behind this technique is that two neighbor data points must be normal points or outliers in the same time. The paper first create the graph model of the data points to be detected. By constructing energy model of vertices and edges,the Markov model for outlier detection problem is established,followed by solving a linear programming problem,the optimal solution of the model is obtained ,and then outlier detection results are provided. Finally,the paper use a synthetic data set and three real data sets experiment to test the proposed algorithm,experiment results show that the proposed algorithm for ordinary data sets and the data sets containing small cluster of data sets are able to correctly detection,and has a high detection accuracy.
收录类别CSCD
资助信息国家973重大基础研究计划(2011CB302400)
语种中文
CSCD记录号CSCD:5010213
内容类型期刊论文
源URL[http://ir.sia.ac.cn/handle/173321/14634]  
专题沈阳自动化研究所_装备制造技术研究室
推荐引用方式
GB/T 7714
王天然,刘伟军. 一种基于线性规划的孤立点检测方法[J]. 控制工程,2013,20(6):1123-1126, 1130.
APA 王天然,&刘伟军.(2013).一种基于线性规划的孤立点检测方法.控制工程,20(6),1123-1126, 1130.
MLA 王天然,et al."一种基于线性规划的孤立点检测方法".控制工程 20.6(2013):1123-1126, 1130.
个性服务
查看访问统计
相关权益政策
暂无数据
收藏/分享
所有评论 (0)
暂无评论
 

除非特别说明,本系统中所有内容都受版权保护,并保留所有权利。


©版权所有 ©2017 CSpace - Powered by CSpace