CORC  > 北京大学  > 数学科学学院
Identifiability and estimation of probabilities from multiple databases with incomplete data and sampling selection
Jia, Jinzhu ; Geng, Zhi ; Wang, Mingfeng
2006
关键词EM ALGORITHM
英文摘要For an application problem, there may be multiple databases, and each database may not contain complete variables or attributes, that is, some variables are observed but some others are missing. Further, data of a database may be collected conditionally on some designed variables. In this paper, we discuss problems related to data mining from such multiple databases. We propose an approach for detecting identifiability of a joint distribution from multiple databases. For an identifiable joint distribution, we further present the expectation-maximization (EM) algorithm for calculating the maximum likelihood estimates (MLEs) of the joint distribution.; Computer Science, Artificial Intelligence; Computer Science, Theory & Methods; SCI(E); CPCI-S(ISTP); 0
语种英语
出处SCI
内容类型其他
源URL[http://hdl.handle.net/20.500.11897/314997]  
专题数学科学学院
推荐引用方式
GB/T 7714
Jia, Jinzhu,Geng, Zhi,Wang, Mingfeng. Identifiability and estimation of probabilities from multiple databases with incomplete data and sampling selection. 2006-01-01.
个性服务
查看访问统计
相关权益政策
暂无数据
收藏/分享
所有评论 (0)
暂无评论
 

除非特别说明,本系统中所有内容都受版权保护,并保留所有权利。


©版权所有 ©2017 CSpace - Powered by CSpace