CORC  > 北京大学  > 数学科学学院
Direct regression modelling of high-order moments in big data
Xi, Ruibin ; Lin, Nan
2016
关键词Big data Higher-order moment U-statistics Estimating equation Divide-and-conquer Aggregation Consistency Asymptotic normality Data cube COPY NUMBER VARIATION AGGREGATION COMPRESSION CUBES
英文摘要Big data problems present great challenges to statistical analyses, especially from the computational side. In this paper, we consider regression estimation of high-order moments in big data problems based on the U-statistic-based Functional Regression Model (U-FRM) model. The U-FRM model is a nonparametric method that allows direct estimation of higher-order moments without imposing parametric assumptions on the high order-moments. Despite this modeling advantage, its estimation relies on a U-statistics based estimating equation whose computational complexity is generally too high for big data. In this paper, we propose using the "divide-and-conquer" strategy to construct a computationally more succinct surrogate estimating equation. Through both theoretical proof and simulations, we show that our method significantly reduces the computational time and meanwhile enjoys the same asymptotic behavior as the original estimation method. We then apply our method to a genomic problem to illustrate its performance on real data.; CPCI-S(ISTP); 4; 445-452; 9
语种英语
出处SCI
出版者Banff Workshop on Statistical and Computational Theory and Methodology for Big Data Analysis
内容类型其他
源URL[http://hdl.handle.net/20.500.11897/460185]  
专题数学科学学院
推荐引用方式
GB/T 7714
Xi, Ruibin,Lin, Nan. Direct regression modelling of high-order moments in big data. 2016-01-01.
个性服务
查看访问统计
相关权益政策
暂无数据
收藏/分享
所有评论 (0)
暂无评论
 

除非特别说明,本系统中所有内容都受版权保护,并保留所有权利。


©版权所有 ©2017 CSpace - Powered by CSpace