耶鲁大学生物统计系教授,国际统计学会推选会员、美国统计学会会士。研究主要集中于生物统计、遗传流行病学、生存分析、高维数据分析等。担任JASA, AISM, Briefings in Bioinformatics等多个国际期刊副主编。已在Nature Genetics、JASA、The Annals of Statistics、Biometrika、Briefings in Bioinformatics等国际权威期刊发表论文数百篇。
吴梦云
上海财经大学统计与数据科学学院教授。2013年获得中山大学概率论与数理统计博士学位,并于2016年8月至2018年7月在耶鲁大学生物统计系进行博士后研究。主要研究方向为高维数据变量选择、网络模型及整合分析等。目前,已在The Annals of Applied Statistics、Biometrics、Biostatistics、Statistics in Medicine、Bioinformatics等期刊发表多篇学术论文。入选上海市晨光计划、浦江人才以及启明星计划,主持国家自然科学青年基金和面上项目,以及全国统计科学研究重大项目。
2
论文发表截图
3
论文题目
Joint identification of spatially variable genes via a network-assisted Bayesian regularization approach
4
论文摘要
Identifying genes that display spatial patterns is critical to investigating expression interactions within a spatial context and further dissecting biological understanding of complex mechanistic functionality. Despite the increase in statistical methods designed to identify spatially variable genes, they are mostly based on marginal analysis and share the limitation that the dependence (network) structures among genes are not well accommodated, where a biological process usually involves changes in multiple genes that interact in a complex network. In addition, the latent cellular composition within the spots can introduce confounding variations, negatively affecting the accuracy of the identification. In this study we develop a novel Bayesian regularization approach for spatial transcriptomic data, with confounding variations induced by varying cellular distributions effectively corrected. Significantly advancing from existing studies, a thresholded graph Laplacian regularization is proposed to simultaneously identify spatially variable genes and accommodate the network structure among genes. The proposed method is based on a zero-inflated negative binomial distribution, effectively accommodating the count nature, zero inflation, and overdispersion of spatial transcriptomic data. Extensive simulations and applications to real data demonstrate the competitive performance of the proposed method.