transparent gif


Ej inloggad.

Göteborgs universitets publikationer

Comparative network analysis of human cancer: sparse graphical models with modular constraints and sample size correction

Författare och institution:
José Sánchez (Institutionen för matematiska vetenskaper, Chalmers/GU)
Utgiven i serie vid Göteborgs universitet:
Preprint - Department of Mathematical Sciences, Chalmers University of Technology and Göteborg University, ISSN 1652-9715
Antal sidor:
Chalmers University of Technology
Datum för examination:
Tidpunkt för examination:
Pascal, Chalmers Tvärgata 3, Chalmers Tekniska Högskola
Docent Patrik Rydén, Matematik och Matematisk Statistik, Umeå Universitet, Sverige
Fulltextlänk (lokalt arkiv):
Sammanfattning (abstract):
In the study of transcriptional data for different groups (e.g. cancer types) it's reasonable to assume that some dependencies between genes on a transcriptional or genetic variants level are common across groups. Also, that this property is preserved locally, thus defining a modular structure in the model networks. For ease of interpretation, sparsity in the resulting model is also desirable. In this thesis we assume genomic data to have a multivariate normal distribution and estimate the networks by optimization of a penalized log-likelihood function for the corresponding inverse covariance matrices. We apply the fused elastic net penalty for sparsity and commonality. To achieve modular topology we propose a novel adaptive penalty. This adaptive penalty is computed from an initial zero-consistent solution. We also propose a generalization of the method which allows for fusion penalties defined by a graph. This method can be used to correct estimates when the groups have different sample sizes. It can also be use to correctly penalize in the presence of ordered variables such as survival. We optimize the penalized log-likelihood using the alternating directions method of multiplier (ADMM). Simulation studies show that our method more accurately identifies differential connectivity (network edges that differ between cancer classes) compared with standard methods. We also apply our method to the investigation of tumor data in glioblastoma, breast and ovarian cancer, integrating two types of data, mRNA (messenger RNA expression) and CNA (copy number aberration), by defining a prior distribution of the plausible links in the corresponding networks.
Ämne (baseras på Högskoleverkets indelning av forskningsämnen):
Matematik ->
Sannolikhetsteori och statistik ->
Matematisk statistik
Biologiska vetenskaper ->
Bioinformatik och systembiologi
Inverse covariance matrix, precision matrix, graphical models, high-dimension, low-sample, networks, sparsity, fused lasso, elastic net, cancer.
Chalmers fundament:
Grundläggande vetenskaper
Postens nummer:
Posten skapad:
2013-05-07 11:43
Posten ändrad:
2014-11-07 16:26

Visa i Endnote-format

Göteborgs universitet • Tel. 031-786 0000
© Göteborgs universitet 2007