Publication:
Integration Analysis of Three Omics Data Using Penalized Regression Methods: An Application to Bladder Cancer.

dc.contributor.authorPineda, Silvia
dc.contributor.authorKogevinas, Manolis
dc.contributor.authorCarrato, Alfredo
dc.contributor.authorChanock, Stephen J
dc.contributor.authorVan Steen, Kristel
dc.contributor.authorMalats, Nuria
dc.contributor.authorReal Arribas, Francisco
dc.contributor.funderInstituto de Salud Carlos III
dc.contributor.funderUnited States Department of Health and Human Services
dc.contributor.funderUnión Europea. European Cooperation in Science and Technology (COST)
dc.contributor.funderFundación La Caixa
dc.contributor.funderUnión Europea
dc.date.accessioned2020-06-09T17:15:42Z
dc.date.available2020-06-09T17:15:42Z
dc.date.issued2015-12
dc.description.abstractOmics data integration is becoming necessary to investigate the genomic mechanisms involved in complex diseases. During the integration process, many challenges arise such as data heterogeneity, the smaller number of individuals in comparison to the number of parameters, multicollinearity, and interpretation and validation of results due to their complexity and lack of knowledge about biological processes. To overcome some of these issues, innovative statistical approaches are being developed. In this work, we propose a permutation-based method to concomitantly assess significance and correct by multiple testing with the MaxT algorithm. This was applied with penalized regression methods (LASSO and ENET) when exploring relationships between common genetic variants, DNA methylation and gene expression measured in bladder tumor samples. The overall analysis flow consisted of three steps: (1) SNPs/CpGs were selected per each gene probe within 1Mb window upstream and downstream the gene; (2) LASSO and ENET were applied to assess the association between each expression probe and the selected SNPs/CpGs in three multivariable models (SNP, CPG, and Global models, the latter integrating SNPs and CPGs); and (3) the significance of each model was assessed using the permutation-based MaxT method. We identified 48 genes whose expression levels were significantly associated with both SNPs and CPGs. Importantly, 36 (75%) of them were replicated in an independent data set (TCGA) and the performance of the proposed method was checked with a simulation study. We further support our results with a biological interpretation based on an enrichment analysis. The approach we propose allows reducing computational time and is flexible and easy to implement when analyzing several types of omics data. Our results highlight the importance of integrating omics data by applying appropriate statistical strategies to discover new insights into the complex genetic mechanisms involved in disease conditions.es_ES
dc.description.peerreviewedes_ES
dc.description.sponsorshipThe work was partially supported by Fondo de Investigaciones Sanitarias (FIS), Instituto de Salud Carlos III, Spain (#PI06-1614 and #PI12-00815); Red Tematica de Investigacion Cooperativa en Cancer, Spain (#RD12/0036/0034 and #RD12/0036/0050); EU-FP7-HEALTH (F2-2008-201663-UROMOL and F2-2008-201333-DECanBio); USA-NIH (RO1-CA089715 and NO2-CP-11015); and European Cooperation in Science and Technology (COST Action #BM1204: EU_Pancreas). SP was funded by a Obra Social Fundacion "la Caixa". The funders had no role in study design, data collection and analysis, decision to publish, or preparation of the manuscript.es_ES
dc.format.number12es_ES
dc.format.pagee1005689es_ES
dc.format.volume11es_ES
dc.identifier.citationPLoS Genet .2015;11(12):e1005689.es_ES
dc.identifier.doi10.1371/journal.pgen.1005689es_ES
dc.identifier.e-issn1553-7404es_ES
dc.identifier.journalPLoS geneticses_ES
dc.identifier.pubmedID26646822es_ES
dc.identifier.urihttp://hdl.handle.net/20.500.12105/10314
dc.language.isoenges_ES
dc.publisherPublic Library of Science (PLOS)
dc.relation.projectIDinfo:eu_repo/grantAgreement/ES/PI12-00815es_ES
dc.relation.projectIDinfo:eu_repo/grantAgreement/ES/PI06-1614es_ES
dc.relation.projectIDinfo:eu_repo/grantAgreement/EC/FP7/201663es_ES
dc.relation.projectIDinfo:eu_repo/grantAgreement/EC/FP7/201333es_ES
dc.relation.publisherversionhttps://doi.org/10.1371/journal.pgen.1005689es_ES
dc.repisalud.institucionCNIOes_ES
dc.repisalud.orgCNIOCNIO::Grupos de investigación::Grupo de Epidemiología Genética y Moleculares_ES
dc.repisalud.orgCNIOCNIO::Grupos de investigación::Grupo de Carcinogénesis Epiteliales_ES
dc.rights.accessRightsopen accesses_ES
dc.rights.licenseAtribución-NoComercial-CompartirIgual 4.0 Internacional*
dc.rights.urihttp://creativecommons.org/licenses/by-nc-sa/4.0/*
dc.subject.meshGenetic Predisposition to Diseasees_ES
dc.subject.meshAlgorithmses_ES
dc.subject.meshCpG Islandses_ES
dc.subject.meshGenomicses_ES
dc.subject.meshHumanses_ES
dc.subject.meshNeoplasm Proteinses_ES
dc.subject.meshPolymorphism, Single Nucleotidees_ES
dc.subject.meshSoftwarees_ES
dc.subject.meshUrinary Bladder Neoplasmses_ES
dc.subject.meshDNA Methylationes_ES
dc.subject.meshELASTIC-NETes_ES
dc.subject.meshAssociationes_ES
dc.subject.meshSelectiones_ES
dc.subject.meshVariantses_ES
dc.subject.meshBiologyes_ES
dc.subject.meshReveales_ES
dc.subject.meshS100A9es_ES
dc.subject.meshLocies_ES
dc.titleIntegration Analysis of Three Omics Data Using Penalized Regression Methods: An Application to Bladder Cancer.es_ES
dc.typejournal articlees_ES
dc.type.hasVersionVoRes_ES
dspace.entity.typePublication
relation.isAuthorOfPublicatione90c1af5-4bfd-42de-af72-ef7ead52074c
relation.isAuthorOfPublication803aa5ee-1ebf-4697-a92a-5918e85b43c7
relation.isAuthorOfPublication62d13a40-e75d-49b1-bb0a-f54a4146ad3e
relation.isAuthorOfPublication.latestForDiscoverye90c1af5-4bfd-42de-af72-ef7ead52074c
relation.isFunderOfPublication7d739953-4b68-4675-b5bb-387a9ab74b66
relation.isFunderOfPublication6081a0d0-d423-4510-b1af-a52eac0c92e4
relation.isFunderOfPublicationcce72908-3c61-438a-bde7-a89d04434528
relation.isFunderOfPublicationf04d23ec-a91d-4242-88b0-cc436888f8a2
relation.isFunderOfPublicationb029ca7c-43c2-46be-af9e-b34b7f455d94
relation.isFunderOfPublication.latestForDiscovery7d739953-4b68-4675-b5bb-387a9ab74b66
relation.isPublisherOfPublicationa2759e3d-0d58-4e8a-9fcd-c6130ee333d1
relation.isPublisherOfPublication.latestForDiscoverya2759e3d-0d58-4e8a-9fcd-c6130ee333d1

Files

Original bundle

Now showing 1 - 2 of 2
Loading...
Thumbnail Image
Name:
IntegrationAnalysisofThreeOmics_2015.pdf
Size:
1.92 MB
Format:
Adobe Portable Document Format
Description:
Artículo principal
Loading...
Thumbnail Image
Name:
1621701.zip
Size:
611.21 KB
Format:
Unknown data format
Description:
Supporting material