Publication:
Variable selection for nonlinear dimensionality reduction of biological datasets through bootstrapping of correlation networks.

dc.contributor.authorAragones, David G
dc.contributor.authorPalomino-Segura, Miguel
dc.contributor.authorSicilia, Jon
dc.contributor.authorCrainiciuc, Georgiana
dc.contributor.authorBallesteros, Iván
dc.contributor.authorSanchez-Cabo, Fatima
dc.contributor.authorHidalgo, Andrés
dc.contributor.authorCalvo, Gabriel F
dc.contributor.funderMinisterio de Ciencia e Innovación (España)es_ES
dc.contributor.funderFundación La Caixaes_ES
dc.contributor.funderFondation Leducqes_ES
dc.contributor.funderUnión Europea. Comisión Europeaes_ES
dc.contributor.funderFederation of European Biochemical Societieses_ES
dc.contributor.funderEuropean Molecular Biology Organizationes_ES
dc.contributor.funderMinisterio de Ciencia, Innovación y Universidades (España)es_ES
dc.contributor.funderFundación ProCNICes_ES
dc.date.accessioned2024-05-07T12:24:14Z
dc.date.available2024-05-07T12:24:14Z
dc.date.issued2024-01
dc.description.abstractIdentifying the most relevant variables or features in massive datasets for dimensionality reduction can lead to improved and more informative display, faster computation times, and more explainable models of complex systems. Despite significant advances and available algorithms, this task generally remains challenging, especially in unsupervised settings. In this work, we propose a method that constructs correlation networks using all intervening variables and then selects the most informative ones based on network bootstrapping. The method can be applied in both supervised and unsupervised scenarios. We demonstrate its functionality by applying Uniform Manifold Approximation and Projection for dimensionality reduction to several high-dimensional biological datasets, derived from 4D live imaging recordings of hundreds of morpho-kinetic variables, describing the dynamics of thousands of individual leukocytes at sites of prominent inflammation. We compare our method with other standard ones in the field, such as Principal Component Analysis and Elastic Net, showing that it outperforms them. The proposed method can be employed in a wide range of applications, encompassing data analysis and machine learning.es_ES
dc.description.peerreviewedes_ES
dc.description.sponsorshipThis research has been supported by grants awarded to G.F.C. by the Spanish Ministerio de Ciencia e Innovación and the European Union NextGenerationEU/PRTR, MCIN/AEI/10.13039/501100011033 (grant numbers TED2021-132296B-C55, PDC2022-133520-I00 and PID2022- 142341OB-I00). D.G.A., Spain is supported by a research contract with reference 2023-CDT-11616 (from project with grant number TED2021- 132296B-C55). A.H. was supported by RTI2018-095497-B-I00 from Ministerio de Ciencia e Innovación (MCIN), Spain, HR17_00527 from Fundación La Caixa, Spain, Transatlantic Network of Excellence, Spain (TNE-18CVD04) from the Leducq Foundation, and FET-OPEN (no. 861878) from the European Comission. M.P.-S. is supported by a Federation of European Biochemical Societies, Spain, the EMBO ALTF (no. 1142–2020) long-term fellowship and from MICINN, Spain (RYC2021- 033511-I). J.S. is supported by a fellowship (PRE2019-089130) from MICINN, Spain. The CNIC is supported by the MCIN and the Pro-CNIC Foundation, Spain.es_ES
dc.format.page107827es_ES
dc.format.volume168es_ES
dc.identifier.citationComput Biol Med. 2024 Jan:168:107827.es_ES
dc.identifier.doi10.1016/j.compbiomed.2023.107827es_ES
dc.identifier.e-issn1879-0534es_ES
dc.identifier.journalComputers in biology and medicinees_ES
dc.identifier.pubmedID38086138es_ES
dc.identifier.urihttp://hdl.handle.net/20.500.12105/19277
dc.language.isoenges_ES
dc.publisherElsevieres_ES
dc.relation.projectFECYTinfo:eu-repo/grantAgreement/ES/MCIN/AEI/10.13039/501100011033es_ES
dc.relation.projectFECYTinfo:eu-repo/grantAgreement/ES/TED2021-132296B-C55es_ES
dc.relation.projectFECYTinfo:eu-repo/grantAgreement/ES/PDC2022-133520-I00es_ES
dc.relation.projectFECYTinfo:eu-repo/grantAgreement/ES/PID2022-142341OB-I00es_ES
dc.relation.projectFECYTinfo:eu-repo/grantAgreement/ES/2023-CDT-11616es_ES
dc.relation.projectFECYTinfo:eu-repo/grantAgreement/ES/TED2021-132296B-C55es_ES
dc.relation.projectFECYTinfo:eu-repo/grantAgreement/ES/RTI2018-095497-B-I00es_ES
dc.relation.projectFECYTinfo:eu-repo/grantAgreement/ES/HR17-00527es_ES
dc.relation.projectFECYTinfo:eu-repo/grantAgreement/ES/TNE-18CVD04es_ES
dc.relation.projectFECYTinfo:eu-repo/grantAgreement/ES/PRE2019-089130es_ES
dc.relation.publisherversion10.1016/j.compbiomed.2023.107827es_ES
dc.repisalud.institucionCNICes_ES
dc.repisalud.orgCNICCNIC::Unidades técnicas::Bioinformáticaes_ES
dc.rights.accessRightsopen accesses_ES
dc.rights.licenseAttribution-NonCommercial-NoDerivatives 4.0 Internacional*
dc.rights.urihttp://creativecommons.org/licenses/by-nc-nd/4.0/*
dc.subject.meshAlgorithmses_ES
dc.subject.meshMachine Learninges_ES
dc.subject.meshPrincipal Component Analysises_ES
dc.titleVariable selection for nonlinear dimensionality reduction of biological datasets through bootstrapping of correlation networks.es_ES
dc.typejournal articlees_ES
dc.type.hasVersionVoRes_ES
dspace.entity.typePublication
relation.isAuthorOfPublication18d7e8cb-2cd8-4c07-8da8-81d3503c749f
relation.isAuthorOfPublicationecd7f1e7-2399-4c06-bbc6-d1a2e86c0fbe
relation.isAuthorOfPublication.latestForDiscovery18d7e8cb-2cd8-4c07-8da8-81d3503c749f

Files

Original bundle

Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
Variable selection for nonlinear dimensionality_Comput Biol Med_2024.pdf
Size:
16.4 MB
Format:
Adobe Portable Document Format
Description: