Publication:
Unsupervised Machine Learning Organization of the Functional Dark Proteome of Gram-Negative "Superbugs": Six Protein Clusters Amenable for Distinct Scientific Applications

dc.contributor.authorSicilia, Carlos
dc.contributor.authorCorral-Lugo, Andres
dc.contributor.authorSmialowski, Pawel
dc.contributor.authorMcConnell, Michael J
dc.contributor.authorMartin-Galiano, Antonio Javier
dc.contributor.funderInstituto de Salud Carlos III
dc.contributor.funderComunidad de Madrid (España)
dc.date.accessioned2023-05-09T12:27:26Z
dc.date.available2023-05-09T12:27:26Z
dc.date.issued2022-12
dc.description.abstractUncharacterized proteins have been underutilized as targets for the development of novel therapeutics for difficult-to-treat bacterial infections. To facilitate the exploration of these proteins, 2819 predicted, uncharacterized proteins (19.1% of the total) from reference strains of multidrug Acinetobacter baumannii, Klebsiella pneumoniae, and Pseudomonas aeruginosa species were organized using an unsupervised k-means machine learning algorithm. Classification using normalized values for protein length, pI, hydrophobicity, degree of conservation, structural disorder, and %AT of the coding gene rendered six natural clusters. Cluster proteins showed different trends regarding operon membership, expression, presence of unknown function domains, and interactomic relevance. Clusters 2, 4, and 5 were enriched with highly disordered proteins, nonworkable membrane proteins, and likely spurious proteins, respectively. Clusters 1, 3, and 6 showed closer distances to known antigens, antibiotic targets, and virulence factors. Up to 21.8% of proteins in these clusters were structurally covered by modeling, which allowed assessment of druggability and discontinuous B-cell epitopes. Five proteins (4 in Cluster 1) were potential druggable targets for antibiotherapy. Eighteen proteins (11 in Cluster 6) were strong B-cell and T-cell immunogen candidates for vaccine development. Conclusively, we provide a feature-based schema to fractionate the functional dark proteome of critical pathogens for fundamental and biomedical purposes.es_ES
dc.description.peerreviewedes_ES
dc.description.sponsorshipThis research was supported by Acción Estratégica en Salud from the ISCIII, Grant MPY509/19, MPY380/18, and MPY516/19. A.C.-L. is the recipient of a Comunidad de Madrid contract by the ISCIII.es_ES
dc.format.number50es_ES
dc.format.page46131-46145es_ES
dc.format.volume7es_ES
dc.identifier.citationACS Omega. 2022 Dec 6;7(50):46131-46145.es_ES
dc.identifier.doi10.1021/acsomega.2c04076es_ES
dc.identifier.e-issn2470-1343es_ES
dc.identifier.journalACS omegaes_ES
dc.identifier.pubmedID36570227es_ES
dc.identifier.urihttp://hdl.handle.net/20.500.12105/16034
dc.language.isoenges_ES
dc.publisherACS Publications
dc.relation.projectFISinfo:eu-repo/grantAgreement/ES/MPY509/19es_ES
dc.relation.projectFISinfo:eu-repo/grantAgreement/ES/MPY380/18es_ES
dc.relation.projectFISinfo:eu-repo/grantAgreement/ES/MPY516/19es_ES
dc.relation.publisherversionhttps://doi.org/10.1021/acsomega.2c04076es_ES
dc.repisalud.centroISCIII::Centro Nacional de Microbiología (CNM)es_ES
dc.repisalud.institucionISCIIIes_ES
dc.rights.accessRightsopen accesses_ES
dc.rights.licenseAttribution-NonCommercial-NoDerivatives 4.0 Internacional*
dc.rights.urihttp://creativecommons.org/licenses/by-nc-nd/4.0/*
dc.titleUnsupervised Machine Learning Organization of the Functional Dark Proteome of Gram-Negative "Superbugs": Six Protein Clusters Amenable for Distinct Scientific Applicationses_ES
dc.typeresearch articlees_ES
dc.type.hasVersionVoRes_ES
dspace.entity.typePublication
relation.isAuthorOfPublication1100a22a-780d-432e-956d-599aff664ba5
relation.isAuthorOfPublication1d2e9ea7-be75-4b13-a09e-656f8f304a14
relation.isAuthorOfPublicationa05be95c-83ed-4217-ab14-cbca92cd5279
relation.isAuthorOfPublication.latestForDiscovery1100a22a-780d-432e-956d-599aff664ba5
relation.isFunderOfPublication7d739953-4b68-4675-b5bb-387a9ab74b66
relation.isFunderOfPublicationc87c70a3-e023-4b6b-ac25-1b2d1b483786
relation.isFunderOfPublication.latestForDiscovery7d739953-4b68-4675-b5bb-387a9ab74b66
relation.isPublisherOfPublicationbbe00a75-ae1f-41b0-8a37-df183dabf093
relation.isPublisherOfPublication.latestForDiscoverybbe00a75-ae1f-41b0-8a37-df183dabf093

Files

Original bundle

Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
UnsupervisedMachineLearningOrganization_2022.pdf
Size:
7.71 MB
Format:
Adobe Portable Document Format
Description: