Publication:
Obtaining EHR-derived datasets for COVID-19 research within a short time: a flexible methodology based on Detailed Clinical Models

dc.contributor.authorPedrera-Jiménez, Miguel
dc.contributor.authorGarcía-Barrio, Noelia
dc.contributor.authorCruz-Rojo, Jaime
dc.contributor.authorTerriza-Torres, Ana Isabel
dc.contributor.authorLópez-Jiménez, Elena Ana
dc.contributor.authorCalvo-Boyero, Fernando
dc.contributor.authorJiménez-Cerezo, María Jesús
dc.contributor.authorBlanco-Martínez, Alvar Javier
dc.contributor.authorRoig-Domínguez, Gustavo
dc.contributor.authorCruz-Bermúdez, Juan Luis
dc.contributor.authorBernal-Sobrino, José Luis
dc.contributor.authorSerrano-Balazote, Pablo
dc.contributor.authorMuñoz Carrero, Adolfo
dc.contributor.funderInstituto de Salud Carlos III
dc.contributor.funderUnión Europea. Fondo Europeo de Desarrollo Regional (FEDER/ERDF)
dc.date.accessioned2021-04-16T11:24:52Z
dc.date.available2021-04-16T11:24:52Z
dc.date.issued2021
dc.description.abstractBackground: COVID-19 ranks as the single largest health incident worldwide in decades. In such a scenario, electronic health records (EHRs) should provide a timely response to healthcare needs and to data uses that go beyond direct medical care and are known as secondary uses, which include biomedical research. However, it is usual for each data analysis initiative to define its own information model in line with its requirements. These specifications share clinical concepts, but differ in format and recording criteria, something that creates data entry redundancy in multiple electronic data capture systems (EDCs) with the consequent investment of effort and time by the organization. Objective: This study sought to design and implement a flexible methodology based on detailed clinical models (DCM), which would enable EHRs generated in a tertiary hospital to be effectively reused without loss of meaning and within a short time. Material and methods: The proposed methodology comprises four stages: (1) specification of an initial set of relevant variables for COVID-19; (2) modeling and formalization of clinical concepts using ISO 13606 standard and SNOMED CT and LOINC terminologies; (3) definition of transformation rules to generate secondary use models from standardized EHRs and development of them using R language; and (4) implementation and validation of the methodology through the generation of the International Severe Acute Respiratory and emerging Infection Consortium (ISARIC-WHO) COVID-19 case report form. This process has been implemented into a 1300-bed tertiary Hospital for a cohort of 4489 patients hospitalized from 25 February 2020 to 10 September 2020. Results: An initial and expandable set of relevant concepts for COVID-19 was identified, modeled and formalized using ISO-13606 standard and SNOMED CT and LOINC terminologies. Similarly, an algorithm was designed and implemented with R and then applied to process EHRs in accordance with standardized concepts, transforming them into secondary use models. Lastly, these resources were applied to obtain a data extract conforming to the ISARIC-WHO COVID-19 case report form, without requiring manual data collection. The methodology allowed obtaining the observation domain of this model with a coverage of over 85% of patients in the majority of concepts. Conclusion: This study has furnished a solution to the difficulty of rapidly and efficiently obtaining EHR-derived data for secondary use in COVID-19, capable of adapting to changes in data specifications and applicable to other organizations and other health conditions. The conclusion to be drawn from this initial validation is that this DCM-based methodology allows the effective reuse of EHRs generated in a tertiary Hospital during COVID-19 pandemic, with no additional effort or time for the organization and with a greater data scope than that yielded by conventional manual data collection process in ad-hoc EDCs.es_ES
dc.description.peerreviewedes_ES
dc.description.sponsorshipHospital 12 de Octubre is supported by “Arquitectura normalizada de datos clínicos para la generación de infobancos y su uso secundario en investigación: caso de uso cáncer de mama, cérvix y útero, y evaluación” PI18/00981, “Infobanco para uso secundario de datos de salud basado en estándares de tecnología y conocimiento: evaluación de la calidad, validez y utilidad de la HCE como origen de datos para el estudio de la infección por VIH” PI18/01047 and Digital Health Research Department, Instituto de Salud Carlos III (ISCIII) is supported by PI18CIII/00019 “Arquitectura normalizada de datos clínicos para la generación de infobancos y su uso secundario en investigación: solución tecnológica”; funded by the Carlos III Health Institute from the Spanish National plan for Scientific and Technical Research and Innovation 2017-2020 and the European Regional Development Funds (FEDER).es_ES
dc.format.page103697es_ES
dc.format.volume115es_ES
dc.identifier.citationJ Biomed Inform . 2021 Mar;115:103697.es_ES
dc.identifier.doi10.1016/j.jbi.2021.103697es_ES
dc.identifier.e-issn1532-0480es_ES
dc.identifier.journalJournal of biomedical informaticses_ES
dc.identifier.pubmedID33548541es_ES
dc.identifier.urihttp://hdl.handle.net/20.500.12105/12672
dc.language.isoenges_ES
dc.publisherElsevier
dc.relation.projectIDinfo:eu-repo/grantAgreement/ES/PI18/00981es_ES
dc.relation.projectIDinfo:eu-repo/grantAgreement/ES/PI18/01047es_ES
dc.relation.projectIDinfo:eu-repo/grantAgreement/ES/PI18CIII/00019es_ES
dc.relation.publisherversionhttps://doi.org/10.1016/j.jbi.2021.103697es_ES
dc.repisalud.centroISCIII::Unidad de Investigación en Salud Digital (UITeS)es_ES
dc.repisalud.institucionISCIIIes_ES
dc.rights.accessRightsopen accesses_ES
dc.rights.licenseAttribution-NonCommercial-NoDerivatives 4.0 Internacional*
dc.rights.urihttp://creativecommons.org/licenses/by-nc-nd/4.0/*
dc.subjectCOVID-19es_ES
dc.subjectDetailed clinical modelses_ES
dc.subjectElectronic health recordses_ES
dc.subjectReal world dataes_ES
dc.subjectSemanticses_ES
dc.subjectStandardses_ES
dc.subject.meshDatasets as Topices_ES
dc.subject.meshElectronic Health Recordses_ES
dc.subject.meshAlgorithmses_ES
dc.subject.meshCOVID-19es_ES
dc.subject.meshCohort Studieses_ES
dc.subject.meshHumanses_ES
dc.titleObtaining EHR-derived datasets for COVID-19 research within a short time: a flexible methodology based on Detailed Clinical Modelses_ES
dc.typeresearch articlees_ES
dc.type.hasVersionVoRes_ES
dspace.entity.typePublication
relation.isAuthorOfPublicationc62651ac-034c-4271-b51e-d82a428af13e
relation.isAuthorOfPublication.latestForDiscoveryc62651ac-034c-4271-b51e-d82a428af13e
relation.isFunderOfPublication7d739953-4b68-4675-b5bb-387a9ab74b66
relation.isFunderOfPublicationefa64f05-b985-4984-8f1e-5fc4ef21f502
relation.isFunderOfPublication.latestForDiscovery7d739953-4b68-4675-b5bb-387a9ab74b66
relation.isPublisherOfPublication7d471502-7bd5-4f7a-90a4-8274382509ef
relation.isPublisherOfPublication.latestForDiscovery7d471502-7bd5-4f7a-90a4-8274382509ef

Files

Original bundle

Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
ObtainingEHRDerivedDatasets_2021.pdf
Size:
3.96 MB
Format:
Adobe Portable Document Format
Description:
Artículo principal