Publication:
Overview of BioASQ 2021-MESINESP track. Evaluation of advance hierarchical classification techniques for scientific literature, patents and clinical trials

dc.contributor.authorGasco, Luis
dc.contributor.authorNentidis, Anastasios
dc.contributor.authorKrithara, Anastasia
dc.contributor.authorEstrada-Zavala, Darryl
dc.contributor.authorMurasaki, Renato Toshiyuki
dc.contributor.authorPrimo-Peña, Elena
dc.contributor.authorBojo Canales, Cristina
dc.contributor.authorPaliouras, Georgios
dc.contributor.authorKrallinger, Martin
dc.contributor.funderPlan for advancement of Language Technologies (Plan TL)
dc.date.accessioned2021-08-25T11:29:37Z
dc.date.available2021-08-25T11:29:37Z
dc.date.issued2021-08
dc.descriptionCLEF 2021 – Conference and Labs of the Evaluation Forum, September 21–24, 2021, Bucharest, Romania,es_ES
dc.description.abstractThere is a pressing need to exploit recent advances in natural language processing technologies, in particular language models and deep learning approaches, to enable improved retrieval, classification and ultimately access to information contained in multiple, heterogeneous types of documents. This is particularly true for the field of biomedicine and clinical research, where medical experts and scientists need to carry out complex search queries against a variety of document collections, including literature, patents, clinical trials or other kind of content like EHRs. Indexing documents with structured controlled vocabularies used for semantic search engines and query expansion purposes is a critical task for enabling sophisticated user queries and even cross-language retrieval. Due to the complexity of the medical domain and the use of very large hierarchical indexing terminologies, implementing efficient automatic systems to aid manual indexing is extremely difficult. This paper provides a summary of the MESINESP task results on medical semantic indexing in Spanish (BioASQ/ CLEF 2021 Challenge). MESINESP was carried out in direct collaboration with literature content databases and medical indexing experts using the DeCS vocabulary, a similar resource as MeSH terms. Seven participating teams used advanced technologies including extreme multilabel classification and deep language models to solve this challenge which can be viewed as a multi-label classification problem. MESINESP resources, we have released a Gold Standard collection of 243,000 documents with a total of 2179 manual annotations divided in train, development and test subsets covering literature, patents as well as clinical trial summaries, under a cross-genre training and data labeling scenario. Manual indexing of the evaluation subsets was carried out by three independent experts using a specially developed indexing interface called ASIT. Additionally, we have published a collection of large-scale automatic semantic annotations based on NER systems of these documents with mentions of drugs/medications (170,000), symptoms (137,000), diseases (840,000) and clinical procedures (415,000). In addition to a summary of the used technologies by the teams, this paperes_ES
dc.description.peerreviewedes_ES
dc.identifier.citationCEUR Workshop Proceedings. 2021:2936 165-187es_ES
dc.identifier.issn1613-0073es_ES
dc.identifier.urihttp://hdl.handle.net/20.500.12105/13311
dc.language.isoenges_ES
dc.publisherCEUR Workshop Proceedingses_ES
dc.relation.publisherversionhttp://ceur-ws.org/Vol-2936/es_ES
dc.repisalud.centroISCIII::Biblioteca Nacional de Ciencias de la Saludes_ES
dc.repisalud.institucionISCIIIes_ES
dc.rights.accessRightsopen accesses_ES
dc.rights.licenseAtribución 4.0 Internacional*
dc.rights.urihttp://creativecommons.org/licenses/by/4.0/*
dc.subjectHierarchical Text Classificationes_ES
dc.subjectSemantic Indexinges_ES
dc.subjectInformation Retrievales_ES
dc.subjectQuestion Answeringes_ES
dc.subjectBiomedical knowledgees_ES
dc.subjectAutomatic Indexinges_ES
dc.subjectDeCSes_ES
dc.subjectSpanishes_ES
dc.titleOverview of BioASQ 2021-MESINESP track. Evaluation of advance hierarchical classification techniques for scientific literature, patents and clinical trialses_ES
dc.typeconference paperes_ES
dc.type.hasVersionVoRes_ES
dspace.entity.typePublication
relation.isAuthorOfPublicationc211e6ef-c557-4f96-8f13-4b37aaae8122
relation.isAuthorOfPublication327d6acc-560a-4e52-b181-319a647caa38
relation.isAuthorOfPublicationb0708f2a-654d-40d7-9d09-38a400d87b21
relation.isAuthorOfPublication.latestForDiscoveryc211e6ef-c557-4f96-8f13-4b37aaae8122

Files

Original bundle

Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
OverviewOfBioASQ_2021.pdf
Size:
3.99 MB
Format:
Adobe Portable Document Format
Description: