Publication:
Next generation community assessment of biomedical entity recognition web servers: metrics, performance, interoperability aspects of BeCalm

dc.contributor.authorPérez-Pérez, Martin
dc.contributor.authorPérez-Rodríguez, Gael
dc.contributor.authorBlanco-Míguez, Aitor
dc.contributor.authorFdez-Riverola, Florentino
dc.contributor.authorValencia, Alfonso
dc.contributor.authorKrallinger, Martin
dc.contributor.authorLourenço, Anália
dc.contributor.funderUnión Europea. Fondo Europeo de Desarrollo Regional (FEDER/ERDF)
dc.contributor.funderXunta de Galicia (España)
dc.contributor.funderFundação para a Ciência e Tecnologia (Portugal)
dc.contributor.funderMinisterio de Energía, Turismo y Agenda Digital (España)
dc.contributor.funderUnión Europea. Comisión Europea. H2020
dc.contributor.funderCentro Nacional de Investigaciones Oncológicas Carlos III (España)
dc.date.accessioned2019-09-11T11:57:41Z
dc.date.available2019-09-11T11:57:41Z
dc.date.issued2019-06-24
dc.description.abstractBACKGROUND: Shared tasks and community challenges represent key instruments to promote research, collaboration and determine the state of the art of biomedical and chemical text mining technologies. Traditionally, such tasks relied on the comparison of automatically generated results against a so-called Gold Standard dataset of manually labelled textual data, regardless of efficiency and robustness of the underlying implementations. Due to the rapid growth of unstructured data collections, including patent databases and particularly the scientific literature, there is a pressing need to generate, assess and expose robust big data text mining solutions to semantically enrich documents in real time. To address this pressing need, a novel track called "Technical interoperability and performance of annotation servers" was launched under the umbrella of the BioCreative text mining evaluation effort. The aim of this track was to enable the continuous assessment of technical aspects of text annotation web servers, specifically of online biomedical named entity recognition systems of interest for medicinal chemistry applications. RESULTS: A total of 15 out of 26 registered teams successfully implemented online annotation servers. They returned predictions during a two-month period in predefined formats and were evaluated through the BeCalm evaluation platform, specifically developed for this track. The track encompassed three levels of evaluation, i.e. data format considerations, technical metrics and functional specifications. Participating annotation servers were implemented in seven different programming languages and covered 12 general entity types. The continuous evaluation of server responses accounted for testing periods of low activity and moderate to high activity, encompassing overall 4,092,502 requests from three different document provider settings. The median response time was below 3.74 s, with a median of 10 annotations/document. Most of the servers showed great reliability and stability, being able to process over 100,000 requests in a 5-day period. CONCLUSIONS: The presented track was a novel experimental task that systematically evaluated the technical performance aspects of online entity recognition systems. It raised the interest of a significant number of participants. Future editions of the competition will address the ability to process documents in bulk as well as to annotate full-text documents.es_ES
dc.description.peerreviewedes_ES
dc.description.sponsorshipThis project has received funding from the European Union’s Horizon 2020 research and innovation programme under Grant Agreement No. 654021 (OpenMinTeD), and the Encomienda MINETAD‑CNIO as part of the Plan for the Advancement of Language Technology for funding. This work was partially supported by the Consellería de Educación, Universidades e Formación Profesional (Xunta de Galicia), under the scope of the strategic funding of ED431C2018/55‑GRC Competitive Reference Group, and the Portuguese Foundation for Science and Technology (FCT ), under the scope of the strategic funding of UID/BIO/04469/2013 unit and COMPETE 2020(POCI‑01‑0145‑FEDER‑006684).es_ES
dc.format.number1es_ES
dc.format.page42es_ES
dc.format.volume11es_ES
dc.identifier.citationJ Cheminform. 2019;11(1):42es_ES
dc.identifier.doi10.1186/s13321-019-0363-6es_ES
dc.identifier.issn1758-2946es_ES
dc.identifier.journalJournal of cheminformaticses_ES
dc.identifier.pubmedID31236786es_ES
dc.identifier.urihttp://hdl.handle.net/20.500.12105/8338
dc.language.isoenges_ES
dc.publisherBioMed Central (BMC)
dc.relation.projectIDinfo:eu-repo/grantAgreement/EC/FH2020/654021es_ES
dc.relation.publisherversionhttps://doi.org/ 10.1186/s13321-019-0363-6.es_ES
dc.repisalud.institucionCNIOes_ES
dc.repisalud.orgCNIOCNIO::Grupos de investigación::Grupo de Biología Computacional Estructurales_ES
dc.rights.accessRightsopen accesses_ES
dc.rights.licenseAtribución-NoComercial-CompartirIgual 4.0 Internacional*
dc.rights.urihttp://creativecommons.org/licenses/by-nc-sa/4.0/*
dc.subjectAnnotation serveres_ES
dc.subjectBeCalm metaserveres_ES
dc.subjectBioCreativees_ES
dc.subjectContinuous evaluationes_ES
dc.subjectNamed entity recognitiones_ES
dc.subjectPatent mininges_ES
dc.subjectREST-APIes_ES
dc.subjectShared taskes_ES
dc.subjectTIPSes_ES
dc.subjectText mininges_ES
dc.titleNext generation community assessment of biomedical entity recognition web servers: metrics, performance, interoperability aspects of BeCalmes_ES
dc.typejournal articlees_ES
dc.type.hasVersionVoRes_ES
dspace.entity.typePublication
relation.isAuthorOfPublicationd691c3d3-9e05-4217-a923-08e68ba16baa
relation.isAuthorOfPublicationb0708f2a-654d-40d7-9d09-38a400d87b21
relation.isAuthorOfPublication.latestForDiscoveryd691c3d3-9e05-4217-a923-08e68ba16baa
relation.isFunderOfPublicationefa64f05-b985-4984-8f1e-5fc4ef21f502
relation.isFunderOfPublication3d2eadbc-cf2c-4663-8d3d-220eb217da68
relation.isFunderOfPublication25f151af-8d71-4e41-b920-ca8474d267b7
relation.isFunderOfPublication4a2a1afc-e4d6-4c9f-af08-fc72d809c081
relation.isFunderOfPublicationa418cf3b-f427-4376-a75f-01dba9bb944f
relation.isFunderOfPublicationf0929bd2-7723-43d1-8105-2c615f11ef3a
relation.isFunderOfPublication.latestForDiscoveryefa64f05-b985-4984-8f1e-5fc4ef21f502
relation.isPublisherOfPublication4fe896aa-347b-437b-a45b-95f4b60d9fd3
relation.isPublisherOfPublication.latestForDiscovery4fe896aa-347b-437b-a45b-95f4b60d9fd3

Files

Original bundle

Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
Nextgenerationcommunityassessmen_2019.pdf
Size:
2.11 MB
Format:
Adobe Portable Document Format
Description: