A deep learning framework to classify breast density with noisy labels regularization

Lopez-Almazan, Hector; Pérez-Benito, Francisco Javier; Larroza, Andrés; Perez-Cortes, Juan-Carlos; Pollan-Santamaria, Marina; Perez-Gomez, Beatriz; Salas Trejo, Dolores; Casals, María; Llobet, Rafael

Publication:
A deep learning framework to classify breast density with noisy labels regularization

dc.contributor.author	Lopez-Almazan, Hector
dc.contributor.author	Pérez-Benito, Francisco Javier
dc.contributor.author	Larroza, Andrés
dc.contributor.author	Perez-Cortes, Juan-Carlos
dc.contributor.author	Pollan-Santamaria, Marina
dc.contributor.author	Perez-Gomez, Beatriz
dc.contributor.author	Salas Trejo, Dolores
dc.contributor.author	Casals, María
dc.contributor.author	Llobet, Rafael
dc.contributor.funder	Generalitat Valenciana (España)
dc.date.accessioned	2022-11-17T08:58:21Z
dc.date.available	2022-11-17T08:58:21Z
dc.date.issued	2022-06
dc.description.abstract	Background and objective: Breast density assessed from digital mammograms is a biomarker for higher risk of developing breast cancer. Experienced radiologists assess breast density using the Breast Image and Data System (BI-RADS) categories. Supervised learning algorithms have been developed with this objective in mind, however, the performance of these algorithms depends on the quality of the ground-truth information which is usually labeled by expert readers. These labels are noisy approximations of the ground truth, as there is often intra- and inter-reader variability among labels. Thus, it is crucial to provide a reliable method to obtain digital mammograms matching BI-RADS categories. This paper presents RegL (Labels Regularizer), a methodology that includes different image pre-processes to allow both a correct breast segmentation and the enhancement of image quality through an intensity adjustment, thus allowing the use of deep learning to classify the mammograms into BI-RADS categories. The Confusion Matrix (CM) - CNN network used implements an architecture that models each radiologist's noisy label. The final methodology pipeline was determined after comparing the performance of image pre-processes combined with different DL architectures. Methods: A multi-center study composed of 1395 women whose mammograms were classified into the four BI-RADS categories by three experienced radiologists is presented. A total of 892 mammograms were used as the training corpus, 224 formed the validation corpus, and 279 the test corpus. Results: The combination of five networks implementing the RegL methodology achieved the best results among all the models in the test set. The ensemble model obtained an accuracy of (0.85) and a kappa index of 0.71. Conclusions: The proposed methodology has a similar performance to the experienced radiologists in the classification of digital mammograms into BI-RADS categories. This suggests that the pre-processing steps and modelling of each radiologist's label allows for a better estimation of the unknown ground truth labels.	es_ES
dc.description.peerreviewed	Sí	es_ES
dc.description.sponsorship	This work was partially funded by Generalitat Valenciana through IVACE (Valencian Institute of Business Competitiveness) distributed nominatively to Valencian technological innovation centres under project expedient IMAMCN/2021/1.	es_ES
dc.format.page	106885	es_ES
dc.format.volume	221	es_ES
dc.identifier.citation	Comput Methods Programs Biomed. 2022 Jun;221:106885.	es_ES
dc.identifier.doi	10.1016/j.cmpb.2022.106885	es_ES
dc.identifier.e-issn	1872-7565	es_ES
dc.identifier.journal	Computer methods and programs in biomedicine	es_ES
dc.identifier.pubmedID	35594581	es_ES
dc.identifier.uri	http://hdl.handle.net/20.500.12105/15173
dc.language.iso	eng	es_ES
dc.publisher	Elsevier
dc.relation.publisherversion	https://doi.org/10.1016/j.cmpb.2022.106885	es_ES
dc.repisalud.centro	ISCIII::Centro Nacional de Epidemiología	es_ES
dc.repisalud.institucion	ISCIII	es_ES
dc.rights.accessRights	open access	es_ES
dc.rights.license	Attribution-NonCommercial-NoDerivatives 4.0 Internacional	*
dc.rights.uri	http://creativecommons.org/licenses/by-nc-nd/4.0/	*
dc.subject	Breast density	es_ES
dc.subject	Noisy labels	es_ES
dc.subject	Deep learning	es_ES
dc.subject	Dense tissue classification	es_ES
dc.subject	Mammography	es_ES
dc.subject.mesh	Breast Neoplasms	es_ES
dc.subject.mesh	Deep Learning	es_ES
dc.subject.mesh	Breast	es_ES
dc.subject.mesh	Breast Density	es_ES
dc.subject.mesh	Female	es_ES
dc.subject.mesh	Humans	es_ES
dc.subject.mesh	Mammography	es_ES
dc.title	A deep learning framework to classify breast density with noisy labels regularization	es_ES
dc.type	research article	es_ES
dc.type.hasVersion	VoR	es_ES
dspace.entity.type	Publication
relation.isAuthorOfPublication	cb3b77d8-c78c-4238-9b9d-c1171ff3ab51
relation.isAuthorOfPublication	4b7eabf1-df99-4684-9c92-42bc37de6a41
relation.isAuthorOfPublication.latestForDiscovery	cb3b77d8-c78c-4238-9b9d-c1171ff3ab51
relation.isFunderOfPublication	842e6617-7822-4501-a927-7e8cedda4aa6
relation.isFunderOfPublication.latestForDiscovery	842e6617-7822-4501-a927-7e8cedda4aa6
relation.isPublisherOfPublication	7d471502-7bd5-4f7a-90a4-8274382509ef
relation.isPublisherOfPublication.latestForDiscovery	7d471502-7bd5-4f7a-90a4-8274382509ef

Files

Original bundle

Now showing 1 - 1 of 1

Name:: ADeepLearningFramework_2022.pdf
Size:: 1.9 MB
Format:: Adobe Portable Document Format
Description:

Download

Collections

Centro Nacional de Epidemiología (CNE)

Publication: A deep learning framework to classify breast density with noisy labels regularization

Files

Original bundle

Collections

Publication:
A deep learning framework to classify breast density with noisy labels regularization