Affiliations 

  • 1 Faculty of Computer Science and Information Technology, University of Malaya, 50603, Kuala Lumpur, Malaysia; Department of Computer Science, Sukkur IBA University, Sukkur, Pakistan. Electronic address: gmurtaza@iba-suk.edu.pk
  • 2 Faculty of Computer Science and Information Technology, University of Malaya, 50603, Kuala Lumpur, Malaysia. Electronic address: ainuddin@um.edu.my
  • 3 Our Lady of Lourdes Hospital Drogheda Ireland, Ireland. Electronic address: ghulam.raza@ymail.com
  • 4 Faculty of Computer Science and Information Technology, University of Malaya, 50603, Kuala Lumpur, Malaysia. Electronic address: liyanashuib@um.edu.my
Comput Med Imaging Graph, 2021 04;89:101870.
PMID: 33545489 DOI: 10.1016/j.compmedimag.2021.101870

Abstract

Worldwide, the burden of cancer is drastically increasing over the past few years. Among all types of cancers in women, breast cancer (BrC) is the main cause of unnatural deaths. For early diagnosis, histopathology (Hp) imaging is a gold standard for positive and detailed (at tissue level) diagnosis of breast tumor (BrT) compared to mammogram images. A large number of studies used BrT Hp images to solve binary or multiclassification problems using high computational resources. However, classification models' performance may be compromised due to the high correlation among various types of BrT in Hp images, which raises the misclassification rate. Thus, this paper aims to develop a tree-based BrT multiclassification model via deep learning (DL) to extract discriminative features to solve the multiclassification problem with better performance using less computational resources. The main contributions of this work are to create an ensemble, tree-based DL model that is pre-trained on the BreakHis dataset, and implementation of a misclassification reduction algorithm. The ensemble, tree-based DL model, extracts discriminative BrT features from Hp images. The target dataset (i.e., Bioimaging challenge 2015 breast histology) is small in size; thus, to avoid overfitting of the proposed model, pretraining is performed on the BreakHis dataset. Whereas, misclassification reduction algorithm is implemented to enhance the performance of the classification model. The experimental results show that the proposed model outperformed the existing state-of-the-art baseline studies. The achieved classification accuracy is ranging from 87.50 % to 100 % for four subtypes of BrT. Thus, the proposed model can assist doctors as the second opinion in any healthcare centre.

* Title and MeSH Headings from MEDLINE®/PubMed®, a database of the U.S. National Library of Medicine.