2: The model that classifies the breast image a ∈ A
3: Resize dataset images to 300 × 300 dimensions
4: Image data augmentation can be used to address the overfitting issue.
5: Image normalization
6: Set of CNN pre-trained models X = {Resnet50, DenseNet121, InceptionV3,}
7: for Xx ∈ x do
8: epochs = 1 to 30
9: for mini-batches (Ai, Bi) ∈ (Atrain, Btrain) do
10: Model parameters changed
11: if all over the previous five epochs, the accuracy of the validation is not increasing then
12: Training has to end.
13: end if
14: end for
15: end for
16: for all A ∈ Atest do
17: Combined outputs from all models should be fed into logistic regression for final classification.
18: end for
In this approach, we propose as shown in Figure 4 below, three novel CNN architectures—ResNet50, DenseNet121, and InceptionV3—ensembled in our proposed methodology. Neither of them employed the latest developments in the meta-learning technique to classify benign and malignant in the literature review. An optimizer is used to change the model’s learning rate. In this research, the Adam optimizer is utilized. The training accuracy score is evaluated using the accuracy measure. To identify the loss, binary cross-entropy is used. In binary class classification, this is the loss function that is most frequently utilized. The model performs better when the loss score goes down. Each model’s output is given to the meta-learner. We use a logistic regression classifier as a meta-learner to make our final prediction.
Architecture of the proposed meta-learning of CNN models.
We thoroughly assess and analyze the performance outcomes obtained using various model configurations to demonstrate the effectiveness of our meta ensemble model in screening benign and malignant breast cancer. We will now go on to the experimental conditions, performance indicators, quantitative and qualitative findings analysis, and discussion.
The distribution of samples in the dataset from both classes is shown in Table 1 . Presents the count distribution of images across all classes in the whole dataset. A split ratio of 70:10:20 is used to divide the total dataset into training, validation, and test sets. The images included in the dataset with the ratio mentioned above are then used to train and evaluate the meta ensemble model, as well as the individual sub-models. We use image augmentation to address the issue of a short dataset, improve training effectiveness, and guard against model overfitting. Additionally, image augmentation is thought to improve the generalizability of models. To address the issue of limited dataset size, data augmentation was used to expand the training dataset. Here is a summary of the augmentation features that were employed as well as other hyperparameters that were set.
Distribution of samples in the dataset from both classes, benign and malignant.
Class | Number of Samples | |||
---|---|---|---|---|
Training | Validation | Testing | Total Images | |
Benign | 3500 | 500 | 1000 | 5000 |
Malignant | 3500 | 500 | 1000 | 5000 |
For its implementation, we decided to leverage the TensorFlow and Keras functional APIs. Using Google Colab, which offers free GPU access, we train and evaluate our models. Model configuration and augmentation features are shown in Table 2 . For the training of models and model validation, we employ the Adam optimizer with momentum. For the Adam optimizer, we used an initial learning level of 0.0001. In addition, we employed the binary cross-entropy loss function for both training and validating the model. The binary cross-entropy loss function is an obvious option for a binary classification job, such as differentiating between malignant and benign breast cancer, as it speeds up model convergence. Additionally, we make use of the model checkpoint and reduce loss plateau decay (ReduceLROnPlateau) callbacks from Keras.
Model configuration and augmentation features.
Parameters | Value |
---|---|
Max epochs | 30 |
Size of batch | 32 |
Optimizer | Adam |
Loss function | Binary cross-entropy |
Learning rate | 0.0001 |
Range of rotation | Random with factor (0.5) |
Shuffling | Yes |
Flip | Nearest |
Table 3 shows the performance results for different CNN models and the proposed meta-model in classifying benign and malignant images for breast cancer diagnosis. Each CNN model was evaluated based on its ability to classify benign and malignant images accurately. The performance measures were accuracy, precision, recall, and F1 score. For Inception V3, an accuracy of 0.83 is achieved for benign and malignant images. The precision of 0.78 for benign and 0.91 for malignant. Recall 0.93 for benign and 0.74 for malignant images. F1 score of 0.85 for benign and 0.82 for malignant images.
The performance results obtained from both the CNN and the proposed meta-model.
Model | Class | Accuracy | Precision | Recall | F1 Score |
---|---|---|---|---|---|
Inception V3 | Benign | 0.83 | 0.78 | 0.93 | 0.85 |
Malignant | 0.91 | 0.74 | 0.82 | ||
ResNet50 | Benign | 0.88 | 0.84 | 0.94 | 0.89 |
Malignant | 0.93 | 0.82 | 0.88 | ||
DenseNet121 | Benign | 0.84 | 0.81 | 0.89 | 0.85 |
Malignant | 0.88 | 0.79 | 0.83 | ||
Ensemble Meta-Model | Benign | 0.90 | 0.86 | 0.95 | 0.90 |
Malignant | 0.94 | 0.84 | 0.89 |
In the case of ResNet50: The accuracy of 0.88 for benign and malignant images. The precision of 0.84 for benign and 0.93 for malignant. Recall 0.94 for benign and 0.82 for malignant images. The F1 score for benign images was 0.89, while the score for malignant images was 0.88.
In the case of DenseNet121: The accuracy of 0.84 for benign and malignant images. The precision of 0.81 for benign and 0.88 for malignant. Recall 0.89 for benign and 0.79 for malignant images. The model achieved an F1 score of 0.85 for benign and 0.83 for malignant images. In the DenseNet121 model case, the training and validation accuracies are relatively high and close to each other, which suggests that the model fits the data in its well-trained learning of the underlying patterns in the training data and generalizing well to unseen data.
In our proposed meta-model: The proposed meta-model outperformed the individual CNN models regarding accuracy, precision, recall, and F1 score. The results for the meta-model are as follows.
The model’s overall performance was evaluated using accuracy scores, and it achieved a consistent score of 0.90 for both benign and malignant images. Precision score for benign images was 0.86, while the score for malignant images was 0.84. Recall was 0.95 for benign and 0.89 for malignant images. The F1 score for benign images was 0.90, while the score for malignant images was 0.89.
The results suggest that the proposed meta-learning ensemble technique CNN could be a promising approach for improving the accuracy and reliability of breast cancer diagnosis. Accurately classifying cancer images for every category is crucial for an efficient diagnosis system. The meta-model does very well to classify benign instances clear of malignant moles. Additionally, using data augmentation and dropout regularization techniques has helped achieve good results.
Additionally, we keep updated on the learning curves for every model we have looked at. The models have a moderate learning trend throughout training while displaying a rather consistent decline in validation losses (as seen in Figure 5 ). Additionally, the initial training of the model was conducted on the BUSI dataset for both benign and malignant classes. Subsequently, the model was tested on breast cancer images, achieving accuracy after 30 epochs of training (as shown in Figure 6 ). The meta-model converges training and validation losses far more effectively than the CNN sub-models. Because our dataset only consists of a few events, learning curves generally do not overfit. The stacked ensemble model’s use of data augmentation and dropout regularization techniques has mostly been responsible for achieving this. Training the meta-model helps ensure that the final model generalizes well to new unseen data.
Training and validation accuracy was achieved using three sub-models and the meta-model.
Training and validation loss using three sub-models and the meta-model.
Figure 7 summarizes the performance of a CNN model and meta-model in classifying breast cancer as benign and malignant. A confusion matrix is a table used to evaluate the performance of a classification model on a dataset. It is also known as an error matrix or a contingency table. The confusion matrix summarizes the model’s predictions, including the true positive, true negative, false positive, and false negative rates. It is a useful tool for understanding the model’s performance and identifying areas where it may make mistakes.
Summarizes the performance of a CNN and meta-model in classifying breast cancer as benign and malignant.
To enhance our understanding of the class distinction in the investigated meta-models, we employ receiver operating characteristic (ROC) curves, as depicted in Figure 8 . An ROC curve plots the true positive rate (TPR) against the false positive rate (FPR), using a range of threshold values derived from the probability outcomes of deep learning models. TPR is indicative of the probability of accurately classifying benign images as malignant. In contrast, FPR represents the risk of false alarms, which is the scenario where a benign image is incorrectly classified as showing symptoms of malignancy.
ROC curves for the ensemble meta-model and different CNN sub-models.
This paper discusses a novel approach for breast cancer classification that achieved state-of-the-art results using the BUSI dataset. The approach presented in the paper achieved an accuracy of 90%. The use of multiple CNN models, including Inception V3, ResNet50, and DenseNet121, in a meta-learning framework allowed for better generalization and improved accuracy, particularly in detecting malignant tumors. The paper demonstrated the potential of meta-learning and ensemble techniques for improving the accuracy and efficiency of a breast cancer diagnosis. The approach in medical imaging datasets could be extended to other types of cancer or medical conditions. In terms of future work, the researcher suggested several avenues for further research. One potential direction is to explore the use of other meta-learning algorithms, such as model-agnostic meta-learning or reinforcement learning, and compare their performance with the approach presented in the paper. Another direction is to investigate the impact of incorporating clinical data, such as patient history or biopsy results, into the classification model. Furthermore, it noted that the dataset used in the study is limited in terms of the number of samples and the diversity of the cases. The proposed approach for breast cancer classification using meta-learning and ensemble techniques has demonstrated promising results and other medical imaging datasets.
Meta-learning involves many trainable parameters, leading to increase model complexity. In the future, designing novel architecture can help reduce the number of trainable parameters while maintaining or improving performance. The idea is to find simpler, more efficient structures that can capture the underlying patterns in the data with fewer parameters.
The authors sincerely appreciate the support from Princess Nourah bint Abdulrahman University Researchers Supporting Project number (PNURSP2023R235), Princess Nourah bint Abdulrahman University, Riyadh, Saudi Arabia.
Princess Nourah bint Abdulrahman University Researchers Supporting Project number (PNURSP2023R235), Princess Nourah bint Abdulrahman University, Riyadh, Saudi Arabia.
Conceptualization, M.D.A., M.A.K. and M.M.Y.; methodology, U.F.K., A.S., H.E. and M.I.K.; software, M.D.A. and M.M.Y.; validation, A.A.-R., M.A.K., U.F.K. and A.S.; formal analysis, M.I.K. and H.E.; investigation, M.M.Y. and U.F.K.; resources, M.M.Y. and M.A.K.; data curation, A.A.-R. and M.I.K.; writing—original draft preparation, M.D.A., A.S. and U.F.K.; writing—review and editing, M.I.K., U.F.K., M.A.K. and M.M.Y.; visualization, A.S. and M.A.K. supervision, M.A.K.; project administration, U.F.K., A.A.-R. and M.A.K.; funding acquisition, A.A.-R. All authors have read and agreed to the published version of the manuscript.
Conflicts of interest.
The authors declare no conflict of interest.
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.
A not-for-profit organization, IEEE is the world's largest technical professional organization dedicated to advancing technology for the benefit of humanity. © Copyright 2024 IEEE - All rights reserved. Use of this web site signifies your agreement to the terms and conditions.
Thank you for visiting nature.com. You are using a browser version with limited support for CSS. To obtain the best experience, we recommend you use a more up to date browser (or turn off compatibility mode in Internet Explorer). In the meantime, to ensure continued support, we are displaying the site without styles and JavaScript.
npj Breast Cancer volume 10 , Article number: 70 ( 2024 ) Cite this article
405 Accesses
4 Altmetric
Metrics details
This study was designed to determine the enrollment patterns in breast cancer clinical trials (CCTs) of patients with diverse backgrounds in an equal access setting and to evaluate the factors contributing to low rates of clinical trial accrual in patients of low socioeconomic status (SES). We performed a retrospective review of a prospectively maintained database of new patients seen at the Dan L. Duncan Comprehensive Cancer Center dating from 5/2015 to 9/2021, which included 3043 patients screened for breast CCTs. We compared the rate of CCT availability, eligibility, and enrollment between two patient populations: Smith Clinic, where most patients are of low SES and uninsured, and Baylor St. Luke’s Medical Center (BSLMC) with mostly predominantly insured, higher income patients. We performed logistic regression to evaluate whether differences in age, clinic, race, trial type, and primary language may be underlying the differences in CCT enrollment. More patients were eligible for CCTs at Smith Clinic (53.7% vs 44.7%, p < 0.001). However, Smith Clinic patients were more likely to decline CCT enrollment compared to BSLMC (61.3% declined vs 39.4%, p < 0.001). On multivariate analysis, Black patients had a significantly higher rate of CCT refusal overall (OR = 0.26, 95% CI 0.12–0.56, p < 0.001) and BSLMC only (OR = 0.20, 95% CI 0.060–0.60, p = 0.006). Our data shows that it is likely an oversimplification to assume that equal access will lead to the elimination of CCT disparities. Efforts to diversify CCTs must include consideration of structural and institutional inequities as well as social needs.
Introduction.
Black cancer patients in the United States have both increased overall cancer mortality and increased cancer-specific mortality 1 , 2 , 3 . In breast cancer, Black women have a 41% higher risk of dying from breast cancer when compared with White women and present on average at a later stage 2 , 4 , 5 . Structural inequities pertaining to access to care, diagnosis timing, and treatment delay affect Black women disproportionately 6 , 7 . While these are socioeconomic predictors of the observed poor outcomes, it is also well documented that Black women have a higher incidence of more aggressive breast cancer subtypes (i.e., triple negative breast cancer (TNBC)) than any other ethnic or racial groups 8 . It is critical to understand the biological basis of the observed poor outcomes of breast cancer among Black women. As we work to design precision-driven interventions for prevention, timely diagnosis, and treatment, achieving cancer health equity is not feasible without improving diversity in cancer clinical trials (CCTs).
In an ideal world, the populations studied in CCTs would be representative of the diversity of patients seen in clinic, and CCTs would be used as a tool to decrease inequity. Unfortunately, well-documented disparities exist within CCTs. The National Institutes of Health (NIH) Revitalization Act of 1993 aimed to increase the number of women and underrepresented racial groups in clinical research through mandated inclusion, yet numbers remain low 9 . In 2014, only approximately 1% of NCI-sponsored clinical trials were primarily focused on racial and ethnic minorities 10 . Studies have shown that the average enrollment of Black Americans in CCTs is at best between 5 and 7%, despite Black Americans making up more than 13% of the general population of the United States 11 , 12 , 13 .
When access to CCTs is not a barrier to enrollment, the rate of clinical trial participation by racial and ethnic minorities, especially those of low SES, has not been well studied. Data often comes from safety-net hospitals or private institutions, but rarely are both serving the same catchment area. The Dan L. Duncan Comprehensive Cancer Center (DLDCCC) in Houston, Texas provides access to breast CCTs at two clinical sites: Smith Clinic (SC), within the safety-net Harris Health System, and Baylor St. Luke’s Medical Center (BSLMC). We hypothesized that the racial and socioeconomic gap in clinical trial enrollment would be at least partially improved by similar access to breast CCTs at the two sites.
Of the 3043 patients screened for breast CCTs, 366 patients were found to be eligible for CCT, and some patients were eligible for multiple CCTs. There were 431 total offers to CCTs (Fig. 1 ).
We identified 3043 new patients seen at DLDCC clinical sites between 5/2015 and 9/2021, 366 of whom were eligible for CCT. The majority of patients at each site were eligible for neoadjuvant trials and patients were often eligible for more than one CCT.
The patient demographics of the 3043 new patients seen at the DLDCCC and screened for breast CCT eligibility from 5/2015 to 7/2021 are shown in Table 1 . Notably, 50% of these patients were White at BSLMC in comparison to 11% at SC, and 74% listed English as their primary language at BSLMC versus 47% at SC. Patients at SC were on average younger, and more frequently presenting with TNBC compared to BSLMC.
More patients at SC had a trial available to them (752/1400, 53.7%) versus at BSLMC (734/1643, 44.7%, p -value < 0.001) (Table 1 ). Patients at SC were also more likely to be eligible for CCTs (191/1400, 13.6%) than patients at BSLMC (175/1643, 10.7%, p -value = 0.011).
Despite higher eligibility, patients at SC were less likely to accept these CCT offers (74/191 accepted, 38.7%) than patients at BSLMC (106/175 accepted, 60.6%, p-value < 0.001) (Table 1 ). This difference in acceptance was significant on univariate but not multivariate analysis (Table 2 ). Age was not found to be significantly associated with trial enrollment.
Univariate analysis of the patients showed that Black patients, Hispanic/Latino patients, and Spanish speaking patients were significantly more likely to decline CCT participation. However, on overall multivariate analysis, only the Black patient category was associated with significantly higher rate of enrollment refusal (odds ratio (OR) = 0.26, 95% CI 0.12–0.56, p < 0.001). (Table 2 ) On the multivariate analyses across the two clinical sites, patients were significantly more likely to accept biobanking trials than other trial types at SC (OR = 16.90, 95% CI 2.13–363.77, p = 0.018) and at BSLMC (OR = 20.10, 95% CI 3.37–395.53, p -value = 0.007). Patients at SC were also more likely to enroll into preventive trials (OR = 7.88, 95% CI 1.53–59.39, p-value = 0.020). Primary language was not found to be a determining factor in trial enrollment or refusal at either site. Black patients at BSLMC were less likely to enroll in CCTs (OR = 0.20, 95% CI 0.060–0.60, p = 0.006) on multivariate analysis, however this was not significant on multivariate analysis in the SC subset (OR = 0.41, 95% CI 0.11–1.53, p = 0.180). (Supplemental Tables 1 and 2 ).
While patients at SC have equal opportunity when it comes to access to clinical trials, trial enrollment is only at 37% in this clinic site, compared to BSLMC clinic where over 61% of trial eligible patients consent to enrollment. Overall, Black patients were less likely to consent to trial enrollment, and the rate of trial refusal was lowest for biobanking trials across both sites and preventive trials at SC compared to other trials. Speaking a primary language other than English was not found to be a major barrier to enrollment in our population.
Our data shows that it is likely an oversimplification to assume that equal access will lead to a complete elimination of CCT disparities. At SC, which serves a more diverse population with a higher percentage of low income and uninsured patients, the patients were significantly more eligible for breast CCTs. As noted in Table 1 , patients at SC had a higher rate of TNBC (29.3% versus 14.3% at BSLMC), which we hypothesize may be one reason for the higher rate of eligibility. More SC patients were eligible for neoadjuvant trials and biobanking trials than at BSLMC (Fig. 1 ), possibly also due to the higher TNBC rates in this population.
Despite the higher rate of eligibility at SC, these patients were significantly more likely to decline the CCT. Our findings in a highly racially and ethnically diverse patient population supports the literature that shows that Black patients are less likely to agree to participate in clinical trials. The causes of discrepancy between eligibility and enrollment are multifactorial and complex. Studies have shown equal willingness among patients of different races to participate in clinical trials 14 , 15 , 16 , 17 , yet disparities in enrollment persist. In 2008, Ford 18 identified three categories of reasons for low accrual: awareness, opportunity, and acceptance/refusal barriers or promoters.
Awareness barriers include lack of knowledge about the purpose and availability of CCTs 13 , 19 . Cancer health literacy has been found in some studies to be significantly lower in Black patients 14 , 20 , 21 , though others found that the role of factual knowledge did not make a significant difference in accrual 22 . The FDA in 2020 published guidelines and potential approaches to increase the diversity of clinical trial populations, including making diversity of enrollment a priority, involving the community, and educating potential participants 23 . When CCTs do not recruit a diverse patient population and fail to be made available to racial or ethnic minorities 17 , the results cannot be assumed to be generalizable to the community at large.
Opportunity barriers include limitations due to socioeconomic status and ineligibility. Research has shown us that CCT participants are less likely to be Black and more likely to be of a higher socioeconomic status 24 , 25 , 26 , 27 , 28 . Black patients are more likely to be deemed ineligible for clinical trials 13 , 16 , 29 . This is partially due to a higher rate of comorbidities such as hypertension, vision loss, or diabetes, as well as benign neutropenia—a condition that has not been shown to increase risk of infection 16 . However, studies have also shown that Black Americans are more likely to be deemed ineligible due to perceived noncompliance or mental status, and that subjective judgements on eligibility more often favor White patients 29 .
Barriers to acceptance include an understandable mistrust in a medical system that has historically caused harm to people of color, perceived financial burden, logistical difficulties including transportation, and family or cultural pressures. When Black American patients are asked about their reasons for opting out of CCT, studies show us that a lack of trust is one of the most common factors influencing their decision 13 , 22 . Barriers relating to logistics or finances are seen more often in safety-net hospitals and clinics 19 .
At BSLMC, where the population is less diverse, we noted a difference in CCT enrollment by race in multivariate analysis. However, this finding was not significant at SC, which has a more diverse population (Supplemental Tables 1 and 2 ). This is an interesting exploratory finding that can be further elucidated in future studies but may point to more diverse clinic experiences encouraging CCT enrollment. This could be due to higher trust in the clinic, awareness of clinical trials, or physicians offering trials more equitably. A limitation of our finding is the low number of patients in each category, and future studies would need to clarify these findings with a larger patient population.
Although it is imperative that we continue to shine a light on these important issues, we must be ready to envision and enact both local and national policy changes. Moving forward, we are focusing on community engagement, patient education, and dialogue with our patients to explore specific interventions designed to improve our Black patient population’s views of trial enrollment. Interventions have been attempted around the country to varying levels of success, including patient navigation systems 30 , 31 , 32 , patient education videos 12 , 13 , the recent ACCURE trial which included multiple levels of intervention including electronic medical record changes and specific physician roles 33 , diversifying staff, ensuring trial resources are in multiple languages, and offering financial incentives 23 . The reason for trial refusal was unfortunately not uniformly captured in the clinical trial database nor in patients’ electronic medical records. This is a limitation of our study; we do not have specific patients’ refusal reasons. In a follow up study that is currently being conducted, we have designed a patient education intervention to collect specific information on patients’ attitudes towards clinical trial enrollment and refusal. This follow-up study will serve as a roadmap for designing patient and community targeted outreach programs to improve our trial enrollment.
Cancer clinical trials have maintained restrictive eligibility criteria that inevitably censor out a large population of patients 34 . It is crucial that efforts continue on all fronts to improve cancer clinical trial diversity, including clinical trial design and challenging long-standing beliefs on eligibility criteria. Prevention and treatment alike need to be considered when designing an equitable future for cancer care, and as others have shown, these efforts must include consideration of structural and institutional inequities as well as social needs. Research and data collection are only the first steps in a necessary journey toward equity in cancer care.
This is a retrospective cohort study of new patients seen from 5/2015 to 9/2021, which included 3043 patients screened for breast CCTs at DLDCCC clinical sites. The populations receiving care at the two clinical sites differ greatly. At SC, half of the patients earn less than $25,000 annually, 60% are uninsured and use a county financial assistance program known as the “Gold Card,” and 65% are not proficient in English. Fifty-nine percent of SC patients self-identify as Hispanic and 29% self-identify as Black, with White patients making up 10% of the population. At BSLMC, over 95% of the patients have federal and commercial insurance and 68% are White, 13% are Black, and 3% are Hispanic. We collected information on age at the time of screening for CCTs, patient-reported race, and primary spoken language.
The study was conducted according to the ethical guidelines set forth in the Declaration of Helsinki and in concordance with the Heath Insurance Portability and Accountability Act. The study was approved by the institutional review board (IRB) of Baylor College of Medicine. The requirement of patient informed consent was waived by the IRB as the data was deidentified prior to analysis.
DLDCCC is an active participant in several cooperative group consortia including the Translational Breast Cancer Research Consortium (TBCRC), Southwest Oncology Group (SWOG), and NRG oncology (from the parental organizations of NSABP, RTOG, and GOG), and it is frequently the leading site for national investigator initiated clinical trials (IIT). We collected information on whether there were trials available for the patients’ diagnosis, trial eligibility, and the type of trial the patients were screened for. We designated 5 categories of trials based on the intent of the trial (e.g., scalp cooling trial to prevent chemotherapy-associated hair loss) and the stage of therapy that the trial was offered (e.g., COMPASS RD, an adjuvant trial). The categories were preventive, neoadjuvant, adjuvant, metastatic, and biobanking. The same CCTs were open at both sites under the DLDCCC.
We first performed Chi-square tests to determine whether there were differences in trial availability, trial eligibility, and trial acceptance rate according to DLDCCC clinical sites. We then performed univariate and multivariate logistic regression to evaluate whether differences in age, clinic site, race, trial type, and primary language may be underlying the observed differences in CCT enrollment rates. We performed logistic regression on the overall dataset as well as by clinic. We calculated odds ratios with 95% confidence intervals to measure the strength of association between the predictors and enrollment. P-values less than 0.05 were considered statistically significant. Analysis was performed using R version 4.1.0.
The participants in this study did not give written consent for their data to be shared. Due to the clinical nature of the dataset, it is not available publicly.
Singh, G. K. & Jemal, A. Socioeconomic and racial/ethnic disparities in cancer mortality, incidence, and survival in the United States, 1950-2014: over six decades of changing patterns and widening inequalities. J. Environ. Public Health 2017 , 2819372 (2017).
Article PubMed PubMed Central Google Scholar
USCS Data Visualizations - CDC. https://gis.cdc.gov/cancer/USCS/#/AtAGlance/ .
Reports on Cancer - Cancer Statistics. https://seer.cancer.gov/statistics/reports.html .
Siegel, R. L., Miller, K. D., Wagle, N. S. & Jemal, A. Cancer statistics, 2023. CA Cancer J. Clin. 73 , 17–48 (2023).
Article PubMed Google Scholar
AACR Cancer Disparities Progress Report 2022 Cancer Progress Report. https://cancerprogressreport.aacr.org/disparities/ .
Hossain, F. et al. Neighborhood social determinants of triple negative breast cancer. Front Public Health 7 , 18 (2019).
Wang, F. et al. Racial/ethnic disparities in all-cause mortality among patients diagnosed with triple-negative breast cancer. Cancer Res. 81 , 1163–1170 (2021).
Article CAS PubMed Google Scholar
Clarke, C. A. et al. Age-specific incidence of breast cancer subtypes: understanding the black-white crossover. J. Natl Cancer Inst. 104 , 1094–1101 (2012).
Murthy, V. H., Krumholz, H. M. & Gross, C. P. Participation in cancer clinical trials: race-, sex-, and age-based disparities. JAMA 291 , 2720–2726 (2004).
Chen, M. S., Lara, P. N., Dang, J. H. T., Paterniti, D. A. & Kelly, K. Twenty years post-NIH Revitalization Act: enhancing minority participation in clinical trials (EMPaCT): laying the groundwork for improving minority clinical trial accrual: renewing the case for enhancing minority participation in cancer clinical trials. Cancer 120 , 1091–1096 (2014).
Al Hadidi, S., Mims, M., Miller-Chism, C. N. & Kamble, R. Participation of African American persons in clinical trials supporting U.S. food and drug administration approval of cancer drugs. Ann. Intern. Med. 173 , 320–321 (2020).
Awidi, M. & Al Hadidi, S. Participation of black Americans in cancer clinical trials: current challenges and proposed solutions. JCO Oncol. Pr. 17 , 265–271 (2021).
Article Google Scholar
Swaby, J., Kaninjing, E. & Ogunsanya, M. African American participation in cancer clinical trials. Ecancermedicalscience 15 , 1307 (2021).
Echeverri, M. et al. Cancer health literacy and willingness to participate in cancer research and donate bio-specimens. Int. J. Environ. Res. Public Health 15 , 2091 (2018).
Byrne, M. M., Tannenbaum, S. L., Glück, S., Hurley, J. & Antoni, M. Participation in cancer clinical trials: why are patients not participating? Med Decis. Mak. 34 , 116–126 (2014).
Langford, A. T. et al. Racial/ethnic differences in clinical trial enrollment, refusal rates, ineligibility, and reasons for decline among patients at sites in the National Cancer Institute’s Community Cancer Centers Program. Cancer 120 , 877–884 (2014).
Wendler, D. et al. Are racial and ethnic minorities less willing to participate in health research? PLoS Med. 3 , 0201–0210 (2006).
Google Scholar
Ford, J. G. et al. Barriers to recruiting underrepresented populations to cancer clinical trials: a systematic review. Cancer 112 , 228–242 (2008).
Hernandez, N. D. et al. African American cancer survivors’ perspectives on cancer Clinical trial participation in a safety-net hospital: considering the role of the social determinants of health. J. Cancer Educ. 37 , 1589–1597 (2022).
Wong, S. T. et al. Using visual displays to communicate risk of cancer to women from diverse race/ethnic backgrounds. Patient Educ. Couns. 87 , 327–335 (2012).
Watson, M. et al. Family history of breast cancer: what do women understand and recall about their genetic risk? J. Med. Genet. 35 , 731–738 (1998).
Article CAS PubMed PubMed Central Google Scholar
Meng, J., McLaughlin, M., Pariera, K. & Murphy, S. A comparison between Caucasians and African Americans in willingness to participate in cancer clinical trials: the roles of knowledge, distrust, information sources, and religiosity. J. Health Commun. 21 , 669–677 (2016).
Enhancing the Diversity of Clinical Trial Populations—Eligibility Criteria, Enrollment Practices, and Trial Designs Guidance for Industry | FDA. https://www.fda.gov/regulatory-information/search-fda-guidance-documents/enhancing-diversity-clinical-trial-populations-eligibility-criteria-enrollment-practices-and-trial .
El-Rayes, B. F. et al. Impact of race, age, and socioeconomic status on participation in pancreatic cancer clinical trials. Pancreas 39 , 967–971 (2010).
Sharrocks, K., Spicer, J., Camidge, D. R. & Papa, S. The impact of socioeconomic status on access to cancer clinical trials. Br. J. Cancer 111 , 1684–1687 (2014).
Kwiatkowski, K., Coe, K., Bailar, J. C. & Swanson, G. M. Inclusion of minorities and women in cancer clinical trials, a decade later: Have we improved? Cancer 119 , 2956–2963 (2013).
Unger, J. M. et al. Patient income level and cancer clinical trial participation. J. Clin. Oncol. 31 , 536–542 (2013).
Noor, A. M. et al. Effect of patient socioeconomic status on access to early-phase cancer trials. J. Clin. Oncol. 31 , 224–230 (2013).
Penberthy, L. et al. Barriers to therapeutic clinical trials enrollment: differences between African-American and white cancer patients identified at the time of eligibility assessment. Clin. Trials 9 , 788–797 (2012).
Battaglia, T. A. et al. Translating research into practice: Protocol for a community-engaged, stepped wedge randomized trial to reduce disparities in breast cancer treatment through a regional patient navigation collaborative. Contemp. Clin. Trials 93 , 106007 (2020).
Cartmell, K. B. et al. Patient barriers to cancer clinical trial participation and navigator activities to assist. Adv. Cancer Res. 146 , 139–166 (2020).
Fouad, M. N. et al. Patient Navigation As a Model to Increase Participation of African Americans in Cancer Clinical Trials. J. Oncol. Pr. 12 , 556–563 (2016).
The Lancet Oncology. Racial disparities in cancer care: can we close the gap? Lancet Oncol. 22 , 1643 (2021).
Liu, R. et al. Evaluating eligibility criteria of oncology trials using real-world data and AI. Nature 592 , 629–633 (2021).
Download references
This work was not funded. This work has previously been presented in part in poster format at the San Antonio Breast Cancer Symposium in December 2021 and the ASCO QI symposium in October 2022, as well as in an online-only abstract at the 2022 ASCO meeting in June 2022. We would like to acknowledge and thank all the patients whose deidentified data was used in this study.
Authors and affiliations.
Baylor College of Medicine, Lester and Sue Smith Breast Center, Houston, TX, USA
Emily L. Podany, Shaun Bulsara, Katherine Sanchez, Kristen Otte, Matthew J. Ellis & Maryam Kinik
Washington University in St. Louis, St. Louis, MO, USA
Emily L. Podany
The Institute for Proteogenomic Discovery, Houston, TX, USA
Matthew J. Ellis
You can also search for this author in PubMed Google Scholar
E.P. performed data collection, data cleaning, data analysis, and wrote the manuscript. S.B. performed data analysis and reviewed the manuscript. K.S. performed data cleaning and reviewed the manuscript. K.O. performed data collection and reviewed the manuscript. M.E. performed data collection and reviewed the manuscript. M.K. performed data collection, data cleaning, study planning, IRB submission, data analysis, and reviewed the manuscript.
Correspondence to Emily L. Podany .
Competing interests.
The authors declare no competing interests.
Publisher’s note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Supplemental tables 1 and 2, rights and permissions.
Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/ .
Reprints and permissions
Cite this article.
Podany, E.L., Bulsara, S., Sanchez, K. et al. Breast cancer clinical trial participation among diverse patients at a comprehensive cancer center. npj Breast Cancer 10 , 70 (2024). https://doi.org/10.1038/s41523-024-00672-0
Download citation
Received : 01 September 2023
Accepted : 10 July 2024
Published : 03 August 2024
DOI : https://doi.org/10.1038/s41523-024-00672-0
Anyone you share the following link with will be able to read this content:
Sorry, a shareable link is not currently available for this article.
Provided by the Springer Nature SharedIt content-sharing initiative
Sign up for the Nature Briefing: Cancer newsletter — what matters in cancer research, free to your inbox weekly.
IMAGES
COMMENTS
The paper starts by reviewing public datasets related to breast cancer diagnosis. ... To this end, Patil et al. took a multi-instance learning approach in a weakly supervised manner for the classification of breast cancer histology images. As shown in Figure 7, each input image is partitioned into multiple smaller patches. Feeding these patches ...
Traditionally, classification of breast cancer patients into those with TNBC and non-TNB has been largely determined by immunohistochemical staining [16,17]. ... As noted above, the research content of this paper was based on a binary classification model with application to pattern recognition classification problem . Under this approach 90% ...
Breast cancer incidence and death rates have increased over the last three decades. Between 1990 and 2016 breast cancer incidence has more than doubled in 60/102 countries (e.g., Afghanistan, Philippines, Brazil, Argentina), whereas deaths have doubled in 43/102 countries (e.g., Yemen, Paraguay, Libya, Saudi Arabia) .
According to World Health Organization (WHO) reports, the cancer death ratio is as high as 9.2 million for lung cancer, 1.7 million for skin cancer, and 627,000 for breast cancer 3,4.
Breast cancer is the most common cancer in women. Classification of cancer/non-cancer patients with clinical records requires high sensitivity and specificity for an acceptable diagnosis test. The ...
Breast cancer classification using machine learning. Abstract: During their life, among 8% of women are diagnosed with Breast cancer (BC), after lung cancer, BC is the second popular cause of death in both developed and undeveloped worlds. BC is characterized by the mutation of genes, constant pain, changes in the size, color (redness), skin ...
1. Set learning rate to 10 −3 and train the last layer for 3 epochs. 2. Set learning rate to 10 −4, unfreeze the top layers and train for 10 epochs, where the top layer number is set to 46 for ...
Hamed et al. proposed using machine learning-based models in their research work for the classification task of breast cancer. They claimed that the physicians' average accuracy obtained in the detection and classification of breast cancer is around 79%, while the accuracy obtained by their proposed model is 91% [17]. Tiwari et al. used the ...
Breast cancer is the most common and deadly type of cancer in the world. Based on machine learning algorithms such as XGBoost, random forest, logistic regression, and K-nearest neighbor, this paper establishes different models to classify and predict breast cancer, so as to provide a reference for the early diagnosis of breast cancer.
The main objective of this research paper is to predict and diagnosis breast cancer, using machine-learning algorithms, and find out the most effective whit respect to confusion matrix, accuracy and precision. ... demonstrates the use of various supervised machine learning algorithms in classification of breast cancer from using 3D images and ...
Convolutional Neural Network (CNN) models are a type of deep learning architecture introduced to achieve the correct classification of breast cancer. This paper has a two-fold purpose. The first aim is to investigate the various deep learning models in classifying breast cancer histopathology images. This study identified the most accurate models in terms of the binary, four, and eight ...
The World Health Organization (WHO) series on the Classification of Tumours (also known as the WHO Blue Books) is regarded as the gold standard for the diagnosis of tumours and provide indispensable international standards for classification of breast tumours worldwide. 20 The 5 th edition (2019) WHO Classification of Breast Tumours continues ...
Breast cancer is the most common malignant tumor in women in the world. Breast cancer patients account for as much as 36% of oncological patients. An estimated 2.089 million women were diagnosed with breast cancer in 2018 [, ]. The incidence of this malignant tumor is increasing in all regions of the world, but the highest incidence occurs in ...
Therefore, the current pathological report of breast carcinoma should include the histopathological classi fication of the tumors [1] and their histopathological grade, and the immunohistochemical parameters (ER, PgR, HER2 and Ki67) that would allow the treating physicians to tailor properly the systemic interventions.
Breast cancer (BC) is a heterogeneous disease, encompassing a diverse spectrum of tumours with varying morphological, biological, and clinical phenotypes. Although tumours may show phenotypic overlap, they often display different biological behaviour and response to therapy. Advances in high-throughput molecular techniques and bioinformatics ...
Artificial intelligence (AI) has been utilized for diagnosis early, rapidly, and accurately breast tumors. The objective of this paper is to review recent studies for classifying these tumors ...
According to Global Cancer Statistics 2020, breast cancer is the most common malignancy and the primary cause of cancer-related mortalities in the female population worldwide 1.Specifically, 2.26 ...
To train this model, they used a learning rate of 0.01 and 60 epochs. Transfer learning is effective in detecting breast cancer by categorizing mammogram images of the breast with general accuracy, sensitivity, specificity, precision, F-score, and accuracy of 98.96, 97.83, 99.13, 97.35, 97.6.%, and 95%, respectively.
According to the world health organization (WHO) Breast cancer is the most frequent cancer among women, impacting 2.1 million women each year, and also causes the greatest number of cancer-related ...
Conclusion Breast cancer is the important field of research and technology helps to reduce mortality rate caused by breast cancer. ... we put forth a model for the classification of data relating to breast cancer. In this paper we applied six ML classification techniques: DT, KNN, SVM, RF, NB, and LR along with ensemble techniques on WDBC ...
Machine learning plays an important role in cancer prediction, this paper realizes the prediction of breast cancer classification by constructing the Extreme Gradient Boosting (XGBoost) algorithm, and compares and analyzes it with other algorithms, and comprehensively applies several indexes such as Accuracy, Precision, F1_Score, Hamming_Loss, etc., for the evaluation of the algorithm ...
The experiment demonstrates the effectiveness of the proposed multi-model scheme with meta-learning for breast cancer classification. Breast cancer is a severe health concern, and research efforts focus on improving diagnosis and treatment. ML and DL techniques have shown great potential in accurately diagnosing breast cancer [1,17]. The ...
Breast Cancer is more common hence, identification of BC and detection of region of breast affected is more important. Mammography screening images two views CC and MLO are widely use in diagnosis process. This paper presents the method to detect cancer region and classify normal and cancerous patient. Pre-processing operation perform on the input Mammogram image and undesirable part removed ...
Black cancer patients in the United States have both increased overall cancer mortality and increased cancer-specific mortality 1,2,3. In breast cancer, Black women have a 41% higher risk of dying ...