Optimized Decision Tree and Black Box Learners for Revealing Genetic Causes of Bladder Cancer

dc.authorid0000-0002-1030-3545en_US
dc.authorscopusid24491277000en_US
dc.authorwosidJAN-6981-2023en_US
dc.contributor.authorYücebaş, Sait Can
dc.date.accessioned2024-01-16T11:59:45Z
dc.date.available2024-01-16T11:59:45Z
dc.date.issued2023en_US
dc.departmentFakülteler, Mühendislik Fakültesi, Bilgisayar Mühendisliği Bölümü
dc.description.abstractThe number of studies in the literature that diagnose cancer with machine learning using genome data is quite limited. These studies focus on the prediction performance, and the extraction of genomic factors that cause disease is often overlooked. However, finding underlying genetic causes is very important in terms of early diagnosis, development of diagnostic kits, preventive medicine, etc. The motivation of our study was to diagnose bladder cancer (BCa) based on genetic data and to reveal underlying genetic factors by using machine-learning models. In addition, conducting hyperparameter optimization to get the best performance from different models, which is overlooked in most studies, was another objective of the study.Within the framework of these motivations, C4.5, random forest (RF), artificial neural networks (ANN), and deep learning (DL) were used. In this way, the diagnostic performance of decision tree (DT)-based models and black box models on BCa was also compared. The most successful model, DL, yielded an area under the curve (AUC) of 0.985 and a mean square error (MSE) of 0.069. For each model, hyper-parameters were optimized by an evolutionary algorithm. On average, hyper-parameter optimization increased MSE, root mean square error (RMSE), LogLoss, and AUC by 30%, 17.5%, 13%, and 6.75%, respectively. The features causing BCa were extracted. For this purpose, entropy and Gini coefficients were used for DT-based methods, and the Gedeon variable importance was used for black box methods. The single nucleotide polymorphisms (SNPs) rs197412, rs2275928, rs12479919, rs798766 and rs2275928, whose BCa relations were proven in the literature, were found to be closely related to BCa. In addition, rs1994624 and rs2241766 susceptibility loci were proposed to be examined in future studies.en_US
dc.identifier.citationYücebaş, S.C. (2023). Optimized Decision Tree and Black Box Learners for Revealing Genetic Causes of Bladder Cancer. Intelligent Automation & Soft Computing, 37(1), 49–71. https://doi.org/10.32604/iasc.2023.036871en_US
dc.identifier.doi10.32604/iasc.2023.036871
dc.identifier.endpage71en_US
dc.identifier.issn1079-8587
dc.identifier.issn2326-005X
dc.identifier.issue1en_US
dc.identifier.scopus2-s2.0-85160436865
dc.identifier.startpage49en_US
dc.identifier.urihttps://doi.org/10.32604/iasc.2023.036871
dc.identifier.urihttps://hdl.handle.net/20.500.12428/5257
dc.identifier.volume37en_US
dc.identifier.wosWOS:000993115400004
dc.identifier.wosqualityQ3
dc.indekslendigikaynakWeb of Science
dc.indekslendigikaynakScopus
dc.institutionauthorYücebaş, Sait Can
dc.language.isoen
dc.publisherTech Science Pressen_US
dc.relation.ispartofIntelligent Automation and Soft Computingen_US
dc.relation.publicationcategoryMakale - Uluslararası Hakemli Dergi - Kurum Öğretim Elemanıen_US
dc.rightsinfo:eu-repo/semantics/openAccessen_US
dc.rightsAttribution 3.0 United States*
dc.rights.urihttp://creativecommons.org/licenses/by/3.0/us/*
dc.subjectBladder canceren_US
dc.subjectDeep learningen_US
dc.subjectHyper-parameter optimizationen_US
dc.subjectNeural networken_US
dc.subjectRandom foresten_US
dc.subjectSingle nucleotide polymorphismen_US
dc.titleOptimized Decision Tree and Black Box Learners for Revealing Genetic Causes of Bladder Cancer
dc.typeArticle

Dosyalar

Orijinal paket
Listeleniyor 1 - 1 / 1
Yükleniyor...
Küçük Resim
İsim:
Sait_Can_Yucebas_Makale.pdf
Boyut:
735.81 KB
Biçim:
Adobe Portable Document Format
Açıklama:
Araştırma Makalesi
Lisans paketi
Listeleniyor 1 - 1 / 1
[ X ]
İsim:
license.txt
Boyut:
1.44 KB
Biçim:
Item-specific license agreed upon to submission
Açıklama: