×
Home Current Archive Editorial board
News Contact
Original scientific article

BILEVEL OPTIMIZED RECURSIVE FEATURE ELIMINATOR FOR CERVICAL CANCER FEATURE SELECTION PROCESS

By
S. Nandhinieswari Orcid logo ,
S. Nandhinieswari

Kongunadu Arts and Science College India

A. Indumathi Orcid logo
A. Indumathi

Kongunadu Arts and Science College (Autonomous) India

Abstract

An immense need has emerged in several areas of biological area for the development of prediction algorithms capable of managing the increasing complexity of high-dimensional information. In developing countries, Cervical Cancer (CC) kills more women than any other disease or accident, and it's the top cause of death among women worldwide. Early detection and treatment lead to improved results and longer patient survival, which in turn reduces cancer mortality. For the majority of real-world data science problems, not all dataset variables are useful for building models. The accuracy of a classifier and the model's ability to generalize are both reduced by repeated variables. Furthermore, adding more variables increases the overall complexity of a model. Deep learning's feature selection approach is a good fit for this issue. When it comes to selecting features for linear regression, our novel Bilevel Optimized Recursive Feature Eliminator (BORFE) method represents a revolutionary development. Finding the optimal fit for a model and eliminating its most undesirable features is the objective of this innovative feature selection method. This study presents a novel cross-validation approach using a Bilevel Optimization–Based -Recursive Feature Extractor (BORFE) to perform an in-depth analysis of the  hyper-parameters. When used with cross-validation, RFE finds the optimal number of features and the optimum selection of ranking features. According to the evaluation metrics, BORFE performs better than the other conventional algorithms when it comes to FS on cervical cancer datasets. For the cervical cancer dataset, this shows that BORFE can solve FS problems successfully

References

1.
Bi J, Bennett K, Embrechts M, Breneman C, Song M. Dimensionality reduction via sparse support vector machines. Journal of Machine Learning Research. 2003;3(Mar):1229–43.
2.
Chicco D, Rovelli C. Computational prediction of diagnosis and feature selection on mesothelioma patient health records. PLOS ONE. 14(1):e0208737.
3.
Rekha G, Tyagi AK, Reddy VK. A wide scale classification of class imbalance problem and its solutions: a systematic literature review. Journal of Computer Science. 2019;15(7):886–929.
4.
Fotouhi S, Asadi S, Kattan MW. A comprehensive data level analysis for cancer diagnosis on imbalanced data. Journal of Biomedical Informatics. 2019;90:103089.
5.
A Compact Circular Polarized Antenna For Fixed Communication Applications. National Journal Of Antennas And Propagation. 2019;01(01).
6.
Karegowda AG, Jayaram MA, Manjunath AS. Feature subset selection problem using wrapper approach in supervised learning. International journal of Computer applications. 2010;Feb;1(7):13-7.
7.
Tabatabay AM, Tabababay AS. STRATEGY OPTIMIZATION FOR RESPONDING TO PRIMARY, SECONDARY AND RESIDUAL RISKS CONSIDERING COST AND TIME DIMENSIONS IN PETROCHEMICAL PROJECTS. Archives for Technical Sciences. 2022;2(27):33–48.
8.
Efroymson M. Stepwise regression–a backward and forward look. In: Eastern Regional Meetings of the Institute of Mathematical Statistics 1966 Apr 27. p. 27–9.
9.
Abinaya R, Vidhya S, Vadivel S. Latent Palm Print Matching Based on Minutiae Features for Forensic Applications. International Journal of communication and computer Technologies. 2014;2(2):85–7.
10.
Parikh D, Menon V. Machine Learning Applied to Cervical Cancer Data. International Journal of Mathematical Sciences and Computing. 2019;5(1):53–64.
11.
Fernandes K, Cardoso JS, Fernandes J. Transfer Learning with Partial Observability Applied to Cervical Cancer Screening. Lecture Notes in Computer Science. 2017. p. 243–50.
12.
Chicco D. Ten quick tips for machine learning in computational biology. BioData Mining. 2017;10(1).
13.
Sihag V, Vardhan M, Singh P, Choudhary G, Son S. De-LADY: Deep learning based Android malware detection using Dynamic features. Journal of Internet Services and Information Security. 11(2):34–45.
14.
Lanckriet GR, Cristianini N, Bartlett P, Ghaoui LE, Jordan MI. Learning the kernel matrix with semidefinite programming. Journal of Machine learning research. 2004;5(Jan):27–72.
15.
Park M, Kim S, Kim J. Research on Note-Taking Apps with Security Features. Journal of Wireless Mobile Networks, Ubiquitous Computing, and Dependable Applications. 11(4):63–76.
16.
Nithya B, Ilango V. Evaluation of machine learning based optimized feature selection approaches and classification methods for cervical cancer prediction. SN Applied Sciences. 2019;1(6).
17.
CAMGÖZLÜ Y, KUTLU Y. Leaf Image Classification Based on Pre-trained Convolutional Neural Network Models. Natural and Engineering Sciences. 8(3):214–32.
18.
Choudhury A, Wesabi YM, Won D. Classification of cervical cancer dataset.
19.
Hastie T, Tibshirani R, Tibshirani RJ. Extended comparisons of best subset selection, forward stepwise selection, and the lasso. 2017;
20.
Bertsimas D, King A, Mazumder R. Best subset selection via a modern optimization lens. The Annals of Statistics. 2016;44(2).
21.
Meinshausen N. Relaxed Lasso. Computational Statistics & Data Analysis. 2007;52(1):374–93.
22.
Momma M, Bennett KP. A Pattern Search Method for Model Selection of Support Vector Regression. Proceedings of the 2002 SIAM International Conference on Data Mining. 2002.
23.
Evgeniou T, Pontil M. Regularized multi--task learning. Proceedings of the tenth ACM SIGKDD international conference on Knowledge discovery and data mining. 2004. p. 109–17.
24.
Mustapa M, Rahmah U, Cakranegara PA, Firdaus W, Pratama D, Rahim R. Implementation of Feature Selection and Data Split using Brute Force to Improve Accuracy. Journal of Wireless Mobile Networks, Ubiquitous Computing, and Dependable Applications. 2023;14(1):50–9.
25.
Abdoh SF, Abo Rizka M, Maghraby FA. Cervical Cancer Diagnosis Using Random Forest Classifier With SMOTE and Feature Reduction Techniques. IEEE Access. 2018;6:59475–85.
26.
Kourou K, Exarchos TP, Exarchos KP, Karamouzis MV, Fotiadis DI. Machine learning applications in cancer prognosis and prediction. Computational and Structural Biotechnology Journal. 2015;13:8–17.
27.
Bischl B, Lang M, Kotthoff L, Schiffner J, Richter J, Studerus E, et al. Machine learning in RJ Mach. Learn Res. 2016;17:5938–42.
28.
Mohandas DR, Veena DS, Kirubasri G, Mary IT, Udayakumar DR. Federated Learning with Homomorphic Encryption for Ensuring Privacy in Medical Data. Indian Journal of Information Sources and Services. 2024;14(2):17–23.
29.
Guyon I, Weston J, Barnhill S, Vapnik V. Machine Learning. 2002;46(1/3):389–422.
30.
Tibshirani R. Regression Shrinkage and Selection Via the Lasso. Journal of the Royal Statistical Society Series B: Statistical Methodology. 1996;58(1):267–88.
31.
Elshrkawey M, Alalfi M, Al-Mahdi H. An enhanced intrusion detection system based on multi-layer feature reduction for probe and dos attacks. Journal of Internet Services and Information Security (JISIS. 2021;(v;11(4):40-57).
32.
Ong CS, Smola A, Williamson R. Learning the kernel with hyperkernels.
33.
Caruana R. Machine Learning. 1997;28(1):41–75.
34.
Organization WH. Fact sheet: human-papillomavirus. 2019;
35.
Hastie T, Rosset S, Tibshirani R, Zhu J. The entire regularization path for the support vector machine. Journal of Machine Learning Research. 2004;5(Oct):1391–415.
36.
Guyon I, Weston J, Barnhill S, Vapnik V. Machine Learning. 2002;46(1/3):389–422.
37.
Guyon I, Weston J, Barnhill S, Vapnik V. Machine Learning. 2002;46(1/3):389–422.
38.
Vapnik V. Statistical learning theory. Vol. 2. 1998.
39.
Trivedi J, Devi MS, Solanki B. STEP TOWARDS INTELLIGENT TRANSPORTATION  SYSTEM WITH VEHICLE CLASSIFICATION AND  RECOGNITION USING SPEEDED-UP ROBUST FEATURES. Archives for Technical Sciences. 2023;1(28):39–56.
40.
Lavanya D, Rani DK. Analysis of feature selection with classification: Breast cancer datasets. Indian Journal of Computer Science and Engineering (IJCSE. 2011;Oct;2(5):756-63.
41.
Latha DS, Lakshmi PV, Fathima S. Staging prediction in cervical cancer patients–a machine learning approach. International Journal of Innovative Research and Practices. 2014;2(2):14–23.
42.
Akyol K. A Study on Test Variable Selection and Balanced Data for Cervical Cancer Disease. International Journal of Information Engineering and Electronic Business. 2018;10(5):1–7.
43.
Devi G, Kavitha M, Surendar A. High Speed Image Searching for Human Gait Feature Selection. International Journal of communication and computer Technologies. 2016;4(2):88–95.
44.
Desing of VLSI Architecture for a flexible testbed of Artificial Neural Network for training and testing on FPGA. Journal of VLSI circuits and systems. 2024;6(1).

Citation

This is an open access article distributed under the Creative Commons Attribution License which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited. 

Article metrics

Google scholar: See link

The statements, opinions and data contained in the journal are solely those of the individual authors and contributors and not of the publisher and the editor(s). We stay neutral with regard to jurisdictional claims in published maps and institutional affiliations.