Genetic Algorithm for Biomarker Search Problem and Class Prediction

Full Text (PDF, 508KB), PP.47-55

Views: 0 Downloads: 0


Shabia Shabir Khan 1,* S.M.K. Quadri 2 M.A. Peer 2

1. Department of Computer Science, Research Scholar, University of Kashmir, Srinagar, India

2. Department of Computer Science, Faculty of Computer Science, University of Kashmir, Srinagar, India

* Corresponding author.


Received: 17 Nov. 2015 / Revised: 10 Feb. 2016 / Accepted: 18 Apr. 2016 / Published: 8 Sep. 2016

Index Terms

Genetic Algorithm (GA), Artificial Neural Network (ANN), Fitness Function, Feature Selection, Classification


In the field of optimization, Genetic Algorithm that incorporates the process of evolution plays an important role in finding the best solution to a problem. One of the main issues that arise in the medical field is to search a finite number of factors or features that actually affect or predict the survival of the patients especially with poor prognosis disease, thus helping them in early diagnosis. This paper discusses the various steps that are performed in genetic algorithm and how it is going to help in extracting knowledge out of high dimensional medical dataset. The more the attributes or features, the more difficult it is to correctly predict the class of that sample or instance. This is because of inefficient, useless, noisy attributes in the dataset. So, here the main aim is to search the strong features or genes that can strongly predict the class of subject (patient) i.e. healthy or cancerous and thus help in early detection and treatment.

Cite This Paper

Shabia Shabir Khan, S.M.K. Quadri, M.A. Peer, "Genetic Algorithm for Biomarker Search Problem and Class Prediction", International Journal of Intelligent Systems and Applications (IJISA), Vol.8, No.9, pp.47-55, 2016. DOI:10.5815/ijisa.2016.09.06


[1]Kampouropoulos, Konstantinos, et al. "A combined methodology of adaptive neuro-fuzzy inference system and genetic algorithm for short-term energy forecasting.Advances in Electrical and computer engineering”. Volume 14, number 1 (2014).
[2]Tahmasebi, Pejman, and Ardeshir Hezarkhani. "A hybrid neural networks-fuzzy logic-genetic algorithm for grade estimation." Computers & geosciences 42 (2012): 18-27.
[3]Fakhreddine O. Karray, Clarence De Silva, “Soft Computing and Intelligent Systems Design- Theory, Tools and Applications”, Pearson Education, 2009.
[4]S.N.Sivanandam, S.N.Deepa, “Principles of Soft Computing”, Wiley India Edition,2007
[5]Hanafy, Tharwat OS. "A modified algorithm to model highly nonlinear system."J Am Sci 6.12 (2010): 747-759.
[6]Ge, Shuzhi Sam, and Cong Wang. "Adaptive neural control of uncertain MIMO nonlinear systems." Neural Networks, IEEE Transactions on 15.3 (2004): 674-692.
[7]Hanafy, Tharwat OS. "A modified algorithm to model highly nonlinear system."J Am Sci 6.12 (2010): 747-759.
[8]Goldberg D.E., “Genetic Algorithms in Search, Optimisation, and Machine Learning”, Addison-Wesly, Reading, 1989.
[9]Michalewicz, Z., “Genetic Algorithms +Data Structures = Evolution Programs”, Springer, 1996.
[10]Vose M.D., “The Simple Genetic Algorithm: Foundations and Theory (Complex Adaptive Systems)”, Bradford Books, 1999.
[11]Matlab, “Global Optimization Toolbox User's Guide”, The MathWorks, Inc, Revised 2015
[12]Yvan Saeys,Inaki Inza and Pedro Larranaga,” A review of feature selection techniques in bioinformatics Bioinformatics” , BIOINFORMATICS REVIEW, Gene expression, Vol. 23 no. 19 2007, pages 2507–2517 , 2007
[13]Daelemans,W., et al., “Combined optimization of feature selection and algorithm parameter interaction in machine learning of language:A review of feature selection techniques”,Proceedings of the 14th European Conference on Machine Learning (ECML-2003), pp. 84–95
[14]Li,T., et al. (2004) A comparative study of feature selection and multiclass classification methods for tissue classification based on gene expression.Bioinformatics, 20, 2429–2437
[15]Petricoin,E., et al. (2002) Use of proteomics patterns in serum to identify ovarian cancer. The Lancet, 359, 572–577.
[16]Kim, Kyoung-jae, and Ingoo Han. "Genetic algorithms approach to feature discretization in artificial neural networks for the prediction of stock price index." Expert systems with Applications 19.2 (2000): 125-132.
[17]Dilip Kumar Choubey, Sanchita Paul, Joy Bhattacharjee “Soft Computing Approaches for Diabetes Disease Diagnosis: A Survey”, International Journal of Applied Engineering Research, Vol. 9, pp. 11715-11726, 2014
[18]Choubey, Dilip Kumar, and Sanchita Paul. "GA_MLP NN: A Hybrid Intelligent System for Diabetes Disease Diagnosis." (2016).
[19]V.S.R. Kumari, P.R. Kumar,” Classification of cardiac arrhythmia using hybrid genetic algorithm optimisation for multi-layer perceptron neural network”, International Journal of Biomedical Engineering and Technology, Volume 20, Issue 2, 2016
[20]Sudhakar, M., J. Albert Mayan, and N. Srinivasan. "Intelligent Data Prediction System Using Data Mining and Neural Networks." Proceedings of the International Conference on Soft Computing Systems. Springer India, 2016.
[21]Ahmadizar, Fardin, et al. "Artificial neural network development by means of a novel combination of grammatical evolution and genetic algorithm."Engineering Applications of Artificial Intelligence 39 (2015): 1-13.
[22]Melanie Mitchell(1996), An Introduction to Genetic Algorithms, A Bradford Book, The MIT Press, Cambridge, Massachusets Institute of Technology, 1996
[23]Ahmad, Fadzil, et al. "A GA-based feature selection and parameter optimization of an ANN in diagnosing breast cancer." Pattern Analysis and Applications 18.4 (2015): 861-870.
[24]Nianyi Chen, Wencong Lu, Jie Yang, Guozheng Li, “Support Vector Machine in Chemistry”, World Scientific ,Chap 4, pp.61, 2004
[25]Khan, Sheema, et al. "MicroRNA-145 targets MUC13 and suppresses growth and invasion of pancreatic cancer." Oncotarget 5.17 (2014): 7599.
[26]Moschopoulos, Charalampos, et al. "A genetic algorithm for pancreatic cancer diagnosis." Engineering Applications of Neural Networks. Springer Berlin Heidelberg, 2013. 222-230.
[27]Svetlana S. Aksenova, “Machine Learning with WEKA”, WEKA Explorer Tutorial., 2004
[28]Zhang L, Farrell JJ, Zhou H, Elashoff D et al. Salivary transcriptomic biomarkers for detection of resectable pancreatic cancer. Gastroenterology,138(3):949-57, Mar 2010
[29]SM Kalami Heris, H Khaloozadeh , “Non-dominated sorting genetic filter a multi-objective evolutionary particle filter”, Intelligent Systems (ICIS), Iranian Conference 2014
[30]Kumari, B., Swarnkar, T., “Filter versus Wrapper Feature Subset Selection in Large Dimensionality Micro array: A Review”, IJCSIT, Vol.2 (3), pp. 1048-1053, 2011
[31]Amato, F.,Lopez, A., Maria, E.P.M.,Vanhara, P.,Hampl, A., Havel, J., “Artificial neural networks in medical diagnosis”, J Appl Biomed, 11:47-58, 2013
[32]Erguzel, Turker Tekin, et al. "Feature Selection and Classification of Electroencephalographic Signals An Artificial Neural Network and Genetic Algorithm Based Approach." Clinical EEG and neuroscience 46.4 (2015): 321-326.