Multilayer Perceptron Network in HIV/AIDS Application


The paper discusses a new method of MultiLayered Perceptron (MLP) network to classify HIV/AIDS infected and non-infected status of individuals. For thispurpose, seven features on the basis of patient's unique likeage, sex, weight, HB, CD4, CD8 and TB were used as input data. In order to determine the applicability and bestperformance of the MLP network, three different training algorithms like Back Propagation, Levenberg-Marquardt,and Bayesian Rule algorithms, were employed to train theMLP networks. The findings conclude that the MLPnetwork trained using Back Propagation algorithmproduced the best performance with 89.80% accuracy ascompared to Levenberg-Marquardt and Bayesian Rulealgorithms. For thispurpose, seven features on the basis of patient’s unique likeage, sex, weight, HB, CD4, CD8 and TB were used as <strong>in</strong>putdata. In order to determ<strong>in</strong>e the applicability and bestperformance of the MLP network, three different tra<strong>in</strong><strong>in</strong>galgorithms like Back Propagation, Levenberg-Marquardt,and Bayesian Rule algorithms, were employed to tra<strong>in</strong> theMLP networks. The f<strong>in</strong>d<strong>in</strong>gs conclude that the MLPnetwork tra<strong>in</strong>ed us<strong>in</strong>g Back Propagation algorithmproduced the best performance with 89.80% accuracy ascompared to Levenberg-Marquardt and Bayesian Rulealgorithms. The results also significantly demonstrated thesuitability of the MLP network for calculat<strong>in</strong>g andspecify<strong>in</strong>g the <strong>HIV</strong>/AIDs positive/negative status of thepatient.Keywords--Multi Layered <strong>Perceptron</strong> (MLP), Backpropagation, Levenberg-Marquardt, Bayesian, Regimens,<strong>HIV</strong>/AIDs.I. INTRODUCTIONAcquired Immunodeficiency Syndrome (<strong>AIDS</strong>) wasfirst def<strong>in</strong>ed <strong>in</strong> 1982 to describe the cases of unusualimmune system failure due to an unknown andunidentified <strong>in</strong>fection <strong>in</strong> the previous years. The HumanImmunodeficiency Virus (<strong>HIV</strong>) was later identified asthe cause of <strong>AIDS</strong>. In a large number of medicalapplications, classification is desired to differentiate apattern of low frequency from a pattern of highfrequency. Cl<strong>in</strong>ical trail system for <strong>HIV</strong>/AIDs is acomplex one. It is an <strong>in</strong>curable disease. More thanmillions of people are <strong>HIV</strong> positive. Recent researchshows that computational <strong>in</strong>telligence has been widelyused on medical diagnosis to solve complex problemsby develop<strong>in</strong>g decision support system with theapplication of Neural <strong>Network</strong> algorithms. Neural<strong>Network</strong> is an appropirate application to practice mostof the medical problems. It has many algorithms forclassification, prediction, image process<strong>in</strong>g, etc. Aproper utilization of a Neural <strong>Network</strong> technique toimplement a large-scale health services research datasetis one of the most difficult areas <strong>in</strong> the Neural <strong>Network</strong>field. Due to <strong>in</strong>correctly def<strong>in</strong>ed and unstructuredfactors, it becomes further complicated affect<strong>in</strong>g thefunctional health status of <strong>HIV</strong>/<strong>AIDS</strong> patients. Many ofthe studies have applied Neural <strong>Network</strong> technique toclassify and predict desired solution or to improvemethodological aspects.<strong>Multilayer</strong>ed perceptron(MLP) network tra<strong>in</strong>ed us<strong>in</strong>gback propagation(BP) algorithm is one of the mostpopular choice <strong>in</strong> neural network applications. Thepresent study proposes the MLP network to classify the<strong>HIV</strong> positive/negative <strong>in</strong>dividuals and compare theperformance of various available tra<strong>in</strong><strong>in</strong>g algorithmsnamely back propagation, Levenberg-Marquardt andBayesian rule.II. REVIEW OF RELATED RESEARCHThe Human Immunodeficiency Virus (<strong>HIV</strong>) is one ofthe ma<strong>in</strong> causes of human death <strong>in</strong> the world. The <strong>HIV</strong>is a human pathogen that <strong>in</strong>fects certa<strong>in</strong> types oflymphocytes called T-helper cells, which are importantto the immune system. Without a sufficient number ofT-helper cells, the immune system is unable to defendthe body aga<strong>in</strong>st <strong>in</strong>fections, thereby mak<strong>in</strong>g it vulnerableto various <strong>in</strong>fections and diseases and f<strong>in</strong>ally itsuccumbs.Acquired immunodeficiency syndrome (<strong>AIDS</strong>) wasfirst def<strong>in</strong>ed [1] <strong>in</strong> 1982 to describe the first cases ofunusual immune system failure that were identified <strong>in</strong>the previous year.The human immunodeficiency virus ( <strong>HIV</strong>) was lateridentified as the cause of <strong>AIDS</strong>.As an <strong>in</strong>dicator, the riskfactor epidemiology exam<strong>in</strong>es the <strong>in</strong>dividualdemographic and social characteristics and attempts todeterm<strong>in</strong>e factors that place an <strong>in</strong>dividual at risk ofacquir<strong>in</strong>g a life-threaten<strong>in</strong>g disease [2]. Thedemographic and social characteristics of the individuals and their behavior are used to determine the risk of HIV infection; referred to as biomedical individualism [2],[3]. By identifying the individual risk factors that lead to

the HIV infection, it is possible to modify socialconditions, which give rise to the disease, and thusdesign effective HIV prevention policies. A model willbe created and used to classify the HIV status ofindividuals based on demographic properties. In thisstudy, the model is created using autoencoder neuralnetworks and genetic algorithms, which have beenapplied for classification.An artificial neural network (ANN) is an interconnectedstructure of processing elements. The ANNstructure [4] used for this study consists of three main components (Fig. 1) [5]. These are the <strong>in</strong>put layer, thehidden layer and the output layer.Neural networks have been successfully used formedical <strong>in</strong>formatics, for decision mak<strong>in</strong>g, cl<strong>in</strong>icaldiagnosis, prognosis, and prediction of outcomes [6]-[10] and also for classification. Marwala [11] used aprobabilistic committee of neural networks to classifyfaults <strong>in</strong> a population of nom<strong>in</strong>ally identical cyl<strong>in</strong>dricalshells and obta<strong>in</strong>ed an accuracy of 95%, <strong>in</strong> classify<strong>in</strong>geight classes of fault cases. Ohno-Machado [12]depicted the limitation on the accuracy of the neuralnetwork model due to lack of data balance and <strong>in</strong>creasedthe accuracy by us<strong>in</strong>g sequential neural networks.Lisboa [13] assessed the evidence of healthcare benefitsus<strong>in</strong>g neural networks. Fernandez and Caballero [14]used ANN to model the activity of cyclic urea <strong>HIV</strong>-1protease <strong>in</strong>hibitors. They showed that ANN was capableof represent<strong>in</strong>g the nonl<strong>in</strong>earity <strong>in</strong> the <strong>HIV</strong> model. Leeand Park [15] applied neural networks to classify andpredict the symptomatic status of <strong>HIV</strong>/<strong>AIDS</strong> patientsbased on publicly available <strong>HIV</strong>/<strong>AIDS</strong> data. A studywas also performed to predict the functional healthstatus of <strong>HIV</strong>/<strong>AIDS</strong> patients def<strong>in</strong>ed as „<strong>in</strong> good health‟or „not <strong>in</strong> good health‟, us<strong>in</strong>g neural networks [16].Laumann and Youm [17] used the racial and ethnicgroup differences to model the prevalence of thedisease and succeeded <strong>in</strong> relat<strong>in</strong>g the demographicproperties to the transmission of the disease.Poundstone and others [2] related demographicproperties to the spread of <strong>HIV</strong>. Their work justified theuse of such demographic properties <strong>in</strong> creat<strong>in</strong>g a modelto predict the <strong>HIV</strong> status of <strong>in</strong>dividuals, as done <strong>in</strong> thepresent paper.All the models refereed above concluded that ANNperforms better <strong>in</strong> <strong>HIV</strong> classification problems. Themethodology adopted here aims at us<strong>in</strong>g demographicand social factors, to predict the <strong>HIV</strong> status of an<strong>in</strong>dividual, us<strong>in</strong>g autoencoder neural networks. Themost common neural network architecture is themultilayer perceptron (MLP). An alternative network isthe Radial Basis Function (RBF) [5]. The use of MLPover RBF can be attributed to the fact that the RBFusually requires the implementation of the pseudo<strong>in</strong>verseof a matrix for tra<strong>in</strong><strong>in</strong>g, which is often s<strong>in</strong>gularwhile MLP uses conventional feedforward optimizationmethods, which are stable[5]. In the present case,prelim<strong>in</strong>ary design showed that the MLP hasoutperformed. This can be attributed to the fact thatMLP networks, also known as universal approximators,are capable of model<strong>in</strong>g any complex relationship withone or two hidden layers and are thus most suited forthis study. For a detailed analysis on neural networksand MLP one can refer to the studies made by [18]-[22]For the purpose of the present paper, neural networksare used with genetic algorithms. A genetic algorithm(GA) is an optimization method deriv<strong>in</strong>g its behaviorfrom processes of evolution <strong>in</strong> nature, <strong>in</strong>spired byDarw<strong>in</strong>‟s theory of natural evolution [23],[24]. This isdone by the creation of a population tak<strong>in</strong>g <strong>in</strong>dividualswith<strong>in</strong> a mach<strong>in</strong>e/computer. In this study, the populationof <strong>in</strong>dividuals represents the miss<strong>in</strong>g <strong>in</strong>put entries. The<strong>in</strong>dividuals then go through the process of evolution.GA uses fitness-proportionate or tournament selection toselect the miss<strong>in</strong>g entries (<strong>in</strong>dividuals), probabilisticallythat yields the right <strong>HIV</strong> status for the <strong>in</strong>dividuals.Although not guaranteed to provide the globallyoptimum solution, GA has been shown to be highlyefficient at reach<strong>in</strong>g to a very near optimum solution <strong>in</strong> acomputationally efficient manner [23],[24]. For moredetails on GA one can refer to Davis and Michalewicz[25],[26]. In the literature review, there is no methodproposed thus far that <strong>in</strong>vestigates the use ofAutoencoder networks for <strong>HIV</strong> model<strong>in</strong>g which is basedon autoassociative models [27] comb<strong>in</strong>ed with GA toclassify the <strong>HIV</strong> status of an <strong>in</strong>dividual based ondemographic properties.<strong>Multilayer</strong>ed perceptron (MLP) network tra<strong>in</strong>edus<strong>in</strong>g back propagation (BP) algorithm is the mostpopular choice <strong>in</strong> neural network applications. Thepresent study proposes the MLP network to predict<strong>HIV</strong>/<strong>AIDS</strong> Regimen specification and compare theperformance of various available tra<strong>in</strong><strong>in</strong>g algorithmsnamely back propagation, Levenberg-Marquardt andBayesian rule.This paper is sequenced as follows. Section 3discusses the basic concept of the MLP network.Section 4 outl<strong>in</strong>es the tra<strong>in</strong><strong>in</strong>g algorithms employed <strong>in</strong>this research. Section 5 discusses the methodology.Section 6 reflects the result and discussion. F<strong>in</strong>ally,section 7 outl<strong>in</strong>es the conclusion.III.MULTILAYERED PERCEPTRON NETWORKA MLP network is a feed forward neural networkwith one or more hidden layers. Cybenko and Funahashi[28], [29] have proved that the MLP network is ageneral function approximator and the MLP networkwith one hidden layer (as shown <strong>in</strong> Fig. 1) is sufficientto approximate any cont<strong>in</strong>uous function. Based on Fig.1, the <strong>in</strong>put layer acts as an <strong>in</strong>put data buffer thatPage | 42

distributes the input to the hidden layer. The outputsfrom the hidden layer then become the inputs to theoutput layer, which provides the network output.A hidden neuron performs two functions, i.e. thecombining function and the activation function.Consider a MLP network with n i input nodes, the outputof the j-th neuron of the hidden layer is given by:v j (t)=F((nii1w x ( t) b ); for1 j n1jiij1where the w ji denotes the weights that connect the inputand hidden layer; x i and b i denote the input that aresupplied to the input layer and thresholds in hiddennodes respectively; n i and n h are number of inputnodes and hidden nodes respectively.h(1)The output of the k-th output neuron, y k in the outputlayer is given by:n h 2w kjy k^t =j=1v j (t);for 1

the hidden layer which is back propagated in thenetwork. Since the activation function of the outputneuron is linear, the error signal at the output node is ( t) y ( t)y ^ ( t)(8)kkand for the neurons in the hidden layerk'2 ( t) F ( x ( t)) ( t)w ( t 1)(9)j'where ( x ( t))ijjF iis the first derivative of F( x ( t))with respect to x i(t).Since back propagation algorithm is a steepest decenttype algorithm, the algorithm suffers from a slowconvergence rate. The search for the global minima maybe trapped at local minima and the algorithm can besensitive to the user selectable parameters (Mashor, M.Y. 2003)B. Levenberg-Marquardt AlgorithmLevenberg-Marquardt algorithm is a gradient-based,deterministic local optimization algorithm. TheLevenbergMarquardt algorithm has an advantage overthe traditional Back Propagation algorithm, where it canprovide faster (second-order) convergence rate and keeprelative stability. [32],[33].Like the quasi-Newton methods, the Levenberg-Marquardt algorithm was designed to approach secondordertraining speed without having to compute theHessian matrix. When the performance function has theform of a sum of squares (as is typical in training feedforward networks), then the Hessian matrix can beapproximated as:TH J J(10)and the gradient can be computed as:Tg J e(11)where J is the Jacobian matrix that contains firstderivatives of the network errors with respect to theweights and biases, and e is a vector of network errors.The Jacobian matrix can be computed through astandard back propagation technique that is much lesscomplex than computing the Hessian matrix [34].TheLevenberg-Marquardt algorithm uses this approximationto the Hessian matrix in the following Newton-likeupdate:T 1 TJJ µ I J e w (12)jiiwhereparameter. w is a differential weights and µ is a controlWhen the scalar µ is zero, it is similar to Newton'smethod, using the approximate Hessian matrix. When µis large, it becomes gradient descent with a small stepsize. Newton's method is faster and more accurate nearan error minimum, so the aim is to shift towardsNewton's method as quickly as possible. Thus, µ isdecreased after each successful step (reduction inperformance function) and is increased only when atentative step would increase the performance function.In this way, the performance function will always bereduced at each iteration of the algorithm [34].C. Bayesian Rule AlgorithmGiven the Baye's Rule asP(D | )P( | D) (13)P(D)Where P() is the prior probability of a parameter before having seen the data and p( | D)called thelikelihood were the probability of the data D.Bayes' Rule is used to determine the posteriorprobability of θ given the data D [34]. In general thiswill provide an entire distribution over possible valuesof θ. This process was applied to neural networks andcome up with the probability distribution over thenetwork weights, w, given the training data. Whenfinding a posterior distribution over weights,p(D | w)p(w)p(w | D)p(D)p(D | w)= p(D | w)p(w)dw(14)In the Bayesian formalism, learning the weightsmeans changing our belief about the weights from theprior, p (w), to the posterior p ( w | D), as aconsequence of seeing the data as illustrated by Fig. 2.Fig. 2 Changing prior weights to posterior weights

V. METHODOLOGY AND DATA SAMPLESAs discussed earlier, the study focuses onclassification of HIV/AIDS infected and non-infectedindividuals. To determine the applicability of the MLPnetwork as HIV/AIDS diagnosis technique, the MLPnetwork needs to go through training and testing phases.During both phases, the optimum structure anddiagnosis performance of the MLP networks aredetermined. The performance analysis of the MLPnetwork is based on accuracy. Accuracy is defined asthe percentage of overall correct determination ofHIV/AIDS cases. The data are taken from NationalAIDS Control Organization (NACO), India. Sevenunique factors like age, sex, weight, HB, CD4, CD8 andTB used as input data to the MLP network. 200 Patient'smedical information was used as training data while theremaining 100 data were used as testing data. The dataare fed randomly into the MLP network. It wasimplemented using MATLAB 7.10 neural network tool(nntool). An input layer is used to represent set of inputvariables (seven input variables). Input pattern has theseven variables: age, sex, weight, CD4 count, CD8count, HB rate and TB are taken as network parameter.VI.RESULT AND DISCUSSIONBy performing the relevant training, the optimumnumbers of hidden nodes and training epochs wereobtained. This was obtained when the MLP networkachieves the highest performance. Fig. 3 (a) and (b)show the result of obtaining the optimum trainingepochs and number of hidden node for back propagationalgorithm respectively. The MLP network using backpropagation algorithm achieves the highest performanceat the number of hidden nodes equal to 2200 and 14 fortraining epochs. This records the optimum level and isalso the highest using the parameters referred earlier.X: 14 ; Y: 0.898(b)Hidden NodeFig. 3 Performance of the MLP network with Back propagationalgorithmResults for Levenberg-Marquardt training algorithmare shown in Fig. 4.This training algorithm produced thebest performance at 100 training epochs and 7 hiddennodesX: 100 ; Y: 0.8735X: 2200 ; Y:0.8694(a)Training EpochesX: 7 ; Y: 0.8735(a)Training Epoches

(b)Hidden NodeFig. 4 Performance of the MLP network with Levenberg-Marquardtalgorithm(a) Training Epoches(b)X: 3 ; Y: 0.8778Fig. 5 shows the result for the Bayesian Rule trainingalgorithm. This training algorithm achieved an optimalresult at 1000 training epochs and 3 hidden nodes.After obtaining the optimum structure for thenetwork, the performance of the MLP network wasdetermined. The following Table 1 shows theperformance comparison of the MLP network using thethree training algorithms. A comparative analysis of theresults of the three training algorithms shows that backpropagation training algorithm produces the highestaccuracy, 89.80% as compared to Levenberg-Marquardtand Bayesian Rule training algorithm, which produces87.35% and 87.76% of accuracy respectively. Thishighest accuracy is obtained keeping all the other factorsconstant for the three training algorithms.X: 1000 ; Y: 0.8735(b)Hidden NodeFig. 5 Performance of the MLP network with Bayesian Rule algorithmTABLE ITHE PERFORMANCE COMPARISON OF THE MLP NETWORKWITH THREE DIFFERENT TRAINING ALGORITHMSTraining Algorithm AccuracyBack propagation 89.80%Levenberg-Marquardt 87.35%Bayesian Rule 87.76%VII.CONCLUSIONThe research undertaken has been implemented usingMLP. In this particular model the patients were grouped into active and inactive based on its output. The networkwas trained and the weight of the hidden layer got

adjusted. The final adjusted weight was the result of thismodel. The outcome of this research shows that theprediction model devised provides medical practitionersa convenient decision support tool that can be used topredict cases of HIV/AIDS infected patient. Thefindings of the research indicate that this predictionmethod is a promising method for identifying andtreating HIV/AIDS patients. This system has thepotential to improve the outcomes of health services andstrengthen the accurate prediction of AIDS infectedpatients. The results obtained indicate that, the MLPnetwork which has been trained with the backpropagation algorithm produced the highestperformance compared to the Levenberg-Marquardt andBayesian Rule algorithms. The result also proved thatthe MLP network can be implemented to HIV/AIDSinfected cases based on seven unique features that havebeen taken for the purpose of the research (i.e. age, sex,weight, HB, CD4, CD8 and TB). There isalso a scope for further study by apply<strong>in</strong>g the differenttypes of neural network architectures comb<strong>in</strong><strong>in</strong>g withthe other learn<strong>in</strong>g algorithms that can be done <strong>in</strong> order tof<strong>in</strong>d the most appropriate network for classification of<strong>HIV</strong>/<strong>AIDS</strong> positive and negative. <strong>HIV</strong>/<strong>AIDS</strong> is one ofthe major health challenges to the world healthcommunity. Millions of people are gett<strong>in</strong>g <strong>in</strong>fected withthis virus every day and thousands are dy<strong>in</strong>g every daythroughout the world. This problem is now not limitedto the under-developed countries but has spread to thedeveloped countries also. The proportion of this problemis so high and press<strong>in</strong>g that the United Nations identifiedit as one of its eight Millennium Development Goals.Every effort at every level should be taken on prioritybasis to control the menace of this <strong>in</strong>fection. <strong>HIV</strong>/<strong>AIDS</strong>is now not a concern only for the medical and healthprofessionals, but for every people at all levels. 