Background
Type: Article

Prediction of GABAA receptor proteins using the concept of Chou's pseudo-amino acid composition and support vector machine

Journal: Journal of Theoretical Biology (10958541)Year: 21 July 2011Volume: 281Issue: Pages: 18 - 23
Mohabatkar H.a Mohammad Beigi M.Esmaeili A.a
DOI:10.1016/j.jtbi.2011.04.017Language: English

Abstract

The amino acid gamma-aminobutyric-acid receptors (GABAARs) belong to the ligand-gated ion channels (LGICs) superfamily. GABAARs are highly diverse in the central nervous system. These channels play a key role in regulating behavior. As a result, the prediction of GABAARs from the amino acid sequence would be helpful for research on these receptors. We have developed a method to predict these proteins using the features obtained from Chou's pseudo-amino acid composition concept and support vector machine as a powerful machine learning approach. The predictor efficiency was assessed by five-fold cross-validation. This method achieved an overall accuracy and Matthew's correlation coefficient (MCC) of 94.12% and 0.88, respectively. Furthermore, to evaluate the effect and power of each feature, the minimum Redundancy and Maximum Relevance (mRMR) feature selection method was implemented. An interesting finding in this study is the presence of all six characters (hydrophobicity, hydrophilicity, side chain mass, pK1, pK2 and pI) or combination of the characters among the 5 higher ranked features (pk2 and pI, hydrophobicity and mass, pk1, hydrophilicity and mass) obtained from the mRMR feature selection method. The results show a biologically justifiable ranked attributes of pk2 and pI; hydrophobicity, hydrophilicity and mass; mass and pk1; pk2 and mass. Based on our results, using the concept of Chou's pseudo-amino acid composition and support vector machine is an effective approach for the prediction of GABAARs. © 2011.


Author Keywords

BioinformaticsMatthew's correlation coefficientMinimum Redundancy and Maximum RelevanceProtein family classification

Other Keywords

AlgorithmsAmino AcidsComputational BiologyReceptors, GABA-AReproducibility of Results4 aminobutyric acid A receptoralgorithmamino acidbioinformaticsclassificationcorrelationefficiency measurementligandpredictionproteinaccuracyamino acid compositionarticlecorrelation coefficienthydrophilicityhydrophobicitymolecular weightpriority journalprocess developmentsequence analysissupport vector machinevalidation process