Open Access Open Access  Restricted Access Subscription or Fee Access

Study of Protein Structure Based on Prediction Mining

Aishwarya Singh

Abstract


One of the most important unsolved challenges in biophysics and computational biology is predicting the 3D structure of a protein given its amino acid sequence. This work aims to provide a complete overview of recent protein structure prediction efforts and progress. Related topics and methodologies are provided and addressed after the general flowchart of structure prediction. In addition, concise overviews of many commonly used prediction methods and community-wide critical assessment of protein structure prediction experiments are provided. Proteins are complex structures that are normally fundamental types of molecules that can conduct a wide range of roles in cell biology. To gain a good understanding of these complicated structures, machine learning algorithms and other existing bioinformatics models must be used to obtain a comprehensive review of proteins. In order to create essential predictions, we used four machine learning methods on our protein dataset: bayes network, multilayer perceptron, One Rule, and decision stump, which were tested on a 10-fold cross validation basis. The classification results provided by these machine learning techniques were determined to be significant. Furthermore, the SWISS model was applied to a protein sequence for the purpose of predicting protein structure and examining the sequence. Proteins fold into three-dimensional configurations called conformations, which are defined by their amino acid sequence. A protein's entire structure can be classified into four stages of complexity: primary, secondary, tertiary, and quaternary.


Full Text:

PDF

References


Andreeva A, Howorth D, Chothia C, Kulesha E, Murzin AG. SCOP2 prototype: a new approach to protein structure mining. Nucleic acids research. 2014 Jan 1;42(D1): D310-4.

Huan J, Wang W, Bandyopadhyay D, Snoeyink J, Prins J, Tropsha A. Mining protein family specific residue packing patterns from protein structure graphs. In Proceedings of the eighth annual international conference on Research in computational molecular biology 2004 Mar 27 (pp. 308-315).

Vallejos-Vidal E, Reyes-Cerpa S, Rivas-Pardo JA, Maisey K, Yáñez JM, Valenzuela H, Cea PA, Castro-Fernandez V, Tort L, Sandino AM, Imarai M. Single-nucleotide polymorphisms (SNP) mining and their effect on the tridimensional protein structure prediction in a set of immunity-related expressed sequence tags (EST) in Atlantic salmon (Salmo salar). Frontiers in Genetics. 2020 Feb 27; 10:1406.

Azizi M, Abade MS. Protein structure prediction by means of sequential pattern mining. International Journal of Artificial Intelligence & Applications (IJAIA). 2015 Jul;6(4):31-42.

Huan J, Wang W, Bandyopadhyay D, Snoeyink J, Prins J, Tropsha A. Mining protein family specific residue packing patterns from protein structure graphs. In Proceedings of the eighth annual international conference on Research in computational molecular biology 2004 Mar 27 (pp. 308-315).

Weber BG, Mateas M. A data mining approach to strategy prediction. In2009 IEEE Symposium on Computational Intelligence and Games 2009 Sep 7 (pp. 140-147). IEEE.

Chou KC, Zhang CT. Prediction of protein structural classes. Critical reviews in biochemistry and molecular biology. 1995 Jan 1;30(4):275-349.

Chou PY, Fasman GD. Prediction of protein conformation. Biochemistry. 1974 Jan 1;13(2):222-45.

Fasman GD, editor. Prediction of protein structure and the principles of protein conformation. Springer Science & Business Media; 2012 Dec 6.

Cheng J, Sweredoski MJ, Baldi P. Accurate prediction of protein disordered regions by mining protein structure data. Data mining and knowledge discovery. 2005 Nov;11(3):213-22.


Refbacks

  • There are currently no refbacks.