Loading

Protein Structure Fingerprint Technology

News Letter | Open Access | Volume 3 | Issue 2

  • 1. Department of Medical Transformation Center, Shenzhen Institutes of Advanced Technology, China
+ Show More - Show Less
Corresponding Authors
Jiaan Yang, Department of Medical Transformation Center, Shenzhen Institutes of Advanced Technology, Chinese Academy of Sciences, China
CITATION

Yang J (2018) Protein Structure Fingerprint Technology. J Bioinform, Genomics, Proteomics 3(2): 1036

NEWS LETTER

Knowledge of Protein Structures

Knowledge of protein structures is significant for biology research and drug discovery. The protein 3D structures may be obtained by experimental measurements or computational approaches. Experimentally, the protein structures can be measured by X-ray crystallography, Nuclear Magnetic Resonance (NMR) or Transmission Electron Cryomicroscopy (Cryo TEM) etc., and so far over 140,000 of 3D structure data are available in PDB (Protein Data Bank) (https://www.rcsb.org/). Experimental measurements may accurately determine atomic coordinates of protein structures which are only some of snapshots for stable states under specific conditions. Although rich data of protein 3D structures have been accumulated by PDB, it cannot keep up with the pace of rapid increase of knowledge of protein one-dimensional sequences. Up to now, over 21,000,000 genetic codes for Prokaryotes and Eukaryota are in NCBI (National Center for Biotechnology Information) (https://www.ncbi. nlm.nih.gov/) and over 120,000,000 protein sequences in Uni Prot (Universal Protein Resource) (http://www.uniprot. org/). Overall, less than 1% of proteins have the knowledge of 3D structures. Therefore, on the other hand, the computational approaches become an important methodology to predict the protein 3D structures. Various computational methodologies for investigation of protein folding have been developed [1-4]. The comparative protein structure modeling as template-based methods to predict protein structures, which search the multiple templates with sequence homology from PDB, and then process energy optimization to predict protein 3D structure [5-7]. The ab initio or de novo approaches as template-free methods to predict protein structures, which carry out through the molecular dynamics (MD) simulation calculations under various force fields [8-14]. One large-scale project of Folding @ Home was developed in the year 2000 at Stanford University to compute the protein folding structures involving contribution from thousands of personal computer clusters world widely [15]. The CASP (platform of Critical Assessment of Techniques for protein Structure Prediction) [16] has provided a worldwide platform to promote the development of protein structure predictions since 1994.Overall, the problem of protein structure prediction is unlikely to be perfectly solved in the near future [17] because it requires vast computational resources.

Challenges in Protein Structure Research

So far, the challenges in protein structure research are in several aspects. First, most of proteins are only know one-dimensional sequences without 3D structural knowledge. Second, even if a protein 3D structure is well known, it is hard to be interpreted and analyzed due to the complexity of topological folds. Third, the nature of proteins is intrinsically disordered, and so how to present this folding flexibility. Furthermore, it is a difficult task to acquire an astronomical number of folding conformations for each protein; moreover, million numbers of proteins are waiting for solutions. Therefore, the knowledge and understanding of protein structure research is a great challenge in the field of life science in the post gene era.

Protein Structure Fingerprint Technology

A novel approach, protein structure fingerprint technology, has been developed trying to overcome these hurdles. In this approach, a folden of 5 amino acid residues was taken as the folding element, and its complete folding description were mathematically acquired. A set of 27 folding vectors was set up to fully cover various folding patterns for 5 amino acid residues, and expressed by Protein Folding Shapes Code (PFSC) as alphabetical letters [18]. Any conformation for protein three-dimensional (3D) structure can be completely presented by a PFSC string in which one PFSC letter represents for a folding vector of 5 amino acid residues, two PFSC letters next each other for coupling two folding vectors overlapping four amino acids and so on. Each set of 5 amino acid residues actually has different folding shapes and flexibilities because of molecular constrain. A database of 5 AAPFSC is created together the possible folding shapes in PFSC for various arrangements for 5 amino acids. Consequently, the comprehensive local folding variations can be obtained according the order of amino acids in sequence, and assemble in Protein Folding Variation Matrix (PFVM). The PFVM contains local folding variations along sequence. It demonstrated how the protein folding variations are determined by the order of amino acids in sequence. Also, the patterns of protein folding variations along sequence may reveal the folding flexibility and proteins intrinsic disorder. In addition, an astronomical number of protein conformations and the most possible conformations can be constructed by PFSC letters in PFVM. The protein structure fingerprint technology is briefly presented in Figure (1), and it illustrates that any protein with known 3D structure can be converted into complete conformation with PFSC description while any protein with one-dimensional sequence can acquire the comprehensive protein folding variations in PFVM (Figure 1).

The protein structure fingerprint technology. 27 Protein Folding Shapes Code (PFSC) is presented in the cubic. The blue arrows indicate that  the complete conformation description with using PFSC is obtained from a protein 3D structure. The red arrows indicated that the comprehensive  protein folding variations in Protein Folding Variation Matrix (PFVM) is obtained from gene or protein sequence and expressed in PFSC description.

Figure 1: The protein structure fingerprint technology. 27 Protein Folding Shapes Code (PFSC) is presented in the cubic. The blue arrows indicate that the complete conformation description with using PFSC is obtained from a protein 3D structure. The red arrows indicated that the comprehensive protein folding variations in Protein Folding Variation Matrix (PFVM) is obtained from gene or protein sequence and expressed in PFSC description.

The protein structure fingerprint technology. 27 Protein Folding Shapes Code (PFSC) is presented in the cubic. The blue arrows indicate that the complete conformation description with using PFSC is obtained from a protein 3D structure. The red arrows indicated that the comprehensive protein folding variations in Protein Folding Variation Matrix (PFVM) is obtained from gene or protein sequence and expressed in PFSC description.

Web Server

A computational platform of protein structure fingerprint has been set up. If to input a protein 3D structure in PDB format, the output will present its complete conformation in PFSC string description, including secondary fragments as well as tertiary fragments. If to input an amino acid sequence, the output will provide the comprehensive protein folding variations in PFVM and further assemble the possible conformations in PFSC description. These functions can be freely accessed by web server www.micropht.com.

Potential Applications

The protein structure fingerprint technology can be applied to biology studies and drug discovery, such as gene protein mutation, protein recombination, protein differentiation analysis, protein / peptide design, antibody / antigen recognition, protein misfolding and protein stability etc. Also, it can provide abundant conformations for protein predictions.

REFERENCES

1. Compiani M, Capriotti E. Computational and theoretical methods for protein folding. Biochemistry. 2013; 52: 8601-8624.

2. Guo JT, Ellrott K, Xu Y. A historical perspective of template-based protein structure prediction. Methods Mol Biol. 2008; 413: 3-42.

3. Dorn M, E Silva MB, Buriol LS, Lamb LC. Three-dimensional protein structure prediction: methods and computational strategies. Comput Biol Chem. 2014; 53: 251-276.

4. Brylinski M. Is the growth rate of Protein Data Bank sufficient to solve the protein structure prediction problem using template-based modeling? Bio-Algo Med-Sys. 2015; 11: 1-7.

5. J Yang, Yan R, Roy A, Xu D, Poisson J, Zhang Y. The I-TASSER Suite: Protein structure and function prediction. Nature Methods. 2015; 12: 7-8.

6. Eswar N, Webb B, Marti-Renom MA, Madhusudhan MS, Eramian D, Shen MY, et al. Comparative protein structure modeling using Modeller. Curr Protoc Bioinformatics. 2006; 5: 5-6.

7. Liu T, Tang GW, Capriotti E. Comparative Modeling: The state of the art and protein drug target structure prediction. Comb Chem High Throughput Screen. 2011; 14: 532-537.

8. Yang L, Tan CH, Hsieh MJ, Wang J, Duan Y, Cieplak P, et al. New-generation amber united-atom force field. J Phys Chem B. 2006; 110: 13166-13176.

9. Brooks BR, Brooks C LIII, Mackerell AD, Nilsson L, Petrella RJ, Roux B, et al. CHARMM: The biomolecular simulation program. J Comput Chem. 2009; 30: 1545-1614.

10. Riniker S, Christ CD, Hansen HS, Hunenberger PH, Oostenbrink C, Steiner D, et al. Calculation of relative free energies for ligand-protein binding, solvation, and conformational transitions using the GROMOS software. J Phys Chem. 2011; 115: 13570-13577.

11. Honig B. Protein folding: from the levinthal paradox to structure prediction. J Mol Biol. 1999; 293: 283-293.

12. Onuchic JN, Wolynes PG. Theory of protein folding. Curr Opin Struct Biol. 2004; 14: 70-75.

13. Vallat B, Madrid-Aliste C, Fiser A. Modularity of Protein Folds as a Tool for Template-Free Modeling of Structures. PLoS Comput Biol. 2015; 11: 1004419.

14. Zhang J, Li W, Wang J, Qin M, Wu L, Yan Z, et al. Protein folding simulations: from coarse-grained model to all-atom model. IUBMB Life. 2009; 61: 627-643.

15. Zagrovic B, Snow CD, Shirts MR, Pande VS. Simulation of folding of a small α-helical protein in atomistic detail using worldwide-distributed computing. J Mol Biol. 2002; 323: 927-937.

16. Moult J, Fidelis K, Kryshtafovych A, Tramontano A. Critical assessment of methods of protein structure prediction (CASP)--round IX. Proteins. 2011; 79: 1-5.

17. Benedix A, Becker CM, de Groot BL, Caflisch A, Böckmann RA. Predicting free energy changes using structural ensembles. Nat Methods. 2009; 6: 3-4.

18. Yang J. Comprehensive description of protein structures using protein folding shape code. Proteins. 2008; 71. 3: 1497-1518

Yang J (2018) Protein Structure Fingerprint Technology. J Bioinform, Genomics, Proteomics 3(2): 1036.

Received : 22 Nov 2018
Accepted : 23 Nov 2018
Published : 25 Nov 2018
Journals
Annals of Otolaryngology and Rhinology
ISSN : 2379-948X
Launched : 2014
JSM Schizophrenia
Launched : 2016
Journal of Nausea
Launched : 2020
JSM Internal Medicine
Launched : 2016
JSM Hepatitis
Launched : 2016
JSM Oro Facial Surgeries
ISSN : 2578-3211
Launched : 2016
Journal of Human Nutrition and Food Science
ISSN : 2333-6706
Launched : 2013
JSM Regenerative Medicine and Bioengineering
ISSN : 2379-0490
Launched : 2013
JSM Spine
ISSN : 2578-3181
Launched : 2016
Archives of Palliative Care
ISSN : 2573-1165
Launched : 2016
JSM Nutritional Disorders
ISSN : 2578-3203
Launched : 2017
Annals of Neurodegenerative Disorders
ISSN : 2476-2032
Launched : 2016
Journal of Fever
ISSN : 2641-7782
Launched : 2017
JSM Bone Marrow Research
ISSN : 2578-3351
Launched : 2016
JSM Mathematics and Statistics
ISSN : 2578-3173
Launched : 2014
Journal of Autoimmunity and Research
ISSN : 2573-1173
Launched : 2014
JSM Arthritis
ISSN : 2475-9155
Launched : 2016
JSM Head and Neck Cancer-Cases and Reviews
ISSN : 2573-1610
Launched : 2016
JSM General Surgery Cases and Images
ISSN : 2573-1564
Launched : 2016
JSM Anatomy and Physiology
ISSN : 2573-1262
Launched : 2016
JSM Dental Surgery
ISSN : 2573-1548
Launched : 2016
Annals of Emergency Surgery
ISSN : 2573-1017
Launched : 2016
Annals of Mens Health and Wellness
ISSN : 2641-7707
Launched : 2017
Journal of Preventive Medicine and Health Care
ISSN : 2576-0084
Launched : 2018
Journal of Chronic Diseases and Management
ISSN : 2573-1300
Launched : 2016
Annals of Vaccines and Immunization
ISSN : 2378-9379
Launched : 2014
JSM Heart Surgery Cases and Images
ISSN : 2578-3157
Launched : 2016
Annals of Reproductive Medicine and Treatment
ISSN : 2573-1092
Launched : 2016
JSM Brain Science
ISSN : 2573-1289
Launched : 2016
JSM Biomarkers
ISSN : 2578-3815
Launched : 2014
JSM Biology
ISSN : 2475-9392
Launched : 2016
Archives of Stem Cell and Research
ISSN : 2578-3580
Launched : 2014
Annals of Clinical and Medical Microbiology
ISSN : 2578-3629
Launched : 2014
JSM Pediatric Surgery
ISSN : 2578-3149
Launched : 2017
Journal of Memory Disorder and Rehabilitation
ISSN : 2578-319X
Launched : 2016
JSM Tropical Medicine and Research
ISSN : 2578-3165
Launched : 2016
JSM Head and Face Medicine
ISSN : 2578-3793
Launched : 2016
JSM Cardiothoracic Surgery
ISSN : 2573-1297
Launched : 2016
JSM Bone and Joint Diseases
ISSN : 2578-3351
Launched : 2017
JSM Bioavailability and Bioequivalence
ISSN : 2641-7812
Launched : 2017
JSM Atherosclerosis
ISSN : 2573-1270
Launched : 2016
Journal of Genitourinary Disorders
ISSN : 2641-7790
Launched : 2017
Journal of Fractures and Sprains
ISSN : 2578-3831
Launched : 2016
Journal of Autism and Epilepsy
ISSN : 2641-7774
Launched : 2016
Annals of Marine Biology and Research
ISSN : 2573-105X
Launched : 2014
JSM Health Education & Primary Health Care
ISSN : 2578-3777
Launched : 2016
JSM Communication Disorders
ISSN : 2578-3807
Launched : 2016
Annals of Musculoskeletal Disorders
ISSN : 2578-3599
Launched : 2016
Annals of Virology and Research
ISSN : 2573-1122
Launched : 2014
JSM Renal Medicine
ISSN : 2573-1637
Launched : 2016
Journal of Muscle Health
ISSN : 2578-3823
Launched : 2016
JSM Genetics and Genomics
ISSN : 2334-1823
Launched : 2013
JSM Anxiety and Depression
ISSN : 2475-9139
Launched : 2016
Clinical Journal of Heart Diseases
ISSN : 2641-7766
Launched : 2016
Annals of Medicinal Chemistry and Research
ISSN : 2378-9336
Launched : 2014
JSM Pain and Management
ISSN : 2578-3378
Launched : 2016
JSM Women's Health
ISSN : 2578-3696
Launched : 2016
Clinical Research in HIV or AIDS
ISSN : 2374-0094
Launched : 2013
Journal of Endocrinology, Diabetes and Obesity
ISSN : 2333-6692
Launched : 2013
Journal of Substance Abuse and Alcoholism
ISSN : 2373-9363
Launched : 2013
JSM Neurosurgery and Spine
ISSN : 2373-9479
Launched : 2013
Journal of Liver and Clinical Research
ISSN : 2379-0830
Launched : 2014
Journal of Drug Design and Research
ISSN : 2379-089X
Launched : 2014
JSM Clinical Oncology and Research
ISSN : 2373-938X
Launched : 2013
JSM Bioinformatics, Genomics and Proteomics
ISSN : 2576-1102
Launched : 2014
JSM Chemistry
ISSN : 2334-1831
Launched : 2013
Journal of Trauma and Care
ISSN : 2573-1246
Launched : 2014
JSM Surgical Oncology and Research
ISSN : 2578-3688
Launched : 2016
Annals of Food Processing and Preservation
ISSN : 2573-1033
Launched : 2016
Journal of Radiology and Radiation Therapy
ISSN : 2333-7095
Launched : 2013
JSM Physical Medicine and Rehabilitation
ISSN : 2578-3572
Launched : 2016
Annals of Clinical Pathology
ISSN : 2373-9282
Launched : 2013
Annals of Cardiovascular Diseases
ISSN : 2641-7731
Launched : 2016
Journal of Behavior
ISSN : 2576-0076
Launched : 2016
Annals of Clinical and Experimental Metabolism
ISSN : 2572-2492
Launched : 2016
Clinical Research in Infectious Diseases
ISSN : 2379-0636
Launched : 2013
JSM Microbiology
ISSN : 2333-6455
Launched : 2013
Journal of Urology and Research
ISSN : 2379-951X
Launched : 2014
Journal of Family Medicine and Community Health
ISSN : 2379-0547
Launched : 2013
Annals of Pregnancy and Care
ISSN : 2578-336X
Launched : 2017
JSM Cell and Developmental Biology
ISSN : 2379-061X
Launched : 2013
Annals of Aquaculture and Research
ISSN : 2379-0881
Launched : 2014
Clinical Research in Pulmonology
ISSN : 2333-6625
Launched : 2013
Journal of Immunology and Clinical Research
ISSN : 2333-6714
Launched : 2013
Annals of Forensic Research and Analysis
ISSN : 2378-9476
Launched : 2014
JSM Biochemistry and Molecular Biology
ISSN : 2333-7109
Launched : 2013
Annals of Breast Cancer Research
ISSN : 2641-7685
Launched : 2016
Annals of Gerontology and Geriatric Research
ISSN : 2378-9409
Launched : 2014
Journal of Sleep Medicine and Disorders
ISSN : 2379-0822
Launched : 2014
JSM Burns and Trauma
ISSN : 2475-9406
Launched : 2016
Chemical Engineering and Process Techniques
ISSN : 2333-6633
Launched : 2013
Annals of Clinical Cytology and Pathology
ISSN : 2475-9430
Launched : 2014
JSM Allergy and Asthma
ISSN : 2573-1254
Launched : 2016
Journal of Neurological Disorders and Stroke
ISSN : 2334-2307
Launched : 2013
Annals of Sports Medicine and Research
ISSN : 2379-0571
Launched : 2014
JSM Sexual Medicine
ISSN : 2578-3718
Launched : 2016
Annals of Vascular Medicine and Research
ISSN : 2378-9344
Launched : 2014
JSM Biotechnology and Biomedical Engineering
ISSN : 2333-7117
Launched : 2013
Journal of Hematology and Transfusion
ISSN : 2333-6684
Launched : 2013
JSM Environmental Science and Ecology
ISSN : 2333-7141
Launched : 2013
Journal of Cardiology and Clinical Research
ISSN : 2333-6676
Launched : 2013
JSM Nanotechnology and Nanomedicine
ISSN : 2334-1815
Launched : 2013
Journal of Ear, Nose and Throat Disorders
ISSN : 2475-9473
Launched : 2016
JSM Ophthalmology
ISSN : 2333-6447
Launched : 2013
Journal of Pharmacology and Clinical Toxicology
ISSN : 2333-7079
Launched : 2013
Annals of Psychiatry and Mental Health
ISSN : 2374-0124
Launched : 2013
Medical Journal of Obstetrics and Gynecology
ISSN : 2333-6439
Launched : 2013
Annals of Pediatrics and Child Health
ISSN : 2373-9312
Launched : 2013
JSM Clinical Pharmaceutics
ISSN : 2379-9498
Launched : 2014
JSM Foot and Ankle
ISSN : 2475-9112
Launched : 2016
JSM Alzheimer's Disease and Related Dementia
ISSN : 2378-9565
Launched : 2014
Journal of Addiction Medicine and Therapy
ISSN : 2333-665X
Launched : 2013
Journal of Veterinary Medicine and Research
ISSN : 2378-931X
Launched : 2013
Annals of Public Health and Research
ISSN : 2378-9328
Launched : 2014
Annals of Orthopedics and Rheumatology
ISSN : 2373-9290
Launched : 2013
Journal of Clinical Nephrology and Research
ISSN : 2379-0652
Launched : 2014
Annals of Community Medicine and Practice
ISSN : 2475-9465
Launched : 2014
Annals of Biometrics and Biostatistics
ISSN : 2374-0116
Launched : 2013
JSM Clinical Case Reports
ISSN : 2373-9819
Launched : 2013
Journal of Cancer Biology and Research
ISSN : 2373-9436
Launched : 2013
Journal of Surgery and Transplantation Science
ISSN : 2379-0911
Launched : 2013
Journal of Dermatology and Clinical Research
ISSN : 2373-9371
Launched : 2013
JSM Gastroenterology and Hepatology
ISSN : 2373-9487
Launched : 2013
Annals of Nursing and Practice
ISSN : 2379-9501
Launched : 2014
JSM Dentistry
ISSN : 2333-7133
Launched : 2013
Author Information X