Protein quadratic indices of the "macromolecular pseudograph's α-carbon atom adjacency matrix" : 1. Prediction of arc repressor alanine-mutant's stability

This report describes a new set of macromolecular descriptors of relevance to protein QSAR/QSPR studies, protein's quadratic indices. These descriptors are calculated from the macromolecular pseudograph's α-carbon atom adjacency matrix. A study of the protein stability effects for a comple...

Descripción completa

Detalles Bibliográficos
Autores principales: Marrero Ponce, Yovani, Medina Marrero, Ricardo, Castro, Eduardo Alberto, Ramos de Armas, Ronal, González Díaz, Humberto, Romero Zaldívar, Vicente, Torrens, Francisco
Formato: Articulo
Lenguaje:inglés
Publicado: 2004
Materias:
Acceso en línea:http://sedici.unlp.edu.ar/handle/10915/35111
Etiquetas: Agregar Etiqueta
Sin Etiquetas, Sea el primero en etiquetar este registro!
_version_ 1862014487485218816
author Marrero Ponce, Yovani
Medina Marrero, Ricardo
Castro, Eduardo Alberto
Ramos de Armas, Ronal
González Díaz, Humberto
Romero Zaldívar, Vicente
Torrens, Francisco
author_facet Marrero Ponce, Yovani
Medina Marrero, Ricardo
Castro, Eduardo Alberto
Ramos de Armas, Ronal
González Díaz, Humberto
Romero Zaldívar, Vicente
Torrens, Francisco
author_sort Marrero Ponce, Yovani
collection Repositorio Sedici - Comunidad
description This report describes a new set of macromolecular descriptors of relevance to protein QSAR/QSPR studies, protein's quadratic indices. These descriptors are calculated from the macromolecular pseudograph's α-carbon atom adjacency matrix. A study of the protein stability effects for a complete set of alanine substitutions in Arc repressor illustrates this approach. Quantitative Structure-Stability Relationship (QSSR) models allow discriminating between near wild-type stability and reduced-stability A-mutants. A linear discriminant function gives rise to excellent discrimination between 85.4% (35/41) and 91.67% (11/12) of near wild-type stability/reduced stability mutants in training and test series, respectively. The model's overall predictability oscillates from 80.49 until 82.93, when n varies from 2 to 10 in leave-n-out cross validation procedures. This value stabilizes around 80.49% when n was > 6. Additionally, canonical regression analysis corroborates the statistical quality of the classification model (Rcanc = 0.72, p-level <0.0001). This analysis was also used to compute biological stability canonical scores for each Arc A-mutant. On the other hand, nonlinear piecewise regression model compares favorably with respect to linear regression one on predicting the melting temperature (t m) of the Arc A-mutants. The linear model explains almost 72% of the variance of the experimental tm (R = 0.85 and s = 5.64) and LOO press statistics evidenced its predictive ability (q2 = 0.55 and s cv = 6.24). However, this linear regression model falls to resolve tm predictions of Arc A-mutants in external prediction series. Therefore, the use of nonlinear piecewise models was required. The tm values of A-mutants in training (R = 0.94) and test (R = 0.91) sets are calculated by piecewise model with a high degree of precision. A break-point value of 51.32°C characterizes two mutants' clusters and coincides perfectly with the experimental scale. For this reason, we can use the linear discriminant analysis and piecewise models in combination to classify and predict the stability of the mutants' Arc homodimers. These models also permit the interpretation of the driving forces of such a folding process. The models include protein's quadratic indices accounting for hydrophobic (z1), bulk-steric (z2), and electronic (z3) features of the studied molecules. Preponderance of z1 and z3 over z 2 indicates the higher importance of the hydrophobic and electronic side chain terms in the folding of the Arc dimer. In this sense, developed equations involve short-reaching (k ≤ 3), middle- reaching (3 < k ≤ 7) and far-reaching (k = 8 or greater) z1, 2, 3-protein's quadratic indices. This situation points to topologic/topographic protein's backbone interactions control of the stability profile of wild-type Arc and its A-mutants. Consequently, the present approach represents a novel and very promising way to mathematical research in biology sciences.
format Articulo
id I89-R6-10915-35111
institution UNLP - Facultad de Arquitectura y Urbanismo
institution_str I-89
language English
publishDate 2004
record_format dspace
repository_str R-6
spelling I89-R6-10915-351112026-02-24T17:08:55Z http://sedici.unlp.edu.ar/handle/10915/35111 Protein quadratic indices of the "macromolecular pseudograph's α-carbon atom adjacency matrix" : 1. Prediction of arc repressor alanine-mutant's stability Marrero Ponce, Yovani Medina Marrero, Ricardo Castro, Eduardo Alberto Ramos de Armas, Ronal González Díaz, Humberto Romero Zaldívar, Vicente Torrens, Francisco 2004-12 2014-05-05T18:36:25Z en Ciencias Exactas Química Farmacia alanine-substitution mutant protein arc repressor amino acid substitution macromolecule protein quadratic indices mutant protein stability quantitative structure activity relation QSPR TOMOCOMD software alanine dimerization stereoisomerism This report describes a new set of macromolecular descriptors of relevance to protein QSAR/QSPR studies, protein's quadratic indices. These descriptors are calculated from the macromolecular pseudograph's α-carbon atom adjacency matrix. A study of the protein stability effects for a complete set of alanine substitutions in Arc repressor illustrates this approach. Quantitative Structure-Stability Relationship (QSSR) models allow discriminating between near wild-type stability and reduced-stability A-mutants. A linear discriminant function gives rise to excellent discrimination between 85.4% (35/41) and 91.67% (11/12) of near wild-type stability/reduced stability mutants in training and test series, respectively. The model's overall predictability oscillates from 80.49 until 82.93, when n varies from 2 to 10 in leave-n-out cross validation procedures. This value stabilizes around 80.49% when n was > 6. Additionally, canonical regression analysis corroborates the statistical quality of the classification model (Rcanc = 0.72, p-level <0.0001). This analysis was also used to compute biological stability canonical scores for each Arc A-mutant. On the other hand, nonlinear piecewise regression model compares favorably with respect to linear regression one on predicting the melting temperature (t m) of the Arc A-mutants. The linear model explains almost 72% of the variance of the experimental tm (R = 0.85 and s = 5.64) and LOO press statistics evidenced its predictive ability (q2 = 0.55 and s cv = 6.24). However, this linear regression model falls to resolve tm predictions of Arc A-mutants in external prediction series. Therefore, the use of nonlinear piecewise models was required. The tm values of A-mutants in training (R = 0.94) and test (R = 0.91) sets are calculated by piecewise model with a high degree of precision. A break-point value of 51.32°C characterizes two mutants' clusters and coincides perfectly with the experimental scale. For this reason, we can use the linear discriminant analysis and piecewise models in combination to classify and predict the stability of the mutants' Arc homodimers. These models also permit the interpretation of the driving forces of such a folding process. The models include protein's quadratic indices accounting for hydrophobic (z1), bulk-steric (z2), and electronic (z3) features of the studied molecules. Preponderance of z1 and z3 over z 2 indicates the higher importance of the hydrophobic and electronic side chain terms in the folding of the Arc dimer. In this sense, developed equations involve short-reaching (k ≤ 3), middle- reaching (3 < k ≤ 7) and far-reaching (k = 8 or greater) z1, 2, 3-protein's quadratic indices. This situation points to topologic/topographic protein's backbone interactions control of the stability profile of wild-type Arc and its A-mutants. Consequently, the present approach represents a novel and very promising way to mathematical research in biology sciences. Instituto de Investigaciones Fisicoquímicas Teóricas y Aplicadas (INIFTA) Articulo Articulo http://creativecommons.org/licenses/by/3.0/ Creative Commons Attribution 3.0 Unported (CC BY 3.0) application/pdf 1124-1147
spellingShingle Ciencias Exactas
Química
Farmacia
alanine-substitution mutant
protein
arc repressor
amino acid substitution
macromolecule
protein quadratic indices
mutant
protein stability
quantitative structure activity relation
QSPR
TOMOCOMD software
alanine
dimerization
stereoisomerism
Marrero Ponce, Yovani
Medina Marrero, Ricardo
Castro, Eduardo Alberto
Ramos de Armas, Ronal
González Díaz, Humberto
Romero Zaldívar, Vicente
Torrens, Francisco
Protein quadratic indices of the "macromolecular pseudograph's α-carbon atom adjacency matrix" : 1. Prediction of arc repressor alanine-mutant's stability
title Protein quadratic indices of the "macromolecular pseudograph's α-carbon atom adjacency matrix" : 1. Prediction of arc repressor alanine-mutant's stability
title_full Protein quadratic indices of the "macromolecular pseudograph's α-carbon atom adjacency matrix" : 1. Prediction of arc repressor alanine-mutant's stability
title_fullStr Protein quadratic indices of the "macromolecular pseudograph's α-carbon atom adjacency matrix" : 1. Prediction of arc repressor alanine-mutant's stability
title_full_unstemmed Protein quadratic indices of the "macromolecular pseudograph's α-carbon atom adjacency matrix" : 1. Prediction of arc repressor alanine-mutant's stability
title_short Protein quadratic indices of the "macromolecular pseudograph's α-carbon atom adjacency matrix" : 1. Prediction of arc repressor alanine-mutant's stability
title_sort protein quadratic indices of the macromolecular pseudograph s α carbon atom adjacency matrix 1 prediction of arc repressor alanine mutant s stability
topic Ciencias Exactas
Química
Farmacia
alanine-substitution mutant
protein
arc repressor
amino acid substitution
macromolecule
protein quadratic indices
mutant
protein stability
quantitative structure activity relation
QSPR
TOMOCOMD software
alanine
dimerization
stereoisomerism
url http://sedici.unlp.edu.ar/handle/10915/35111
work_keys_str_mv AT marreroponceyovani proteinquadraticindicesofthemacromolecularpseudographsacarbonatomadjacencymatrix1predictionofarcrepressoralaninemutantsstability
AT medinamarreroricardo proteinquadraticindicesofthemacromolecularpseudographsacarbonatomadjacencymatrix1predictionofarcrepressoralaninemutantsstability
AT castroeduardoalberto proteinquadraticindicesofthemacromolecularpseudographsacarbonatomadjacencymatrix1predictionofarcrepressoralaninemutantsstability
AT ramosdearmasronal proteinquadraticindicesofthemacromolecularpseudographsacarbonatomadjacencymatrix1predictionofarcrepressoralaninemutantsstability
AT gonzalezdiazhumberto proteinquadraticindicesofthemacromolecularpseudographsacarbonatomadjacencymatrix1predictionofarcrepressoralaninemutantsstability
AT romerozaldivarvicente proteinquadraticindicesofthemacromolecularpseudographsacarbonatomadjacencymatrix1predictionofarcrepressoralaninemutantsstability
AT torrensfrancisco proteinquadraticindicesofthemacromolecularpseudographsacarbonatomadjacencymatrix1predictionofarcrepressoralaninemutantsstability