Machine Learning Prediction of Enzymes’ Optimal Catalytic Temperatures
Abstract
Enzymes that have been genetically engineered to withstand high temperatures are
used by industry to make products with less waste and pollution. Different features
of protein structure affect the optimal catalytic temperature ("topt") at which enzymes
catalyze reactions most efficiently. We sought to use information from protein
structures to predict the topt. To do this, we analyzed the structures and optimal
catalytic temperatures of 1379 proteins in 7 different ways. For a set of analyses
based on Delaunay atomic interactions, the atoms for each protein were categorized
by their Tsai atomic group, Popelier atomic group, or by their amino acid, and the
nearest neighbors of each atom were then found by Delaunay triangulation. Next,
the neighbors were classified by their atomic group and their frequencies calculated.
For a separate analysis of atomic interactions (“threshold residue atomic interactions”),
the atoms for each protein were categorized by the beta carbon of their
amino acids. Any beta carbons within 8Å were found to be interacting. A third set
of analyses based on the frequencies of each category of atom on the protein interior
and surface was also performed. Each atom was again categorized by Tsai atomic
group, Popelier atomic group, or amino acid residue. All of the frequencies in these
seven groups were separately used as the predictor variables in regression to predict
the response variable, the optimal catalytic temperature. Four different kinds
of regression were tried: elastic net, sparse group lasso, decision tree, and support
vector. The predictions had maximum testing R2 values of 0.4. These results are
similar to results in previous work done by Ulfenborg 2020. We found that being
very detailed in defining interactions and categories did not give better results.
Degree
Student essay
Collections
View/ Open
Date
2023-03-22Author
Finlinson Porter, Camille
Keywords
enzyme
protein
amino acid
protein structure
optimal catalytic temperature
Language
eng