Show simple item record

dc.contributor.authorFinlinson Porter, Camille
dc.date.accessioned2023-03-22T14:07:42Z
dc.date.available2023-03-22T14:07:42Z
dc.date.issued2023-03-22
dc.identifier.urihttps://hdl.handle.net/2077/75679
dc.description.abstractEnzymes that have been genetically engineered to withstand high temperatures are used by industry to make products with less waste and pollution. Different features of protein structure affect the optimal catalytic temperature ("topt") at which enzymes catalyze reactions most efficiently. We sought to use information from protein structures to predict the topt. To do this, we analyzed the structures and optimal catalytic temperatures of 1379 proteins in 7 different ways. For a set of analyses based on Delaunay atomic interactions, the atoms for each protein were categorized by their Tsai atomic group, Popelier atomic group, or by their amino acid, and the nearest neighbors of each atom were then found by Delaunay triangulation. Next, the neighbors were classified by their atomic group and their frequencies calculated. For a separate analysis of atomic interactions (“threshold residue atomic interactions”), the atoms for each protein were categorized by the beta carbon of their amino acids. Any beta carbons within 8Å were found to be interacting. A third set of analyses based on the frequencies of each category of atom on the protein interior and surface was also performed. Each atom was again categorized by Tsai atomic group, Popelier atomic group, or amino acid residue. All of the frequencies in these seven groups were separately used as the predictor variables in regression to predict the response variable, the optimal catalytic temperature. Four different kinds of regression were tried: elastic net, sparse group lasso, decision tree, and support vector. The predictions had maximum testing R2 values of 0.4. These results are similar to results in previous work done by Ulfenborg 2020. We found that being very detailed in defining interactions and categories did not give better results.en_US
dc.language.isoengen_US
dc.subjectenzymeen_US
dc.subjectproteinen_US
dc.subjectamino aciden_US
dc.subjectprotein structureen_US
dc.subjectoptimal catalytic temperatureen_US
dc.titleMachine Learning Prediction of Enzymes’ Optimal Catalytic Temperaturesen_US
dc.typetext
dc.setspec.uppsokTechnology
dc.type.uppsokH2
dc.contributor.departmentGöteborgs universitet/Institutionen för data- och informationsteknikswe
dc.contributor.departmentUniversity of Gothenburg/Department of Computer Science and Engineeringeng
dc.type.degreeStudent essay


Files in this item

Thumbnail

This item appears in the following Collection(s)

Show simple item record