Show simple item record

dc.contributor.authorFinlinson Porter, Camille
dc.date.accessioned2023-03-22T14:07:42Z
dc.date.available2023-03-22T14:07:42Z
dc.date.issued2023-03-22
dc.identifier.urihttps://hdl.handle.net/2077/75679
dc.description.abstractEnzymes that have been genetically engineered to withstand high temperatures are used by industry to make products with less waste and pollution. Different features of protein structure affect the optimal catalytic temperature ("topt") at which enzymes catalyze reactions most efficiently. We sought to use information from protein structures to predict the topt. To do this, we analyzed the structures and optimal catalytic temperatures of 1379 proteins in 7 different ways. For a set of analyses based on Delaunay atomic interactions, the atoms for each protein were categorized by their Tsai atomic group, Popelier atomic group, or by their amino acid, and the nearest neighbors of each atom were then found by Delaunay triangulation. Next, the neighbors were classified by their atomic group and their frequencies calculated. For a separate analysis of atomic interactions (“threshold residue atomic interactions”), the atoms for each protein were categorized by the beta carbon of their amino acids. Any beta carbons within 8Å were found to be interacting. A third set of analyses based on the frequencies of each category of atom on the protein interior and surface was also performed. Each atom was again categorized by Tsai atomic group, Popelier atomic group, or amino acid residue. All of the frequencies in these seven groups were separately used as the predictor variables in regression to predict the response variable, the optimal catalytic temperature. Four different kinds of regression were tried: elastic net, sparse group lasso, decision tree, and support vector. The predictions had maximum testing R2 values of 0.4. These results are similar to results in previous work done by Ulfenborg 2020. We found that being very detailed in defining interactions and categories did not give better results.en
dc.language.isoengen
dc.subjectenzymeen
dc.subjectproteinen
dc.subjectamino aciden
dc.subjectprotein structureen
dc.subjectoptimal catalytic temperatureen
dc.titleMachine Learning Prediction of Enzymes’ Optimal Catalytic Temperaturesen
dc.typetext
dc.setspec.uppsokTechnology
dc.type.uppsokH2
dc.contributor.departmentGöteborgs universitet/Institutionen för data- och informationsteknikswe
dc.contributor.departmentUniversity of Gothenburg/Department of Computer Science and Engineeringeng
dc.type.degreeStudent essay


Files in this item

Thumbnail

This item appears in the following Collection(s)

Show simple item record