ESTIMATING ENZYME MELTING POINT BASED ON ENZYME AMINO ACID SEQUENCE USING MULTIPLE LINEAR REGRESSION
Enzymes are organic compounds that can accelerate biochemical reactions (catalysts). The structure of enzymes consists of two main parts: coenzymes that are active in the enzyme's catalytic function, and apoenzymes that encompass proteins in the form of amino acid chains or sequence. In industr...
Saved in:
Main Author: | |
---|---|
Format: | Final Project |
Language: | Indonesia |
Online Access: | https://digilib.itb.ac.id/gdl/view/81548 |
Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Institution: | Institut Teknologi Bandung |
Language: | Indonesia |
id |
id-itb.:81548 |
---|---|
spelling |
id-itb.:815482024-07-01T08:01:32ZESTIMATING ENZYME MELTING POINT BASED ON ENZYME AMINO ACID SEQUENCE USING MULTIPLE LINEAR REGRESSION Dira Kurnia, Muhammad Indonesia Final Project Enzyme Melting Point Estimation, Amino Acid Sequences, Physicochemical Properties, Multiple Linear Regression, Sequential Method. INSTITUT TEKNOLOGI BANDUNG https://digilib.itb.ac.id/gdl/view/81548 Enzymes are organic compounds that can accelerate biochemical reactions (catalysts). The structure of enzymes consists of two main parts: coenzymes that are active in the enzyme's catalytic function, and apoenzymes that encompass proteins in the form of amino acid chains or sequence. In industry, the use of enzymes is crucial for improving the quality of food products and cleaning agents. However, enzyme performance can be limited due to factors such as environmental temperatures that do not match the enzyme's characteristics. To address this issue, the development of identification or estimation systems for enzyme melting points or optimal temperatures based on physicochemical properties is necessary. Statistical methods such as Multiple Linear Regression can be used to predict enzyme melting points based on the physicochemical properties of amino acid sequences. Additionally, a method called Sequential Method with three different approaches can assist in evaluating significant physicochemical properties in estimating enzyme melting points. This method can be applied to data obtained from Novozymes, a global biotechnology company from Denmark, through Kaggle.com, which provides information about amino acid sequences and enzyme melting points. Based on experiments, preprocessed data or nummerized amino acid sequence based on physicochemical properties and subsequently, the sequential method yielded a multiple linear regression model with a combination of several predictor variables built from data without outlier residuals from the previous model. text |
institution |
Institut Teknologi Bandung |
building |
Institut Teknologi Bandung Library |
continent |
Asia |
country |
Indonesia Indonesia |
content_provider |
Institut Teknologi Bandung |
collection |
Digital ITB |
language |
Indonesia |
description |
Enzymes are organic compounds that can accelerate biochemical reactions (catalysts). The structure of enzymes consists of two main parts: coenzymes that are active in the enzyme's catalytic function, and apoenzymes that encompass proteins in the form of amino acid chains or sequence. In industry, the use of enzymes is crucial for improving the quality of food products and cleaning agents. However, enzyme performance can be limited due to factors such as environmental temperatures that do not match the enzyme's characteristics. To address this issue, the development of identification or estimation systems for enzyme melting points or optimal temperatures based on physicochemical properties is necessary.
Statistical methods such as Multiple Linear Regression can be used to predict enzyme melting points based on the physicochemical properties of amino acid sequences. Additionally, a method called Sequential Method with three different approaches can assist in evaluating significant physicochemical properties in estimating enzyme melting points. This method can be applied to data obtained from Novozymes, a global biotechnology company from Denmark, through Kaggle.com, which provides information about amino acid sequences and enzyme melting points.
Based on experiments, preprocessed data or nummerized amino acid sequence based on physicochemical properties and subsequently, the sequential method yielded a multiple linear regression model with a combination of several predictor variables built from data without outlier residuals from the previous model. |
format |
Final Project |
author |
Dira Kurnia, Muhammad |
spellingShingle |
Dira Kurnia, Muhammad ESTIMATING ENZYME MELTING POINT BASED ON ENZYME AMINO ACID SEQUENCE USING MULTIPLE LINEAR REGRESSION |
author_facet |
Dira Kurnia, Muhammad |
author_sort |
Dira Kurnia, Muhammad |
title |
ESTIMATING ENZYME MELTING POINT BASED ON ENZYME AMINO ACID SEQUENCE USING MULTIPLE LINEAR REGRESSION |
title_short |
ESTIMATING ENZYME MELTING POINT BASED ON ENZYME AMINO ACID SEQUENCE USING MULTIPLE LINEAR REGRESSION |
title_full |
ESTIMATING ENZYME MELTING POINT BASED ON ENZYME AMINO ACID SEQUENCE USING MULTIPLE LINEAR REGRESSION |
title_fullStr |
ESTIMATING ENZYME MELTING POINT BASED ON ENZYME AMINO ACID SEQUENCE USING MULTIPLE LINEAR REGRESSION |
title_full_unstemmed |
ESTIMATING ENZYME MELTING POINT BASED ON ENZYME AMINO ACID SEQUENCE USING MULTIPLE LINEAR REGRESSION |
title_sort |
estimating enzyme melting point based on enzyme amino acid sequence using multiple linear regression |
url |
https://digilib.itb.ac.id/gdl/view/81548 |
_version_ |
1822997358518992896 |