ESTIMATING ENZYME MELTING POINT BASED ON ENZYME AMINO ACID SEQUENCE USING MULTIPLE LINEAR REGRESSION

Enzymes are organic compounds that can accelerate biochemical reactions (catalysts). The structure of enzymes consists of two main parts: coenzymes that are active in the enzyme's catalytic function, and apoenzymes that encompass proteins in the form of amino acid chains or sequence. In industr...

Full description

Saved in:
Bibliographic Details
Main Author: Dira Kurnia, Muhammad
Format: Final Project
Language:Indonesia
Online Access:https://digilib.itb.ac.id/gdl/view/81548
Tags: Add Tag
No Tags, Be the first to tag this record!
Institution: Institut Teknologi Bandung
Language: Indonesia
id id-itb.:81548
spelling id-itb.:815482024-07-01T08:01:32ZESTIMATING ENZYME MELTING POINT BASED ON ENZYME AMINO ACID SEQUENCE USING MULTIPLE LINEAR REGRESSION Dira Kurnia, Muhammad Indonesia Final Project Enzyme Melting Point Estimation, Amino Acid Sequences, Physicochemical Properties, Multiple Linear Regression, Sequential Method. INSTITUT TEKNOLOGI BANDUNG https://digilib.itb.ac.id/gdl/view/81548 Enzymes are organic compounds that can accelerate biochemical reactions (catalysts). The structure of enzymes consists of two main parts: coenzymes that are active in the enzyme's catalytic function, and apoenzymes that encompass proteins in the form of amino acid chains or sequence. In industry, the use of enzymes is crucial for improving the quality of food products and cleaning agents. However, enzyme performance can be limited due to factors such as environmental temperatures that do not match the enzyme's characteristics. To address this issue, the development of identification or estimation systems for enzyme melting points or optimal temperatures based on physicochemical properties is necessary. Statistical methods such as Multiple Linear Regression can be used to predict enzyme melting points based on the physicochemical properties of amino acid sequences. Additionally, a method called Sequential Method with three different approaches can assist in evaluating significant physicochemical properties in estimating enzyme melting points. This method can be applied to data obtained from Novozymes, a global biotechnology company from Denmark, through Kaggle.com, which provides information about amino acid sequences and enzyme melting points. Based on experiments, preprocessed data or nummerized amino acid sequence based on physicochemical properties and subsequently, the sequential method yielded a multiple linear regression model with a combination of several predictor variables built from data without outlier residuals from the previous model. text
institution Institut Teknologi Bandung
building Institut Teknologi Bandung Library
continent Asia
country Indonesia
Indonesia
content_provider Institut Teknologi Bandung
collection Digital ITB
language Indonesia
description Enzymes are organic compounds that can accelerate biochemical reactions (catalysts). The structure of enzymes consists of two main parts: coenzymes that are active in the enzyme's catalytic function, and apoenzymes that encompass proteins in the form of amino acid chains or sequence. In industry, the use of enzymes is crucial for improving the quality of food products and cleaning agents. However, enzyme performance can be limited due to factors such as environmental temperatures that do not match the enzyme's characteristics. To address this issue, the development of identification or estimation systems for enzyme melting points or optimal temperatures based on physicochemical properties is necessary. Statistical methods such as Multiple Linear Regression can be used to predict enzyme melting points based on the physicochemical properties of amino acid sequences. Additionally, a method called Sequential Method with three different approaches can assist in evaluating significant physicochemical properties in estimating enzyme melting points. This method can be applied to data obtained from Novozymes, a global biotechnology company from Denmark, through Kaggle.com, which provides information about amino acid sequences and enzyme melting points. Based on experiments, preprocessed data or nummerized amino acid sequence based on physicochemical properties and subsequently, the sequential method yielded a multiple linear regression model with a combination of several predictor variables built from data without outlier residuals from the previous model.
format Final Project
author Dira Kurnia, Muhammad
spellingShingle Dira Kurnia, Muhammad
ESTIMATING ENZYME MELTING POINT BASED ON ENZYME AMINO ACID SEQUENCE USING MULTIPLE LINEAR REGRESSION
author_facet Dira Kurnia, Muhammad
author_sort Dira Kurnia, Muhammad
title ESTIMATING ENZYME MELTING POINT BASED ON ENZYME AMINO ACID SEQUENCE USING MULTIPLE LINEAR REGRESSION
title_short ESTIMATING ENZYME MELTING POINT BASED ON ENZYME AMINO ACID SEQUENCE USING MULTIPLE LINEAR REGRESSION
title_full ESTIMATING ENZYME MELTING POINT BASED ON ENZYME AMINO ACID SEQUENCE USING MULTIPLE LINEAR REGRESSION
title_fullStr ESTIMATING ENZYME MELTING POINT BASED ON ENZYME AMINO ACID SEQUENCE USING MULTIPLE LINEAR REGRESSION
title_full_unstemmed ESTIMATING ENZYME MELTING POINT BASED ON ENZYME AMINO ACID SEQUENCE USING MULTIPLE LINEAR REGRESSION
title_sort estimating enzyme melting point based on enzyme amino acid sequence using multiple linear regression
url https://digilib.itb.ac.id/gdl/view/81548
_version_ 1822997358518992896