Functional classification of super-large families of enzymes based on substrate binding pocket residues for biocatalysis and enzyme engineering applications

Large enzyme families such as the groups of zinc-dependent alcohol dehydrogenases (ADHs), long chain alcohol oxidases (AOxs) or amine dehydrogenases (AmDHs) with, sometimes, more than one million sequences in the non-redundant protein database and hundreds of experimentally characterized enzymes are...

Full description

Saved in:

Bibliographic Details
Main Authors:	Sirota, Fernanda L., Maurer-Stroh, Sebastian, Li, Zhi, Eisenhaber, Frank, Eisenhaber, Birgit
Other Authors:	School of Biological Sciences
Format:	Article
Language:	English
Published:	2022
Subjects:	Science::Biological sciences Zinc-Dependent Alcohol Dehydrogenase Sequence Alignment Conflict
Online Access:	https://hdl.handle.net/10356/154026
Tags:	Add Tag No Tags, Be the first to tag this record!
Institution:	Nanyang Technological University
Language:	English

id	sg-ntu-dr.10356-154026
record_format	dspace
spelling	sg-ntu-dr.10356-1540262023-02-28T17:11:06Z Functional classification of super-large families of enzymes based on substrate binding pocket residues for biocatalysis and enzyme engineering applications Sirota, Fernanda L. Maurer-Stroh, Sebastian Li, Zhi Eisenhaber, Frank Eisenhaber, Birgit School of Biological Sciences Bioinformatics Institute, ASTAR Genome Institute of Singapore, ASTAR Science::Biological sciences Zinc-Dependent Alcohol Dehydrogenase Sequence Alignment Conflict Large enzyme families such as the groups of zinc-dependent alcohol dehydrogenases (ADHs), long chain alcohol oxidases (AOxs) or amine dehydrogenases (AmDHs) with, sometimes, more than one million sequences in the non-redundant protein database and hundreds of experimentally characterized enzymes are excellent cases for protein engineering efforts aimed at refining and modifying substrate specificity. Yet, the backside of this wealth of information is that it becomes technically difficult to rationally select optimal sequence targets as well as sequence positions for mutagenesis studies. In all three cases, we approach the problem by starting with a group of experimentally well studied family members (including those with available 3D structures) and creating a structure-guided multiple sequence alignment and a modified phylogenetic tree (aka binding site tree) based just on a selection of potential substrate binding residue positions derived from experimental information (not from the full-length sequence alignment). Hereupon, the remaining, mostly uncharacterized enzyme sequences can be mapped; as a trend, sequence grouping in the tree branches follows substrate specificity. We show that this information can be used in the target selection for protein engineering work to narrow down to single suitable sequences and just a few relevant candidate positions for directed evolution towards activity for desired organic compound substrates. We also demonstrate how to find the closest thermophile example in the dataset if the engineering is aimed at achieving most robust enzymes. National Research Foundation (NRF) Published version This research was supported by the National Research Foundation (NRF), Singapore, through a Competitive Research Programme (CRP) (Project ID NRF-CRP17-2017-03) to ZL and BE. 2022-05-24T05:33:34Z 2022-05-24T05:33:34Z 2021 Journal Article Sirota, F. L., Maurer-Stroh, S., Li, Z., Eisenhaber, F. & Eisenhaber, B. (2021). Functional classification of super-large families of enzymes based on substrate binding pocket residues for biocatalysis and enzyme engineering applications. Frontiers in Bioengineering and Biotechnology, 9, 701120-. https://dx.doi.org/10.3389/fbioe.2021.701120 2296-4185 https://hdl.handle.net/10356/154026 10.3389/fbioe.2021.701120 34409021 2-s2.0-85112709777 9 701120 en NRF-CRP17-2017-03 Frontiers in Bioengineering and Biotechnology © 2021 Sirota, Maurer-Stroh, Li, Eisenhaber and Eisenhaber. This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms. application/pdf
institution	Nanyang Technological University
building	NTU Library
continent	Asia
country	Singapore Singapore
content_provider	NTU Library
collection	DR-NTU
language	English
topic	Science::Biological sciences Zinc-Dependent Alcohol Dehydrogenase Sequence Alignment Conflict
spellingShingle	Science::Biological sciences Zinc-Dependent Alcohol Dehydrogenase Sequence Alignment Conflict Sirota, Fernanda L. Maurer-Stroh, Sebastian Li, Zhi Eisenhaber, Frank Eisenhaber, Birgit Functional classification of super-large families of enzymes based on substrate binding pocket residues for biocatalysis and enzyme engineering applications
description	Large enzyme families such as the groups of zinc-dependent alcohol dehydrogenases (ADHs), long chain alcohol oxidases (AOxs) or amine dehydrogenases (AmDHs) with, sometimes, more than one million sequences in the non-redundant protein database and hundreds of experimentally characterized enzymes are excellent cases for protein engineering efforts aimed at refining and modifying substrate specificity. Yet, the backside of this wealth of information is that it becomes technically difficult to rationally select optimal sequence targets as well as sequence positions for mutagenesis studies. In all three cases, we approach the problem by starting with a group of experimentally well studied family members (including those with available 3D structures) and creating a structure-guided multiple sequence alignment and a modified phylogenetic tree (aka binding site tree) based just on a selection of potential substrate binding residue positions derived from experimental information (not from the full-length sequence alignment). Hereupon, the remaining, mostly uncharacterized enzyme sequences can be mapped; as a trend, sequence grouping in the tree branches follows substrate specificity. We show that this information can be used in the target selection for protein engineering work to narrow down to single suitable sequences and just a few relevant candidate positions for directed evolution towards activity for desired organic compound substrates. We also demonstrate how to find the closest thermophile example in the dataset if the engineering is aimed at achieving most robust enzymes.
author2	School of Biological Sciences
author_facet	School of Biological Sciences Sirota, Fernanda L. Maurer-Stroh, Sebastian Li, Zhi Eisenhaber, Frank Eisenhaber, Birgit
format	Article
author	Sirota, Fernanda L. Maurer-Stroh, Sebastian Li, Zhi Eisenhaber, Frank Eisenhaber, Birgit
author_sort	Sirota, Fernanda L.
title	Functional classification of super-large families of enzymes based on substrate binding pocket residues for biocatalysis and enzyme engineering applications
title_short	Functional classification of super-large families of enzymes based on substrate binding pocket residues for biocatalysis and enzyme engineering applications
title_full	Functional classification of super-large families of enzymes based on substrate binding pocket residues for biocatalysis and enzyme engineering applications
title_fullStr	Functional classification of super-large families of enzymes based on substrate binding pocket residues for biocatalysis and enzyme engineering applications
title_full_unstemmed	Functional classification of super-large families of enzymes based on substrate binding pocket residues for biocatalysis and enzyme engineering applications
title_sort	functional classification of super-large families of enzymes based on substrate binding pocket residues for biocatalysis and enzyme engineering applications
publishDate	2022
url	https://hdl.handle.net/10356/154026
_version_	1759857162199236608

Functional classification of super-large families of enzymes based on substrate binding pocket residues for biocatalysis and enzyme engineering applications

Similar Items