Multistate and functional protein design using RoseTTAFold sequence space diffusion

Protein denoising diffusion probabilistic models are used for the de novo generation of protein backbones but are limited in their ability to guide generation of proteins with sequence-specific attributes and functional properties. To overcome this limitation, we developed ProteinGenerator (PG), a s...

Full description

Saved in:
Bibliographic Details
Main Authors: Lisanza, Sidney (Author) , Gershon, Jacob Merle (Author) , Tipps, Samuel W. K. (Author) , Sims, Jeremiah Nelson (Author) , Arnoldt, Lucas (Author) , Hendel, Samuel J. (Author) , Simma, Miriam K. (Author) , Liu, Ge (Author) , Yase, Muna (Author) , Wu, Hongwei (Author) , Tharp, Claire D. (Author) , Li, Xinting (Author) , Kang, Alex (Author) , Brackenbrough, Evans (Author) , Bera, Asim K. (Author) , Gerben, Stacey (Author) , Wittmann, Bruce J. (Author) , McShan, Andrew C. (Author) , Baker, David (Author)
Format: Article (Journal)
Language:English
Published: 25 September 2024
In: Nature biotechnology
Year: 2024, Pages: 1-11
ISSN:1546-1696
DOI:10.1038/s41587-024-02395-w
Online Access:Verlag, kostenfrei, Volltext: https://doi.org/10.1038/s41587-024-02395-w
Verlag, kostenfrei, Volltext: https://www.nature.com/articles/s41587-024-02395-w
Get full text
Author Notes:Sidney Lyayuga Lisanza, Jacob Merle Gershon, Samuel W.K. Tipps, Jeremiah Nelson Sims, Lucas Arnoldt, Samuel J. Hendel, Miriam K. Simma, Ge Liu, Muna Yase, Hongwei Wu, Claire D. Tharp, Xinting Li, Alex Kang, Evans Brackenbrough, Asim K. Bera, Stacey Gerben, Bruce J. Wittmann, Andrew C. McShan & David Baker
Description
Summary:Protein denoising diffusion probabilistic models are used for the de novo generation of protein backbones but are limited in their ability to guide generation of proteins with sequence-specific attributes and functional properties. To overcome this limitation, we developed ProteinGenerator (PG), a sequence space diffusion model based on RoseTTAFold that simultaneously generates protein sequences and structures. Beginning from a noised sequence representation, PG generates sequence and structure pairs by iterative denoising, guided by desired sequence and structural protein attributes. We designed thermostable proteins with varying amino acid compositions and internal sequence repeats and cage bioactive peptides, such as melittin. By averaging sequence logits between diffusion trajectories with distinct structural constraints, we designed multistate parent-child protein triples in which the same sequence folds to different supersecondary structures when intact in the parent versus split into two child domains. PG design trajectories can be guided by experimental sequence-activity data, providing a general approach for integrated computational and experimental optimization of protein function.
Item Description:Gesehen am 14.04.2025
Physical Description:Online Resource
ISSN:1546-1696
DOI:10.1038/s41587-024-02395-w