Publication | Open Access
Multistate and functional protein design using RoseTTAFold sequence space diffusion
85
Citations
61
References
2024
Year
Structural BioinformaticsMolecular BiologyPg Design TrajectoriesMolecular DesignProtein FoldingComputational BiochemistryProtein FunctionFunctional Protein DesignDiffusion TrajectoriesProtein ModelingProtein Structure PredictionBioinformaticsProtein BioinformaticsStructural BiologyNatural SciencesComputational BiologyProtein EvolutionProtein EngineeringSystems BiologyMedicine
Protein denoising diffusion probabilistic models are used for the de novo generation of protein backbones but are limited in their ability to guide generation of proteins with sequence-specific attributes and functional properties. To overcome this limitation, we developed ProteinGenerator (PG), a sequence space diffusion model based on RoseTTAFold that simultaneously generates protein sequences and structures. Beginning from a noised sequence representation, PG generates sequence and structure pairs by iterative denoising, guided by desired sequence and structural protein attributes. We designed thermostable proteins with varying amino acid compositions and internal sequence repeats and cage bioactive peptides, such as melittin. By averaging sequence logits between diffusion trajectories with distinct structural constraints, we designed multistate parent-child protein triples in which the same sequence folds to different supersecondary structures when intact in the parent versus split into two child domains. PG design trajectories can be guided by experimental sequence-activity data, providing a general approach for integrated computational and experimental optimization of protein function.
| Year | Citations | |
|---|---|---|
Page 1
Page 1