TY - GEN
T1 - Generating Immune-aware SARS-CoV-2 Spike Proteins for Universal Vaccine Design
AU - Phillips, Dominic
AU - Gasser, Hans-Christof
AU - Kamp, Sebestyén
AU - Pałkowski, Aleksander
AU - Rabalski, Lukasz
AU - Oyarzún, Diego A.
AU - Rajan, Ajitha
AU - Alfaro, Javier Antonio
PY - 2022/7/22
Y1 - 2022/7/22
N2 - Dozens of SARS-CoV-2 vaccines have been approved for public use, yet there remains a risk that the virus evolves to escape vaccine protection. This motivates the development of universal vaccines capable of protecting against current and potentially new strains of the virus. A key challenge is the lack of computational tools to design new viral proteins capable of vaccine escape, which could serve as good targets for the development of universal vaccines. Here, we designed VAE capable of generating SARS-CoV-2 spike proteins with variable immune visibility to the cell-mediated immune response. We compared our model with two simpler generative models; a random-mutator and an 11-gram language model. All three models can generate stable, structurally valid sequences, yet only the VAE model can generate low immunogenicity sequences that interpolate smoothly along the principal variance directions of known natural sequences. This model provides an effective computational tool for the generation of spike protein sequences useful for universal vaccine design. We provide its source code at https://github.com/hcgasser/SpikeVAE.
AB - Dozens of SARS-CoV-2 vaccines have been approved for public use, yet there remains a risk that the virus evolves to escape vaccine protection. This motivates the development of universal vaccines capable of protecting against current and potentially new strains of the virus. A key challenge is the lack of computational tools to design new viral proteins capable of vaccine escape, which could serve as good targets for the development of universal vaccines. Here, we designed VAE capable of generating SARS-CoV-2 spike proteins with variable immune visibility to the cell-mediated immune response. We compared our model with two simpler generative models; a random-mutator and an 11-gram language model. All three models can generate stable, structurally valid sequences, yet only the VAE model can generate low immunogenicity sequences that interpolate smoothly along the principal variance directions of known natural sequences. This model provides an effective computational tool for the generation of spike protein sequences useful for universal vaccine design. We provide its source code at https://github.com/hcgasser/SpikeVAE.
M3 - Conference contribution
VL - 184
T3 - Proceedings of Machine Learning Research
SP - 100
EP - 116
BT - Proceedings of the 1st Workshop on Healthcare AI and COVID-19, ICML 2022
A2 - Xu, Peng
A2 - Zhu, Tingting
A2 - Zhu, Pengkai
A2 - Clifton, David A.
A2 - Belgrave, Danielle
A2 - Zhang, Yuanting
PB - PMLR
T2 - The 1st Workshop on Healthcare AI and COVID-19
Y2 - 22 July 2022 through 22 July 2022
ER -