AbAgym: a well-curated dataset for the mutational analysis of antibody-antigen complexes

This article has 0 evaluations Published on
Read the full article Related papers
This article on Sciety

Abstract

With monoclonal antibodies becoming one of the largest classes of biopharmaceuticals, it is important to have curated data to train computational models that can accelerate their design. Despite the massive amount of mutagenesis data generated on antibody-antigen interactions, only a few small, well-curated datasets are available. This paper introduces AbAgym, a manually curated dataset comprising approximately 335k mutations in antibody-antigen complexes, including one tenth of interface mutations, whose effects on antibody-antigen binding have been experimentally quantified through deep mutational scanning (DMS) experiments. We collected and curated 67 DMS datasets from the literature together with the three-dimensional structure of each antibody-antigen complex. We benchmarked the performance of established force field methods as well as recent machine learning models that predict the change in binding affinity upon mutation. The former achieved modest performance, whereas the latter performed only marginally better than random. Finally, our analysis of hotspot residues responsible for immune evasion highlights the importance of accounting for biological complexities, such as conformational changes or oligomeric states that influence antibody-antigen binding, which are often overlooked. Abagym is freely available for academic use at github.com/3BioCompBio/Abagym.

Related articles

Related articles are currently not available for this article.