Categorizing prediction modes within low-pLDDT regions of AlphaFold2 structures

This article has 7 evaluations Published on
Read the full article Related papers
This article on Sciety

Abstract

AlphaFold2 protein structure predictions are widely available for structural biology uses. These predictions, especially for eukaryotic proteins, frequently contain extensive regions predicted below the pLDDT 70 level, the rule-of-thumb cutoff for high confidence. This work identifies major modes of behavior within low-pLDDT regions through a survey of human proteome predictions provided by the AlphaFold Protein Structure Database. Thenear-predictivemode resembles folded protein and can be a nearly accurate prediction.Barbed wireis extremely unproteinlike, being recognized by wide looping coils, an absence of packing contacts, and numerous signature validation outliers, and it likely represents a nonpredicted region.Pseudostructurepresents an intermediate behavior with a misleading appearance of isolated and badly formed secondary structure-like elements. These prediction modes are compared with annotations of disorder from MobiDB, showing general correlation betweenbarbed wire/pseudostructureand many measures of disorder, an association betweenpseudostructureand signal peptides, and an association betweennear-predictiveand regions of conditional folding. To enable users to identify these regions within a prediction, a new Phenix tool is developed encompassing the results of this work, including prediction annotation, visual markup, and residue selection based on these prediction modes. This tool will help users develop expertise in interpreting difficult AlphaFold predictions and identify the near-predictive regions that can aid in molecular replacement when a prediction does not contain enough high-pLDDT regions.

Related articles

Related articles are currently not available for this article.