RataFold is a conformational variant atlas for proteins. For each protein, we provide the native fold (from AlphaFold) plus a curated set of alternative conformations — the shapes the same backbone could plausibly adopt.
AlphaFold predicts a single most-likely structure per protein. RataFold extends this with a conformational neighborhood — the alternative folds, intermediate states, and lower-energy configurations that exist around the native prediction. Useful for ensemble docking, druggable intermediate identification, and structural variation analysis.
RataFold uses proprietary conformational search technology to explore the vast space of possible backbone arrangements and surface the most relevant alternatives for each protein. For each target, we report the top-ranked alternative conformations along with their relative energy and structural distance from the native fold.
The alternative conformations provided here are candidate structures intended as starting points for downstream analysis (molecular dynamics validation, docking studies, experimental investigation). RataFold does not replace AlphaFold for single-structure prediction, nor real-physics simulation tools for stability assessment.
Proteins are not static. A single 3D prediction is a useful starting point, but the same amino-acid chain in a cell adopts many shapes — opening and closing to bind partners, flexing during catalysis, occasionally misfolding. Two foundational ideas:
RataFold surfaces a curated set of low-energy alternatives that may correspond to: binding-competent states, hinge / breathing motions, folding intermediates, allosteric forms, or partially-misfolded precursors of disease states.
Each protein record at
data/proteins/<PREFIX>/<ID>.json
has the following shape. .js twins exist for file:// use.
{
id: "P01308", // UniProt accession
name: "Insulin", // UniProt description
gene, organism, taxId, length, sequence,
native: {
source: "AlphaFold v6",
plddtMean: 52.91, // mean plDDT confidence
energy: 605.88, // RataFold force-field score
ca: [[x,y,z], …], // Cα coords (length = residues)
residues: [{ // full-atom per residue
resSeq: 1,
resName: "MET",
plddt: 64.44, // per-residue confidence
atoms: { N:[x,y,z], CA:…, C:…, O:…, CB:…, … }
}, …]
},
variants: [{ // one per discovered variant
rank, energy, stability,
rmsdNative, // overall Cα-RMSD vs native
rmsdConfident, // RMSD in AF-confident regions
afVerdict: "af-confirmed" | "af-plausible" | "af-divergent",
confidentResidueCount,
category, // twin | near-native | variant | conformer | outlier
ca: [[x,y,z], …],
residues: [ … ] // same shape as native.residues
}],
stats: {
variantsDiscovered, variantsBeatingNative, energyImprovement,
energyRange: { native, best, worstKept },
searchSpaceExplored: "10^883",
categoryCounts: { twin, nearNative, variant, conformer, outlier },
compute: { searchSeconds, evaluations, evaluationsPerSecond,
acceptanceRate, exhaustiveYearsExp, universeAgesExp }
},
generatedAt, framework
}
All data is released under CC-BY 4.0. Free to use with attribution. Native structures originate from the AlphaFold Protein Structure Database; please also cite Jumper et al. (2021) and Varadi et al. (2022).