APLSuite bundles antigen-processing calculations to speed CD4+ T cell epitope prediction
This paper describes APLSuite, a new software package that combines several computations used to predict which short protein fragments (epitopes) will be presented to CD4+ T cells. The key idea is to bring together measures of a protein’s shape and stability that affect how it is cut and presented inside the body. That process — called antigen processing — influences which peptides are actually available to bind immune molecules and trigger T cells.
The authors built APLSuite from four main parts: a distributed RESTful API framework (DRAF) to run algorithms on remote servers, a Python client that makes those APIs easy to call from code, a data science tool for managing computations, and a web-based graphical user interface for people who do not write code. The suite wires together methods that feed the Antigen Processing Likelihood (APL) model. These inputs include crystallographic B-factor (a measure of local disorder in experimental structures) or AlphaFold pLDDT (a confidence score from predicted structures), solvent accessible surface area (SASA, how exposed residues are to solvent), COREX-derived hydrogen exchange protection estimates (a proxy for local stability), and sequence entropy (how variable a position is among related sequences).
At a high level, APLSuite automates the whole workflow so users do not run separate tools by hand. The framework enforces standard data types for inputs and outputs so one tool’s result can feed the next. It also adds practical features for speed and scaling: cached results identified by parameter fingerprints to avoid repeated work; a resource-aware job scheduler that routes heavy tasks (for example, COREX) to servers with GPUs or many CPUs; and WebSocket support so users can monitor jobs without repeated polling. The team reports parallelized CPU and GPU versions of COREX that reduced runtimes from hours or days to under 180 seconds in their tests. The interface accepts common inputs such as Protein Data Bank files, PDB identifiers, or AlphaFold mmCIF files and can run on desktops or cloud servers.