HOME

Researchers Develop AI Model to Decode B-Cell Receptor "Fingerprints"

May 09, 2026 | By ZHAO Hailong; ZHAO Weiwei

Researchers led by Professors GU Hongcang and ZHANG Fan at the Institute of Health and Medical Technology, Hefei Institutes of Physical Science, Chinese Academy of Sciences, have developed BCRInsight, an advanced AI language model capable of decoding the complex "fingerprints" of B-cell receptors (BCRs).

The study, published in Briefings in Bioinformatics, introduces a phenotype-aware contrastive learning framework that leverages self-supervised learning on large-scale sequence datasets.

B-cell receptors are not only molecular keys for antigen recognition, but also record the history of B-cell activation, differentiation, and clonal evolution. However, traditional bioinformatics methods struggle to capture these complex nonlinear relationships. While single-cell sequencing provides detailed insights, its high cost limits large-scale clinical applications. In this study, BCRInsight integrates amino acid sequences with gene annotations and metadata, in a manner analogous to paired sentence encoding in natural language processing. The model is built on a 12-layer Transformer encoder with 86 million parameters.

In benchmark evaluations, BCRInsight demonstrated strong performance. It accurately deconvolved B-cell subset compositions from bulk BCR-seq data at a fraction of the cost of single-cell approaches. In antibody paratope prediction, it achieved an AUROC of 0.962, outperforming nine leading international methods in direct comparisons.

They found that the model exhibited emergent structural awareness. Even without any 3D structural training, BCRInsight’s attention mechanisms consistently focused on critical HCDR3 loops, which are central to antigen binding, as well as other structurally important regions. This indicates that the model can infer structural and functional features directly from sequence and metadata alone, representing a significant breakthrough in predictive immunology.

BCRInsight offers a cost-effective tool for decoding immune repertoires and enables large-scale patient analysis. By identifying key paratope regions and functional patterns, it may also support antibody design, therapeutic optimization, and the development of personalized vaccines and immunotherapies, according to the team.

BCRInsight Model Framework (Image by ZHAO Hailong)


Attachments Download:
Contact

Reference
Related Articles
Copyright © Hefei Institutes of Physical Science, CAS All Rights Reserved
Record number: 皖ICP备05001008号