Research Highlights

My research focuses on developing machine learning and large language model-based methods for biological discovery, with an emphasis on antibody-antigen interaction modeling, antibody design and optimization, and structure-aware prediction of binding and neutralization. I also build data-efficient and interpretable AI frameworks for biomedical applications, including antiviral drug screening and medical image analysis, bridging computational modeling with real-world therapeutic challenges.

2025

J. Chem. Inf. Model. · 2025
Predicting Antibody-Antigen Interactions with Structure-Aware LLMs: Insights from SARS-CoV-2 Variants
DOI: 10.1021/acs.jcim.5c00973 ↗
Proposed a structure-aware LLM framework that is the first to jointly predict antibody binding and neutralization for SARS-CoV-2, consistently outperforming prior classifiers, especially on closely related viral variants. We further showed that refining ESM-2 with target-specific antibody sequences and structural information boosts performance substantially, improving binding AUC from 0.84 to 0.93 and neutralization AUC from 0.88 to 0.95, establishing a strong, generalizable baseline for antibody–antigen interaction modeling.
Web-based Research Platform · 2025
SPICE: Structural Protein Interaction Complex Evaluator
Faisal Bin Ashraf, Stefano Lonardi
Webapp: spice.cs.ucr.edu ↗
End-to-end web platform for comparative structural analysis of protein–protein complexes, supporting interface characterization, energetic analysis, and variant-level comparison.
J. Immunology · 2025  |  NeurIPS LXAI · 2024
ASPred: Identification of Antigen-Specific B-Cell Receptors from Single V(D)J Sequences Using Large Language Models
Karen Paco, Mariana Paco Mendivil, Zihao Zhang, ..., Faisal Bin Ashraf, ..., Stefano Lonardi, Fernando L. Barroso da Silva, Animesh Ray
DOI: 10.1093/jimmun/vkaf283.1885 ↗
ASPred is a large-language-model-based framework trained on antigen-BCR sequence pairs that accurately predicts antigen-specific B-cell receptors from full repertoires, successfully generalizes to previously unseen antigens, and validates top candidates via molecular dynamics simulations, demonstrating that interaction specificity is learnable directly from sequence data.

2024

Preprint (bioRxiv)  |  AAAI FMs4Bio · 2025
A Large Language Model Guides the Affinity Maturation of Antibodies Generated by Combinatorial Optimization Algorithms
Faisal Bin Ashraf, Karen Paco, Zihao Zhang, Christian J. Dávila Ojeda, Mariana P. Mendivil, Jordan A. Lay, Tristan Y. Yang, Fernando L. Barroso da Silva, Matthew H. Sazinsky, Animesh Ray, Stefano Lonardi
DOI: 10.1101/2024.12.19.629473 ↗
Developed Ab-Affinity, a hybrid large-language-model-driven antibody design framework that integrates genetic algorithms and simulated annealing to generate novel SARS-CoV-2-targeting antibodies, achieving >160-fold binding affinity improvements over experimental candidates and outperforming existing computational approaches while maintaining strong biophysical stability.
Heliyon · 2024
Enhancing Breast Cancer Classification via Histopathological Image Analysis: Leveraging Self-Supervised Contrastive Learning and Transfer Learning
DOI: 10.1016/j.heliyon.2024.e24094 ↗
Designed a lightweight, state-of-the-art breast cancer histopathology classifier that combines self-supervised contrastive learning and an efficient ResNet–Inception architecture, achieving up to 98% accuracy across multiple magnification levels with ~50% fewer parameters than existing leading models—enabling accurate, data-efficient, and scalable diagnostic AI.

2023

PLOS ONE · 2023
Bio-Activity Prediction of Drug Candidate Compounds Targeting SARS-CoV-2 Using Machine Learning Approaches
DOI: 10.1371/journal.pone.0288053 ↗
Developed and benchmarked machine-learning models for SARS-CoV-2 3CLpro inhibitor prediction, evaluating 27 classifiers and a neural network that achieved 91% accuracy on ChEMBL and 93% on combined ChEMBL–PubChem data, with F1-scores of 93–94% for active and inactive compounds. The study further validated scalability on >100,000 molecules and used SHAP-based interpretability to identify key molecular fingerprints, enabling data-driven and explainable antiviral drug screening.
SN Computer Science · 2023
YoNet: A Neural Network for Yoga Pose Classification
Faisal Bin Ashraf, Muhammad Usama Islam, Md Rayhan Kabir, Jasim Uddin
DOI: 10.1007/s42979-022-01618-8 ↗
Proposed a data-efficient deep learning architecture for yoga pose recognition that jointly captures spatial and depth features, outperforming state-of-the-art CNNs (ResNet, Inception, Xception) while achieving 94.9% accuracy and 95.6% precision using limited training data.