Preprint / Version 1

AI-Powered Analysis to Identify Potential Patient-Specific Therapy for Crohn's Disease Using Single-Cell RNA Sequencing

##article.authors##

  • Rohan Vellamcheti Cupertino High School

DOI:

https://doi.org/10.58445/rars.3793

Keywords:

Crohn's Disease, IBD, Machine-learning, Single-cell RNA sequencing, Single-cell AI, Random Forest

Abstract

Inflammatory bowel disease (IBD), affecting over 3 million Americans, describes chronic inflammatory disorders of the digestive tract, with Crohn's disease as a major subtype affecting the small intestine and colon. Patients often cycle through multiple therapies including corticosteroids, 5-aminosalicylates, immunomodulators, anti-TNF agents, anti-integrin antibodies, and JAK inhibitors before achieving remission. This trial-and-error approach prolongs disease activity, reduces quality of life, and burdens patients and the economy. The current treatment paradigm relies on population-level responses rather than patient-specific biology, therefore precision medicine approaches are needed.This project developed an AI-powered pipeline (automated sequential steps) identifying treatments by combining single-cell RNA sequencing, which measures gene activity in individual cells, with machine learning. Using data from 347,017 cells across multiple Crohn's disease patients, gene set enrichment analysis, which identifies abnormally active cellular processes, was performed across 2,186 pathways from KEGG, Hallmark, and Reactome databases. Singular value decomposition, which mathematically groups related pathways by similarity, compressed them into 20 summary patterns. Random Forest classifiers were trained for both tissues (small intestine and colon) to score treatment compatibility.The pipeline distinguished effective from ineffective treatment matches across 480 pathway-patient pairs. The small intestine model achieved 73.8% accuracy and area under the curve (AUC) of 0.811, while the colon model achieved 75.7% accuracy and 0.767, where 0.5 is random chance. All 480 recommendations received high confidence scores above 0.7.This pipeline offers a data-driven path toward treatments matched to each patient's unique biology, potentially transforming Crohn's disease treatment.

References

Lewis JD, Parlett LE, Jonsson Funk ML, Brensinger C, Pate V, Wu Q, et al. Incidence, Prevalence, and Racial and Ethnic Distribution of Inflammatory Bowel Disease in the United States. Gastroenterology. 2023 Nov;165(5):1197-1205.e2. doi:10.1053/j.gastro.2023.07.003

Groundbreaking Study Estimates Nearly 1 in 100 Americans Has IBD [Internet]. Crohn’s & Colitis Foundation. Available from: https://www.crohnscolitisfoundation.org/groundbreaking-study-led-the-crohns-colitis-foundation-estimates-nearly-1-100-americans-has

Kontola K, Oksanen P, Huhtala H, Jussila A. Increasing Incidence of Inflammatory Bowel Disease, with Greatest Change Among the Elderly: A Nationwide Study in Finland, 2000–2020. J Crohns Colitis. 2023 May 3;17(5):706–11. doi:10.1093/ecco-jcc/jjac177

Rosen MJ, Dhawan A, Saeed SA. Inflammatory Bowel Disease in Children and Adolescents. JAMA Pediatr. 2015 Nov 1;169(11):1053. doi:10.1001/jamapediatrics.2015.1982

Benchimol EI, Fortinsky KJ, Gozdyra P, Van Den Heuvel M, Van Limbergen J, Griffiths AM. Epidemiology of pediatric inflammatory bowel disease: A systematic review of international trends: Inflamm Bowel Dis. 2011 Jan;17(1):423–39. doi:10.1002/ibd.21349

Torres J, Mehandru S, Colombel JF, Peyrin-Biroulet L. Crohn’s disease. The Lancet. 2017 Apr;389(10080):1741–55. doi:10.1016/S0140-6736(16)31711-1

Sabino J, Verstockt B, Vermeire S, Ferrante M. New biologics and small molecules in inflammatory bowel disease: an update. Ther Adv Gastroenterol. 2019 Jan;12:1756284819853208. doi:10.1177/1756284819853208

Colombel JF, Sandborn WJ, Reinisch W, Mantzaris GJ, Kornbluth A, Rachmilewitz D, et al. Infliximab, Azathioprine, or Combination Therapy for Crohn’s Disease. N Engl J Med. 2010 Apr 15;362(15):1383–95. doi:10.1056/NEJMoa0904492

Colombel JF, Panaccione R, Bossuyt P, Lukas M, Baert F, Vaňásek T, et al. Effect of tight control management on Crohn’s disease (CALM): a multicentre, randomised, controlled phase 3 trial. The Lancet. 2017 Dec;390(10114):2779–89. doi:10.1016/S0140-6736(17)32641-7

Roda G, Chien Ng S, Kotze PG, Argollo M, Panaccione R, Spinelli A, et al. Crohn’s disease. Nat Rev Dis Primer. 2020 Apr 2;6(1):22. doi:10.1038/s41572-020-0156-2

Bertin L, Crepaldi M, Zanconato M, Lorenzon G, Maniero D, De Barba C, et al. Refractory Crohn’s Disease: Perspectives, Unmet Needs and Innovations. Clin Exp Gastroenterol. 2024 Oct;Volume 17:261–315. doi:10.2147/CEG.S434014

Bernstein CN, Hitchon CA, Walld R, Bolton JM, Sareen J, Walker JR, et al. Increased Burden of Psychiatric Disorders in Inflammatory Bowel Disease. Inflamm Bowel Dis. 2019 Jan 10;25(2):360–8. doi:10.1093/ibd/izy235

Hwang B, Lee JH, Bang D. Single-cell RNA sequencing technologies and bioinformatics pipelines. Exp Mol Med. 2018 Aug;50(8):1–14. doi:10.1038/s12276-018-0071-8

Zhang X, Li T, Liu F, Chen Y, Yao J, Li Z, et al. Comparative Analysis of Droplet-Based Ultra-High-Throughput Single-Cell RNA-Seq Systems. Mol Cell. 2019 Jan;73(1):130-142.e5. doi:10.1016/j.molcel.2018.10.020

Mukherjee PK, Nguyen QT, Li J, Zhao S, Christensen SM, West GA, et al. Stricturing Crohn’s Disease Single-Cell RNA Sequencing Reveals Fibroblast Heterogeneity and Intercellular Interactions. Gastroenterology. 2023 Nov;165(5):1180–96. doi:10.1053/j.gastro.2023.07.014

Kong L, Subramanian S, Segerstolpe Å, Tran V, Shih AR, Carter GT, et al. Single-cell and spatial transcriptomics of stricturing Crohn’s disease highlights a fibrosis-associated network. Nat Genet. 2025 Jul;57(7):1742–53. doi:10.1038/s41588-025-02225-y

Thomas T, Friedrich M, Rich-Griffin C, Pohin M, Agarwal D, Pakpoor J, et al. A longitudinal single-cell atlas of anti-tumour necrosis factor treatment in inflammatory bowel disease. Nat Immunol. 2024 Nov;25(11):2152–65. doi:10.1038/s41590-024-01994-8

Chen Y, Zou J. GenePT: A Simple But Effective Foundation Model for Genes and Cells Built From ChatGPT [Internet]. Bioinformatics; 2023 [cited 2026 Apr 19]. Available from: http://biorxiv.org/lookup/doi/10.1101/2023.10.16.562533 doi:10.1101/2023.10.16.562533

Barabási AL, Gulbahce N, Loscalzo J. Network medicine: a network-based approach to human disease. Nat Rev Genet. 2011 Jan;12(1):56–68. doi:10.1038/nrg2918

Argelaguet R, Velten B, Arnol D, Dietrich S, Zenz T, Marioni JC, et al. Multi‐Omics Factor Analysis—a framework for unsupervised integration of multi‐omics data sets. Mol Syst Biol. 2018 Jun;14(6):e8124. doi:10.15252/msb.20178124

Hasin Y, Seldin M, Lusis A. Multi-omics approaches to disease. Genome Biol. 2017 Dec;18(1):83. doi:10.1186/s13059-017-1215-1

Human CD Fibrosis Study Using Single-Cell and Spatial Data [Internet]. Single Cell Portal, Broad Institute. Available from: https://singlecell.broadinstitute.org/single_cell/study/SCP2959/human-cd-fibrosis-study-using-single-cell-and-spatial-data

Downloads

Posted

2026-04-26