Identification of potential biological targets in sepsis based on random forest and artificial neural network models

Authors

  • Chenghong OuYang Jiangxi Provincial People’s Hospital, The First Affiliated Hospital of Nanchang Medical College, Nanchang, 330006, China
  • Huiwei He Jiangxi Provincial People’s Hospital, The First Affiliated Hospital of Nanchang Medical College, Nanchang, 330006, China
  • Zhiying Lin Jiangxi Provincial People’s Hospital, The First Affiliated Hospital of Nanchang Medical College, Nanchang, 330006, China

DOI:

https://doi.org/10.3855/jidc.21442

Keywords:

sepsis, artificial neural network, random forest, biomarker discovery

Abstract

Background: Machine learning and artificial neural networks are increasingly making new research progress in various human diseases.

Objective: To identify key gene markers related to sepsis based on random forest (RF) and artificial neural network (ANN) models.

Methodology: Key genes (KG) were screened through RF based on GSE9960, GSE13904, and GSE26440 datasets, and the diagnostic model of core genes was constructed using ANN analysis, which was validated.

Results: GSE9960, GSE13904, and GSE26440 were merged as the training set, and 452 differentially expressed genes (DEGs) were identified. Then, GO and KEGG enrichment analysis, as well as Cytoscape module analysis, were used to explore the mechanisms related to sepsis. Subsequently, these DEGs were further screened through RF to obtain key genes UPP1, ZNF600, GPR160, TRAT1, C3AR1, NAIP, MAL, and F5. An ANN method was used to construct a screening model. The AUC metrics obtained from the training and validation cohorts were quantitatively assessed through ROC curve analysis, yielding discriminative performance values of 0.969 and 0.792, respectively. The CIBERSORT-based immune profiling revealed a marked depletion of lymphocyte populations (including B cells and T cells) in the experimental cohort than healthy controls, with regulatory T cell subsets demonstrating a paradoxical increase in abundance. Sepsis immune suppression was the dominant process.

Conclusions: The core-gene model constructed by the RF and ANN method is a robust method for predicting, diagnosing, or treating sepsis, with nice predictive accuracy. It may be used for sepsis-related medical decisions and provide new entry routes for the treatment or prognosis improvement of sepsis.

Downloads

Published

2026-04-30

How to Cite

1.
OuYang C, He H, Lin Z (2026) Identification of potential biological targets in sepsis based on random forest and artificial neural network models. J Infect Dev Ctries 20:537–550. doi: 10.3855/jidc.21442

Issue

Section

Original Articles