Machine learning differentiates between bulk and pseudo-bulk RNA-seq datasets

Boon How Low
Md Mamunur Rashid
Kumar Selvarajoo

0 evaluations Published on Jul 1, 2025

This article on Sciety

Abstract

Modern synthetic data generators and deconvolution methods rely heavily on single-cell (sc) RNA- seq data. Aggregated scRNA-seq (pseudo-bulk) is commonly assumed to closely match true bulk RNA-seq, making it a dependable benchmark for developing and evaluating new bioinformatics methods. Here, we investigated paired bulk and scRNA-seq datasets using machine learning techniques to assess the fidelity of pseudo-bulk profiles. Our results demonstrate that pseudo-bulks differ substantially from bulk RNA-seq in both analytic metrics and biological processes.

Related articles are currently not available for this article.