Prepare Query Data for Usage Fitting — prepare_query • CellProgramMapper

Prepares query expression data by:

Finding genes overlapping with reference
Validating data properties (non-negative, integer counts)
Scaling columns by standard deviation (without centering)

The scaling step matches the preprocessing in Python starCAT: sklearn.preprocessing.scale(X, with_mean=False)

Usage

prepare_query(query, ref_genes, verbose = TRUE)

Arguments

query: Query matrix (cells x genes)
ref_genes: Gene names in the reference
verbose: Print progress messages

Value

List with:

matrix: Processed query matrix
overlap_genes: Character vector of overlapping gene names