AggregateCellMetadata - take cell level metadata and collapse down to sample-level metadata for use in pseudobuk testing. This can be useful if you do not already have metadata for each sample in the experiment, but these data are stored in single cell metadata. For example, metadata for the donor ID, unique sample (e.g. donorid_timepoint), age, sex etc. Note these are all individual sample level data. Single cell intrinsic variables like nUMI cannot be collapsed down, only variables that are unique for each sample which are the columns of the pseudobulk data. Only include metadata that you intend to adjust models for as covariates or random effects because all variables will be referenced during count normalization and feature filtering.
Source:R/pseudobulk_helpers.r
AggregateCellMetadata.Rd
AggregateCellMetadata - take cell level metadata and collapse down to sample-level metadata for use in pseudobuk testing. This can be useful if you do not already have metadata for each sample in the experiment, but these data are stored in single cell metadata. For example, metadata for the donor ID, unique sample (e.g. donorid_timepoint), age, sex etc. Note these are all individual sample level data. Single cell intrinsic variables like nUMI cannot be collapsed down, only variables that are unique for each sample which are the columns of the pseudobulk data. Only include metadata that you intend to adjust models for as covariates or random effects because all variables will be referenced during count normalization and feature filtering.
Arguments
- cell.metadata
dataframe of meta data for cells-rows as columns i.e. ColData or Seurat@meta.data.
- sample_column
quoted character e.g. "sample". This should indicate the variable corresponding to the rows of the pseudobulk gene expression data.
- variable_columns
experiment variables coresponding to sample level data. For example: c('SubjectID', 'timepoint', 'sex', 'age').
- pseudobulklist
the object output from PseudobulkList. used to check the columns of the aggregated metadata match the columns of the elements of the Pseudobulk list.
Examples
if (FALSE) {
samplemd = AggregateCellMetadata(cell.metadata = s@meta.data, sample_column = 'sample', variable_columns = c('subjectID', 'timepoint', 'age', 'sex'), pseudobulk.List = pb)
}