Single-cell RNA-seq datasets in diverse biological and clinical conditions provide great opportunities for the full transcriptional characterization of cell types.
However, the integration of these datasets is challeging as they remain biological and techinical differences. **Harmony** is an algorithm allowing fast, sensitive and accurate single-cell data integration.
Single-cell RNA sequencing (scRNA-seq) data often encountered technical artifacts called "doublets" which are two cells that are sequenced under the same cellular barcode.
Doublets formed from different cell types or states are called heterotypic and homotypic otherwise. These factors constrain cell throughput and may result in misleading biological interpretations.
DoubletFinder (McGinnis, Murrow, and Gartner 2019) is one of the methods proposed for doublet detection. In this notebook, we will illustrate an example workflow of DoubletFinder. We use a 10x Genomics dataset which captures peripheral blood mononuclear cells (PBMCs) from a healthy donor stained with a panel of 31 TotalSeqâ„¢-B antibodies (BioLegend).
Spatial transcriptomic studies are becoming increasingly common and large, posing important statistical and computational challenges for many analytic tasks. Here, we present SPARK-X, a non-parametric method for rapid and effective detection of spatially expressed genes in large spatial transcriptomic studies.
SPARK-X not only produces effective type I error control and high power but also brings orders of magnitude computational savings. We apply SPARK-X to analyze three large datasets, one of which is only analyzable by SPARK-X. In these data, SPARK-X identifies many spatially expressed genes including those that are spatially expressed within the same cell type, revealing new biological insights.
Expanded CRISPR-compatible CITE-seq (ECCITE-seq) which is built upon pooled CRISPR screens, allows to simultaneously measure transcriptomes, surface protein levels, and single-guide RNA (sgRNA) sequences at single-cell resolution. The technique enables multimodal characterization of each perturbation and effect exploration. However, it also encounters heterogeneity and complexity which can cause substantial noise into downstream analyses.
Mixscape (Papalexi, Efthymia, et al., 2021) is a computational framework proposed to substantially improve the signal-to-noise ratio in single-cell perturbation screens by identifying and removing confounding sources of variation.
In this notebooks, we demonstrate Mixscape's features using pertpy - a Python package offering a range of tools for perturbation analysis. The original pipeline of Mixscape implemented in R can be found here.
Recent spatial transcriptomics (ST) technologies have allowed us to capture cellular heterogeneity while retaining spatial information. However, ST datasets may lose single-cell resolution, limiting the discovery of cell-type-specific spatial patterns of localization and expression.
spacexr (Spatial-eXpression-R) is an R package providing two methods, i.e., Robust Cell Type Decomposition (RCTD) (Cable, Dylan M., et al., 2022) and Cell type-Specific Inference of Differential Expression (C-SIDE) (Cable, Dylan M., et al., 2022) for ST data. RCTD is proposed for cell type deconvolution, while leveraging references from another annotated single-cell RNA-seq data. C-SIDE identifies cell type-specific differential expression, accounting for localization of other cell types.
We will illustrate an example workflow in two notebooks, RCTD and C-SIDE, on a hippocampus Visium dataset provided by the authors. The notebooks are inspired from spacexr's vignettes and modified to demonstrate how the tool works on BioTuring's platform.