Provides curated, ready-to-run notebooks, multi-omics data, and applications specifically designed for bioinformaticians

Notebooks

Geneformer

scGPT

PopV

Visiumccc

Mixscape

NicheNet

CopyKAT

Monocle3

Tangram

Signac

Show all

All
Single-cell
Spatial RNA-seq
ATAC-seq
Multi-omics

Premium

Spatial charting of single-cell transcriptomes in tissues - celltrek

BioTuring

Spatial charting of single-cell transcriptomes in tissues - celltrek

Single-cell RNA sequencing methods can profile the transcriptomes of single cells but cannot preserve spatial information. Conversely, spatial transcriptomics assays can profile spatial regions in tissue sections but do not have single-cell resolution. Here, Runmin Wei (Siyuan He, Shanshan Bai, Emi Sei, Min Hu, Alastair Thompson, Ken Chen, Savitri Krishnamurthy & Nicholas E. Navin) developed a computational method called CellTrek that combines these two datasets to achieve single-cell spatial mapping through coembedding and metric learning approaches. They benchmarked CellTrek using simulation and in situ hybridization datasets, which demonstrated its accuracy and robustness. They then applied CellTrek to existing mouse brain and kidney datasets and showed that CellTrek can detect topological patterns of different cell types and cell states. They performed single-cell RNA sequencing and spatial transcriptomics experiments on two ductal carcinoma in situ tissues and applied CellTrek to identify tumor subclones that were restricted to different ducts, and specific T-cell states adjacent to the tumor areas.

Only CPU

CellTrek

Deep learning and alignment of spatially resolved single-cell transcriptomes with Tangram

BioTuring

Deep learning and alignment of spatially resolved single-cell transcriptomes with Tangram

Charting an organs’ biological atlas requires us to spatially resolve the entire single-cell transcriptome, and to relate such cellular features to the anatomical scale. Single-cell and single-nucleus RNA-seq (sc/snRNA-seq) can profile cells comprehensively, but lose spatial information. Spatial transcriptomics allows for spatial measurements, but at lower resolution and with limited sensitivity. Targeted in situ technologies solve both issues, but are limited in gene throughput. To overcome these limitations we present Tangram, a method that aligns sc/snRNA-seq data to various forms of spatial data collected from the same region, including MERFISH, STARmap, smFISH, Spatial Transcriptomics (Visium) and histological images. **Tangram** can map any type of sc/snRNA-seq data, including multimodal data such as those from SHARE-seq, which we used to reveal spatial patterns of chromatin accessibility. We demonstrate Tangram on healthy mouse brain tissue, by reconstructing a genome-wide anatomically integrated spatial map at single-cell resolution of the visual and somatomotor areas.

Required GPU

Tangram

Multimodal single-cell chromatin analysis with Signac

BioTuring

Multimodal single-cell chromatin analysis with Signac

The recent development of experimental methods for measuring chromatin state at single-cell resolution has created a need for computational tools capable of analyzing these datasets. Here we developed Signac, a framework for the analysis of single-cell chromatin data, as an extension of the Seurat R toolkit for single-cell multimodal analysis. **Signac** enables an end-to-end analysis of single-cell chromatin data, including peak calling, quantification, quality control, dimension reduction, clustering, integration with single-cell gene expression datasets, DNA motif analysis, and interactive visualization. Furthermore, Signac facilitates the analysis of multimodal single-cell chromatin data, including datasets that co-assay DNA accessibility with gene expression, protein abundance, and mitochondrial genotype. We demonstrate scaling of the Signac framework to datasets containing over 700,000 cells.

Only CPU

Required PFP

signac

Reference-free cell type deconvolution of multi-cellular pixel-resolution spatially resolved transcriptomics data - stdeconvolve

BioTuring

Reference-free cell type deconvolution of multi-cellular pixel-resolution spatially resolved transcriptomics data - stdeconvolve

Recent technological advancements have enabled spatially resolved transcriptomic profiling but at multi-cellular pixel resolution, thereby hindering the identification of cell-type-specific spatial patterns and gene expression variation. To address this challenge, we develop STdeconvolve as a reference-free approach to deconvolve underlying cell types comprising such multi-cellular pixel resolution spatial transcriptomics (ST) datasets. Using simulated as well as real ST datasets from diverse spatial transcriptomics technologies comprising a variety of spatial resolutions such as Spatial Transcriptomics, 10X Visium, DBiT-seq, and Slide-seq, we show that STdeconvolve can effectively recover cell-type transcriptional profiles and their proportional representation within pixels without reliance on external single-cell transcriptomics references. **STdeconvolve** provides comparable performance to existing reference-based methods when suitable single-cell references are available, as well as potentially superior performance when suitable single-cell references are not available. STdeconvolve is available as an open-source R software package with the source code available at https://github.com/JEFworks-Lab/STdeconvolve .

Only CPU

STdeconvolve

Trends

BioTuring

Bioalpha-Biocolab: Enabling Large-Scale Data Uploads for BBrowserX single-cell analysis platform

Single-cell data analysis is revolutionizing biological research, but often these dataset sizes can be massive and pose challenges for submission process. Bioalpha-Biocolab addresses this issue by implementing advanced algorithms and leveraging efficient computational resources to overcome these challenges.

Required GPU

AlphaSC

BioTuring

pySCENIC: Single-Cell rEgulatory Network Inference and Clustering

SCENIC Suite is a set of tools to study and decipher gene regulation. Its core is based on SCENIC (Single-Cell Regulatory Network Inference and Clustering) which enables you to infer transcription factors, gene regulatory networks and cell types from single-cell RNA-seq data. pySCENIC is a lightning-fast python implementation of the SCENIC pipeline (Single-Cell Regulatory Network Inference and Clustering) which enables biologists to infer transcription factors, gene regulatory networks and cell types from single-cell RNA-seq data.

Only CPU

pySCENIC

BioTuring

Bioturing Massive-scale Analysis Solution for CellChat: Running analysis for massive-scale data from Seurat dataset

This tool provides a user-friendly and automated way to analyze large-scale single-cell RNA-seq datasets stored in RDS (Seurat) format. It allows users to run various analysis tools on their data in one command, streamlining the analysis workflow and saving time. Note that this notebook is only for the demonstration of the tool. Users can run the tool directly through the command line. Currently, we support: - CellChat - Inference and analysis of cell-cell communication using CellChat

Only CPU

CellChat

BioTuring

COMMOT: Screening cell-cell communication in spatial transcriptomics via collective optimal transport

In this notebook, we present COMMOT (COMMunication analysis by Optimal Transport) to infer cell-cell communication (CCC) in spatial transcriptomic, a package that infers CCC by simultaneously considering numerous ligand–receptor pairs for either spatial transcriptomic data or spatially annotated scRNA-seq data equipped with spatial distances between cells estimated from paired spatial imaging data. A collective optimal transport method is developed to handle complex molecular interactions and spatial constraints. Furthermore, we introduce downstream analysis tools to infer spatial signaling directionality and genes regulated by signaling using machine learning models.

Only CPU

COMMOT

BioTuring

SCEVAN: Single CEll Variational ANeuploidy analysis

In the realm of cancer research, grasping the intricacies of intratumor heterogeneity and its interplay with the immune system is paramount for deciphering treatment resistance and tumor progression. While single-cell RNA sequencing unveils diverse transcriptional programs, the challenge persists in automatically discerning malignant cells from non-malignant ones within complex datasets featuring varying coverage depths. Thus, there arises a compelling need for an automated solution to this classification conundrum. SCEVAN (De Falco et al., 2023), a variational algorithm, is designed to autonomously identify the clonal copy number substructure of tumors using single-cell data. It automatically separates malignant cells from non-malignant ones, and subsequently, groups of malignant cells are examined through an optimization-driven joint segmentation process.

Required GPU

scevan

BioTuring

Monorail-pipeline and Recount3

Monorail can be used to process local and/or private data, allowing results to be directly compared to any study in recount3. Taken together, Monorail-pipeline tools help biologists maximize the utility of publicly available RNA-seq data, especially to improve their understanding of newly collected data. This is for helping potential users of the Monorail RNA-seq processing pipeline (alignment/quantification) get started running their own data through it.

Only CPU

recount3

BioTuring

NicheNet: modeling intercellular communication by linking ligands to target genes

Computational methods that model how the gene expression of a cell is influenced by interacting cells are lacking. We present NicheNet, a method that predicts ligand–target links between interacting cells by combining their expression data with prior knowledge of signaling and gene regulatory networks. We applied NicheNet to the tumor and immune cell microenvironment data and demonstrated that NicheNet can infer active ligands and their gene regulatory effects on interacting cells.

Only CPU

nichenetr

BioTuring

BioTuring Data Converter: Seurat <=> Scanpy for single-cell data transcriptomic and spatial transcriptomics

This notebook illustrates how to convert data from a Seurat object into a Scanpy annotation data and a Scanpy annotation data into a Seurat object using the BioStudio data transformation library (currently under development). It facilitates continued research using libraries that interact with Scanpy in Python and Seurat in R. seurat.to.adata function can retain information about reductions (such as PCA, t-SNE, UMAP and Seurat Clusters) and spatial information.

Only CPU

Scanpy

Seurat

BioTuring

Multimodal single-cell chromatin analysis with Signac

Only CPU

Required PFP

signac

BioTuring

Spatially informed cell-type deconvolution for spatial transcriptomics - CARD

Many spatially resolved transcriptomic technologies do not have single-cell resolution but measure the average gene expression for each spot from a mixture of cells of potentially heterogeneous cell types. Here, we introduce a deconvolution method, conditional autoregressive-based deconvolution (CARD), that combines cell-type-specific expression information from single-cell RNA sequencing (scRNA-seq) with correlation in cell-type composition across tissue locations. Modeling spatial correlation allows us to borrow the cell-type composition information across locations, improving accuracy of deconvolution even with a mismatched scRNA-seq reference. **CARD** can also impute cell-type compositions and gene expression levels at unmeasured tissue locations to enable the construction of a refined spatial tissue map with a resolution arbitrarily higher than that measured in the original study and can perform deconvolution without an scRNA-seq reference. Applications to four datasets, including a pancreatic cancer dataset, identified multiple cell types and molecular markers with distinct spatial localization that define the progression, heterogeneity and compartmentalization of pancreatic cancer.

Only CPU

card

BioTuring

SpaCET: Cell type deconvolution and interaction analysis

Spatial transcriptomics (ST) technology has allowed to capture of topographical gene expression profiling of tumor tissues, but single-cell resolution is potentially lost. Identifying cell identities in ST datasets from tumors or other samples remains challenging for existing cell-type deconvolution methods. Spatial Cellular Estimator for Tumors (SpaCET) is an R package for analyzing cancer ST datasets to estimate cell lineages and intercellular interactions in the tumor microenvironment. Generally, SpaCET infers the malignant cell fraction through a gene pattern dictionary, then calibrates local cell densities and determines immune and stromal cell lineage fractions using a constrained regression model. Finally, the method can reveal putative cell-cell interactions in the tumor microenvironment. In this notebook, we will illustrate an example workflow for cell type deconvolution and interaction analysis on breast cancer ST data from 10X Visium. The notebook is inspired by SpaCET's vignettes and modified to demonstrate how the tool works on BioTuring's platform.

Only CPU

SpaCET

Seurat

BioTuring

SoupX: removing ambient RNA contamination from droplet-based single-cell RNA sequencing data

Droplet-based single-cell RNA sequence analyses assume that all acquired RNAs are endogenous to cells. However, there is a certain amount of cell-free mRNAs floating in the input solution (referred to as 'the soup'), created from cells in the input solution being lysed. These background mRNAs are then distributed into the droplets with cells and sequenced alongside them, resulting in background contamination that confounds the biological interpretation of single-cell transcriptomic data. SoupX (Young and Behjati, 2020) is one of the methods proposed for ambient mRNA removal. In this notebook, we will illustrate a workflow example that applies SoupX to correct the ambient RNA in a dataset of 10k PBMC cells. The output of SoupX is a modified counts matrix, which can be used for any downstream analysis tool.

Only CPU

SoupX

BioTuring

MUON: multimodal omics analysis framework

Advances in multi-omics have led to an explosion of multimodal datasets to address questions from basic biology to translation. While these data provide novel opportunities for discovery, they also pose management and analysis challenges, thus motivating the development of tailored computational solutions. `muon` is a Python framework for multimodal omics. It introduces multimodal data containers as `MuData` object. The package also provides state of the art methods for multi-omics data integration. `muon` allows the analysis of both unimodal omics and multimodal omics.

Required GPU

muon

BioTuring

FunPat: Function-based Pattern analysis on RNA-seq time series data

Dynamic expression data, nowadays obtained using high-throughput RNA sequencing (RNA-seq), are essential to monitor transient gene expression changes and to study the dynamics of their transcriptional activity in the cell or response to stimuli. FunPat is an R package designed to provide: - a useful tool to analyze time series genomic data; - a computational pipeline which integrates gene selection, clustering and functional annotations into a single framework to identify the main temporal patterns associated to functional groups of differentially expressed (DE) genes; - an easy way to exploit different types of annotations from currently available databases (e.g. Gene Ontology) to extract the most meaningful information characterizing the main expression dynamics; - a user-friendly organization and visualization of the outcome, automatically linking the DE genes and their temporal patterns to the functional information for an easy biological interpretation of the results.

Only CPU

FunPat

BioTuring

BPCells: Scaling Single Cell Analysis to Millions of Cells

BPCells is a package for high performance single cell analysis on RNA-seq and ATAC-seq datasets. It can analyze a 1.3M cell dataset with 2GB of RAM in under 10 minutes. This makes analysis of million-cell datasets practical on a laptop. BPCells provides: * Efficient storage of single cell datasets via bitpacking compression * Fast, disk-backed RNA-seq and ATAC-seq data processing powered by C++ * Downstream analysis such as marker genes, and clustering * Interoperability with AnnData, 10x datasets, R sparse matrices, and GRanges

Only CPU

BPCells

Notebooks
Bioalpha-Biocolab: Enabling Large-Scale Data Uploads for BBrowserX single-cell analysis platform Required GPU AlphaSC More
pySCENIC: Single-Cell rEgulatory Network Inference and Clustering Only CPU pySCENIC More
Bioturing Massive-scale Analysis Solution for CellChat: Running analysis for massive-scale data from Seurat dataset Only CPU CellChat More
COMMOT: Screening cell-cell communication in spatial transcriptomics via collective optimal transport Only CPU COMMOT More
SCEVAN: Single CEll Variational ANeuploidy analysis Required GPU scevan More
Monorail-pipeline and Recount3 Only CPU recount3 More
NicheNet: modeling intercellular communication by linking ligands to target genes Only CPU nichenetr More
BioTuring Data Converter: Seurat <=> Scanpy for single-cell data transcriptomic and spatial transcriptomics Only CPU Scanpy Seurat More
Multimodal single-cell chromatin analysis with Signac Required PFP Only CPU signac More
Spatially informed cell-type deconvolution for spatial transcriptomics - CARD Only CPU card More
SpaCET: Cell type deconvolution and interaction analysis Only CPU SpaCET Seurat More
SoupX: removing ambient RNA contamination from droplet-based single-cell RNA sequencing data Only CPU SoupX More
MUON: multimodal omics analysis framework Required GPU muon More
FunPat: Function-based Pattern analysis on RNA-seq time series data Only CPU FunPat More
BPCells: Scaling Single Cell Analysis to Millions of Cells Only CPU BPCells More

E-spatial

Notebooks

Premium

Trends