BFF and cellhashR: Analysis tools for accurate demultiplexing of cell hashing data

Gregory J. Boggy; G. W. Mcelfresh; Eisa Mahyari; Abigail B. Ventura; Scott G. Hansen; Louis J. Picker; Benjamin N. Bimber

doi:10.1093/bioinformatics/btac213

BFF and cellhashR: Analysis tools for accurate demultiplexing of cell hashing data

Gregory J. Boggy, G. W. Mcelfresh, Eisa Mahyari, Abigail B. Ventura, Scott G. Hansen, Louis J. Picker, Benjamin N. Bimber

Research output: Contribution to journal › Article › peer-review

5 Scopus citations

Abstract

Motivation: Single-cell sequencing methods provide previously impossible resolution into the transcriptome of individual cells. Cell hashing reduces single-cell sequencing costs by increasing capacity on droplet-based platforms. Cell hashing methods rely on demultiplexing algorithms to accurately classify droplets; however, assumptions underlying these algorithms limit accuracy of demultiplexing, ultimately impacting the quality of single-cell sequencing analyses. Results: We present Bimodal Flexible Fitting (BFF) demultiplexing algorithms BFFcluster and BFFraw, a novel class of algorithms that rely on the single inviolable assumption that barcode count distributions are bimodal. We integrated these and other algorithms into cellhashR, a new R package that provides integrated QC and a single command to execute and compare multiple demultiplexing algorithms. We demonstrate that BFFcluster demultiplexing is both tunable and insensitive to issues with poorly behaved data that can confound other algorithms. Using two well-characterized reference datasets, we demonstrate that demultiplexing with BFF algorithms is accurate and consistent for both well-behaved and poorly behaved input data.

Original language	English (US)
Pages (from-to)	2791-2801
Number of pages	11
Journal	Bioinformatics
Volume	38
Issue number	10
DOIs	https://doi.org/10.1093/bioinformatics/btac213
State	Published - May 15 2022

ASJC Scopus subject areas

Statistics and Probability
Biochemistry
Molecular Biology
Computer Science Applications
Computational Theory and Mathematics
Computational Mathematics

Access to Document

10.1093/bioinformatics/btac213

Cite this

@article{1875f36b72954c4eb1b488ffec7a3377,

title = "BFF and cellhashR: Analysis tools for accurate demultiplexing of cell hashing data",

abstract = "Motivation: Single-cell sequencing methods provide previously impossible resolution into the transcriptome of individual cells. Cell hashing reduces single-cell sequencing costs by increasing capacity on droplet-based platforms. Cell hashing methods rely on demultiplexing algorithms to accurately classify droplets; however, assumptions underlying these algorithms limit accuracy of demultiplexing, ultimately impacting the quality of single-cell sequencing analyses. Results: We present Bimodal Flexible Fitting (BFF) demultiplexing algorithms BFFcluster and BFFraw, a novel class of algorithms that rely on the single inviolable assumption that barcode count distributions are bimodal. We integrated these and other algorithms into cellhashR, a new R package that provides integrated QC and a single command to execute and compare multiple demultiplexing algorithms. We demonstrate that BFFcluster demultiplexing is both tunable and insensitive to issues with poorly behaved data that can confound other algorithms. Using two well-characterized reference datasets, we demonstrate that demultiplexing with BFF algorithms is accurate and consistent for both well-behaved and poorly behaved input data.",

author = "Boggy, {Gregory J.} and Mcelfresh, {G. W.} and Eisa Mahyari and Ventura, {Abigail B.} and Hansen, {Scott G.} and Picker, {Louis J.} and Bimber, {Benjamin N.}",

year = "2022",

month = may,

day = "15",

doi = "10.1093/bioinformatics/btac213",

language = "English (US)",

volume = "38",

pages = "2791--2801",

journal = "Bioinformatics",

issn = "1367-4803",

publisher = "Oxford University Press",

number = "10",

}

TY - JOUR

T1 - BFF and cellhashR

T2 - Analysis tools for accurate demultiplexing of cell hashing data

AU - Boggy, Gregory J.

AU - Mcelfresh, G. W.

AU - Mahyari, Eisa

AU - Ventura, Abigail B.

AU - Hansen, Scott G.

AU - Picker, Louis J.

AU - Bimber, Benjamin N.

PY - 2022/5/15

Y1 - 2022/5/15

N2 - Motivation: Single-cell sequencing methods provide previously impossible resolution into the transcriptome of individual cells. Cell hashing reduces single-cell sequencing costs by increasing capacity on droplet-based platforms. Cell hashing methods rely on demultiplexing algorithms to accurately classify droplets; however, assumptions underlying these algorithms limit accuracy of demultiplexing, ultimately impacting the quality of single-cell sequencing analyses. Results: We present Bimodal Flexible Fitting (BFF) demultiplexing algorithms BFFcluster and BFFraw, a novel class of algorithms that rely on the single inviolable assumption that barcode count distributions are bimodal. We integrated these and other algorithms into cellhashR, a new R package that provides integrated QC and a single command to execute and compare multiple demultiplexing algorithms. We demonstrate that BFFcluster demultiplexing is both tunable and insensitive to issues with poorly behaved data that can confound other algorithms. Using two well-characterized reference datasets, we demonstrate that demultiplexing with BFF algorithms is accurate and consistent for both well-behaved and poorly behaved input data.

AB - Motivation: Single-cell sequencing methods provide previously impossible resolution into the transcriptome of individual cells. Cell hashing reduces single-cell sequencing costs by increasing capacity on droplet-based platforms. Cell hashing methods rely on demultiplexing algorithms to accurately classify droplets; however, assumptions underlying these algorithms limit accuracy of demultiplexing, ultimately impacting the quality of single-cell sequencing analyses. Results: We present Bimodal Flexible Fitting (BFF) demultiplexing algorithms BFFcluster and BFFraw, a novel class of algorithms that rely on the single inviolable assumption that barcode count distributions are bimodal. We integrated these and other algorithms into cellhashR, a new R package that provides integrated QC and a single command to execute and compare multiple demultiplexing algorithms. We demonstrate that BFFcluster demultiplexing is both tunable and insensitive to issues with poorly behaved data that can confound other algorithms. Using two well-characterized reference datasets, we demonstrate that demultiplexing with BFF algorithms is accurate and consistent for both well-behaved and poorly behaved input data.

UR - http://www.scopus.com/inward/record.url?scp=85132218398&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=85132218398&partnerID=8YFLogxK

U2 - 10.1093/bioinformatics/btac213

DO - 10.1093/bioinformatics/btac213

M3 - Article

AN - SCOPUS:85132218398

SN - 1367-4803

VL - 38

SP - 2791

EP - 2801

JO - Bioinformatics

JF - Bioinformatics

IS - 10

ER -

BFF and cellhashR: Analysis tools for accurate demultiplexing of cell hashing data

Abstract

ASJC Scopus subject areas

Access to Document

Other files and links

Fingerprint

Cite this