Sparse combinatorial inference with an application in cancer biology

Sach Mukherjee, Steven Pelech, Richard M. Neve, Wen Lin Kuo, Safiyyah Ziyad, Paul T. Spellman, Joe W. Gray, Terence P. Speed

Research output: Contribution to journalArticle

16 Scopus citations

Abstract

Motivation: Combinatorial effects, in which several variables jointly influence an output or response, play an important role in biological systems. In many settings, Boolean functions provide a natural way to describe such influences. However, biochemical data using which we may wish to characterize such influences are usually subject to much variability. Furthermore, in high-throughput biological settings Boolean relationships of interest are very often sparse, in the sense of being embedded in an overall dataset of higher dimensionality. This motivates a need for statistical methods capable of making inferences regarding Boolean functions under conditions of noise and sparsity. Results: We put forward a statistical model for sparse, noisy Boolean functions and methods for inference under the model. We focus on the case in which the form of the underlying Boolean function, as well as the number and identity of its inputs are all unknown. We present results on synthetic data and on a study of signalling proteins in cancer biology.

Original languageEnglish (US)
Pages (from-to)265-271
Number of pages7
JournalBioinformatics
Volume25
Issue number2
DOIs
StatePublished - Jan 1 2009

ASJC Scopus subject areas

  • Statistics and Probability
  • Biochemistry
  • Molecular Biology
  • Computer Science Applications
  • Computational Theory and Mathematics
  • Computational Mathematics

Fingerprint Dive into the research topics of 'Sparse combinatorial inference with an application in cancer biology'. Together they form a unique fingerprint.

  • Cite this

    Mukherjee, S., Pelech, S., Neve, R. M., Kuo, W. L., Ziyad, S., Spellman, P. T., Gray, J. W., & Speed, T. P. (2009). Sparse combinatorial inference with an application in cancer biology. Bioinformatics, 25(2), 265-271. https://doi.org/10.1093/bioinformatics/btn611