A Community Challenge for Inferring Genetic Predictors of Gene Essentialities through Analysis of a Functional Screen of Cancer Cell Lines

Mehmet Gönen, Barbara A. Weir, Glenn S. Cowley, Francisca Vazquez, Yuanfang Guan, Alok Jaiswal, Masayuki Karasuyama, Vladislav Uzunangelov, Tao Wang, Aviad Tsherniak, Sara Howell, Daniel Marbach, Bruce Hoff, Thea C. Norman, Antti Airola, Adrian Bivol, Kerstin Bunte, Daniel Carlin, Sahil Chopra, Alden DeranKyle Ellrott, Peddinti Gopalacharyulu, Kiley Graim, Samuel Kaski, Suleiman A. Khan, Yulia Newton, Sam Ng, Tapio Pahikkala, Evan Paull, Artem Sokolov, Hao Tang, Jing Tang, Krister Wennerberg, Yang Xie, Xiaowei Zhan, Fan Zhu, Bahman Afsari, Tero Aittokallio, Jesse S. Boehm, Yu Chuan Chang, Tenghui Chen, Zechen Chong, Haitham Elmarakeby, Elana J. Fertig, Emanuel Gonçalves, Pinghua Gong, Christoph Hafemeister, William C. Hahn, Lenwood Heath, Łukasz Kędziorski, Niraj Khemka, Erh kan King, Mario Lauria, Mark Liu, Daniel Machado, Hiroshi Mamitsuka, Adam A. Margolin, Mateusz Mazurkiewicz, Michael P. Menden, Szymon Migacz, Zhi Nie, Paurush Praveen, Corrado Priami, Simone Rizzetto, Miguel Rocha, David E. Root, Cameron Rudd, Witold R. Rudnicki, Julio Saez-Rodriguez, Lei Song, Gustavo Stolovitzky, Joshua M. Stuart, Duanchen Sun, Bence Szalai, Difei Wang, Ling yun Wu, Guanghua Xiao, Jieping Ye, Yuting Ye, Wanding Zhou

Research output: Contribution to journalArticle

8 Scopus citations

Abstract

We report the results of a DREAM challenge designed to predict relative genetic essentialities based on a novel dataset testing 98,000 shRNAs against 149 molecularly characterized cancer cell lines. We analyzed the results of over 3,000 submissions over a period of 4 months. We found that algorithms combining essentiality data across multiple genes demonstrated increased accuracy; gene expression was the most informative molecular data type; the identity of the gene being predicted was far more important than the modeling strategy; well-predicted genes and selected molecular features showed enrichment in functional categories; and frequently selected expression features correlated with survival in primary tumors. This study establishes benchmarks for gene essentiality prediction, presents a community resource for future comparison with this benchmark, and provides insights into factors influencing the ability to predict gene essentiality from functional genetic screens. This study also demonstrates the value of releasing pre-publication data publicly to engage the community in an open research collaboration. Gönen et al. report the results of an open-participation DREAM challenge to critically assess the ability to predict gene essentiality on a novel functional screening dataset of 149 cancer cell lines. This study establishes benchmarks for gene essentiality prediction, presents a community resource for future comparison with this benchmark, and provides insights into factors influencing the ability to predict gene essentiality from functional genetic screens.

Original languageEnglish (US)
Pages (from-to)485-497.e3
JournalCell Systems
Volume5
Issue number5
DOIs
StatePublished - Jan 1 2017

Keywords

  • cancer genomics
  • community challenge
  • crowdsourcing
  • functional screen
  • machine learning
  • oncogene

ASJC Scopus subject areas

  • Pathology and Forensic Medicine
  • Histology
  • Cell Biology

Fingerprint Dive into the research topics of 'A Community Challenge for Inferring Genetic Predictors of Gene Essentialities through Analysis of a Functional Screen of Cancer Cell Lines'. Together they form a unique fingerprint.

Cite this