Prediction of Cancer Driver Genes Using a Deep Convolutional Network

Natalia Novoselova, Igor Tom


The paper describes a method for predicting genes associated with the development of cancer. The method applies the convolutional neural network for the purpose of predicting disease driver genes. Distinctive features of the method are the use of gene expression data to determine the topological structure of the network, the efficiency of prediction with limited information about genes associated with the disease, and the possibility of jointly including information on mutations and similarity of gene expression profiles to improve the accuracy of prediction.


Driver genes; gene expression; gene mutation; neural network; prediction

Full Text:



M. S. Lawrence et al., “Mutational heterogeneity in cancer and the search for new cancer-associated genes,” Nature, vol. 499, no. 7457, pp. 214–218, Jun. 2013.

D. Tamborero, A. Gonzalez-Perez and N. Lopez-Bigas, “OncodriveCLUST: exploiting the positional clustering of somatic mutations to identify cancer genes,” Bioinformatics, vol. 29, no. 18, pp. 2238–2244, Sep. 2013.

L. Cowen, T. Ideker, B. J. Raphael, and R. Sharan, “Network propagation: a universal amplifier of genetic associations,” Nature Reviews Genetics, vol. 18, no. 9, pp. 551–562, Jun. 2017.

J. P. Hou and J. Ma, “DawnRank: discovering personalized driver genes in cancer,” Genome Medicine, vol. 6, no. 7, Jul. 2014, Art. no. 56.

W. F. Guo et al., “Discovering personalized driver mutation profiles of single samples in cancer by network control strategy,” Bioinformatics, vol. 34, no. 11, pp. 1893–1903, Jun. 2018.

F. Li et al., “Effects of multi-omics characteristics on identification of driver genes using machine learning algorithms,” Genes, vol. 13, no. 5, Apr. 2022, Art. no. 716.

C. J. Tokheim, N. Papadopoulos, K. W. Kinzler, B. Vogelstein, and R. Karchin, “Evaluating the evaluation of cancer driver genes,” in Proceedings of the National Academy of Sciences, vol. 113, no. 50, Nov. 2016, pp. 14330–14335.

J. F. Cutigi et al., “Combining mutation and gene network data in a machine learning approach for false-positive cancer driver gene discovery,” in Advances in Bioinformatics and Computational Biology: 13th Brazilian Symposium on Bioinformatics, BSB 2020, São Paulo, Brazil, Nov. 23–27, 2020, pp. 81–92.

S. Zhang, H. Tong, J. Xu, and R. Maciejewski, “Graph convolutional networks: a comprehensive review,” Computational Social Networks, vol. 6, no. 1, Nov. 2019, Art. no. 11.

T. Cover and P. Hart, “Nearest neighbor pattern classification”, IEEE Transactions on Information Theory, vol. 13, no. 1, pp. 21–27, Jan. 1967.

P. C. Ng and S. Henikoff, “SIFT: Predicting amino acid changes that affect protein function,” Nucleic Acids Research, vol. 31, no 13, pp. 3812–3814, Jul. 2003.

National Cancer Institute, “The cancer genome atlas (TCGA)”. [Online]. Available: Accessed on: Sep. 6, 2023.

Z. Sondka et al., “The COSMIC cancer gene census: describing genetic dysfunction across all human cancers,” Nature Reviews Cancer, vol. 18, no. 11, pp. 696–705, Oct. 2018.

M. H. Bailey et al., “Comprehensive characterization of cancer driver genes and mutations,” Cell, vol. 173, no. 2, pp. 371–385, Apr. 2018.

A. Gulli and S. Pal, “Deep learning with Keras,” Packt Publishing Ltd, 2017, 318 p.

J. Lever, E. Y. Zhao, J. Grewal, M. R. Jones, and S. J. M. Jones, “CancerMine: a literature-mined resource for drivers, oncogenes and tumor suppressors in cancer,” Nature Methods, vol. 16, no 6, pp. 505–507, May 2019.

DOI: 10.7250/itms-2023-0002


  • There are currently no refbacks.

Copyright (c) 2023 Natalia Novoselova

Creative Commons License
This work is licensed under a Creative Commons Attribution 4.0 International License.