Conumee 2.0: enhanced copy-number variation analysis from DNA methylation arrays for humans and mice

Bioinformatics. 2024 Feb 1;40(2):btae029. doi: 10.1093/bioinformatics/btae029.

Abstract

Motivation: Copy-number variations (CNVs) are common genetic alterations in cancer and their detection may impact tumor classification and therapeutic decisions. However, detection of clinically relevant large and focal CNVs remains challenging when sample material or resources are limited. This has motivated us to create a software tool to infer CNVs from DNA methylation arrays which are often generated as part of clinical routines and in research settings.

Results: We present our R package, conumee 2.0, that combines tangent normalization, an adjustable genomic binning heuristic, and weighted circular binary segmentation to utilize DNA methylation arrays for CNV analysis and mitigate technical biases and batch effects. Segmentation results were validated in a lung squamous cell carcinoma dataset from TCGA (n = 367 samples) by comparison to segmentations derived from genotyping arrays (Pearson's correlation coefficient of 0.91). We further introduce a segmented block bootstrapping approach to detect focal alternations that achieved 60.9% sensitivity and 98.6% specificity for deletions affecting CDKN2A/B (60.0% and 96.9% for RB1, respectively) in a low-grade glioma cohort from TCGA (n = 239 samples). Finally, our tool provides functionality to detect and summarize CNVs across large sample cohorts.

Availability and implementation: Conumee 2.0 is available under open-source license at: https://github.com/hovestadtlab/conumee2.

Publication types

  • Research Support, Non-U.S. Gov't

MeSH terms

  • Algorithms
  • Animals
  • DNA Copy Number Variations
  • DNA Methylation*
  • Genomics
  • Humans
  • Mice
  • Neoplasms* / genetics
  • Software