Tail-Greedy Unbalanced Haar Wavelet Segmentation for Copy Number Alteration Data

Ummi, Maharani Ahsani; Barber, Stuart; Wood, Henry M.; Gusnanto, Arief

Statistics > Applications

arXiv:2604.22364 (stat)

[Submitted on 24 Apr 2026]

Title:Tail-Greedy Unbalanced Haar Wavelet Segmentation for Copy Number Alteration Data

Authors:Maharani Ahsani Ummi, Stuart Barber, Henry M. Wood, Arief Gusnanto

View PDF HTML (experimental)

Abstract:Detecting copy number alterations (CNAs) from next-generation sequencing data remains challenging, particularly for short segments under noisy conditions. Existing segmentation methods often suffer from high false positive rates or fail to reliably detect short aberrations, especially in low-coverage data. In this study, we propose a modified tail-greedy unbalanced Haar (TGUHm) method that introduces a dual-thresholding strategy to improve segmentation accuracy. The proposed approach effectively suppresses spurious spikes while preserving sensitivity to both short and long CNA segments. Extensive simulation studies under Gaussian and heavy-tailed noise demonstrate that TGUHm consistently achieves higher true positive rates and lower false positive rates compared to state-of-the-art methods, including CBS, HaarSeg, and FDRSeg. In particular, the proposed method improves detection accuracy for short segments while maintaining competitive overall performance. Application to real cancer genomic data further confirms the practical utility of the method, revealing biologically meaningful CNAs associated with known cancer-related genes. These results suggest that TGUHm provides a robust and effective framework for CNA detection in challenging sequencing settings.

Comments:	17 pages, 9 figures
Subjects:	Applications (stat.AP); Computation (stat.CO)
MSC classes:	65T60
ACM classes:	G.3
Cite as:	arXiv:2604.22364 [stat.AP]
	(or arXiv:2604.22364v1 [stat.AP] for this version)
	https://doi.org/10.48550/arXiv.2604.22364

Submission history

From: Arief Gusnanto [view email]
[v1] Fri, 24 Apr 2026 08:54:40 UTC (1,113 KB)

Statistics > Applications

Title:Tail-Greedy Unbalanced Haar Wavelet Segmentation for Copy Number Alteration Data

Submission history

Access Paper:

Current browse context:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Statistics > Applications

Title:Tail-Greedy Unbalanced Haar Wavelet Segmentation for Copy Number Alteration Data

Submission history

Access Paper:

Current browse context:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators