AlphaMap An Open-Source Python Package For The Visual Annotation of Proteomics Data With Sequence-Specific Knowledge
AlphaMap An Open-Source Python Package For The Visual Annotation of Proteomics Data With Sequence-Specific Knowledge
doi: 10.1093/bioinformatics/btab674
Advance Access Publication Date: 29 September 2021
Applications Note
Received on July 30, 2021; revised on September 2, 2021; editorial decision on September 13, 2021; accepted on September 22, 2021
Abstract
Summary: Integrating experimental information across proteomic datasets with the wealth of publicly available se-
quence annotations is a crucial part in many proteomic studies that currently lacks an automated analysis platform.
Here, we present AlphaMap, a Python package that facilitates the visual exploration of peptide-level proteomics data.
Identified peptides and post-translational modifications in proteomic datasets are mapped to their corresponding pro-
tein sequence and visualized together with prior knowledge from UniProt and with expected proteolytic cleavage sites.
The functionality of AlphaMap can be accessed via an intuitive graphical user interface or—more flexibly—as a Python
package that allows its integration into common analysis workflows for data visualization. AlphaMap produces
publication-quality illustrations and can easily be customized to address a given research question.
Availability and implementation: AlphaMap is implemented in Python and released under an Apache license. The
source code and one-click installers are freely available at https://github.com/MannLabs/alphamap.
Contact: [email protected]
Supplementary information: Supplementary data are available at Bioinformatics online.
standard Python module installed via PyPI (Python Software encourage the broader community to integrate AlphaMap in their
Foundation, n.d.) or directly from its GitHub repository. own data analysis and visualization workflows with the possibility
In line with the AlphaPept ecosystem (Strauss et al., 2021), we to easily adopt the code according to specific needs.
make the AlphaMap code openly available on GitHub, using its
many supporting features for unit and system testing via GitHub
Fig. 1. (A) Overview of the AlphaMap workflow from MS data upload to the interactive sequence visualization. (B) Exemplary sequence visualization for epidermal growth
factor receptor (EGFR). A zoom-in on a selected sequence region, indicated by dashed lines, is provided at the lower part of the panel
AlphaMap: Interactive protein sequence visualization 3
recently introduced AlphaPept framework (Strauss et al., 2021). In processing with the goal to achieve a more complete sequence
contrast to Protter (Omasits et al., 2014), users can select multiple coverage.
independent datasets for co-visualization. These could either have
been processed by the same or with different MS analysis tools. It is
also possible to select only a single sample, or a subset of samples of
Demichev,V. et al. (2020) DIA-NN: neural networks and interference correc- Levitsky,L.I. et al. (2019) Pyteomics 4.0: five years of development of a Python
tion enable deep proteome coverage in high throughput. Nat. Methods, 17, proteomics framework. J. Proteome Res., 18, 709–714.
41–44. Müller,J.B. et al. (2020) The proteome landscape of the kingdoms of life.
Desiere,F. et al. (2006) The PeptideAtlas project. Nucleic Acids Res., 34, Nature, 582, 592–596.
D655–D658. Omasits,U. et al. (2014) Protter: interactive protein feature visualization and in-