Project scaffold for genomic notebooks and analysis, managed with uv.
- Install uv (one-time):
curl -LsSf https://astral.sh/uv/install.sh | sh- Create the environment and install deps:
uv sync- Register a Jupyter kernel for this project (optional but recommended):
uv run python -m ipykernel install --user --name rogen-aging --display-name "Python (rogen-aging)"- Launch JupyterLab and start working in
notebooks/:
uv run jupyter lab- Add a runtime dependency:
uv add <package>- Add a dev-only dependency (linters, tests, etc.):
uv add --dev <package>- Install optional genomics extras:
uv add .[genomics]src/rogen_aging/: Python package for shared codenotebooks/: genomic analysis notebooksdata/: put large/local data here (git-ignored)
This project targets Python 3.12 (configured in pyproject.toml).
The notebooks/AlphaGenome.ipynb notebook performs a comprehensive analysis of gene lists for Alzheimer's and Parkinson's diseases. To run it, you'll need to set up your environment with the necessary API keys and data files.
The notebook requires API keys for NCBI and AlphaGenome. Follow these steps to set them up:
- Copy the
.env.examplefile to.env:
cp .env.example .env- Edit the
.envfile and add your actual API keys:
NCBI_API_KEY=your_actual_ncbi_api_key_here
ALPHA_GENOME_API_KEY=your_actual_alpha_genome_api_key_hereImportant: The .env file is git-ignored and will not be committed to version control, keeping your API keys secure.
The notebook needs two data files:
Supplementary Table 3.xlsx: An Excel file with gene lists.longevitymap.sqlite: A SQLite database from the Longevity Map.
You should place these files in the data/ directory. This directory is included in .gitignore to prevent large data files from being committed to your repository.
Once you've set up your API keys and data files, you can launch JupyterLab:
uv run jupyter labNow, you can open notebooks/AlphaGenome.ipynb and run the cells. The notebook is configured to read the data files from the data/ directory and the API keys from your environment.
This repository includes a complete methylation calling pipeline for Oxford Nanopore sequencing data.
- METHYLATION_PIPELINE_README.md - Comprehensive user guide
- docs/METHYLATION_PIPELINE_USAGE.md - Detailed step-by-step usage guide
- Pipeline Scripts:
pipeline_validation.sh- Basecalling and methylation extractiondownstream_analysis.R- DMR calling and analysisnotebooks/DownstreamMethylationAnalysis.ipynb- Interactive R notebook
The methylation pipeline integrates three main tools:
- Dorado - Basecalling with methylation-aware models
- Modkit - BAM to bedMethyl conversion
- DMRcaller - Differential methylation analysis
# 1. Run basecalling and methylation extraction
./pipeline_validation.sh
# 2. Run downstream analysis
Rscript downstream_analysis.R
# Or use the interactive notebook
uv run jupyter lab
# Open: notebooks/DownstreamMethylationAnalysis.ipynbFor detailed instructions, see METHYLATION_PIPELINE_README.md.