Add SQLAlchemy deferred loading by JadeCara · Pull Request #7249 · ethyca/fides

JadeCara · 2026-01-26T18:10:04Z

Description Of Changes

🎯 Out of memory errors on workers when processing DSRs tasks

One area of opportunity identified was loading large data while querying tasks. This follows the pattern of defering loading of large columns for Privacy Request objects. This PR updates the following:

query_with_deferred_data() method for lazy loading
Applied to cached_erasure_results_by_collection_key()
Applied to cached_consent_results_by_collection_key()
Applied to scheduler and service layer queries

Code Changes

src/fides/api/models/privacy_request/privacy_request.py - Updated to use query with deferred data
src/fides/api/models/privacy_request/request_task.py - Added query_with_deferred_data
src/fides/api/service/privacy_request/request_service.py - Updated to use query with deferred data
src/fides/api/task/scheduler_utils.py - Updated to use query with deferred data

Steps to Confirm

Start FidesPlus pointed at this endpoint
Run 1 or more access requests, run 1 or more erasure requests.
All tests should pass.

Pre-Merge Checklist

vercel · 2026-01-26T18:10:09Z

The latest updates on your projects. Learn more about Vercel for GitHub.

2 Skipped Deployments

Project	Deployment	Review	Updated (UTC)
fides-plus-nightly	Ignored	Preview	Jan 26, 2026 10:23pm
fides-privacy-center	Ignored		Jan 26, 2026 10:23pm

greptile-apps · 2026-01-26T20:54:13Z

Greptile Overview

Greptile Summary

This PR adds SQLAlchemy deferred loading to prevent out-of-memory (OOM) errors when processing DSR tasks. The changes implement a new query_with_deferred_data() method on RequestTask that defers loading of large JSON columns (_access_data, _data_for_erasures, collection, traversal_details) when only metadata is needed.

Key changes:

Added RequestTask.query_with_deferred_data() with configurable deferred loading for access and erasure data columns
Applied deferred loading to methods that only need task metadata: get_pending_downstream_tasks(), upstream_tasks_objects(), cached_erasure_results_by_collection_key(), cached_consent_results_by_collection_key(), and requeue_polling_tasks()
Refactored use_dsr_3_0_scheduler() to check for RequestTasks first (cheap SQL count) before checking cache, avoiding the expensive get_raw_access_results() call that was causing 622MB+ memory usage per call

The implementation follows established patterns from PrivacyRequest.query_with_deferred_data() and correctly applies deferred loading only where metadata queries are performed, not where actual data payloads are needed.

Confidence Score: 4/5

This PR is safe to merge with minimal risk, addressing critical OOM issues through deferred loading
Score reflects solid implementation following established patterns and addressing a critical performance issue. Minor concern about lack of test coverage for the new deferred loading functionality, but the changes are straightforward and well-documented. The refactoring of use_dsr_3_0_scheduler() is a significant optimization that prevents OOM errors on the hot path.
Pay close attention to src/fides/api/task/scheduler_utils.py to ensure the cache key prefix logic is correct

Important Files Changed

Filename	Overview
src/fides/api/models/privacy_request/request_task.py	Added `query_with_deferred_data()` method with comprehensive docstring; applied to `get_pending_downstream_tasks()` and `upstream_tasks_objects()`
src/fides/api/models/privacy_request/privacy_request.py	Applied deferred loading to `get_raw_masking_counts()` and `get_consent_results()` to avoid loading large JSON columns
src/fides/api/task/scheduler_utils.py	Refactored to check RequestTasks first (fast path), then check cache for DSR 2.0 data instead of loading access results (prevents OOM)

greptile-apps

_{No files reviewed, no comments}

_{Edit Code Review Agent Settings | Greptile}

changelog/7249.yaml

Co-authored-by: Adrian Galvan <adrian@ethyca.com>

Co-authored-by: Jade Wibbels <jade@ethyca.com> Co-authored-by: Adrian Galvan <adrian@ethyca.com>

Add SQLAlchemy deferred loading

42c7126

Jade Wibbels added 3 commits January 26, 2026 11:15

changelog

2fa2180

.

3908398

fixed prefix to include encoded prefix

d692c29

JadeCara marked this pull request as ready for review January 26, 2026 20:51

JadeCara requested a review from a team as a code owner January 26, 2026 20:51

JadeCara requested review from galvana and thabofletcher and removed request for a team January 26, 2026 20:51

greptile-apps bot reviewed Jan 26, 2026

View reviewed changes

galvana approved these changes Jan 26, 2026

View reviewed changes

changelog/7249.yaml Outdated Show resolved Hide resolved

Update changelog/7249.yaml

c467e74

Co-authored-by: Adrian Galvan <adrian@ethyca.com>

JadeCara added this pull request to the merge queue Jan 26, 2026

Merged via the queue into main with commit 1a30b6c Jan 26, 2026
55 checks passed

JadeCara deleted the ENG-2126-deferred-loading branch January 26, 2026 23:25

JadeCara added a commit that referenced this pull request Jan 29, 2026

Add SQLAlchemy deferred loading (#7249)

029f8b1

Co-authored-by: Jade Wibbels <jade@ethyca.com> Co-authored-by: Adrian Galvan <adrian@ethyca.com>

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add SQLAlchemy deferred loading#7249

Add SQLAlchemy deferred loading#7249
JadeCara merged 5 commits intomainfrom
ENG-2126-deferred-loading

JadeCara commented Jan 26, 2026 •

edited

Loading

Uh oh!

vercel bot commented Jan 26, 2026 •

edited

Loading

Uh oh!

greptile-apps bot commented Jan 26, 2026

Uh oh!

greptile-apps bot left a comment

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

JadeCara commented Jan 26, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Description Of Changes

Code Changes

Steps to Confirm

Pre-Merge Checklist

Uh oh!

vercel bot commented Jan 26, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

greptile-apps bot commented Jan 26, 2026

Greptile Overview

Greptile Summary

Confidence Score: 4/5

Important Files Changed

Uh oh!

greptile-apps bot left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

JadeCara commented Jan 26, 2026 •

edited

Loading

vercel bot commented Jan 26, 2026 •

edited

Loading