Fix the logical replication timeout during large transactions.
authorAmit Kapila <[email protected]>
Wed, 11 May 2022 05:21:04 +0000 (10:51 +0530)
committerAmit Kapila <[email protected]>
Wed, 11 May 2022 05:21:04 +0000 (10:51 +0530)
commitd6da71fa8f28faa68823e163f318ffb38a7a9a54
tree6a384644d3e56ec4f4ccc1232d87f1eff0a41df8
parentca9e9b08e453523314a3b8e87d1894edb23b6e8d
Fix the logical replication timeout during large transactions.

The problem is that we don't send keep-alive messages for a long time
while processing large transactions during logical replication where we
don't send any data of such transactions. This can happen when the table
modified in the transaction is not published or because all the changes
got filtered. We do try to send the keep_alive if necessary at the end of
the transaction (via WalSndWriteData()) but by that time the
subscriber-side can timeout and exit.

To fix this we try to send the keepalive message if required after
processing certain threshold of changes.

Reported-by: Fabrice Chapuis
Author: Wang wei and Amit Kapila
Reviewed By: Masahiko Sawada, Euler Taveira, Hou Zhijie, Hayato Kuroda
Backpatch-through: 10
Discussion: https://postgr.es/m/CAA5-nLARN7-3SLU_QUxfy510pmrYK6JJb=bk3hcgemAM_pAv+w@mail.gmail.com
src/backend/replication/logical/logical.c
src/backend/replication/pgoutput/pgoutput.c
src/backend/replication/walsender.c
src/include/replication/logical.h