pgdedup: Self-hosted PostgreSQL backup manager with BTRFS block-level deduplication
Innovative PostgreSQL backup solution with BTRFS block-level deduplication Single binary with zero runtime dependencies and advanced backup management
betapgdedup
pgdedup emerges as a pragmatic solution for PostgreSQL database backup management, targeting organizations with storage constraints and a desire for efficient, self-hosted backup strategies. The tool leverages BTRFS filesystem and dduper to achieve block-level deduplication, potentially reducing storage requirements by up to 85% for consecutive backups.
The core innovation lies in its approach to backup storage: instead of traditional incremental or compressed backup methods, pgdedup performs full backups using pg_basebackup, stores them uncompressed on BTRFS, and then uses filesystem-level deduplication to minimize actual storage consumption. This approach provides both simplicity and significant storage optimization, particularly for databases with predictable change patterns.
While still in early beta, pgdedup demonstrates compelling features like Grandfather-Father-Son (GFS) retention policies, point-in-time recovery support, and a minimalist design with a compact 4.5MB binary. Its zero runtime dependencies and straightforward configuration make it an attractive option for database administrators seeking a lightweight, self-managed backup solution.