← All Jobs
Posted Mar 7, 2026

Data Platform Reliability Engineer, Postgres

Apply Now
Job Description: • Manage the lifecycle of Postgres databases - platform RDS clusters and customer project databases. • Design and execute strategies for low-downtime major version upgrades and database migrations. • Proactively identify and resolve database performance issues before they impact users. • Build and maintain comprehensive monitoring, alerting, and observability for database systems. • Write detailed run books, technical documentation, and operational guides. • Identify reliability risks and implement preventative measures. • Participate in on-call rotation to support our global platform. • Work with development teams to optimize database schema and query patterns. • Analyze and optimize slow queries, connection pooling, and resource utilization. • Tune Postgres configurations for different workload patterns. • Monitor and address database bloat, vacuum strategies, and WAL management. • Partner with platform engineers, product teams, and SREs to deliver reliable database services. • Communicate database changes and maintenance windows clearly to stakeholders. • Share knowledge and mentor team members on Postgres best practices. Requirements: • Deep understanding of Postgres internals, architecture, and advanced features. • Production experience with replication (logical and physical), backups, and disaster recovery. • Strong command of query optimization, EXPLAIN plans, indexing strategies, and performance tuning. • Experience managing Postgres at scale in cloud environments. • Hands-on experience with AWS RDS for platform infrastructure. • Familiarity with cloud infrastructure concepts, networking, and storage systems. • Understanding of IaC tools and automation approaches. • Experience with other cloud database services (GCP Cloud SQL, Azure Database) is a plus. • Track record of maintaining high-availability database systems. • Obsessive about monitoring, observability, and measuring what matters. • Proactive approach to identifying and mitigating risks. • Experience with production troubleshooting and supporting live systems. • You write clear run books and technical documentation. • You're good at explaining complex database concepts to different audiences. • You record decisions and share knowledge effectively in async environments. • You thrive operating independently with high-level guidance. • You see problems through to resolution, not just escalation. • You automate repetitive tasks and build tools to make the team more effective. • Proficiency in TypeScript or Go (we can teach these). • Experience with Postgres extensions and customization. • Contributions to Postgres or database-related open source projects. • Familiarity with backup tools like WAL-G, pgBackRest, or Barman. • Experience with database migration strategies and tooling. • Background in SRE or DevOps practices. Benefits: • Fully Remote • ESOP • Tech Allowance • Health Benefits • Annual Off-Sites • Flexible Work • Professional Development