Job Description:
• Manage the lifecycle of Postgres databases - platform RDS clusters and customer project databases.
• Design and execute strategies for low-downtime major version upgrades and database migrations.
• Proactively identify and resolve database performance issues before they impact users.
• Build and maintain comprehensive monitoring, alerting, and observability for database systems.
• Write detailed run books, technical documentation, and operational guides.
• Identify reliability risks and implement preventative measures.
• Participate in on-call rotation to support our global platform.
• Work with development teams to optimize database schema and query patterns.
• Analyze and optimize slow queries, connection pooling, and resource utilization.
• Tune Postgres configurations for different workload patterns.
• Monitor and address database bloat, vacuum strategies, and WAL management.
• Partner with platform engineers, product teams, and SREs to deliver reliable database services.
• Communicate database changes and maintenance windows clearly to stakeholders.
• Share knowledge and mentor team members on Postgres best practices.
Requirements:
• Deep understanding of Postgres internals, architecture, and advanced features.
• Production experience with replication (logical and physical), backups, and disaster recovery.
• Strong command of query optimization, EXPLAIN plans, indexing strategies, and performance tuning.
• Experience managing Postgres at scale in cloud environments.
• Hands-on experience with AWS RDS for platform infrastructure.
• Familiarity with cloud infrastructure concepts, networking, and storage systems.
• Understanding of IaC tools and automation approaches.
• Experience with other cloud database services (GCP Cloud SQL, Azure Database) is a plus.
• Track record of maintaining high-availability database systems.
• Obsessive about monitoring, observability, and measuring what matters.
• Proactive approach to identifying and mitigating risks.
• Experience with production troubleshooting and supporting live systems.
• You write clear run books and technical documentation.
• You're good at explaining complex database concepts to different audiences.
• You record decisions and share knowledge effectively in async environments.
• You thrive operating independently with high-level guidance.
• You see problems through to resolution, not just escalation.
• You automate repetitive tasks and build tools to make the team more effective.
• Proficiency in TypeScript or Go (we can teach these).
• Experience with Postgres extensions and customization.
• Contributions to Postgres or database-related open source projects.
• Familiarity with backup tools like WAL-G, pgBackRest, or Barman.
• Experience with database migration strategies and tooling.
• Background in SRE or DevOps practices.
Benefits:
• Fully Remote
• ESOP
• Tech Allowance
• Health Benefits
• Annual Off-Sites
• Flexible Work
• Professional Development