Job Description:
• Design, deploy and maintain cloud infrastructure for data and ML workloads using Infrastructure as Code.
• Manage and evolve AWS-based data platform components running on Kubernetes (EKS).
• Provision and maintain services such as EMR on EKS, SageMaker, MWAA (Managed Airflow), Lambda, API Gateway and Step Functions.
• Implement and maintain IAM roles, permissions and governance policies aligned with compliance requirements.
• Support orchestration frameworks used by data teams (DBT, Airflow, Step Functions).
• Collaborate with data engineers to troubleshoot infrastructure or platform issues affecting pipelines.
• Participate in platform observability initiatives (metrics, logging and monitoring).
• Maintain Terraform modules and deployment pipelines.
• Support platform migrations and organizational AWS changes when required.
• Contribute to platform reliability, scalability and operational excellence.
Requirements:
• 3+ years of experience working with AWS cloud infrastructure
• Strong experience with Terraform or similar Infrastructure as Code tools
• Experience deploying and operating containerized workloads on Kubernetes / EKS
• Solid understanding of AWS IAM, roles and security best practices
• Experience with serverless architectures (Lambda, API Gateway, Step Functions)
• Experience supporting data or ML platforms from an infrastructure perspective
• DevOps mindset and experience managing CI/CD or infrastructure automation
• Strong troubleshooting skills across distributed systems.
Benefits:
• Remote
• Professional development opportunities