Data Pipeline & AI Infrastructure Developer

Remote Full-time
We're looking for an experienced machine learning and data engineer to build the systems that power our embodied AI research and production. In this role, you'll own the build-out of critical components of our data pipelines and compute infrastructure, ensuring our research team has reliable, high-performance platforms to train and deploy advanced robotics models. Data Pipelines You'll build and maintain large-scale data ingestion systems that capture multimodal robotics data (video, point clouds, proprioception, and action trajectories), handling the end-to-end flow from ingestion through transformation, quality assurance, and delivery to training systems. You'll ensure data reliability, versioning, and reproducibility across terabytes of embodied data while building observability and dataset management tooling. Your work directly determines the quality and scale of data our AI systems learn from. AI Cluster Infrastructure You'll architect and operate our training infrastructure—Kubernetes-based HPC clusters, GPU orchestration, distributed training, and model deployment—optimizing resource allocation, monitoring cluster health, and ensuring high availability. You'll build automation and tooling that makes research code production-ready, enables efficient multi-tenant experiments, and lets the team move fast. Your infrastructure enables breakthroughs in robotic intelligence. What you bring You're fluent in Python and comfortable with systems languages (C, C++, Rust, or Go). You have deep experience building data pipelines or infrastructure at scale. You know Kubernetes, distributed systems, and HPC environments well. You've worked with large-scale data storage, workflow orchestration, and compute resource management. You understand Linux systems, networking, and real-time constraints. You bridge the gap between research and production. You debug across layers and value reliability, observability, and clean abstractions. You're excited to work in a fast-moving environment where your infrastructure directly enables cutting-edge AI research and real-world robotic deployments. Apply tot his job
Apply Now →

Similar Jobs

Software Engineer, Data Platform-Slack (Senior SWE/Staff SWE)

Remote Full-time

Data Platform Support Engineer

Remote Full-time

Analytics Platform Engineer Associate

Remote Full-time

Senior Software Engineer (Data Platform)

Remote Full-time

Senior/Staff Software Engineer, Data

Remote Full-time

Data Platform Engineer

Remote Full-time

Senior Privacy Analyst, FedRAMP

Remote Full-time

Data Loss Prevention (DLP) Analyst

Remote Full-time

Cyber Security Analyst @ Texas Remote in USA

Remote Full-time

IT Security Analyst 3 - IS - Data Security - FT - Day - Remote SoCal

Remote Full-time

Lighting Designer

Remote Full-time

**Virtual Customer Service Associate – Empathetic and Skilled Contact Center Professional**

Remote Full-time

Identity Access Manager Engineer – Sr Level – Remote in Cumberland, MD

Remote Full-time

Sales Associate – Amazon Store

Remote Full-time

**Experienced Customer Support Agent (Remote) - Night Shift at arenaflex**

Remote Full-time

Environmental Program Specialist 4 (PCN 187474)

Remote Full-time

**Entry-Level Data Entry Clerk - Logistics and Supply Chain Solutions - Immediate Hiring with Comprehensive Training**

Remote Full-time

Experienced Seasonal Customer Support Agent – Chat & Phone Representative for Exceptional Customer Experience

Remote Full-time

Warehouse Worker - Order Selector - 1st Shift

Remote Full-time

**Experienced Customer Service Representative – Technical Support Specialist (Remote 24/7) – Florida or New York Residents**

Remote Full-time
← Back to Home