Offline
Modal Logo

Software Development — US

The production cloud for AI. Run inference, training, batch processing and sandboxes with sub-second cold starts, instant autoscaling across thousands of GPUs and a developer experience that feels local.

Modal

Member of Technical Staff - Platform Engineering

Full time • New York; Stockholm

$150K – $350K • Offers Equity

The Role:

At Modal, we sell cloud services atop which our customers run their critical production systems. As a rapidly growing new cloud infrastructure company, we seek to improve our reliability dramatically while scaling the size of our platform, customer base, and our team. This role is for people who are deep systems thinkers, love stacking nines, and thrive from making others move faster at scale.

Responsibilities

  • Identifying architectural changes to improve reliability and performance.
  • Fostering a culture of reliability across Modal’s engineering organization.
  • Defining and implementing operational processes such as deployments, upgrades, etc.
  • Operating systems like Kubernetes, Postgres, Redis, etc.
  • Participating in on-call rotations, and responding to production incidents.

Requirements

  • 5+ years of experience writing high-quality production code.
  • 2+ years of on-call experience for critical production services.
  • Strong cloud skills, and deep familiarity with at least one hyperscaler cloud (AWS preferred).
  • Familiarity with auto scaling, fleet management, and capacity planning at scale.
  • Experience operating databases, monitoring, CI/CD, and other infrastructure, at scale
  • Experience owning and scaling Kubernetes clusters to thousands of nodes a plus.
  • Experience with systems safety research (e.g. STAMP) and control theory a plus.
  • Ability to work in-person in our NYC or Stockholm offices.
Apply Now