People Matter

Software Engineer, Infrastructure

Scaled Cognition

Scaled Cognition

Software Engineering, Other Engineering
Mountain View, CA, USA
Posted on Mar 30, 2026

Location

Mountain View; Boston; New York

Employment Type

Full time

Location Type

Hybrid

Department

Engineering

Scaled Cognition is the world’s only model lab dedicated exclusively to customer experience and pioneering agentic models purpose-built for reliable action-taking enterprise applications. Backed by Khosla Ventures, the company’s flagship Agentic Pretrained Transformer (APT) eliminates hallucinations, enforces enterprise policies and increases reliability in real-world CX workflows. Founded by serial AI entrepreneurs, former Microsoft Corporate Vice President of Conversational AI Dan Roth, and UC Berkeley AI Professor Dan Klein, and built by a team of world-class PhD researchers and engineers, Scaled Cognition advances the science of agentic AI to deliver safe, policy-aligned automation that enterprises can trust.

As a Software Engineer, Infrastructure at Scaled Cognition you will:

  • Design and improve inference infrastructure that powers our AI models.

  • Benchmark, profile, monitor and analyze latency and throughput and drive improvements throughout the stack.

  • Collaborate with research scientists and product engineers to streamline the end-to-end process from model development to production deployment.

You might be the right person for the job if you:

  • Are a continuous learner and are eager to explore new tools and technologies.

  • Thrive in a dynamic environment and are comfortable adapting to changing priorities while maintaining a focus on delivering high-quality solutions.

  • Have successfully navigated projects with significant product and technical ambiguity, and you excel at the intersection of complex technical challenges and user-focused solutions.

Key Qualifications:

  • Experience deploying systems on major cloud platforms (AWS, GCP, Azure).

  • Prior experience designing and implementing GPU infrastructure / tooling.

  • A strong sense for scalability and developing secure, highly reliable environments.