Backend Intern - Inference Pipelines & Diagnostics

Sarvam AI

Sarvam AI

Software Engineering

Bengaluru, Karnataka, India

Posted on May 7, 2026

Location

Bengaluru

Employment Type

Full time

Location Type

On-site

Department

Engineering

About Sarvam

Sarvam is building the bedrock of Sovereign AI for India. The company is developing India's full-stack sovereign AI platform, building across research, models, infrastructure and applications with a singular focus on making AI genuinely work for India. Sarvam works with leading enterprises and public institutions and is backed by Lightspeed, Peak XV, and Khosla Ventures. Sarvam partners with India's leading brands, including Tata Capital, SBI Life, CRED, IDFC, and LIC.

About the Role

We're looking for a Backend Intern to join Sarvam's engineering team and own meaningful workstreams in two critical areas: our inference pipeline infrastructure and diagnostics API services. You'll work on the systems that serve AI models at scale, help build robust APIs for diagnostics and observability, and contribute to the data pipelines that keep everything running reliably. Strong performers will be fast-tracked to a full-time offer at the end of the internship. Preferred background: AI/ML or Computer Science.

What You'll Do

• Build and optimise backend services for LLM inference pipelines in Python or Node.js

• Develop and maintain diagnostics API services for model observability and health monitoring

• Integrate LLM APIs and manage request routing, latency, and error handling across inference flows

• Design and query SQL and NoSQL databases to support pipeline state management and diagnostics data

• Build and maintain data pipelines to support inference workloads and operational metrics

• Deploy and manage services on cloud infrastructure (AWS or GCP) using version-controlled codebases on Git

• Collaborate with ML engineers and platform teams to debug, profile, and improve system performance

What We're Looking For

• Proficiency in Python or Node.js; comfortable writing clean, production-quality backend code

• Solid understanding of REST API design, including diagnostics and observability endpoints

• Familiarity with SQL and at least one NoSQL database (e.g. MongoDB, Redis, or DynamoDB)

• Working knowledge of Git for version control and collaborative development

• Basic exposure to cloud platforms — AWS or GCP

• Interest in LLMs and familiarity with LLM API integration patterns

• Background in Computer Science, AI, or Machine Learning preferred

Bonus Points

• Prior exposure to inference serving frameworks (e.g. vLLM, TGI, Triton, or similar)

• Experience with monitoring and observability tooling (e.g. Prometheus, Grafana, or OpenTelemetry)

• Familiarity with containerisation and orchestration (Docker, Kubernetes)

• Contributions to open-source projects in backend or ML infrastructure

Why Sarvam?

Sarvam is a fast-moving, high talent-density team building full-stack AI for India, working on problems that push the frontiers of AI with real population-scale impact.

• Work alongside researchers, engineers, builders, and business leaders who move fast and hold each other to a very high bar

• High ownership and high impact, from day one

• Everything we do is AI-first, from the way we build and ship to the way we think about problems

• You can work on problems that could change how an entire country learns, works, and communicates

If you want to work on problems at the frontier of AI in India, Sarvam is the place to be.