Sr. Forward Deployed Software Engineer - Dubbing Platform
Sarvam AI
Software Engineering
Bengaluru, Karnataka, India
Location
Bengaluru
Employment Type
Full time
Location Type
On-site
Department
Engineering
About Sarvam
Sarvam is building the bedrock of Sovereign AI for India. The company is developing India’s full-stack sovereign AI platform, building across research, models, infrastructure and applications with a singular focus on making AI genuinely work for India. Sarvam works with leading enterprises and public institutions and is backed by Lightspeed, Peak XV, and Khosla Ventures. Sarvam partners with India’s leading brands, including Tata Capital, SBI Life, CRED, IDFC, and LIC.
About the Role
We’re looking for a Senior FDSE to lead complex, high-touch enterprise deployments of Sarvam’s AI Dubbing Platform. You will be the senior technical partner to media companies, OTT platforms, content studios, and enterprise localization teams — owning everything from integration architecture to pipeline tuning, while ensuring clients achieve production-quality multilingual dubbed content at scale.
Beyond hands-on deployment and support, you will set the technical standard for field engineering on the dubbing platform: defining integration playbooks, driving escalation resolution, and feeding product-critical field intelligence back to the dubbing engineering team. This role carries significant customer-facing and mentoring responsibility.
You will co-own the most ambitious content localization problem in India: building a platform that dubs video into 12+ Indian languages while preserving speaker voice, tone, and timing. From ASR accuracy tuning to TTS voice quality and translation fidelity, this is a platform-defining role at the intersection of ML, media, and enterprise delivery.
What You’ll Do
Lead end-to-end integration of Sarvam’s dubbing platform into enterprise content workflows (OTT, media houses, ed-tech, enterprise L&D)
Own the technical relationship with strategic accounts — scoping requirements, designing integration architecture, and ensuring production readiness
Debug and resolve complex pipeline issues across the full dubbing stack: audio separation, ASR, translation, TTS, and video stitching
Tune pipeline parameters (VAD thresholds, translation glossaries, TTS voice profiles, audio mixing) for client-specific content types
Drive presales engagements — leading technical discovery, scoping POC deployments, and presenting to content/engineering leadership
Build and maintain integration playbooks, API guides, and troubleshooting runbooks for the dubbing platform
Define SLA governance across enterprise accounts — setting expectations for turnaround time, quality benchmarks, and escalation resolution
Act as the primary technical liaison between enterprise clients and Sarvam’s dubbing product and ML engineering teams
Mentor and provide technical guidance to FDSE engineers in the field
Contribute fixes and improvements back to internal platform codebases when client deployments surface bugs or gaps
What We’re Looking For
5–8 years of experience in field engineering, solutions engineering, technical account management, or senior client-facing engineering roles
Strong Python proficiency — ability to read, debug, and contribute to production FastAPI services and ML pipelines (non-negotiable)
Experience with audio/video processing workflows: FFmpeg, codec pipelines, media formats, or streaming infrastructure (non-negotiable)
Proven track record working with enterprise media, OTT, or content localization clients
Comfort operating across the stack: REST APIs, async job queues (Celery/Redis or similar), PostgreSQL, cloud storage (Azure/GCP/AWS), Kubernetes
Strong debugging instincts — ability to trace failures across distributed systems (API → queue → worker → ML inference → storage)
Experience owning SLA management and escalation governance across multiple enterprise accounts
Excellent communication skills — comfortable engaging CXO/VP-level stakeholders at media companies
Bonus Points
Prior experience with speech/NLP systems: ASR, TTS, machine translation, or audio ML
Familiarity with Indic languages and the nuances of multilingual content (code-mixing, transliteration, regional dialects)
Experience with ML serving infrastructure: Triton, ONNX Runtime, or similar model-serving frameworks
Background in media localization, subtitling, or dubbing workflows (even manual/traditional)
Experience with WebSocket-based real-time systems or event-driven architectures
Contributions to open-source audio/video/NLP tools
Why Sarvam?
Sarvam is a fast-moving, high talent-density team building full-stack AI for India, working on problems that push the frontiers of AI with real population-scale impact.
Work alongside researchers, engineers, builders, and business leaders who move fast and hold each other to a very high bar
High ownership and high impact, from day one
Everything we do is AI-first, from the way we build and ship to the way we think about problems
You can work on problems that could change how an entire country learns, works, and communicates
If you want to work on problems at the frontier of AI in India, Sarvam is the place to be.