Sr Software Development Engineer
Volterra
At F5, we strive to bring a better digital world to life. Our teams empower organizations across the globe to create, secure, and run applications that enhance how we experience our evolving digital world. We are passionate about cybersecurity, from protecting consumers from fraud to enabling companies to focus on innovation.
Everything we do centers around people. That means we obsess over how to make the lives of our customers, and their customers, better. And it means we prioritize a diverse F5 community where each individual can thrive.
Position Summary: At F5 Inc., we strive to bring better digital operations through intelligent automation and remediation. We're seeking a Senior Software Development Engineer with strong expertise in runbook automation, observability platforms, cloud infrastructure, and SRE/DevOps practices to design and implement intelligent automation services that enhance incident response, enable self-healing capabilities, and improve operational reliability across distributed cloud platforms.
What You'll Do:
Design and Develop Scalable Automation Solutions
- Design, develop, and implement automation, remediation, and runbook execution services on distributed cloud platforms
- Build and maintain observability pipelines, alerting mechanisms, runbook execution workflows, and automated response systems
- Incorporate engineering best practices into design, coding, testing, debugging, documenting, and support of automation and runbook solutions
- Work closely with peers to design and code robust, scalable automation services and runbook orchestration engines integrated with cloud infrastructure
- Develop integration layers with ITSM tools, monitoring platforms, incident management systems, and runbook repositories
- Design and develop runbook automation frameworks with parameterization, approval workflows, and execution tracking
- Implement ML-based anomaly detection, predictive analytics, and intelligent runbook selection for proactive incident prevention
- Design and implement test automation for automation frameworks, runbook execution, and remediation workflows
- Scale and optimize control plane components to support system reliability and operational efficiency
Drive Reliability and Performance
- Collaborate with SREs, QA, and engineering teams to ensure high availability and performance of automation and runbook systems
- Continuously reduce toil through intelligent automation and runbook development
- Support operational issues with timely fixes and provide proper root cause analysis
- Participate in incident response and implement lessons learned through automation improvements
- Apply SRE principles including SLIs, SLOs, and error budgets to automation platforms
Collaborate and Grow
- Participate in code reviews, technical design discussions, and architecture planning
- Mentor junior engineers on automation best practices, runbook development, and system design
- Work with cross-functional teams to understand operational pain points and build automation solutions
- Contribute to the continuous improvement of development processes and platform capabilities
Qualifications:
Must-Have:
- Software Engineering Experience – 6-8 years of software development experience with 3+ years in DevOps, SRE, automation, runbook development, or related fields
- Programming Skills – Strong programming skills in Python, Go, Java, or similar languages
- Runbook Automation & Observability – Hands-on experience with:
- Runbook automation platforms (Rundeck, StackStorm, Ansible Tower/AWX, Ansible)
- Workflow orchestration or job scheduling systems
- Observability tools (Prometheus, Grafana, ELK Stack, Splunk, Datadog)
- Monitoring frameworks, alerting mechanisms, and log aggregation
- Incident management and ITSM tool integration (ServiceNow, PagerDuty, Jira)
- Linux & Kubernetes – Solid experience with:
- Linux system administration and shell scripting
- Container technologies (Docker) and orchestration systems (Kubernetes)
- Understanding of Kubernetes architecture, deployments, and services
- Container image optimization and management
- Cloud Platforms – Hands-on experience with cloud platforms (AWS, Azure, or GCP) and cloud-native technologies
- Understanding of cloud services for compute, storage, networking, and serverless
- Experience with at least one major cloud provider's core services
- SRE/DevOps Practices – Experience with:
- Infrastructure as Code (Terraform, Ansible, CloudFormation)
- CI/CD methodology, Git-based workflows, and modern development practices
- Site Reliability Engineering principles and practices
- Incident response and on-call support
- Networking & Architecture – Understanding of:
- L3-L7 networking concepts and protocols
- Microservices architecture and API design
- Event-driven architectures and messaging systems
- Database Systems – Database experience with SQL and/or NoSQL technologies
- Communication & Problem-Solving – Excellent written and verbal communication skills, strong analytical, troubleshooting, and end-to-end debugging capabilities
Nice-to-Have:
- Advanced Observability – Experience with:
- Time-series databases (InfluxDB, TimescaleDB, Prometheus)
- Distributed tracing and APM tools (Jaeger, Zipkin, New Relic, Dynatrace)
- OpenTelemetry implementation and customization
- AI/ML for Operations – Understanding or basic experience with:
- Machine learning concepts and their application in IT operations
- Anomaly detection and predictive analytics
- Intelligent alerting and log analysis techniques
- Advanced Automation Tools – Experience with:
- Configuration management tools (Chef, Puppet, SaltStack)
- ChatOps frameworks and integrations
- Service mesh technologies (Istio, Linkerd)
- Cloud & Kubernetes – Experience with:
- Multiple cloud providers (AWS, Azure, GCP)
- Advanced Kubernetes features (operators, CRDs, custom controllers)
- GitOps workflows (ArgoCD, Flux)
- Helm and Kustomize
- Additional Skills:
- Experience with streaming platforms (Kafka, Kinesis, Pub/Sub)
- Knowledge of chaos engineering and resilience testing
- Understanding of incident management and ITIL principles
- Familiarity with eBPF for observability
- Cloud certifications (AWS, Azure, GCP, CKA, CKAD)
- Collaboration – Proven ability to work effectively with SREs, QA, application developers, and network engineers
Education:
- Typically requires a minimum of 10+ years of related experience with a bachelor's degree; or 3+ years and a master's degree
Environment:
- Freedom and Learning: Embrace an environment that fosters freedom, continuous learning, and ownership
- Mentorship: Benefit from great mentors with solid backgrounds in various areas, eager to contribute to your professional development
- Team Collaboration: Join a great team where you will feel at home from day one, contributing to a positive and supportive workplace culture
The Job Description is intended to be a general representation of the responsibilities and requirements of the job. However, the description may not be all-inclusive, and responsibilities and requirements are subject to change.
Please note that F5 only contacts candidates through F5 email address (ending with @f5.com) or auto email notification from Workday (ending with f5.com or @myworkday.com).
Equal Employment Opportunity
It is the policy of F5 to provide equal employment opportunities to all employees and employment applicants without regard to unlawful considerations of race, religion, color, national origin, sex, sexual orientation, gender identity or expression, age, sensory, physical, or mental disability, marital status, veteran or military status, genetic information, or any other classification protected by applicable local, state, or federal laws. This policy applies to all aspects of employment, including, but not limited to, hiring, job assignment, compensation, promotion, benefits, training, discipline, and termination. F5 offers a variety of reasonable accommodations for candidates. Requesting an accommodation is completely voluntary. F5 will assess the need for accommodations in the application process separately from those that may be needed to perform the job. Request by contacting accommodations@f5.com.