People Matter

Site Reliability Engineer (SRE) - CloudVision

Arista Networks

Arista Networks

Software Engineering
Vancouver, BC, Canada
Posted on Wednesday, June 12, 2024

Company Description

Arista Networks is an industry leader in data-driven, client-to-cloud networking for large data center, campus and routing environments. What sets us apart is our relentless pursuit of innovation. We leverage the latest advancements in cloud computing, artificial intelligence, and software-defined networking to provide our clients with a competitive edge in an increasingly interconnected world. Our solutions are designed to not only meet the current demands of the digital landscape but to also anticipate and adapt to future challenges.

At Arista we value the diversity of thought and perspectives that each employee brings to the table. We believe that fostering an inclusive environment, where individuals from various backgrounds and experiences feel welcome, is essential for driving creativity and innovation.

Our commitment to excellence has earned us several prestigious awards, such as Best Engineering Team, Best Company for Diversity, Compensation, and Work-Life Balance. At Arista, we take pride in our track record of success and strive to maintain the highest standards of quality and performance in everything we do.

Job Description

Who You’ll Work With

SREs at Arista combine strong software and systems engineering with a passion for operating production systems at scale. As an SRE you’ll be part of the team responsible for our global service fleet.

What You’ll Do
As an SRE you’ll be responsible for our global CloudVision service fleet. This includes:

  • Building the CI/CD lifecycle for services, from inception and design to deployment and scaling
  • Improving operational processes through automation
  • Identifying key service indicators to be used in capacity planning
  • Owning disaster recovery and management
  • Driving infrastructure and cloud-based application security design
  • Leading sustainable incident response and blameless postmortems
  • Being an active member of our globally distributed on-call team

Arista’s CloudVision is an enterprise network management and streaming telemetry SaaS offering. CloudVision is deployed on Kubernetes across global regions using Spinnaker for our CI/CD pipeline. Our tech stack runs on GKE, using HBase/Hadoop as main distributed database and storage layer, ElasticSearch for powering search data, ClickHouse for fast real time queries of flow data, our own Kafka-based distributed real time stream processing layer for analytics, and TensorFlow for ML analysis. Our monitoring system is built on top of Prometheus, Grafana, Loki, and other OSS tools.

Qualifications

  • BS/MS degree in Computer Science or a relevant experience subject.
  • 4+ years software engineering experience.
  • Experience developing or managing deployments of distributed database systems or scale out applications for a SaaS environment.

Compensation Information:

The new hire base pay for this role has a salary range of $100,000 to $140,000. Arista offers different pay ranges based on work location, so that we can offer consistent and competitive pay appropriate to the market. The actual base pay offered will be based on a wide range of factors, including skills, qualifications, relevant experience, and work location. The pay range provided reflects base pay only and in addition certain roles may also be eligible for discretionary Arista bonuses and equity. Employees in Sales roles are eligible to participate in Arista’s Sales Incentive Plan, which pays commissions calculated as a percentage of eligible sales. US-based employees are also entitled to benefits including medical, dental, vision, wellbeing, tax savings and income protection. The recruiting team can share more details during the hiring process specific to the role and location.

#LI-GR1