People Matter

Staff Site Reliability Engineer (Customer Identity Cloud)

Okta

Okta

Software Engineering, Customer Service
United States
Posted on Jul 5, 2024

Get to know Okta

Okta is The World’s Identity Company. We free everyone to safely use any technology—anywhere, on any device or app. Our Workforce and Customer Identity Clouds enable secure yet flexible access, authentication, and automation that transforms how people move through the digital world, putting Identity at the heart of business security and growth.

At Okta, we celebrate a variety of perspectives and experiences. We are not looking for someone who checks every single box - we’re looking for lifelong learners and people who can make us better with their unique experiences.

Join our team! We’re building a world where Identity belongs to you.

As a Staff SRE Engineer, you will champion all things pertaining to reliability at Okta on our Customer Identity (CIC) product. Working closely with the product engineers, quality engineers, platform engineers and architecture teams, your primary focus will be on ensuring production systems remain operational at all times, while continually setting and achieving long-term performance, reliability and scalability goals in a platform with an exponential growth plan for the coming years.

With CIC’s increased dedication to ensuring customer availability expectations are exceeded in every way, you will play a key role as we evolve our system architecture to meet the demands of enormous growth and support the hundreds of millions of users who rely on us to provide uninterrupted access to business-critical enterprise and consumer applications.

You will:

  • Core contributor to OKTA’s FedRAMP initiative
  • Collaborate with engineering teams to improve availability, reliability, and observability of their services.
  • Participate in regular on-call rotations to ensure 24/7 coverage of all critical systems
  • Use existing monitoring tools to identify problems and resolve and/or escalate to service teams
  • Implement changes to enable or improve infrastructure resilience, monitoring, and alerting
  • Lead the development and continuous refinement of SRE tools and processes to improve software delivery, observability, reliability, and operational efficiency.
  • Daily coding, scripting, and development - Go, Terraform, Helm, etc
  • Optimize existing systems and eliminate toil through simplification and automation.

You might be a good fit if you:

  • Are a U.S. Person Status (e.g. a U.S. Citizen, National, Lawful Permanent Resident, Refugee or Asylee)*
  • Have experience working in a FedRAMP environment
  • Have 6+ years industry experience as a Site Reliability Engineer or adjust disciplines (DevOps/Platform/etc)
  • Are proficient in Golang
  • Have experience in managing infrastructure with Terraform at scale
  • Are comfortable working with a fully distributed team
  • Have 4+ years as software developer in a SaaS environment
  • Have experience in a production environment supporting large-scale, mission-critical applications
  • Have demonstrable expertise working with Microsoft Azure and/or Amazon Web Services.
  • Production on-call experience in a 24/7 cloud based environment
  • Have a good understanding of microservices, cloud infrastructure (AWS, Azure, GCP), databases (SQL, No-SQL, Key/Value), containers (docker, kubernetes), web technologies (web sockets, http) and networking (SSL, routing, VPN)
  • Exceptional communication skills, including technical writing in the English language
  • Have a systematic problem-solving approach, coupled with a strong sense of ownership and drive
  • Comfortable with the Agile software development methodology
  • Loves to work as a team, but is able to work effectively in a remote environment where tasks may be self-driven

#LI-Remote

Below is the annual base salary range for candidates located in California, Colorado, New York and Washington. Your actual base salary will depend on factors such as your skills, qualifications, experience, and work location. In addition, Okta offers equity (where applicable), bonus, and benefits, including health, dental and vision insurance, 401(k), flexible spending account, and paid leave (including PTO and parental leave) in accordance with our applicable plans and policies. To learn more about our Total Rewards program please visit: https://rewards.okta.com/us.

The annual base salary range for this position for candidates located in California (excluding San Francisco Bay Area), Colorado, New York, and Washington is between:
$160,000$240,000 USD
The annual base salary range for this position for candidates located in the San Francisco Bay area is between:
$179,000$269,000 USD

What you can look forward to as an Full-Time Okta employee!

Okta cultivates a dynamic work environment, providing the best tools, technology and benefits to empower our employees to work productively in a setting that best and uniquely suits their needs. Each organization is unique in the degree of flexibility and mobility in which they work so that all employees are enabled to be their most creative and successful versions of themselves, regardless of where they live. Find your place at Okta today! https://www.okta.com/company/careers/.

Okta is an Equal Opportunity Employer. All qualified applicants will receive consideration for employment without regard to race, color, religion, sex, sexual orientation, gender identity, national origin, ancestry, marital status, age, physical or mental disability, or status as a protected veteran. We also consider for employment qualified applicants with arrest and convictions records, consistent with applicable laws. If reasonable accommodation is needed to participate in the job application or interview process, please use this Form to request an accommodation.

Okta is committed to complying with applicable data privacy and security laws and regulations. For more information, please see our Privacy Policy at https://www.okta.com/privacy-policy/.