People Matter

Cloud Reliability Engineer- Ops & Automation

ThoughtSpot

ThoughtSpot

Operations
Hyderabad, Telangana, India
Posted on Oct 1, 2024

We are looking for a Cloud Reliability Engineer to join our team and focus on maintaining the reliability and availability of our cloud-based infrastructure. The ideal candidate will work in shifts and handle on-call duties to ensure smooth cloud operations by managing incidents and change requests within defined SLAs. Additionally, you'll contribute to efforts to automate operational tasks, reduce manual interventions, and improve overall efficiency.

Responsibilities:

  • Monitor and manage cloud-based infrastructure to ensure high availability, performance, and security.

  • Respond to alerts and incidents, troubleshooting and resolving issues swiftly to minimize downtime.

  • Perform root cause analysis and post-incident reviews to improve system reliability and prevent future incidents.

  • Handle change requests within established SLAs, ensuring seamless updates to the production environment.

  • Participate in a shift-based schedule and on-call rotation to support critical infrastructure.

  • Collaborate with Engineering and Field teams to resolve service requests in a timely manner.

  • Automate routine operational tasks to reduce manual interventions and operational toil.

  • Identify opportunities for further automation in cloud operations and implement solutions to streamline processes.

  • Assist in the optimisation and maintenance of monitoring and alerting systems for cloud environments.

Required Skills/Qualifications:

  • Bachelor’s degree in Computer Science, Information Technology, or a related field, or equivalent work experience.

  • 3-5 years of experience in cloud operations, system administration, or related fields.

  • Familiarity with cloud platforms such as AWS, GCP, or Azure.

  • Experience in automating operational tasks using scripting languages (e.g., Python, Bash, etc.).

  • Strong problem-solving skills, particularly in managing incidents under pressure.

  • Understanding of ITIL processes, incident, and change management.

  • Familiarity with monitoring tools and incident management platforms.

  • A proactive mindset focused on improving operational processes and reducing manual work through automation.

Preferred Skills:

  • Experience with cloud-native tools (e.g., CloudWatch, Stackdriver) and automation frameworks.

  • Basic knowledge of containers and Kubernetes (EKS/GKE/AKS).

  • Familiarity with Linux systems, cloud networking, and troubleshooting.

  • Experience with CI/CD pipelines and DevOps tools for automation and infrastructure as code.

  • Interest in identifying and implementing automation to reduce operational toil and improve efficiency.

Working Conditions:

  • Shift-based work with on-call responsibilities.

  • Fast-paced, collaborative environment with an emphasis on automation and cloud operations.

What makes ThoughtSpot a great place to work?

ThoughtSpot is the experience layer of the modern data stack, leading the industry with our AI-powered analytics and natural language search. We hire people with unique identities, backgrounds, and perspectives—this balance-for-the-better philosophy is key to our success. When paired with our culture of Selfless Excellence and our drive for continuous improvement (2% done), ThoughtSpot cultivates a respectful culture that pushes norms to create world-class products. If you’re excited by the opportunity to work with some of the brightest minds in the business and make your mark on a truly innovative company, we invite you to read more about our mission, and apply to the role that’s right for you.

ThoughtSpot for All

Building a diverse and inclusive team isn't just the right thing to do for our people, it's the right thing to do for our business. We know we can’t solve complex data problems with a single perspective. It takes many voices, experiences, and areas of expertise to deliver the innovative solutions our customers need. At ThoughtSpot, we continually celebrate the diverse communities that individuals cultivate to empower every Spotter to bring their whole authentic self to work. We’re committed to being real and continuously learning when it comes to equality, equity, and creating space for underrepresented groups to thrive. Research shows that in order to apply for a job, women feel they need to meet 100% of the criteria while men usually apply after meeting 60%. Regardless of how you identify, if you believe you can do the job and are a good match, we encourage you to apply.