Our solutions hosted on our Iris Smart Manufacturing platform combines equipment and process domain expertise in Mining & Metals, Oil & Gas, Chemicals & Petrochemicals with the state-of-the-art in data sciences, machine learning and process optimization. The IRIS platform can work in hybrid mode and built using microservices architecture. The applications are containerised and run on Azure Kubernetes service and uses Azure blob storage, Data Lake, IoT Hub, Event Hub, Event Grid and Queues.
The development team embraces DevOps culture and delivers products using Continuous Delivery principles. You will be key in managing multiple installations of the platform across Cloud and on-premise environments, fixing security vulnerabilities on a monthly basis, conducting audits and contributing to automation of installations. You will have a great opportunity to work with a world-class team and latest technologies and be able to learn and contribute.
Job Description
Roles & Responsibilities:
· Contribute to the IRIS Platform operations road map and execute planned research and development
· Deploy and manage cloud infrastructure on Azure and AWS using best practices and governance standards.
· Lead production operations, including incident management, troubleshooting, and root cause analysis.
· Automate so that IRIS Platform is deployable easily across multiple Cloud providers such as Azure and AWS, and on-premise such as K3s and RKE2.
· Work with customers and Services team to support multiple installations of the IRIS platform.
· Conduct monthly audits including vulnerabilities, DAST tests and review of backups, cost and access reviews.
· Develop CICD pipelines for projects built microservices that would be deployable across multiple environments.
· Implement security best practices in cloud and on-premise environments to address vulnerabilities and compliance requirements.
· Develop and maintain Helm charts for packaging and deploying Kubernetes applications.
· Manage operations of the databases and messaging systems Postgres, Elastic, Redis and Kafka including configuration, scalability and backups.
· Implement auto-scaling of various platform services within the Kubernetes environment
· Automate infrastructure provisioning with Terraform, Ansible, or similar IaC tools.
· Operate Kubernetes clusters, ensuring scalability, reliability, and security.
· Implement monitoring, logging, and alerting to maintain SLAs and availability targets.
· Apply security best practices and enforce compliance standards across cloud and deployment pipelines.
· Be the operations contact point for multiple product teams
Mandatory skills:
· 4+ years of hands-on experience with AWS and/or Azure
· 3+ years of experience working with Linux systems
· 4+ years of commercial experience with Kubernetes
· 2+ years of experience working with Docker
· 4+ years of experience setting up CICD pipelines (AzureDevOps or similar)
· 2+ year of experience with automation tools such as Terraform and Ansible
· 2+ year of experience with Cloud such as Azure, AWS or GCP
· Good knowledge of best security practices, vulnerability management and testing tools such as Acunetix, Snyk, CheckMarx or Trivy
· Troubleshooting problems in Production and be comfortable doing root cause analysis
· Able to work in an Agile environment
Desirable:
· Good working or operational knowledge of databases (Postgres, Elastic search, Redis or similar)
· Exposure to configuring web servers such as Nginx
· Working knowledge of monitoring tools such as Grafana and Prometheus
· Working knowledge of a messaging framework such as Event Hub, Kafka, RabbitMQ or similar
About Us
What We Offer
· Competitive salary and benefits package.
· Flexible hybrid working model.
· Opportunities for professional growth and development.
· Collaborative and inclusive work environment.
· Access to the latest technologies and tools.
· Opportunity to make a tangible impact on cutting-edge Retail/ CPG AI solutions.