People Matter

Senior AI/MLOps Engineer

Tarana Wireless

Tarana Wireless

Software Engineering, Data Science
San Francisco, CA, USA
Posted on Wednesday, June 5, 2024
This position focuses on the end-to-end workflow for our AI/ML models and data. It will require our team members to wear different hats as we scale our deployments, our maturity and our organization

  • Data engineering - work with our cloud system data engineers and our data scientists to ensure high data quality and reliability in our data warehouse. Design and implement our feature store to support repeatable, high quality model builds
  • AI/MLOps - design and implement an improved model workflow including EDA, feature engineering, training, evaluation, deployment and monitoring. Implement versioning, observability and reporting. Monitor efficiency, cost, performance and reliability.
  • DevOps - We are a product-oriented company, so you will need to work with software and DevOps engineers on our teams to build out GitOps pipelines for new APIs, clusters and tools that we develop. This includes deployment tools, security, observability and alerting.

Our process is highly iterative, high velocity and focused on quality and operational excellence. We use small, product-focused teams that own features and products end-to-end, so you will improve infrastructure and implement strategic changes as part of the product creation and release process. We emphasize general software skills and the ability to work interactively with a LLM to experiment and implement infrastructure over experience with specific technologies.

In your first year at Tarana, you will take ownership of the end-to-end model lifecycle, and help us to consistently deliver high-quality, high-reliability, cost-effective machine learning and AI systems as part of various products. You will guide our strategic decisions around technology and implementation by working with other engineers and executives to develop business-focused designs based on experience and empirical data.

This is a hands-on role in a low-overhead team of builders.

Required Skills & Experience:

  • BS or higher in Computer Science, MS preferred
  • 5-12 years of experience building large scale ML/AI models and systems

Knowledge, Skills and Abilities:

  • Strong understand of software systems and programming, with and without assistance from a LLM
  • Strong python, Spark, Pandas and SQL skills. Scala and Rust are a bonus
  • Strong knowledge of cloud platforms such as AWS, Azure or Google Cloud and experience with infrastructure-as-code tools like Terraform or CloudFormation.
  • Proficiency in containerization technologies such as Docker and container orchestration platforms like Kubernetes.
  • Experience with CI/CD tools such as GitLab CI/CD, Github Actions or CircleCI.
  • Familiarity with schema design and data warehouse architectures.
  • Familiarity with machine learning frameworks and libraries such as PyTorch, Tensorflow and scikit-learn.
  • Understanding of DevOps/Agile/Lean core principles and how to apply them
  • Strong problem-solving and troubleshooting of complex systems
  • Experience with ML workflow tools such as MLflow, MetaFlow and/or Kubeflow.
  • Experience with monitoring, metrics, and logging model performance and data pipelines (Prometheus, Grafana, etc.)

The salary range for this position is: $180,000 to $240,000

Compensation will be determined based on several factors including, but not limited to: skill set, years of experience and the employee’s geographic location.

Tarana provides competitive benefits to employees in this role including: Medical, dental and vision benefits, 401K match, flexible time off and stock options.

Since our founding in 2009, we’ve been on a mission to accelerate the pace of bringing fast and affordable internet access — and all the benefits it provides — to the 90% of the world’s households who can’t get it. Through a decade of R&D and more than $400M of investment, we’ve created an entirely unique next-generation fixed wireless access technology, powering our first commercial platform, Gigabit 1 (G1). It delivers a game-changing advance in broadband economics in both mainstream and underserved markets, using either licensed or unlicensed spectrum. G1 started production in mid 2021 and has now been installed by over 160 service providers globally. We’re headquartered in Milpitas, California, with additional research and development in Pune, India.

G1 has been developed by an incredibly talented and pioneering core technical team. We are looking for more world-class problem solvers who can carry on our tradition of customer obsession and ground-breaking innovation. We’re well funded, growing incredibly quickly, maintaining a superb results-focused culture while we’re at it, and all grooving on the positive difference we are making for people all over the planet. If you want to help make a real difference in this world, apply now!