People Matter

Sr Machine Learning Operations Engineer

Blue River

Blue River

Software Engineering, Operations
Santa Clara, CA, USA
Posted on Mar 15, 2024

We’re Blue River, a team of innovators driven to create intelligent machinery that solves monumental problems for our customers. We empower our customers – farmers, construction crews, and foresters - to implement safer and more sustainable solutions, driving increased profitability with less reliance on scarce labor. We believe that focusing on the small stuff – pixel-by-pixel and task-by-task - leads to big gains. With our partners at John Deere, we have the ability to bring innovative computer vision, machine learning, robotics, and product management solutions to scale production, maximizing their potential impact.

Our people are at the heart of what we do. Through cross-discipline collaboration, this mission-driven and daring team is eager to define the new frontier of mobile robotics. We are always asking hard questions, rapidly iterating, and getting our boots in the field and on-site to figure it out. We won’t give up until we’ve made a tangible and positive impact on the planet.

Summary

Blue River is seeking a Machine Learning Ops Engineer specializing in data and ML infrastructure to join our team. Our machine learning platform helps manage the various components of the ML application development life cycle, starting from data ingestion, annotation, and exploration, to model training, deployment, and monitoring. All of these components are interdisciplinary, so you will be working closely with roboticists, ML researchers, and Safety & Perception teams.

This is an impact opportunity for a problem-solver who is capable of exhibiting deftness to handle multiple simultaneously competing priorities and deliver solutions in a timely manner. You like to automate anything you do and you document it for the benefit of others. The candidate should be adept at prioritizing multiple issues and have strong expertise in troubleshooting complex production issues.

Job Responsibilities

  • Lead the design, development, and implementation of on-premise and cloud MLOps solutions that support the delivery of machine learning models.
  • Work with stakeholders across the organization to understand their machine learning needs and requirements, and to develop MLOps solutions that meet those needs.
  • Guide to improve the stability, security, efficiency, and scalability of systems.
  • Collaborate with MLE, data engineers, and software engineers to build scalable and efficient data pipelines and model training and deployment systems.
  • Develop and maintain monitoring and management tools to ensure the reliability and performance of our on-premises MLOps infrastructure.
  • Drive automation initiatives for model deployment and infrastructure provisioning.
  • Stay up-to-date with the latest trends and technologies in MLOps, LLMOps, machine learning, and artificial intelligence, and share your knowledge with the team to help us stay at the forefront of the field.
  • Create and maintain comprehensive documentation for ML Ops processes, configurations, and best practices.
  • Mentor junior members of the team and help them grow their skills and expertise in MLOps.

Required Experience and Skills

  • 5 or more years of professional experience in MLOps, Data Engineering, software engineering, or a related field.
  • Experience designing and building MLOps infrastructure, including data pipelines, model training and deployment systems, and monitoring and management tools.
  • Experience in supporting highly scalable data systems and services written in Python.
  • Familiarity with Kubernetes container orchestration, microservices, and cloud platforms such as AWS, GCP, or Azure.
  • Have a passion for automation by creating tools using Python.
  • Bachelor's Degree or higher in Computer Science, Math, or other quantitative field.

Preferred Experience and Skills

  • Experience with Cloud workflow platforms such as Kubeflow, MLflow, airflow, etc.
  • Prior background with AWS services and the Databricks platform.
  • Experience building ETL workflows and Data Warehouse solutions.
  • Familiarity with ML Frameworks such as TensorFlow, PyTorch, and harnessing GPUs for distributed model training.
  • Experience in Infrastructure templating tools like Terraform or CloudFormation.
  • Strong communication skills and ability to work effectively across multiple technical teams.

At Blue River, we’re passionate about creating an inclusive workplace that promotes and values diversity. While we have more work to do to advance diversity and inclusion, we’re investing in our programs, including recruiting, mentorship, career development, and learning & development to ensure they support our Diversity, Equity, and Inclusion goals.

We support each employee in living a full life, enabling a thriving career, and accomplishing a meaningful, challenging mission while collaborating with incredible people. We are dedicated to building a diverse and inclusive workplace, so if you’re excited about this role but your experience doesn’t align completely with the job description, we encourage you to apply anyway. We are an equal opportunity employer and do not discriminate based on race, religion, color, national origin, sex, gender, gender expression, sexual orientation, age, marital status, veteran status, or disability status. We will ensure that individuals with disabilities are provided reasonable accommodation to participate in the job application or interview process, perform essential job functions, and receive other benefits and privileges of employment. Please contact us to request an accommodation.

The US annual base salary range for this position is $133,000 - $238,000, along with eligibility for Blue River’s bonus and benefit programs.

Our salary ranges are determined by role, level, and location. Within the range, individual pay is determined by work location and additional factors, including job-related skills, experience, and relevant education or training. Your recruiter can share more about the specific salary range for your location during the hiring process. During the recruitment process, we may identify an alternative role or level to which you are more suited. If your ideal role at Blue River differs from the advertised position, we will provide an updated pay range as soon as possible during the hiring process.

#LI-AN1