Full-Time Director of Machine Learning Engineering
Chalice is hiring a remote Full-Time Director of Machine Learning Engineering. The career level for this job opening is Manager and is accepting USA, UK based applicants remotely. Read complete job description before applying.
Chalice
Job Title
Posted
Career Level
Career Level
Locations Accepted
Share
Job Details
About Us: Chalice Custom Algorithms (chalice.ai) is the leading AI solution for brands applying their own data and analytics to real-time decisioning in ad buying.
About the Role: We are seeking a Director of Machine Learning Engineering (MLE) to lead the architecture, deployment, and optimization of our machine learning infrastructure and models. This role will report directly to the VP of Data Science and Analytics and manage our Senior MLE(s).
Key Responsibilities:
- Lead the design, development, and deployment of scalable machine learning systems leveraging Databricks, PySpark, Rust, and Kubernetes.
- Oversee LLMs and neural network predictive modeling pipelines, ensuring performance, observability, and alignment with product strategy.
- Own the full ML lifecycle—from data ingestion and feature engineering to model deployment and performance monitoring.
- Collaborate with the Engineering Architect to build robust, fault-tolerant, and maintainable ML pipelines that support enterprise scale.
- Manage and mentor a growing team of MLEs and collaborate closely with software engineers.
- Partner with product, data science, and engineering teams to translate business objectives into technical solutions.
- Guide adoption of emerging technologies to improve model efficiency and latency.
- Ensure compliance with audit, privacy, and security standards in all ML engineering efforts.
Qualifications:
- 8+ years of industry experience in machine learning engineering, with at least 3+ years leading ML engineering teams.
- Deep experience building low-latency machine learning systems using Kubernetes, Rust, PySpark, and Databricks.
- Proven track record in designing and deploying LLM- and neural network-based solutions at scale.
- Mastery of MLOps tools and processes (MLflow, Prometheus, Grafana, Airflow, etc.).
- Strong experience with cloud platforms (AWS, Databricks) and production data engineering at scale.
- Excellent communication and collaboration skills.
- Advertising technology or performance marketing experience is a strong plus.