Full-Time AI & Machine Learning SRE
Oomnitza is hiring a remote Full-Time AI & Machine Learning SRE. The career level for this job opening is Experienced and is accepting Ireland based applicants remotely. Read complete job description before applying.
Oomnitza
Job Title
Posted
Career Level
Career Level
Locations Accepted
Share
Job Details
Oomnitza offers a versatile Enterprise Technology Management platform orchestrating and automating key business processes for IT.
Our SaaS solution, with agentless integrations, best practices, and low-code workflows, enables enterprises to leverage their existing infrastructure and automate processes like offboarding, onboarding, audits, refresh forecasting, and more. We reduce reliance on manual tasks and tickets.
We help leading companies improve efficiency, expedite audits, mitigate cyber risk, and eliminate redundant IT spend. The Oomnitza team seeks an experienced AI & ML Site Reliability Engineer passionate about AI, machine learning, and data science.
Responsibilities:
- Build and maintain Oomnitza’s big data analytics platform centralizing data.
- Design and build scalable, secure AI infrastructure for training and deploying models/software.
- Implement and manage vector databases for high-dimensional data and knowledge graphs.
- Develop and integrate Retrieval-Augmented Generation (RAG) and GraphRAG systems.
- Work with data scientists to fine-tune LLMs for business applications.
- Deploy, manage, and monitor ML models in production.
- Implement CI/CD for machine learning pipelines.
- Develop and manage AI agents for task automation.
- Monitor model performance, retraining, and governance.
- Collaborate with data scientists, ML engineers, and teams for development/deployment.
Qualifications:
- Bachelor’s degree in CS, Engineering, Data Science.
- 5+ years experience in SRE, DevOps, ML Ops.
- Experience with cloud platforms (AWS, GCP, Azure).
- Proficient in deploying ML models (regression, decision trees, neural networks).
- Experience with data processing tools (Apache Spark, Hadoop, Airflow).
- Experience with AI/ML tools (TensorFlow, PyTorch, LangChain, Hugging Face).
- Strong understanding of vector databases and knowledge graph tools.
- Experience with RAG, GraphRAG systems.
- Proficient in programming languages (Python, Bash).
- Experience implementing CI/CD for ML pipelines.
- Experience with ML version control systems.