Full-Time LLM Engineer

Thaloz is hiring a remote Full-Time LLM Engineer. The career level for this job opening is Experienced and is accepting Brazil based applicants remotely. Read complete job description before applying.

Thaloz

Job Title

LLM Engineer

Posted

Career Level

Full-Time

Career Level

Experienced

Locations Accepted

Brazil

Job Details

We are looking for a talented and innovative Large Language Model (LLM) Engineer with specialized experience in Retrieval Augmented Generation (RAG) to join our dynamic team. The ideal candidate will be instrumental in the development, optimization, and deployment of RAG-based LLM solutions that meet various business needs.

This role requires a deep understanding of Data Science, Machine Learning (ML), Natural Language Processing (NLP), and the fundamentals of LLMs.

The successful candidate will have hands-on experience with:

  • agent-based prompting techniques such as LangChain
  • API-driven development architectures for AI products
  • usage of vector databases
  • information retrieval pipelines to extract actionable insights

Strong listening skills and an ability to understand business requirements are essential for designing effective solutions tailored to stakeholder needs.

Job Responsibilities

  • Designing RAG Solutions: Develop robust RAG-based LLM solutions that enhance data retrieval processes while ensuring high-quality output aligned with business objectives.
  • Prompt Engineering: Craft effective prompts that efficiently guide LLMs towards generating outputs that fulfill specific business requirements; iterate on prompt designs based on performance metrics.
  • AI Product Development: Design, develop, and refine AI products based on user feedback and stakeholder requirements; ensure alignment with program priorities through agile methodologies.
  • Support ML Engineering Teams: Collaborate closely with ML engineering teams during the deployment phase of APIs in production environments; assist in troubleshooting issues related to model performance or integration challenges.
  • Documentation Creation: Document Generative AI use cases comprehensively; create manuals detailing workflows, methodologies employed in solution design, and best practices adopted throughout the process.
  • Retrieval Augmented Generation Maintenance: Develop and maintain RAG concepts following established data science principles; ensure adherence to industry standards while fostering innovation.
  • Pipeline Optimization: Ensure efficient operation of data pipelines and processes according to company security policies; promote Responsible AI practices within all project phases.
  • Collaboration & Communication: Work closely with cross-functional teams including product managers, data scientists, software engineers, and stakeholders to align goals effectively; communicate technical concepts clearly across diverse audiences.

Requirements

  • Minimum of 3 years' experience working with Python for machine learning applications along with expertise in AWS Cloud Computing services.
  • Familiarity with AWS Bedrock's suite of LLM models along with associated prompting strategies tailored for optimal performance.
  • At least 1 year of direct experience working specifically within NLP frameworks/LLMs focusing on Retrieval Augmented Generation models.
  • Proficiency in Python programming language utilized extensively for data science projects involving AI development.
  • Familiarity with agent-based frameworks like LangChain which support advanced interactions between users/models.
  • Experience leveraging AI training frameworks such as TensorFlow or PyTorch alongside Hugging Face Transformers library for model building/training tasks.
  • Demonstrated expertise in prompt engineering techniques aimed at fine-tuning large language models tailored toward specific applications or industries.
  • Knowledgeable about vector databases including practical experience implementing embedding techniques necessary for enhanced information retrieval capabilities.

Preferred Skills

  • Experience utilizing orchestration tools designed specifically for AI product development (e.g., ArgoCD)
  • Ability to create detailed diagrams illustrating data flows/solutions which can effectively communicate complex ideas/processes among team members/internal stakeholders alike.
  • Prior experience deploying UI/frontend components related directly back into existing large language model systems would be advantageous.

FAQs

What is the last date for applying to the job?

The deadline to apply for Full-Time LLM Engineer at Thaloz is 18th of April 2025 . We consider jobs older than one month to have expired.

Which countries are accepted for this remote job?

This job accepts [ Brazil ] applicants. .

Related Jobs You May Like

Conversational Designer (Google Dialogflow)

All cities, India
8 hours ago
Conversation Design
Conversational AI
Google Dialogflow
Miratech
Full-Time
Experienced

Google CCAI BOT Developer

All cities, India
8 hours ago
Conversational AI
Google Dialogflow
Java
Miratech
Full-Time
Experienced

Solution Architect (Google Dialogflow)

All cities, India
2 days ago
API Integrations
Google Dialogflow CX
Solution Architect
Miratech
Full-Time
Experienced

Conversational Designer (Google Dialogflow)

All cities, India
1 week ago
Conversation Design
Conversational AI
Google Dialogflow
Miratech
Full-Time
Experienced

Senior Product Manager

USA
1 week ago
AI/ML
Communication
Language Models
Seekr
Full-Time
Senior Manager