Back to all jobs

LLM / GenAI Engineer

Work from home Full-time role Hiring

About The Role The role is focused on architecting and scaling production-grade generative AI features, moving beyond basic API wrappers to build robust, deterministic systems powered by large language models. The engineer will design orchestration layers, optimize retrieval-augmented generation (RAG) workflows, and implement strict evaluation and guardrail systems to ensure safety, accuracy, and low latency at scale. The team works at the intersection of modern software engineering and applied AI. This role involves collaborating with backend engineers and product owners to integrate intelligence into core platform workflows, ensuring LLM applications are observable, cost-effective, and highly performant.

Key Responsibilities

  • Design and optimize advanced RAG pipelines, utilizing hybrid search, query rewriting, and reranking strategies to maximize retrieval quality.
  • Implement systematic LLM evaluation pipelines using frameworks like Ragas, TruLens, or custom LLM-as-a-judge architectures to measure hallucination and accuracy.
  • Integrate and manage enterprise-grade vector databases such as Pinecone, Milvus, or pgvector, including indexing strategies and metadata filtering.
  • Develop agentic workflows and multi-agent systems using frameworks like LangGraph, Autogen, or custom state machines.
  • Deploy, fine-tune, and optimize open-source models (e.g., Llama, Mistral) using LoRA, QLoRA, and quantization techniques for specialized tasks.
  • Build robust guardrails and alignment layers using tools like NeMo Guardrails or Llama Guard to ensure safe and deterministic model behavior.
  • Monitor LLM latency, cost, and token usage in production using tracing tools such as LangSmith, Phoenix, or Arize.

What We Are Looking For

  • 3-6 years of professional software engineering experience, with at least 1.5 years dedicated to building and deploying LLM applications in production.
  • Deep proficiency in Python and familiarity with asynchronous programming, FastAPI, and containerization via Docker.
  • Hands-on experience with LLM orchestration frameworks like LangChain, LlamaIndex, or DSPy.
  • Strong understanding of modern NLP techniques, embedding models, vector spaces, and semantic search.
  • Experience deploying production applications on AWS, GCP, or Azure, utilizing managed Kubernetes or serverless containers.
  • Bachelor's or Master's degree in Computer Science, Data Science, or a related quantitative technical field.
  • Bonus: Experience with vLLM, TensorRT-LLM, custom model hosting, or contribution to open-source GenAI frameworks.

Apply To This Job

Related remote jobs

Lead Machine Learning Engineer - REMOTE

Work from home Full-time role

[Remote] Applied Machine Learning Engineer, Circuit Design - New College Grad 2026

Work from home Full-time role

Immediate Hiring: Remote Machine Learning engineer jobs

Work from home Full-time role

Data Science/Machine Learning Engineer (Remote, Continental United States)

Work from home Full-time role

AI Prompt Engineer, Remote

Work from home Full-time role

Prompt Engineer (100% Worldwide Remote) in Los Angeles, CA in vidIQ

Work from home Full-time role

[Remote] AI Prompt Engineer & Evaluator | $50/hr Remote

Work from home Full-time role

[Remote] AI/Prompt Engineer – Intern/Entry Level

Work from home Full-time role

Project Lion - Senior Prompt Engineer - United States (Remote, Part-Time)

Work from home Full-time role

Junior Prompt Engineer; Remote

Work from home Full-time role

Experienced Virtual Receptionist and Data Entry Clerk – Remote Work Opportunity at arenaflex

Work from home Full-time role

Experienced Data Entry Specialist – Entry-Level Opportunity at arenaflex

Work from home Full-time role

Customer Success Manager Leader – Team Development, Retention & Expansion Strategy at arenaflex

Work from home Full-time role

Entry-Level Remote Data Entry Clerk – Launch Your Career with arenaflex’s Growing Digital Operations Team

Work from home Full-time role

EP Mapping Specialist, CAS- Dover/ Lewis

Work from home Full-time role

Experienced Part-Time Remote Data Entry Specialist – Web Application Development and Customer Support

Work from home Full-time role

Clinical Research Associate I (Central: IL, CO, MI, KS, TN)

Work from home Full-time role

Data Engineer (Databricks, SQL)

Work from home Full-time role

Senior Packaging Engineer

Work from home Full-time role

Entry Level Remote Data Entry Specialist – Content Operations Support at arenaflex (Hiring Now, No Experience Required)

Work from home Full-time role