See all roles

AI Engineer (RAG Specialist)

Work from home Full-time role Hiring

AI Engineer (RAG Specialist)

We are looking for a skilled AI Engineer specializing in Retrieval-Augmented Generation (RAG) to join our team. Your primary focus will be bridging the gap between static LLMs and dynamic, proprietary data. You won't just be "calling an API"; you will be architecting the entire data lifecycle-from ingestion and chunking strategies to advanced retrieval and response synthesis. The ideal candidate understands that the secret to a great RAG system isn't just the LLM, but the quality of the retrieval and the nuances of the vector database.

US Citizenship Required

Key Responsibilities

Pipeline Architecture: Design and deploy end-to-end RAG pipelines using frameworks like LangChain, LlamaIndex, or Haystack.

Data Engineering: Develop robust ETL processes to ingest unstructured data (PDFs, docs, web scrapes) into high-performance vector stores.

Retrieval Optimization: Implement and tune advanced retrieval techniques, including Hybrid Search (keyword + semantic), Re-ranking (Cross-Encoders), and Parent-Document Retrieval.

Vector Database Management: Manage and scale vector databases such as Pinecone, Weaviate, Milvus, or Chroma.

Evaluation & Benchmarking: Establish rigorous evaluation frameworks (e.g., RAGAS, TruLens) to measure faithfulness, relevancy, and hit rates.

Performance Tuning: Optimize embedding models and prompt engineering to reduce latency and "hallucinations."

Technical Qualifications

Language Proficiency: Advanced Python (preferred) or TypeScript.

LLM Expertise: Hands-on experience with OpenAI GPT-4, Anthropic Claude, or open-source models like Llama 3 via Ollama or vLLM.

Vector Expertise: Deep understanding of embeddings, similarity metrics (Cosine, Euclidean), and indexing strategies (HNSW, IVF).

NLP Fundamentals: Familiarity with tokenization, context windows, and attention mechanisms.

Cloud/DevOps: Experience deploying AI applications on AWS, GCP, or Azure using Docker/Kubernetes.

Preferred Skills

• Experience with Agentic RAG (Multi-step reasoning and tool-use).

• Knowledge of Graph Databases (Neo4j) for GraphRAG implementations.

• Contributions to open-source AI projects.

• Background in traditional Information Retrieval (Elasticsearch/Solr).

Apply To This Job

You might like

Caregiver - Home Health Aide - Non-Medical

Work from home Full-time role

Preschool Photographer - Seasonal

Work from home Full-time role

Chef

Work from home Full-time role

Senior Director, Product Management

Work from home Full-time role

Senior Manager, Product Marketing

Work from home Full-time role

Panel Fabricator

Work from home Full-time role

Senior Software Engineer

Work from home Full-time role

Summer Student Coordinator

Work from home Full-time role

Business System Analyst, Clinical Data Management

Work from home Full-time role

Senior Business System Analyst - Clinical Operations

Work from home Full-time role

Sr. Manager, Emerging Surgeon Development

Work from home Full-time role

Experienced Part-Time Remote Amazon Chat Specialist – Delivering Exceptional Customer Service with blithequark

Work from home Full-time role

Make Money Online Virtual Assistant Jobs for Teens No Experience

Work from home Full-time role

Geochemical Data Science Fellow

Work from home Full-time role

Need Resource Room Teacher (1.0 Con) in Kennewick, WA

Work from home Full-time role

Remote Sales Representative

Work from home Full-time role

[Remote Part-time jobs] American Express Remote (No Degree) – Data Ent – USA Remote Jobs

Work from home Full-time role

Experienced Customer Sales and Service Representative – Delivering Exceptional Experiences with arenaflex

Work from home Full-time role

Virtual Customer Service/Sales Representative – Unlocking Exceptional Benefits for Families at blithequark

Work from home Full-time role

Immediate Hiring: Experienced USPS Mail Sortati...

Work from home Full-time role