Share this role
Quick answers for candidates and hiring teams reviewing this listing.
You optimize answer engines that combine live retrieval, citations, and LLMs—quality and latency at query scale matter more than offline benchmarks alone.
Hybrid SF is typical. Confirm schedule and US work authorization with the team.
Link writeups showing retrieval improvements, citation accuracy fixes, or cost/latency wins on search-like products.
Suggested
Profiles ranked by overlap with this role's skills and tools—handy if you're hiring for a team or comparing backup candidates.
Other listings that share skills or tools with this one—useful if you want comparable stacks or backup options.
Cohere
Glean
Lone Star Autonomy
Thames Inference Labs
Paris · 50-200 employees · Startup · GenAI: Yes
How to read this score
Scores reward proof density: skills, projects, use cases, and experience. When you filter talent search or view matches from a job, we also show which required stack items are present or missing.
Context fit
Has: Python, Docker
Missing: LangChain, Vector DBs
How to read this score
Scores reward proof density: skills, projects, use cases, and experience. When you filter talent search or view matches from a job, we also show which required stack items are present or missing.
Context fit
Has: Python, LangChain
Missing: Vector DBs, Docker
How to read this score
Scores reward proof density: skills, projects, use cases, and experience. When you filter talent search or view matches from a job, we also show which required stack items are present or missing.
Context fit
Has: Python, Docker
Missing: LangChain, Vector DBs
How to read this score
Scores reward proof density: skills, projects, use cases, and experience. When you filter talent search or view matches from a job, we also show which required stack items are present or missing.
Context fit
Has: Python, LangChain
Missing: Vector DBs, Docker
Cohere builds enterprise-grade language models and APIs used globally from our Toronto headquarters.Join as a Senior LLM Engineer to ship RAG, embeddings, and fine-tuning workflows for regulated customers.What you will doShip production LLM and retrieval systems with offline evals, monitoring, and…
Perks · GPU access, equity, Toronto waterfront office.
View role
Glean connects enterprise knowledge with LLM search and agents used by Fortune 500 teams.LLM Engineer — enterprise RAG (Palo Alto / hybrid): connectors, ACL-aware retrieval, and quality evals.What you will doDeliver production-grade enterprise RAG with measurable quality, latency, and cost…
Perks · Equity, hybrid, comprehensive medical.
View role
Lone Star Autonomy delivers agentic assistants for professional services and internal ops teams across the US. We emphasise transparency, approvals, and traceability—not black-box autonomy.Join our Austin office (hybrid) as a Senior LLM Engineer focused on tool use, workflow engines, and enterprise…
Perks · No state income tax context — competitive total.
View role
Thames Inference Labs ships assistant and RAG platforms for regulated customers in banking, insurance, and critical infrastructure. We blend pragmatic delivery with strong evaluation and rollback discipline so models never become a black-box liability.We are hiring a Staff LLM Engineer based in…
Perks · Private medical, pension, learning budget, cycle scheme.
View role
Our applied team ships agentic assistants for operations teams across the DACH region. We need a Staff engineer who can own architecture for multi-step workflows, safety layers, and multilingual rollouts.Berlin hybrid; you will collaborate with research, infra, and compliance from day one.What you…
Perks · Relocation support, conference budget, sabbatical policy.
View role