Join a high-agency team taking on unsolved challenges at the frontier of AI.
Context engineering is the art of providing AI with the right information at the right time. For AI agents to deliver personalized, accurate experiences, they need systematic access to user preferences, business data, and temporal relationships beyond static facts.
We're developing the foundational infrastructure that orchestrates context retrieval and assembly from user memory and business data. This context engineering foundation will power the next generation of AI applications that truly understand users and business scenarios.
Backed by Leading Investors
Platinum medical, dental, and vision insurance
Highly competitive salary and equity compensation. 401K plan + employer matching. Unlimited PTO.
Flexible in office culture in San Francisco. Remote work options and periodic travel to San Francisco if based outside the Bay Area
Monthly stipend toward your mobile plan.
Open positions from Work at a Startup.
San Francisco, United States / Remote (US)
Zep is the memory and context layer for AI agents. As a Senior Applied Research Engineer, you'll explore novel approaches to memory, context, and context generation, then own those ideas all the way to production.
This is a research role with a hard applied bent. We're not hiring ML researchers chasing publications. We're hiring engineers who can run rigorous experiments, train and evaluate models, and ship the result as production code our customers depend on.
What you'll do
What we're looking for
Nice to have
Tech stack: Python, Rust/C++/Go, PyTorch, vLLM/SGLang, AWS.
This role is probably NOT a fit if:
We respect your time and keep our interview process tight and focussed.
Screening Call (w/ Daniel, our Founder) → Team Calls (2-3 hours back-to-back, may include a presentation) → Decision Call (Daniel, again)
San Francisco, United States / Remote (US)
Zep is the memory and context layer for AI agents. As a Senior AI Engineer, you'll build low-latency backend systems, operate them in production on AWS, and ship LLM-powered capabilities our customers depend on.
You'll have the opportunity to work on Graphiti (25K+ GitHub stars), Zep’s popular open-source context graph framework.
This is a senior backend role centered on running LLM workloads at significant scale. We're not hiring ML researchers or data scientists. We're hiring engineers who have already lived through the messy reality of taking an LLM application from demo to production.
What you'll do
What we're looking for
Nice to have
Tech stack: Go, Python, TypeScript, AWS.
This role is probably NOT a fit if:
We respect your time and keep our interview process tight and focussed.
Screening Call (w/ Daniel, our Founder) → Team Calls (2-3 hours back-to-back, may include a presentation) → Decision Call (Daniel, again)
San Francisco, United States / Remote (US)
Zep is the memory and context layer for AI agents. As Lead Forward Deployed Engineer, you'll embed with customer engineering teams to integrate Zep into their production agent systems: diagnosing context-quality failures, designing memory architectures around their data, and shipping the integrations that make their agents actually work in the wild.
This is an applied AI engineering role with a customer surface. We're not looking for ML researchers or data scientists. We're looking for engineers who have already lived through the messy reality of taking an agent from demo to production.
What you'll do
What we're looking for
Tech stack: Python, TypeScript, AWS or GCP, Docker.
This role is probably NOT a fit if:
We respect your time and keep our interview process tight and focussed.
Screening Call (w/ Daniel, our Founder) → Team Calls (2-3 hours back-to-back, may include a presentation) → Decision Call (Daniel, again)
At Zep, we believe in moving quickly when we spot talent.
A short call with Daniel, our founder.
An opportunity to assess how well you fit into our collaborative, team-focused environment.
A one-on-one discussion about your role, goals, and potential contributions to Zep's growth.