Morning Stack · hiring index

Engineering Manager, Evals

Cursor · San Francisco

Listed by Cursor on Ashby, posted 2026-06-03 · last seen 2026-06-03. Read directly from the company's applicant-tracking system, not LinkedIn or Indeed.

What we pulled from this posting

Lead evaluation systemsmachine learningpythondata analysisteam leadershipproduct strategymetrics designai/ml

From the company's job description

Our mission is to automate coding. The first step in our journey is to build the best tool for professional programmers, using a combination of inventive research, design, and engineering. Our organization is very flat, and our team is small and talent dense. We particularly like people who are truth-seeking, passionate, and creative. We enjoy spirited debate, crazy ideas, and shipping code.

ABOUT THE ROLE

As an Engineering Manager on the Evals team at Cursor, you’ll lead the group responsible for creating high-signal evaluation datasets for coding agents and building the tools engineers use to write and run them. The team also owns online evaluation systems that track agent quality in production, and the close integration between online and offline evaluations.

The evaluation systems that this team builds, including CursorBench https://cursor.com/blog/cursorbench, are critical in the development of our coding models and the quality of our Cursor agents https://cursor.com/blog/continually-improving-agent-harness. Your impact will compound across every Cursor product and every Cursor model by making quality measurable, comparable, and easy to improve.

WHAT YOU’LL DO

YOU MAY BE A FIT IF

#LI-DNI

More open roles at Cursor