About the Company
We are a seed-stage AI company building the industry standard for evaluating and benchmarking large language models on real enterprise tasks.
About the Role
As a Research Scientist, you will develop new benchmarks, methodologies, and evaluation pipelines that shape how cutting-edge models are assessed, compared, and deployed in production environments. Your work will directly influence model selection and safety decisions across foundation model labs, high-growth AI product companies, and Fortune-scale enterprises.
Responsibilities
Benchmarking & Model Analysis
Design New Benchmarks from Scratch
Advance Automated Evaluation Methodologies
Cross-functional Collaboration
Qualifications
Required Skills
Preferred Skills
Pay range and compensation package
Equal Opportunity Statement
Visa sponsorship available. Relocation support. Health & dental coverage. Lunch + dinner provided, snacks & coffee. Unlimited PTO. Weekly happy hours with community guests. Team events (bowling, hiking, rock climbing, etc.). Swag program (hats, etc.).
Work Environment & Culture
In-person, San Francisco HQ (required). Core hours: 9–5, some teammates extend voluntarily. Most team members work 1 weekend day per week (flexible). High-ownership, low-ego, collaborative. Live demos Mondays, team lunch Thursdays, community Fridays. Early-stage pace, applied focus—not academic publishing.
Tech Environment
(while research-focused, exposure beneficial) Backend: Python / Django. Frontend: React + TypeScript. Infra: AWS. Evaluation frameworks + internal tooling.
Why This Role Is Unique
The company already collaborates with foundation model labs, high-growth AI vertical product companies, and Fortune 500 enterprises (not publicly facing). ChatGPT Vals AI $5M seed raised, runway of 2+ years at current burn. Only one research scientist is being hired—true founding impact. Opportunity to define industry standards for model trust, reliability, and certification. Positioned to become the rating agency for generative AI.
Haverford College's Department of Political Science invites applications for a full-time, benefits-eligible Visiting Assistant Professorappointment in American Politics to begin July 1, 2026, with classes starting on August 31, 2026. This appointment is for 1-year and...
...REGISTERED NURSE | Maternal Child JOB SUMMARY The Registered Nurse (RN) provides professional nursing care to assigned OB/GYN... ...health status, plan of care, and anticipated outcomes. Provides service excellence to all customers. Maintains professional standards...
...Healthcare Facilities Solutions (HFS) is rapidly expanding, and we are seeking Commercial Real Estate Agents to join our team for this exciting opportunity. This position requires someone who is a self-starter and a multi-tasker. They should also be comfortable thriving...
...Project Manager (Entry-Level to Senior) Construction | Ground-Up | Tilt-Up, Concrete, Cold Storage Overview We are seeking Project Managers at multiple levels (Entry, Mid-Level, Senior) to support and lead ground-up construction projects, with a focus on tilt-up...
...As a Plumbers Helper , you will support residential and commercial plumbing projects by installing plumbing systems and related materials. Job Duties: Assists with residential and commercial plumbing installation from underground/rough-ins, stack out to trim outs...