Available Position

HUD
San Francisco, CA, United States
Full-time
Posted Today

Job Description

**About HUD** HUD (YC W25) is developing agentic evaluations for Computer Use Agents (CUAs) that browse the web. Our CUA Evals framework is the first comprehensive evaluation tool for CUAs. **Our Mission:** People don't actually know if AI agents are working. To make AI agents work in the real world, we need detailed evaluations for a huge range of tasks. We're backed by Y Combinator and work closely with frontier AI labs to provide agent evaluation infrastructure at scale. **About the Role** HUD is a fast-growing startup. If you can't find a role on our job board, feel free to suggest a new role, and we'll reach out if we find a good fit. **Open Opportunities:** • Building new evaluations/eval environments for HUD's CUA evaluation framework • Building out our CUA evals framework • Conducting outbound sales, developing partnerships and improving developer experience for CUA developers • Leading and supporting teams of research engineers as they build out our evals • General startup operations as we scale **Experience** Strong candidates may have: • Engagement with AI Safety and AI alignment • Understanding of LLM evaluation frameworks, particularly multimodal and agentic evaluations • Familiarity in using and deploying latest AI tools for operational efficiency • Experience in fullstack LLM deployment, particularly for multimodal and agentic AI evaluations • Prior experience in fast-growing startup teams **Team & Company Details** **Team Size:** ~15 people currently, mostly full-time in-person, but some remote. **Our Team:** Our team includes 4 international Olympiad medallists (IOI, ILO, IPhO), serial AI startup founders, and researchers with publications at ICLR, NeurIPS and other top venues. **Company Stage:** We have received $2 million in seed funding, plus very strong demand and revenue growth beyond that. We are scaling profitably and fast to meet demand. **Logistics** **Employment:** Full-time preferred, but we're willing to consider internship offers. **Location:** Remote-friendly, but if you're in the San Francisco Bay Area, we do have an office you can work in. We prioritize applicants who can attend meetings in Pacific Time (UTC-7:00/8:00) or China/Singapore Time (UTC +8:00). **Visa Sponsorship:** We provide support for relocation and visas for strong full-time candidates. For part-time/contract/internship arrangements, we'll work fully remote. **Timeline:** Applications are rolling. The process involves 1-2 interviews and takes less than a week. We prioritize operational aptitude and cultural fit. Motivated candidates are encouraged to apply even if they don't meet all criteria.

Interested in this position?

Don't miss out on this opportunity. Apply now and take the next step in your career.

About HUD

HUD is actively hiring.

Available Position
HUD
    Available Position | Expat Job