API Development Jobs

2 jobs found

HUD

**About HUD** HUD (YC W25) is developing agentic evaluations for Computer Use Agents (CUAs) that browse the web. Our CUA Evals framework is the first comprehensive evaluation tool for CUAs. **Our Mission:** People don't actually know if AI agents are working. To make AI agents work in the real world, we need detailed evaluations for a huge range of tasks. We're backed by Y Combinator and work closely with frontier AI labs to provide agent evaluation infrastructure at scale. **About the Role** HUD is a fast-growing startup. If you can't find a role on our job board, feel free to suggest a new role, and we'll reach out if we find a good fit. **Open Opportunities:** • Building new evaluations/eval environments for HUD's CUA evaluation framework • Building out our CUA evals framework • Conducting outbound sales, developing partnerships and improving developer experience for CUA developers • Leading and supporting teams of research engineers as they build out our evals • General startup operations as we scale **Experience** Strong candidates may have: • Engagement with AI Safety and AI alignment • Understanding of LLM evaluation frameworks, particularly multimodal and agentic evaluations • Familiarity in using and deploying latest AI tools for operational efficiency • Experience in fullstack LLM deployment, particularly for multimodal and agentic AI evaluations • Prior experience in fast-growing startup teams **Team & Company Details** **Team Size:** ~15 people currently, mostly full-time in-person, but some remote. **Our Team:** Our team includes 4 international Olympiad medallists (IOI, ILO, IPhO), serial AI startup founders, and researchers with publications at ICLR, NeurIPS and other top venues. **Company Stage:** We have received $2 million in seed funding, plus very strong demand and revenue growth beyond that. We are scaling profitably and fast to meet demand. **Logistics** **Employment:** Full-time preferred, but we're willing to consider internship offers. **Location:** Remote-friendly, but if you're in the San Francisco Bay Area, we do have an office you can work in. We prioritize applicants who can attend meetings in Pacific Time (UTC-7:00/8:00) or China/Singapore Time (UTC +8:00). **Visa Sponsorship:** We provide support for relocation and visas for strong full-time candidates. For part-time/contract/internship arrangements, we'll work fully remote. **Timeline:** Applications are rolling. The process involves 1-2 interviews and takes less than a week. We prioritize operational aptitude and cultural fit. Motivated candidates are encouraged to apply even if they don't meet all criteria.

San Francisco, CA, United States
Full-time

HUD

**About HUD** HUD (YC W25) is developing agentic evals for Computer Use Agents (CUAs) that browse the web. Our CUA Evals framework is the first comprehensive evaluation tool for CUAs. **Our Mission:** People don't actually know if AI agents are working. To make AI agents work in the real world, we need detailed evals for a huge range of tasks. We're backed by Y Combinator and work closely with frontier AI labs to provide agent evaluation infrastructure at scale. **About the Role** HUD is a fast-growing startup. If you can't find a specific role on our job board, we encourage you to suggest a position that aligns with your expertise – we'll reach out if we find a good fit. **Potential Opportunities:** - Building new evaluations/eval environments for HUD's CUA evaluation framework - Developing our CUA evals framework - Conducting outbound sales, developing partnerships, and improving developer experience for CUA developers - Leading and supporting teams of research engineers as they build out our evals - General startup operations as we scale **Experience** Strong candidates may have: - Engagement with AI Safety and AI alignment - Understanding of LLM evaluation frameworks, particularly multimodal and agentic evaluations - Familiarity in using and deploying latest AI tools for operational efficiency - Experience in fullstack LLM deployment, particularly for multimodal and agentic AI evaluations - Prior experience in fast-growing startup teams **Team & Company Details** **Team Size:** ~15 people currently, mostly full-time in-person, with some remote team members. **Our Team:** Includes 4 international Olympiad medallists (IOI, ILO, IPhO), serial AI startup founders, and researchers with publications at ICLR, NeurIPS, and other top venues. **Company Stage:** We have received $2 million in seed funding, plus strong demand and revenue growth beyond that. We are scaling profitably and rapidly to meet demand. **Employment Details** **Employment Type:** Full-time preferred, but we'll consider internship offers. **Location:** Remote-friendly, with an office available in the San Francisco Bay Area. We prioritize applicants who can attend meetings in Pacific Time (UTC-7:00/8:00) or China/Singapore Time (UTC +8:00). **Visa Sponsorship:** We provide support for relocation and visas for strong full-time candidates. For part-time/contract/internship arrangements, we work fully remote. **Timeline:** Applications are reviewed on a rolling basis. The process involves 1-2 interviews and takes less than a week. We prioritize operational aptitude and cultural fit. Motivated candidates are encouraged to apply even if they don't meet all criteria.

Singapore, Singapore
Full-time