Job Overview: We are looking for experienced Cybersecurity Experts to contribute to the development of advanced evaluation environments for frontier AI systems.
Duties and Responsibilities: Design vulnerable multi-component applications and security" style="border-bottom: 1px dotted #007bff !important;">security challenge environments across languages such as Go, Python, Node.js, or Rust. Develop realistic exploit chains combining multiple vulnerability categories and attack vectors. Build deterministic evaluation environments using Docker and automated validation tooling. Create security-focused test cases and verification logic for vulnerability detection and remediation workflows. Review and analyze AI-generated outputs to identify gaps in reasoning, security understanding, or exploit detection. Develop adversarial scenarios involving misleading documentation, obfuscated code, edge cases, and hidden attack paths. Model real-world vulnerability classes inspired by CVEs, bug bounty findings, and production security incidents. Ensure evaluation tasks remain scalable, reproducible, and resistant to contamination from public datasets. Collaborate with cross-functional teams working on AI evaluation, benchmarking, and automated testing systems.
Required Qualifications: 4+ years of experience in cybersecurity, application security, vulnerability research, or offensive security. Hands-on experience with vulnerability discovery, exploit development, secure code review, or patch validation. Strong understanding of web security, authentication, sessions, OAuth, JWT, SSRF, injection attacks, and access control vulnerabilities. Experience with cryptographic vulnerabilities, filesystem attacks, or privilege escalation scenarios. Experience using security tools such as SAST, fuzzers, IAST, or similar security testing frameworks. Strong coding skills in at least two of the following languages: Go, Python, Node.js, Rust. Experience working with Docker and containerized environments. Familiarity with Linux internals and system-level behavior. Experience with bug bounty programs, CTFs, red teaming, or CVE research is a strong plus.
Additional Notes: Commitments Required: 8 hours per day with an overlap of 4 hours with PST. Employment type: Contractor assignment (no medical/paid leave). Duration of contract: 4 weeks+. Interview: 2x technical interviews.
Info
Job Posting Disclaimer
All job postings on this site are shared for informational purposes only. The responsibility for the accuracy of job descriptions, requirements, qualifications, and other details rests entirely with the employer or organization offering the position. We do not verify or guarantee the authenticity of these listings.
Applicants are encouraged to perform their own due diligence and confirm all information directly with the employer before submitting an application.
We are not responsible for any actions, decisions, or outcomes resulting from applying to a job listed here. All interviews, selection processes, and job offers are conducted solely by the employer or organization.
Exercise caution and watch out for fraudulent job offers. Never provide sensitive personal information or make payments to secure a position.