Job Overview: We are looking for a SwarmBench Task Engineer specializing in planning and operations to design and build complex, multi-agent benchmark tasks that simulate real-world planning, scheduling, and operational decision-making scenarios.
Duties and Responsibilities: Design and develop multi-agent benchmark tasks involving planning, scheduling, and resource allocation; operational decision-making (project management, logistics, incident response, capacity planning); create constraint-rich problem statements with multiple interacting variables; develop verification scripts to evaluate feasibility, completeness, and optimality; build decomposition strategies; model real-world operational scenarios with dependencies, timelines, and resource constraints; collaborate on improving task quality, coverage, and evaluation rigor.
Required Qualifications: 5+ years of experience in operations, project management, logistics, or supply chain; strong ability to formalize constraints, dependencies, and scheduling logic; proficiency in Python for building verification and validation scripts; strong structured problem-solving and decomposition skills; clear and precise technical writing skills; experience with AI coding benchmarks (e.g., SWE-bench, Terminal-bench); hands-on experience with Docker (Dockerfiles, image builds, debugging).
Additional Notes: Nice to have: experience with optimization techniques (linear programming, constraint satisfaction, scheduling algorithms); background in operations research; experience with simulation or modeling tools; knowledge of AI planning systems or automated reasoning; project management experience or certifications (PMP, Agile, etc.). Perks of Freelancing With Turing: Work in a fully remote environment. Opportunity to work on cutting-edge AI projects with leading LLM companies. Offer Details: Commitments Required: 40 hours per week with overlap of 4 hours with PST. Engagement Type: Contractor assignment (no medical/paid leave). Duration of Contract: 4 weeks (adjustable based on engagement). Evaluation Process: Take home assessment.
Info
Job Posting Disclaimer
All job postings on this site are shared for informational purposes only. The responsibility for the accuracy of job descriptions, requirements, qualifications, and other details rests entirely with the employer or organization offering the position. We do not verify or guarantee the authenticity of these listings.
Applicants are encouraged to perform their own due diligence and confirm all information directly with the employer before submitting an application.
We are not responsible for any actions, decisions, or outcomes resulting from applying to a job listed here. All interviews, selection processes, and job offers are conducted solely by the employer or organization.
Exercise caution and watch out for fraudulent job offers. Never provide sensitive personal information or make payments to secure a position.