Negotiable
Undetermined
Remote
United Kingdom
Summary: The Software Engineer role focuses on utilizing Bash and shell scripting to analyze and document benchmark tasks related to Docker and Linux system administration. The position requires evaluating AI agent outputs and synthesizing information to enhance system architecture and agent performance. This is a remote, hourly contract position with a commitment of 10 to 40 hours per week. Candidates must have professional experience in software engineering and a strong command of relevant technologies.
Key Responsibilities:
- Analyze, solve, and document benchmark tasks involving Docker, shell scripting, and Linux system administration.
- Evaluate AI agent outputs for correctness, reproducibility, and reliability across multi-step CLI workflows.
- Provide detailed, evidence-based reasoning grounded in terminal behavior and code structure.
- Synthesize information across files and configurations to assess end-to-end system architecture.
- Develop high-quality reference solutions and diagnostic insights to improve agent performance.
- Collaborate asynchronously with research teams and reviewers within a structured benchmark environment.
Key Skills:
- Professional experience in software engineering with strong Bash and shell scripting expertise.
- Deep familiarity with Linux environments and terminal-based workflows.
- Strong knowledge of Docker, Git, Python, and distributed systems concepts.
- Ability to trace, debug, and clearly explain complex system behaviors.
- Strong analytical thinking and structured problem-solving skills.
- Commitment to clarity, rigor, and methodological accuracy.
- Based in the United States, United Kingdom, Canada, Australia, or New Zealand.
Salary (Rate): £90.00/hr
City: undetermined
Country: United Kingdom
Working Arrangements: remote
IR35 Status: undetermined
Seniority Level: undetermined
Industry: IT
Position: Software Engineer (Bash and Shell Script)
Type: Hourly contract
Compensation: $80–$90/hour
Location: Remote
Commitment: 10–40 hours/week
Role Responsibilities
- Analyze, solve, and document benchmark tasks involving Docker, shell scripting, and Linux system administration.
- Evaluate AI agent outputs for correctness, reproducibility, and reliability across multi-step CLI workflows.
- Provide detailed, evidence-based reasoning grounded in terminal behavior and code structure.
- Synthesize information across files and configurations to assess end-to-end system architecture.
- Develop high-quality reference solutions and diagnostic insights to improve agent performance.
- Collaborate asynchronously with research teams and reviewers within a structured benchmark environment.
Requirements
- Professional experience in software engineering with strong Bash and shell scripting expertise.
- Deep familiarity with Linux environments and terminal-based workflows.
- Strong knowledge of Docker, Git, Python, and distributed systems concepts.
- Ability to trace, debug, and clearly explain complex system behaviors.
- Strong analytical thinking and structured problem-solving skills.
- Commitment to clarity, rigor, and methodological accuracy.
- Based in the United States, United Kingdom, Canada, Australia, or New Zealand.
Application Process (Takes 20 Min)
- Upload resume
- Interview (15 min)
- Submit form