Negotiable
Outside
Remote
USA
Summary: The Lead Engineer for Infrastructure Operations is responsible for managing and optimizing eCommerce infrastructure, particularly within AWS and CDN environments. This role involves designing high-availability systems, automating operations, and ensuring security best practices. The ideal candidate will have extensive experience with AWS services and Fastly CDN management, contributing to the performance and scalability of digital commerce platforms. Collaboration with development and DevOps teams is essential to enhance site performance and reliability.
Key Responsibilities:
- Design, deploy, and maintain AWS infrastructure to support high-availability, high-traffic eCommerce applications
- Manage and configure Fastly CDN services, including writing, testing, and optimizing VCL code for caching, routing, and security use cases
- Collaborate with developers and site reliability engineers to improve site performance, latency, monitoring, alerting, and uptime
- Implement infrastructure-as-code (IaC) using tools like Terraform, CloudFormation, or AWS CDK
- Monitor system performance and troubleshoot infrastructure issues across AWS and CDN layers
- Automate infrastructure operations, deployment workflows, health checks and incident response
- Support CI/CD pipelines and application rollouts in collaboration with DevOps and engineering teams
- Maintain security best practices including IAM, network controls, and data protection standards
- Participate in on-call rotation and incident management for infrastructure-related issues
Key Skills:
- 5+ years of experience with AWS (EC2, S3, CloudFront, Lambda, API Gateway, RDS, Route 53, etc.)
- 2+ years of experience configuring and managing Fastly CDN, including custom VCL scripting and Distributed Compute services and/or applicable web development experience
- Strong knowledge of web caching, HTTP/2, TLS, and CDN optimization techniques
- Solid scripting and automation experience with Node.js, Bash, Python, or similar languages
- Familiarity with containerization technologies and certificate management (Docker, ECS, EKS)
- Experience implementing infrastructure-as-code (e.g., Terraform, CloudFormation)
- Experience with monitoring and observability tools (e.g., Datadog, CloudWatch, New Relic)
- Understanding of CI/CD processes and tools (e.g., Jenkins, GitHub Actions, CodePipeline)
- Strong analytical and troubleshooting skills, especially in high-traffic environments
- Excellent communication and documentation skills
Salary (Rate): undetermined
City: undetermined
Country: USA
Working Arrangements: remote
IR35 Status: outside IR35
Seniority Level: undetermined
Industry: IT
Job Title: Lead Engineer Infrastructure Operations
Location: Remote
Department: Engineering / DevOps / Infrastructure
About the Role
We are looking for a highly skilled Engineer with deep experience in eCommerce infrastructure, AWS cloud-native services, and CDN/WAF management, ideally Fastly VCL (Varnish Configuration Language) and NG-WAF. You will play a key role in maintaining, optimizing, and evolving our digital commerce platform to deliver fast, secure, and scalable customer experiences.
Responsibilities
- Design, deploy, and maintain AWS infrastructure to support high-availability, high-traffic eCommerce applications
- Manage and configure Fastly CDN services, including writing, testing, and optimizing VCL code for caching, routing, and security use cases
- Collaborate with developers and site reliability engineers to improve site performance, latency, monitoring, alerting, and uptime
- Implement infrastructure-as-code (IaC) using tools like Terraform, CloudFormation, or AWS CDK
- Monitor system performance and troubleshoot infrastructure issues across AWS and CDN layers
- Automate infrastructure operations, deployment workflows, health checks and incident response
- Support CI/CD pipelines and application rollouts in collaboration with DevOps and engineering teams
- Maintain security best practices including IAM, network controls, and data protection standards
- Participate in on-call rotation and incident management for infrastructure-related issues
Required Qualifications
- 5+ years of experience with AWS (EC2, S3, CloudFront, Lambda, API Gateway, RDS, Route 53, etc.)
- 2+ years of experience configuring and managing Fastly CDN, including custom VCL scripting and Distributed Compute services and/or applicable web development experience
- Strong knowledge of web caching, HTTP/2, TLS, and CDN optimization techniques
- Solid scripting and automation experience with Node.js, Bash, Python, or similar languages
- Familiarity with containerization technologies and certificate management (Docker, ECS, EKS)
- Experience implementing infrastructure-as-code (e.g., Terraform, CloudFormation)
- Experience with monitoring and observability tools (e.g., Datadog, CloudWatch, New Relic)
- Understanding of CI/CD processes and tools (e.g., Jenkins, GitHub Actions, CodePipeline)
- Strong analytical and troubleshooting skills, especially in high-traffic environments
- Excellent communication and documentation skills
Preferred Qualifications
- Experience in a large-scale retail or e-commerce environment
- Node.js experience preferred
- Signal based WAF management preferred (ex: Signal Sciences)
- Knowledge of edge security practices such as WAF configuration, bot protection, or TLS management
- Experience with performance tuning and synthetic monitoring
AWS certifications (e.g., Solutions Architect, DevOps Engineer, Cloud Architect)