Negotiable
Outside
Remote
USA
Summary: The Operating Systems Specialist role involves supporting the administration, maintenance, deployment, and troubleshooting of enterprise-level Unix, Linux, and Windows environments. The position requires extensive hands-on experience with distributed systems and cloud-based infrastructure. The candidate will be responsible for incident management, automation of operational tasks, and ensuring performance and availability across complex server environments. This is a long-term remote position requiring Public Trust Clearance.
Key Responsibilities:
- Provide incident response and deployment support for Unix, Linux, and Windows server platforms (on-site and remote).
- Administer and troubleshoot middle-tier components including web/application servers, containers, messaging systems, cloud infrastructure, automation tools, and monitoring platforms.
- Execute tasks via ServiceNow in compliance with CLIENT-defined SLAs and change management protocols.
- Lead technical response for server, database, and middleware issues within a distributed infrastructure.
- Analyze system performance and implement tuning and capacity plans.
- Develop and execute disaster recovery plans and preventative maintenance procedures.
- Create and maintain technical and project documentation as required.
- Participate in hardware and software deployments and configuration management.
- Utilize CLIENT-approved tools for system monitoring, software distribution, and security management.
- Track and maintain server and system inventory and deployment records.
- Implement escalation and fault resolution procedures.
- Maintain all relevant system documentation, including configuration changes and performance metrics.
- Provide operational and incident reports to CLIENT Management as requested.
- Conduct root cause analysis and provide action plans for fault remediation.
- Support application deployments and collaborate with development teams on testing and implementation.
- Ensure adherence to CLIENT's quality assurance and security standards.
Key Skills:
- Advanced experience with Unix/Linux/Windows server administration in production environments.
- Middleware technologies (e.g., Apache, Tomcat, WebLogic, JBoss).
- Containers and orchestration tools (e.g., Docker, Kubernetes).
- Scripting languages (e.g., Bash, PowerShell, Python).
- Monitoring tools (e.g., Nagios, Zabbix, Prometheus).
- Cloud platforms (e.g., AWS, Azure, or Google Cloud Platform).
- ServiceNow or equivalent ITSM platforms.
- Strong understanding of backup and restore procedures, change control, and system hardening.
- Ability to analyze and resolve complex infrastructure and deployment issues.
Salary (Rate): undetermined
City: undetermined
Country: USA
Working Arrangements: remote
IR35 Status: outside IR35
Seniority Level: undetermined
Industry: IT
Job Title: Operating Systems Specialist
Positions Available: Multiple Positions
Clearance: Must be able to get Public Trust Clearance
Duration: Long Term
Location: Remote
Job Description:
The Operating Systems Specialist will support the administration, maintenance, deployment, and troubleshooting of enterprise-level Unix, Linux, and Windows environments. The role requires extensive hands-on experience managing distributed systems, middle-tier technologies, and supporting cloud-based and on-prem infrastructure components. The candidate must be proficient in managing incidents, automating operational tasks, and maintaining performance and availability across complex server environments.
Primary Responsibilities:
Provide incident response and deployment support for Unix, Linux, and Windows server platforms (on-site and remote).
Administer and troubleshoot middle-tier components including:
o Web/application servers
o Containers and container orchestration platforms
o Messaging systems
o Cloud infrastructure and services
o Automation/scripting tools
o Monitoring platforms
Execute tasks via ServiceNow in compliance with CLIENT-defined SLAs and change management protocols.
Lead technical response for server, database, and middleware issues within a distributed infrastructure.
Analyze system performance and implement tuning and capacity plans.
Develop and execute disaster recovery plans and preventative maintenance procedures.
Create and maintain technical and project documentation as required.
Participate in hardware and software deployments and configuration management.
Utilize CLIENT-approved tools for system monitoring, software distribution, and security management.
Track and maintain server and system inventory and deployment records.
Implement escalation and fault resolution procedures.
Maintain all relevant system documentation, including configuration changes and performance metrics.
Provide operational and incident reports to CLIENT Management as requested.
Conduct root cause analysis and provide action plans for fault remediation.
Support application deployments and collaborate with development teams on testing and implementation.
Ensure adherence to CLIENT s quality assurance and security standards.
Technical Requirements:
Advanced experience with:
o Unix/Linux/Windows server administration in production environments
o Middleware technologies (e.g., Apache, Tomcat, WebLogic, JBoss)
o Containers and orchestration tools (e.g., Docker, Kubernetes)
o Scripting languages (e.g., Bash, PowerShell, Python)
o Monitoring tools (e.g., Nagios, Zabbix, Prometheus)
o Cloud platforms (e.g., AWS, Azure, or Google Cloud Platform)
o ServiceNow or equivalent ITSM platforms
Strong understanding of backup and restore procedures, change control, and system hardening.
Ability to analyze and resolve complex infrastructure and deployment issues.
Please share your Word Format resume with complete contact details, work status and the expected rate.