

Experienced IT Support Engineer specialising in enterprise Windows, macOS, and Linux environments, with strong expertise in endpoint deployment, OS upgrades, and Active Directory administration. Skilled in ServiceNow-based incident and change management, delivering SLA-driven support across business-critical systems. Brings practical exposure to Azure, AWS, disaster recovery, backup solutions, and security-focused operations, underpinned by clear technical documentation and strong service ownership.
• Supported enterprise-scale HPC platforms including ARCHER2 (5,634 compute nodes), Cirrus, and Tursa, operating in a 24/7 production environment.
• Provided Linux-based operational support for large-scale compute, storage, and networked systems serving thousands of users.
• Supported Slurm-based batch scheduling, assisting with job failures, queue behaviour, and workload troubleshooting.
• Provided operational support for Lustre-backed parallel filesystems, identifying availability and performance issues and escalating appropriately.
• Used Ansible to automate repeatable configuration and validation tasks across 50–100+ Linux systems, reducing manual errors and improving consistency.
• Wrote Python and Bash scripts for diagnostics, system checks, and incident triage.
• Responded to alerts and participated in coordinated incident response and maintenance windows, including out-of-hours work.
• Documented procedures and incidents using Git-based workflows, improving operational consistency and handover.
• Designed, deployed, and supported Linux-based server infrastructure hosting 6+ business-critical services.
• Provided ongoing operational and escalation support for 40–60 users, maintaining >99.9% service availability.
• Utilised Ansible to automate Linux server configuration and maintenance across 10–20 servers, improving consistency and reducing configuration drift.
• Developed and maintained 10+ Ansible playbooks supporting repeatable operational tasks and faster recovery.
• Created technical documentation and operational runbooks, supporting maintainability and handover.
• Administered business-critical Linux and Windows systems supporting vessel chartering, bunkering operations, and crude oil exports for 25–40 users.
• Maintained 24/7 availability of core systems, where downtime had direct commercial and operational impact.
• Conducted regular audits across 10–15 endpoints and servers, ensuring secure access control and compliance.
• Designed disaster recovery and business continuity procedures, reducing recovery times during outages.
• Executed system upgrades within maintenance windows, achieving