System Administrator
TCPWave Pvt Ltd
Hyderabad , India
06 2022 - 08 2024
- Manage and maintain Linux servers and systems in Vcenter, AWS, GCE and Azure environments and ensuring optimal performance and uptime.
- Perform regular system updates, patch management, and package installations to keep systems secure and up-to-date.
- Monitor system health, performance, and logs using tools like Nagios, Zabbix, Netdata, Prometheus and Grafana to proactively identify and resolve issues.
- Serve as the escalation point for L1 support teams by troubleshooting complex Linux-related issues.
- Diagnose and resolve hardware and software problems, including network, storage, and application issues on Linux systems.
- Handle incident management and participate in root cause analysis (RCA) to prevent future occurrences.
- Develop and maintain shell scripts (Bash) for automating routine tasks and improving operational efficiency.
- Work with configuration management tools like Ansible for automating server provisioning and configuration.
- Implement security best practices for Linux systems, including user access management, firewall configuration, and vulnerability scanning.
- Assist in compliance audits by ensuring systems meet security standards.
- Manage backup schedules and perform data recovery operations to ensure data integrity and availability.
- Regularly test disaster recovery procedures to ensure business continuity in case of critical system failures.
- Configure and manage network services on Linux systems, such as DNS, DHCP, VPN, and NFS/SMB file shares.
- Troubleshoot network connectivity issues, working with network teams to resolve latency or downtime issues.
- Work closely with cross-functional teams (development, network, storage) to address infrastructure challenges and support projects.
- Maintain up-to-date documentation on systems, processes, and procedures to ensure knowledge sharing and continuity.
- Provide guidance and support to junior administrators (L1) to enhance their troubleshooting skills and knowledge of Linux systems.
- Conduct training sessions or knowledge-sharing workshops on Linux best practices and new tools/technologies.
