Summary
Overview
Work history
Education
Skills
Work availability
Languages
Timeline
Hi, I’m

Krishna Koushik Gogulapaty

London,UK
Krishna Koushik Gogulapaty

Summary


Open for relocation anywhere in UK Currently on Tier2, would require Sponsorship to switch

DevOps, AWS Kubernetes, Linux Admin, CI/CD, Python Highly motivated and experienced DevOps Engineer with over 5 years of experience in designing, implementing, and maintaining scalable and secure infrastructure on public and private cloud platforms. Proficient in automation, CI/CD, containerization, and monitoring technologies. Proven track record of improving operational efficiency, reducing costs, and delivering high-quality services to clients.

Overview

8
years of professional experience

Work history

Aidizital Solutions
London

DevOps Engineer/Site Reliability Engineer
10.2022 - Current

Job overview

  • Here as some details about the key technologies on which I am working
  • AWS :
  • Designing, implementing, and managing cloud infrastructure
  • Troubleshooting by reviewing logs, monitoring metrics and getting feedback from users
  • Reviewing logs, errors or warnings in CloudWatch or CloudTrail
  • Review Security Group 3network ACL's, IAM policies
  • Planning disaster recovery (DR) by identifying critical systems and services like application, data etc
  • Develop DR based on recovery time objective (RTO) and recovery point objective(RPO)
  • Implementing High availability by including ELB, Autoscaling, Multi AZ in RDS
  • Design, build, manage, and operate the infrastructure and configuration of SaaS applications with a focus on automation and infrastructure as code
  • Ensure infrastructure security compliance
  • Disaster recovery planning and regional failover testing
  • Automatic scaling of infrastructure and services
  • Conduct post-mortems to analyze and prevent repeat failures
  • Kubernetes:
  • Ensuring reliability and high availability of the cluster
  • Monitoring the cluster health, resource usage and application performance with tools like Prometheus and Grafana
  • Regularly updating Kubernetes cluster for bug fixed, security patches
  • Managing worker nodes like adding or removing nodes, monitoring node health and resources
  • Backing up Cluster configuration and data for DR
  • Testing the recovery process
  • Maintaining the security of the cluster by configuring network security policies, managing user access and authentication, monitoring the cluster for security threats
  • Troubleshooting network connectivity, application failures
  • Analyzing logs, metrics and other data to identify the root cause of problem
  • Site Reliability Engineering (SRE):
  • Monitoring the health, performance, and availability of production systems and services using tools like Prometheus, Grafana
  • Automating operational tasks and procedures using tools like Ansible, Terraform, or Jenkins to reduce manual work and improve efficiency
  • Responding to and resolving incidents in production environments, working closely with development teams to identify and resolve the root cause of the problem
  • Azure DevOps/Jenkins:
  • Used Azure DevOps to build a complete pipeline with integration of multiple plugins like Sonarqube, Nexus, Kubernetes, Docker compose
  • Setting up build and release pipelines, configuring testing and deployment environments, and automating the entire pipeline
  • Implementing IAC using Terraform and automating with Azure Devops
  • Application Performance Monitoring: Setting up and monitoring application performance using tools like Application Insights, Log Analytics, or other third-party tools, to ensure that applications are performing optimally
  • Implementing security best practices and regulatory compliance in the Azure environment
  • This includes monitoring security vulnerabilities, applying security patches, configuring access controls, and enforcing compliance policies
  • Setting up and maintaining monitoring and logging solutions to track system and application health, monitor resource utilization, and troubleshoot issues
  • Docker:
  • Developing and building docker images
  • Writing dockerfiles, configuring dependencies, and optimizing image size
  • Deploying Docker containers to various environments, such as development, staging, and production
  • Monitoring the health and performance of Docker containers
  • Integrating Docker into your CI/CD pipeline to automate the building, testing, and deployment
  • Ensuring the security of Docker containers by implementing best practices, such as restricting access, scanning for vulnerabilities, and using secure images
  • Optimizing the performance of Docker containers and minimizing resource utilization by tuning settings like CPU and memory usage
  • Working collaboratively with cross-functional teams such as developers, testers, and operations teams to ensure smooth integration and delivery of products
  • CI/CD:
  • Design and Maintain CI/CD Pipelines
  • Creating CI/CD pipelines with Jenkins, ArgoCD and Spinnaker
  • Develop and optimize continuous integration and deployment processes (CI/CD)
  • Created several CI/CD pipelines with Azure DevOps
  • IAAC:
  • Expertise in systems architecture and design, infrastructure automation using Terraform
  • Develop and maintain Terraform configs, extend these configurations to support new services, features and multiple environments
  • Configure CI/CD pipelines with Jenkins
  • Configure Zero-downtime deployments
  • Python:
  • Ability to write scripts in an administrative language (Python, Shell)
  • Puppet/Ansible:
  • Tasks related to Software/Package Installation
  • Tasks related to the changes to be made to the configuration files and Scheduling jobs
  • Tasks related to Installing, Configuring, and managing various services
  • Configure Puppet Modules and manifests
  • Monitoring:
  • Establish monitoring and alerting of production systems and critical applications
  • Monitor and ensure smooth operation of production and test environments by executing common sysadmin (system administration) tasks and automating repetitive tasks
  • Occasionally provide support and initial troubleshooting when required by reviewing dashboards and logs to ensure system issues are timely addressed
  • Setup production monitoring with Zabbix, Grafana and Prometheus
  • Setting up 24/7/365 production logging, monitoring, and alerting
  • Experience with logging, monitoring, and alerting tools such as New Relic, ELK, DataDog, and PagerDuty
  • Linux:
  • Maintaining and provisioning Linux servers (Centos, Ubuntu)
  • Experience supporting and managing RedHat LINUX operating systems
  • Strong Linux system administration and troubleshooting skills with a focus on performance tuning, security hardening and high availability
  • GIT:
  • Maintaining GIT repository
  • Implementing GitFlow and GITOps practices
  • Handling merge request
  • Configuring Post and pre commit hooks
  • Define Branching and release strategy
  • Advice on Solution versioning control best practices
  • Non-Technical Skills:
  • Own on-call shift to prevent incidents
  • Great communication skills
  • Likes to work in a team environment
  • Knowledge and experience using Jira and Confluence
  • Proficiency with case documentation and digital communications
  • Experience managing the entire customer issue lifecycle from case creation to development escalation, to resolution
  • Experience writing escalation tickets to fully describe customer issues and dive deep into root causes
  • Experience working within an Agile/Scrum SDLC
  • Troubleshoot and solve customer issues on production deployments
  • Ensure that SLAs are met in executing operational tasks
  • Manage risk and resolves issues that affect release scope , schedule, and quality.

Saba Software

DevOps Engineer
08.2016 - 08.2021

Job overview

  • CI/CD:
  • Task: To setup CI/CD pipelines on cloud platforms
  • Designing DevOps workflow for multiple applications by orchestrating Test, Build, Release and Deploy phases through various CI/CD pipelines using Git, Jenkins, Docker, Kubernetes, Ansible & Cloud formation tools
  • Built end to end CI/CD Pipelines in Jenkins to retrieve code, compile applications, perform tests and push build artifacts to Nexus Artifactory
  • Automated CI/CD pipeline for the monitoring tools Docker containers and written script to test them
  • Performed all necessary day-to-day Subversion/GIT support for different projects and responsible for designing and maintenance of the Subversion/GIT Repositories, views, and the access control strategies
  • Used GIT as source code repositories and managed GIT repositories for branching, merging, and tagging
  • Monitoring Setup:
  • Task: To setup OpenNMS and Nagios monitoring for RS and NTT production Servers for monitoring Severs and Service Availability
  • Installing and configuring NMS to poll Production Servers for availability of services like Ping, SSH, Http, Tomcat, Redis, Active MQ etc
  • Configuring SNMP to create SNMP Graph for all servers with Disk Space Utilization, CPU Load, Service Availability
  • Configuring Dashboard with Alarms, Events, and graphs with 120 sec refreshes
  • Scheduling Reports via Email for Service Availability for particular time Period
  • Creating Notification via Emails about Alarms generated
  • Configured NMS to be available via https traffic
  • Monitoring Remote servers' connectivity with Remote Pollers
  • Created Threshold for alerting disk space at reaching 80 %
  • Integrating CLMS to Create Events in NMS Events Dashboard
  • Integrating Nessus for Vulnerability Scan
  • Configuring PEN proxy for Partner Connectivity Monitoring
  • Setting up Cron for daily backup from PG_SQL server
  • 3) AWS environment setup:
  • Worked on Amazon Web Services daily to work on EC2 instances, S3 buckets, Amazon Glacier and EBS
  • Worked on the migration from VMware to AWS and used Chef to automate the infrastructure in AWS by creating EC2, S3, RDS, VPC and Route 53
  • Managed storage in AWS using Elastic Block Storage, S3, created Volumes and configured Snapshots
  • Configured Inbound/Outbound in AWS Security groups according to the requirements
  • Extensively automated the deployments using AWS by creating IAMs and integrated the Jenkins with AWS plugins to pipeline the code
  • Created Cloud Formation templates to automate the AWS Deployments
  • 4) Docker and Kubernetes:
  • Configured Docker container for branching purposes and Implemented Docker automation solution for Continuous Integration/Continuous Delivery model
  • Deployed Kubernetes application on multiple cloud provides like AWS EKS, GKE and AKS
  • Created CI/CD pipelines for deployment to Kubernetes clusters.

Salmon Commerce Pvt Ltd

DevOps Engineer
03.2015 - 06.2016

Job overview

  • Major Projects:
  • Cloud Hosting:
  • Deploy and monitor scalable infrastructure on Amazon web services (AWS) & configuration management using Puppet
  • Deploy Ansible and Jenkins to completely provision and manage AWS EC2 instances, volumes, DNS, and S3
  • Involved in creating and working through Docker images, containers, and Docker Consoles for managing Application Life cycle
  • Azure :
  • Used Azure DevOps to build a complete pipeline with integration of multiple plugins like Sonarqube, Nexus, Kubernetes, Docker compose
  • Build pipelines to orchestrate containers with Azure Kubernetes services
  • Optimized CI/CD pipelines for best performance
  • Used multiple agents to run pipeline on many servers in parallel
  • Containerization:
  • Involved in creating and working through Docker images, containers, and Docker Consoles for managing Application Life cycle
  • Responsible for continuous integration and continuous delivery using Jenkins, Maven and Chef
  • Worked with Red Hat OpenShift Container Platform for Docker and Kubernetes
  • Used Kubernetes to manage containerized applications using its nodes, ConfigMaps, selector, Services, and deployed application containers as Pods
  • Troubleshoot and resolved Build failures due to infrastructure issues reduced by 95% stabilizing the build process
  • Setup and executed process to code review system effectively
  • Developed and implemented Software Release Management strategies for various applications according to the agile process
  • System Administration:
  • Performed Installation, Configuring and troubleshooting the WebLogic Application Servers 8.1/9.2
  • Installed and configured Apache 1.3 as a plug-in on the UNIX environment and deployed applications
  • Involved in Supporting LDAP, admin Console, iPlanet, WebLogic Applications, Sales Tools
  • Worked closely with designers, developers, and partners
  • Installed, configured, and maintained iPlanet web server on Solaris 2.x
  • Worked for Application Deployment on WebLogic 8.1 on a single server and a cluster of Servers
  • Worked on the Red Hat Enterprise 4/5 platform and Sun Solaris one
  • Configured Clusters on WebLogic using the WebLogic Console
  • Performed hot deployment of the application on WebLogic7.0and 8.1
  • Performed analysis Memory Leaks, JVM heap settings and tuning
  • Configured SSL communication between Admin Server and Node Manager
  • Troubleshoot network problems between servers (clustering failover issues)
  • Designed, tested, and realized a wide range of services based on the latest technologies
  • Migration of application from WebLogic 8.1 Installation and administration of WebLogic
  • Installation of SUN Solaris-2.x, 8 on Sun Boxes Building a development, testing, staging Environment
  • Configured database connectivity using JDBC and Configured WebLogic Virtual hosts
  • Managed and Monitored the JVM performance by tuning WebLogic Heap Size, garbage collection, JDBC Pools
  • Involved in 24/7 support for production environments.

Education

Middlesex University

MSc
2022

Skills

  • Top Skills:
  • Cloud Services: Amazon Web Services(Major) , Azure, Google Cloud
  • Container tools: Docker, Kubernetes
  • OS : Linux
  • Provisioning Tools: Puppet, Ansible, Terraform, CloudFormation , HELM
  • App Servers: Apache Tomcat, JBOSS, Apache tomcat, WebSphere, WebLogic, Python
  • Version Control Systems: GIT
  • Build Automation: Azure DevOps, CI/CD, Jenkins, GITHUB Actions, Aws Code build, Pipeline, Code build and Code pipeline for CI/CD
  • Build Tools: Bamboo, Maven, ANT, Gradle
  • Artifactory: Nexus, Jfrog
  • Code Quality: SonarQube, Trivy
  • Continuous Deployment: ArgoCD , FluxCD
  • Service Mesh: Istio
  • Databases: MySQL, Oracle, PostgreSQL
  • Ecommerce Solutions: Hybris, Intershop, Magento, IBM WebSphere, Oracle SOA
  • Linux Versions : RHEL, CentOS, Ubuntu
  • Web Servers: Apache, Nginx
  • Upgrading and Patching: Spacewalk, RHN Satellite
  • Virtualization: Docker, Vagrant, VirtualBox, VMware
  • Directory Services: Open LDAP
  • Load Balancing: HA Proxy, AWS Elastic Load Balancing (ELB), Autoscaling
  • Monitoring tools: Zabbix, Nagios DataDog, Grafana, Dynatrace, ELK, Azure Monitoring, CloudWatch, Prometheus, Grafana, Kibana, New Relic, DataDog
  • Log Management: Splunk, Elasticsearch, Kibana, Logstash, Solr
  • Networking: TCP/IP, DNS, DHCP, HTTP, TLS
  • Other AWS Services Knowledge: S3, CloudFront, SNS, SQS, ECS
  • Troubleshooting, Server security, Service monitoring, Backup management, Capacity planning, Strategic planning, Documentation, SLA's, Agile workflow, JIRA
  • Experience working in a hybrid production environment (SaaS/PaaS - AWS Azure on-premises)
  • LANGUAGE
  • Scripting: Python, Bash
  • Kubernetes
  • Software development
Availability
See my work availability
Not available
Available
Monday
Tuesday
Wednesday
Thursday
Friday
Saturday
Sunday
morning
afternoon
evening
swipe to browse

Languages

English
Fluent

Timeline

DevOps Engineer/Site Reliability Engineer

Aidizital Solutions
10.2022 - Current

DevOps Engineer

Saba Software
08.2016 - 08.2021

DevOps Engineer

Salmon Commerce Pvt Ltd
03.2015 - 06.2016

Middlesex University

MSc
Krishna Koushik Gogulapaty