Platform/infrastructure Engineer Resume
Atlanta, GA
SUMMARY
- Over 6+ years of IT experience in System Administration, Cloud and Dev Ops with expertise in designing, installing, configuring and automating various implementations of CentOS, RHEL, Ubuntu and experience on AWS and Microsoft Azure in a DevOps Culture through Continuous Integration (CI) & Continuous Deployment/Delivery (CD) as an iterative process and Automation of Infrastructure as Code.
- Involved in deploying the content cloud platform on Amazon Web Services like EC2 instances, S3, RDS, EC2 EBS, Elastic Load Balancer, Auto - Scaling groups and AMI and Hands-on Experience in configuration of Network architecture on AWS with VPC, Subnets, Internet gateway, NAT and Route tables.
- Experienced in working as a System Administration on Red Hat Enterprise Linux (RHEL) based Linux Distribution including tasks like Troubleshooting, Security, Backup, Disaster Recovery, System Builds, Server builds, Installs, Upgrades, Patches, Migration, Performance Monitoring and Fine-tuning.
- Experience with configuration management and automation tools such as Ansible, Chef, Puppet, or custom-built, designing cloud-hosted solutions to manage Amazon Web Services (AWS) infrastructure and specific AWS product suite experience.
- Extensively worked on routing the data through AWS network infrastructure for high cost effective, availability and fault tolerant applications. With AWS instances Elastic Load Balancer, Auto scaling, CDN, Route 53, VPC Peering. Also used EBS in storing persistent data to mitigate the failure.
- Experience in development of Azure ARM Templates to perform immutable deployment of the Azure VMS with the corresponding Storage accounts, Azure Key Vault, Azure Blob storage, Azure disks.
- Worked with Implementation/configuration/Administration of VMware ESXi 5.1, 5.5, 6.0, 6.5 vCenter, VMotion, VMware SRM 5.x/6.0, DNS, DHCP, NIS, NFS, SMTP and Physical to Virtual Migration, Host Clustering with HA, DRS, Site Recovery Manager and Managing SAN Datastores with, iSCSI, NFS & FC.
- Hands on expertise in building Kubernetes clusters on Cloud and deploying the clustered applications, Built Jenkins pipeline to drive all microservices builds out to the Docker registry and then deployed to Kubernetes, created pods and managed using Kubernetes.
- Experience in deploying Kubernetes cluster using Ansible playbooks through kubeadm Kubernetes cluster deployment tool, Configured nginx plus as an ingress controller.
- Experience on creating Azure Load Balancers and Azure Auto scaling to design fault tolerant, cost effective and highly available systems. Hands on experience creating User, Group Accounts and attaching policies to those created accounts using Azure Active Directory service and having sound knowledge on using Azure Load Balancer and Block BLOB for storing data.
- Experience in working with OpenShift platform to manage the Docker container and the Kubernetes clusters which are primarily used to perform rolling deployments of Java spring boot-based applications.
- Worked on creating users and groups and their permissions and roles, startup and shutdown scripts, cronjobs, file system maintenance and creating NFS, Configuring NTP.
- Experience in Installing, configuring and maintaining apache/tomcat, samba & send mail, Httpd, SMTP, DHCP, NFS, NIS, NIS+, LDAP, DNS, SQUID, Postfix, FTP, Remote Access, Security Management & Security troubleshooting skills.
- Experience in implementing branching strategies with feature-based branches for the GitHub and resolved merge conflicts on git-based tools like git-hub, VSTS, bitbucket, GitLab.
- Designed and implemented CI system: configuring Jenkins servers, Jenkins nodes, created required scripts Perl & Python to deploy different applications, implemented code coverage, unit test plug-ins with Maven in Jenkins. Implemented CI workflow, project testing and deployments with Jenkins.
- Experience with Atlassian JIRA installation, administration and maintenance with ability to create Jira workflow, screen schemes, permissions schemes and notification schemes for JIRA bug/issue tracker.
- Installed, Configured, Managed Monitoring Tools such as Splunk, Nagios, iCinga, ELK for Resource Monitoring/Network Monitoring/Log Trace Monitoring.
- Experience in RedHat-Linux(5x,6x&7x), CentOS (5x, 6x & 7x), Ubuntu, and Windows Server(2008R2/2012R2) Environment and worked in configuring servers using kickstart and PXE Boot installations to initiate installation of Linux on several machines at once.
- Extensive experience in creating LVM (logical volumes), Volume groups and file systems. Increasing and reducing logical volumes, swapping, mounting and unmounting file systems and have strong knowledge in RAID Technologies.
- Strong knowledge in the principles and best practices of Software Configuration Management (SCM) in Agile, scrum, and Waterfall methodologies. Worked on JavaScript, Web Services and Databases like Oracle, MS-Access, and SQL Server.
- Monitored the servers and checked the network performance using Nagios, AppDynamics, Dynatrace, Splunk, Zabbix for Resource Monitoring/Network Monitoring/Log Trace Monitoring and logging tools like ELK.
- Worked with bug tracking tools like Jira, Remedy, and Bugzilla and configured various workflows, configurations and plugins for JIRA bug tracker.
TECHNICAL SKILLS
Operating Systems: RHEL/CentOS 5.x/6.x/7, Ubuntu/Debian/Fedora, Solaris 7/8/9/10, Windows Server 2008/2012/2016
Build/Automation Tools: Ansible, Puppet, Chef, Ant, Maven, Jenkins, Hudson, TeamCity & Bamboo
Languages: Shell, Bash, Ruby, Groovy, Hashicorp Configuration Language(HCL) and Python scripting
Databases: MySQL, MongoDB, Cassandra, PostgreSQL, SQL Server
Web/App Server: Apache, IIS, HIS, Tomcat, WebSphere, WebLogic, Jboss, Glassfish.
Bug Tracking Tools: JIRA, Fisheye, Crucible, Rally, Remedy and IBM Clear Quest, Bugzilla, HP Quality Center
Version Control Tools: Subversion, GIT, Tortoise CVS, Visual SVN, IBM Clear Case, Perforce
Cloud technologies: AWS EC2, VPC, EBS, AMI, SNS, RDS, Aurora, Redshift, EBS, Cloud Watch, Cloud Formation AWS Config, S3, Lambda, Cloud Trail, IAM. VMware, Microsoft Azure, Google Cloud Platform, OpenStack
PROFESSIONAL EXPERIENCE
Confidential - Atlanta, GA
Platform/Infrastructure Engineer
Responsibilities:
- Administered, provisioned and configured various AWS Cloud Web services such as Amazon EC2, S3, IAM, Code Build, Code Commit, Lambda, EKS, ECR, Cloud9, and Cloud Watch etc...
- Contributed significantly in enhancing the infrastructure as code written in HCL (Hashicorp Configuration Language) to provision the infrastructure for the QA and Production environments using Terraform.
- Fixed service requests pertaining to issues hindering the regular functioning of infrastructure, and changing security configurations such as updating the listener rules of Application Load Balancer (ALB), modifying IAM policies attached to IAM roles given to various AWS services like Lambda, EC2, Code build, Cloud Formation Stack etc… to provide the services the necessary access for them to function properly.
- Composed and configured various server less Lambda Functions, which are triggered utilizing Cloud Watch with cron jobs, handling operations pertaining to the maintenance of the cloud infrastructure, such as taking daily backups of Instance Snapshots, deleting stale AMIs (Amazon Machine Image) and Snapshots that are older than a month to cut on costs.
- Provided workspaces to users by provisioning and configuring computing resource (EC2 Instances) and providing role-based access using IAM and linking the resources to the users, after which the corresponding users can access the workspace using the Amazon Cloud9 terminal.
- Scripted Packer files to enable services like Qualys Agent right after the EC2 instance spins to simplify asset discovery and tracking. The custom image is baked by using amazon code build and the generated images are stored in Amazon ECR.
- Assisted other platform teams in porting the HCL (Hashicorp Configuration Language) Infrastructure as Code to various platforms custom tailored to the team’s particular environments, requirements and configurations.
- Deployed a Jenkins sandbox environment in the non-prod, prod EKS cluster environment, and implemented shared libraries and modularizing the CI-CD toolset.
- Collaborated with platform associates and automated the CI-CD process for all the micro-services.
- Modified the CI-CD environment by integrating configuration as code, and hardcoded the plugins on the Jenkins Docker file and pushed the image to Amazon ECR, which could then be pulled be then deployed on to the EKS cluster.
- Monitored and administered the Jenkins CI-CD environment using slack notifier, and administered and resolved build issues to ensure all the services are deployed successfully on the EKS cluster following a code commit by the developer.
- Documented the entire CI-CD toolset and processes implemented in the project, on Confluence, which was then used to onboard new teammates.
- Resolved and administered service requests to grant access to AWS resources, GitHub and Jenkins.
- Worked on migration of the Jenkins server to Amazon Web Services Cloud and moving jobs from the Git and Analyze, and resolving conflicts related to merging of source code from GIT followed by the code quality analysis using SonarQube and fix bugs.
- Deployed NGINX ingress controller on the Non-prod and Production EKS Cluster and, configured and whitelisted IPs of the individual micro-services and Jenkins.
- Installed and deployed cert-manager on the production cluster, and configured the cluster issuer to represent the certificate authorities.
- Installed the AppDynamics Agent on individual micro-services, monitored, and configured the agent to manage the performance and availability of application across the cloud environment.
- Imported data and backed high performance metrics by configuring a dedicated Grafana instance using Grafana Cloud, and integrated log aggregation and storage with Loki.
- Provided the monitoring solution by utilizing Prometheus to collect time-series data, and provide alerts to indicate the performance at the container and micro-service level.
- Installed and configured Fluent bit to collect, process and transfer the logs to Splunk to seamlessly monitor the cluster-level resources.
- Deployed various monitoring and logging tools on the non-prod and production cluster such as Prometheus, Grafana, Alert manager, kube-state metrics, node-exporter, AppDynamics etc... to monitor and the microservices performance at the cluster and microservice levels.
- Collaborated and worked with other teams to develop Jenkins-shared libraries which could be used for static code analysis, building, publishing deploying to ECR, and deploying on the cluster all over the organization for all the node-js and spring boot based microservices.
- Developed spring boot helm chart to deploy spring boot based microservices which is called upon on by the Jenkins-shared libraries, in the final helm-deploy stage.
Confidential - Houston, TX
Cloud Engineer
Responsibilities:
- Provisioning, configuring and troubleshooting AWS cloud services, EC2, S3, RDS, ELB, EBS, Auto scaling groups, Cloud watch, Cloud-Front and managed IAM accounts (with MFA) and IAM policies to meet security audit and compliance requirements.
- Configured Elastic Load Balancer (ELB) including high availability of ELB using various subnets in various availability zones and used Amazon Route53 to manage DNS zones and give public DNS names to Elastic Load Balancers IP's.
- Created python scripts for completely automating AWS services including build server, deploying EC2 instances on AWS environment and Data centers, Cloud Front Distribution, Elastic-Search and managing database security groups on AWS.
- Experience in working with Terraform for automating VPCs, ELBs, security groups, SQS queues, S3 buckets, and integrated Terraform with Jenkins and GIT to achieve continuous integration and test automation framework.
- Experience in managing multiple Azure subscriptions in automating the Azure VM provisioning, Blob storage, file blobs, Azure disks for the Azure VM.
- Automated the installation of ELK agent with Ansible playbooks and used Ansible to deploy security tools, manage Web applications, Mount points and Packages.
- Used Docker hub to pull images, run containers based on an image, configure automation using containers and implementation of several Tomcat/WebSphere instances by using the Docker engine for running many containerized application servers.
- Used Terraform scripts to automate future AWS service creations like creating subnets, security groups, route tables and tasks such as encrypting S3 buckets and EBS Volumes backing AMIs.
- Migrating current application to micro service architecture using docker container technology with Kubernetes using REST API and worked on Docker and Kubernetes on cloud to perform CI/CD on public or private cloud.
- Orchestrated Container applications using Open-Shift and Kubernetes for container operations in AWS and worked on creation of Kubernetes Pods and used Kubernetes cluster to maintain Services, Load Balancing and Network policies and provided PAAS on public and private cloud in VMware cloud and improved security using Open shift.
- Worked on Kubernetes control plane to create API objects to maintain Kubernetes clusters in their desired state and run the applications on them.
- Implemented and automated non-relational databases like MongoDB and Cassandra as well as relational databases like MySQL, PostgreSQL and clusters by creating Ansible Playbooks on AWS and AZURE Cloud and on-premise environments.
- Designing and writing code to develop and configure systems which power Splunk Multi-Tenant Architecture in the organization and creating Applications on Splunk to analyze the Big Data and have strong knowledge on Splunk components like indexer, search head, forwarder, index replication and indexer clusters and deployment server.
- Created and wrote shell scripts Bash, Ruby, Python and PowerShell for setting up baselines, branching, merging, and automation processes across the environments using version control tools like GIT, SVN on Linux and Windows platforms and wrote troubleshooting python code for Lambda service.
- Used Python and Flask micro framework for developing RESTful API followed by creating an endpoint that returns static data (dictionaries). Created a class with few specializations and a few endpoints to insert and retrieve instances of these classes and looking on how to run the API on a Docker container.
- Implemented AWS solutions using EC2, S3, RDS, EBS, ELB, Auto scaling groups and created python scripts to automate the backup of the EC2 EBS volumes and configured cronjobs to create snapshots of the volumes with the AWS API for EC2 Instances storage.
- Developed end-to-end build and deployment automation scripts using MAVEN and associating MAVEN plugins to Jenkins and building artifacts in pom.xml files, pushing the artifacts to nexus repository and deploying it on Tomcat server.
- Worked on source control management with GitHub and GitLab Enterprise level repositories including activities like configure user’s access levels, monitor logs, identifying merge conflicts and managing master repository and wrote scripts to back up and restore GitHub repositories and experience in doing checkout with Jenkins for continuous integration.
- Designed and configured Gerrit above Git for approving changes restricted to selected users other than the owner. Installed and configured Gerrit client for pushing a commit using different submit types to Gerrit.
- Created automated tests in Jenkins to revert products thoroughly with each change and perform parallel automated tests which will release the power of Agile Development and helps to find and fix bugs very easily.
- Worked on migration of the Jenkins server to Amazon Web Services Cloud and moving jobs from the Git and Analyze, and resolving conflicts related to merging of source code from GIT followed by the code quality analysis using SonarQube and fix bugs.
- Experience in working with AWS deployment services such as AWS Cloud Formation, AWS Elastic Beanstalk and Terraform for efficient deployment of application infrastructure and for automating the creation of services like VPCs, ELBs, security groups, subnets, EC2 instances, RDS, SQS queues, S3 buckets, and continuing to replace the rest of our infrastructure.
- Worked on configuring fully automated server build management, monitoring, analyzing network traffic and security appliances for identify instructions, discover infection vectors and compromised accounts using Splunk.
- Written new plugins in Nagios to monitor Linux Cluster nodes configured using Red Hat Cluster Suite and worked with implementation team to build and engineer servers on Ubuntu and RHEL Linux.
Confidential
DevOps Engineer
Responsibilities:
- Detailed Support to Customers of Microsoft Azure in Design, Develop, Debug, And Migration of on Premises Infrastructure into Public or Private Clouds. Designed and developed Cloud Service projects and deployed to Web Apps, PaaS, and IaaS.
- Working as Cloud Administrator, involved in configuration for Web apps/Function apps, V-net integration, HCM, Application gateway, App Insights, Active directory, Azure Key Vault, Encryption and Security on AZURE using ARM templates and PowerShell script.
- Planning and implementing Disaster Recovery solutions, capacity planning, data archiving, backup/recovery strategies, Performance Analysis and optimization.
- Working on Ansible, configuring and integrating servers with different environment, cloud and on-premise which includes designing and patching and also creating new server replica that contains all the packages and patches for the environment.
- Troubleshooting the Azure Development, configuration and Performance issues, manage Azure VMs, Networking (Load Balancers, App Gateway, Traffic Manager, etc.)
- Automated various infrastructure activities like Continuous Deployment, Application Server setup, Stack Monitoring using Ansible playbooks using CI tool like Run Desk and Jenkins.
- Implemented cluster services using Docker and Kubernetes to manage local deployments in Kubernetes by building a self-hosted Kubernetes cluster using Terraform and Ansible and deploying application containers.
- Used Ansible to manage Web applications, Environments configuration files, Users, Mount points and Packages and drove end to end deployment of various Components on the Azure cloud Platform and also implemented Continuous Integration using Jenkins and GIT .
- Developed PowerShell scripts and ARM templates to automate the provisioning and deployment process. Configured VM networking with reserved IPs, Health Monitors, Firewall rules, VM scale sets and availability sets in the Azure Cloud.
- Deployed microservices, including provisioning AZURE environment. Used Docker for virtualization, Ship, Run and Deploy the application securely to speed-up Build/Release Engineering process.
- Creating and Scheduling Cron-Jobs such as backup, System Monitoring and removal of unnecessary files.
- Regularly collaborate with engineers in many other Microsoft verticals/disciplines: Browsers, Intune, Networking, Virtualization, Directory Services and App Services.
- Interacted with multiple teams responsible for Azure Platform to fix the Azure Platform Bugs. Providing 24/7 support for on-call on Azure configuration and Performance issues. Excellent Client interaction skills and proven experience in working independently as well as in a team.
- Experience in migration activities of Java scripts and Database scripts from Oracle, MS SQL Server and MYSQL into different environments like Development, QA, UAT and Production on Red Hat Enterprise Linux.
- Involved in daily meetings, product backlog and other scrum activities and artifacts in collaboration with the team.