Site Reliability Engineer Resume
Seattle, WA
SUMMARY
- Around 5 years of experience offering outstanding services in IT industry with strong background in Linux Administration, Build & Release, DevOps, Cloud and knowledge of different practices including but not limited to Configuration Management, Containerization, Continuous Integration and Continuous Delivery.
- Experience in automation of code deployment, support and administrative tasks across multiple cloud providers such as Amazon Web Services, Microsoft Azure, Google Cloud and OpenStack.
- Proficiency with AWS developer tools like AWS CLI, CloudFormation templates and workflows.
- Expertise in utilizing AWS services such as EC2, RDS, S3, Lambda, Elastic Beanstalk, EFS, Glacier, Storage Gateway, DynamoDB, ElastiCache, Redshift, VPC, CloudFront, Route53, Direct Connect, API Gateway, EBS, AMI, SNS, CloudWatch, ELB, Auto Scaling, IAM, OpsWork, KMS.
- Sound knowledge in setting up databases in AWS using RDS, storage using S3 buckets and configuring instance backups to S3 buckets
- Experience in managing Infrastructure as Code with tools like Chef, Puppet, Ansible, Terraform, and CloudFormation templates.
- Exposure in building highly scalable and reliable Azure cloud services using IaaS, DBaaS, PaaS like Azure Data Factory, OMS, Key Vault, Traffic Manager, Service Bus, Virtual Machines, App Services components.
- Knowledge of container - based deployments using Docker, Docker Swarm, and Kubernetes.
- Demonstrated proficiency in creating and maintaining chef cookbooks and defining recipes using Ruby
- Good knowledge of writing Ansible playbooks and Salt formulas to automate application deployment, configuration, managing system packages.
- Experienced in setting up CI/CD pipelines for automation of agile process using tools like Jenkins, TeamCity, Visual Studio Team Services.
- Well experienced with managing source code using Version control tools like Git, and Team Foundation Version Control.
- Familiarity with build tools like Ant, Gradle, Maven, MS Build, Visual Build, Gulp for building deployable artifacts from source and static code analysis using SonarQube.
- Experience in administering and configuring Nexus Repository from Maven build.
- Strong knowledge of Monitoring and log analytics tools like Nagios, Dynatrace, New Relic, Splunk, Elasticsearch, Logstash, and Kibana (ELK), Grafana, Datadog, Sumo Logic, AppDynamics.
- Expertise in Design and development of multi-tier applications using J2EE, Spring, Spring Boot, Web Services, Hibernate, Spring MVC, JSF, Struts, JavaScript, XML.
- Experience in scripting languages like Shell, Bash, Python, PHP, Ruby and YAML for automating deployment.
- Deep understanding of Web application Servers like JBoss, WebSphere, Nginx, Apache Tomcat, WebLogic
- Good knowledge of virtual environments such as VMWare ESXi, Hyper-V, VirtualBox, Xen.
- Practical knowledge in configuring servers to provide network services including HTTP/HTTPS, TCP/IP, FTP, SSH, DNS, LDAP, SMTP, SFTP
- Experienced in software development methodologies like Agile/Scrum, Waterfall and Micro Services.
- Knowledge in managing and administering Linux, Windows, and Linux/Windows hybrid environments.
- Ready to provide 24/7 on call support on rotation basis.
- Ability to work effectively and professionally with all the teams to deliver high quality product on time.
- Capable of learning new technologies, processes and successfully applying them to projects and operation
TECHNICAL SKILLS
Cloud Environment: Amazon Web Services, Azure, Google Cloud Platform, OpenStack
Containerization Tools: Docker, Docker Swarm, Kubernetes, OpenShift, Mesos, Marathon
Configuration Management: Puppet, Chef, Ansible, SaltStack
Continuous Integration: Jenkins, Bamboo, Hudson
Build tools: Maven, Ant
Version Control: Git, TFVC, TFS, SubVersion (SVN)
Monitoring tools: Nagios, Splunk, ELK, Grafana, New Relic, Datadog
Ticketing tools: Jira, Scrum, Bugzilla, ServiceNow, Pivotal Tracker
Webservers: Apache Tomcat, Nginx, Web Sphere, JBoss, WebLogic
Virtualization tools: VMWare, VirtualBox, Hyper-V, Vagrant
Operating Systems: RedHat, CentOS, Ubuntu, Debian, Windows & MacOS
Repository management tools: Nexus, JFrog Artifactory
Programming & Scripting Languages: HTML, Python, Bash, Shell, Ruby, Perl, XML
Databases: SQL Server, MongoDB, Cassandra DB, Oracle, MySQL, PostgreSQL
Networking: HTTP, HTTPS, TCP/IP, UDP, DNS, FTP, SSH, SNMP, SFTP
PROFESSIONAL EXPERIENCE
Confidential, Seattle, WA
Site Reliability Engineer
Responsibilities:
- Created Azure Container Registry to store all the docker images for the application and worked closely with the developers for deploying the applications.
- Configured Azure AppInsights for monitoring and troubleshooting Azure resources.
- Managed, configured Azure Web Jobs to run continuous and scheduled tasks and wrote ARM templates to deploy Azure services.
- Implemented Azure App Service Environment and configure DNS and Application Gateway to restrict applications and APIs to be accessible only from intranet.
- Worked with Terraform key features such as Infrastructure as code, Execution plans, Resource Graphs, Change Automation.
- POC and create multiple Terraform modules to manage configurations, applications, and automate installation process on Azure
- Created on-premises HA Kubernetes cluster in different environments for Confidential careers, rewards and menu pages. Also responsible for managing and troubleshooting multiple on-premises Kubernetes clusters by creating monitoring and alerting.
- Configured Rancher for managing Kubernetes cluster and installed Kubewatch to send notifications to Slack.
- Installed Helm for easy deployment of applications and deployed the containerized NodeJS applications.
- Configured Ingress Controller to distribute the incoming traffic to related services and installed Kubernetes dashboard to help developers with easy readability for all the clusters.
- Administration and maintenance of Docker runtime environment, versioning, lifecycle management of Docker images and experienced in docker orchestration framework.
- Worked to setup Jenkins as a service inside the Kubernetes cluster to reduce the failover downtime to minutes and to automate the Docker containers deployment without using configuration management tool.
- Participated in the on-call rotation where I support Confidential ' web application (except for transactions) which is a mix of legacy ASP.NET and NodeJS application, a customer-driven platform for UI/UX orchestration, content services, and few APIs that are responsible for Offers, Rewards, Recommendations, etc.
- Configured security and system in Jenkins, added multiple nodes to Jenkins and configured SSH for Continuous Deployment
- Wrote salt formulas to containerize legacy application including Memcached and Elasticsearch.
- Wrote Shell/Python scripts to automate the deployment process and assisted developers in deployments using Azure DevOps
- Extensively worked on Jenkins, Docker for continuous integration and for End to End automation for all build and deployment.
- Reduced critical vulnerabilities on windows servers that were not patched for few years by writing a PowerShell script and applied 100+ patches on each server.
- Wrote PowerShell script to collect application logs from Azure and send it to Splunk.
- Great experience using GIT like fork, branching, merging, tagging, pushing, pulling, and cloning.
- Experience in using GitHub repository to commit the code by developers and cloning the code to Jenkins.
- Used Ansible for multi-tier deployment and microservices deployment by writing playbooks.
- Managed and configured multiple instances of Splunk forwarders, indexers, search heads and deployment servers.
- Participated in the on-call rotation and provided 24x7 support for Confidential mobile and web applications along with customer driven platform for UI/UX orchestration.
- Worked closely with application business owners and operations teams in an Agile/Scrum/Kanban environment to deliver high quality product.
Environment: Azure, Kubernetes, Docker, Ansible, Salt Stack, Terraform, Helm, Git, Jenkins, Bash, DataDog, New Relic, Splunk, GlusterFS, Heketi, Python, PagerDuty, ITSM, Jira, Confluence
Confidential, Fort Lee, NJ
AWS DevOps Engineer
Responsibilities:
- Responsible for creating and writing CloudFormation templates to automate installation of auto scaling, EC2, VPC and other services
- Reduced server crashing when there is heavy traffic to the website by spinning up the EC2 instances integrated with ELB and auto-scaling.
- Used Terraform in AWS Virtual Private Cloud (VPC) to automatically setup and modify settings by interfacing with control layer
- Used AWS Beanstalk for deploying and scaling web application and services developed with Java, PHP, Node.js, Python, Ruby, etc.
- Achieved high level security by maintaining the security groups and controlling the inbound and outbound traffic that are allowed to reach the instances.
- Defined lifecycle policies for EC2 container registry to define set of rules to automatically remove outdated containers images.
- Worked on AWS API Gateway for custom domain and record sets in Amazon Route53 for applications hosted in AWS Environment
- Designed and implemented of Azure Site Recovery in both Disaster Recovery Scenario and for migrating the workloads from On-Premise to Azure.
- Managed Clusters with various Servers in Azure Cloud Resource Groups.
- Collaborated in planning and move to Google Cloud Platform.
- Worked on CI/CD pipeline and created sandbox, UAT and Production Environments in Google Cloud Platform
- Used Kubernetes to deploy and manage containers in AWS, this helped us to make our application platform independent
- Used Kubernetes to deploy load balance, scale and manage Docker containers with multiple namespace versions.
- Containerized all the spring boot applications - Java and Nodejs using Docker
- Added and dropped Linux capabilities to containers through the use of Docker cap-add and cap-drop respectively
- Used Ansible for multi-tier deployment and microservices deployment by writing playbooks
- Assisted in analyzing internal and external data using Standard and Legacy SQL to query within the Google BigQuery and Google Cloud Platform
- Leveraged several Jenkins plugins to automate tasks like code coverage, metrics, aws-ec2 plug-in, and job creation and regularly built Jenkins builds and delivered to Nexus Artifactory.
- Automated MySQL container deployment in docker and monitored these containers using Nagios.
- Worked on infrastructure with Docker containerization, and created Docker containers leveraging existing Linux containers in addition to creating Docker containers from scratch.
- Created Jenkins job to build the artifacts using Maven, deployed the Terraform templates to create the stack
- Integrated Maven with Bash to create automatic builds and deployments on application servers
- Installed and configured monitoring tools like Elk, Nagios while using it for monitoring network services and host resources.
Environment: AWS, GCP, Kubernetes, Git, Jenkins, Ansible, Docker, MySQL, Nexus, Apache Tomcat, Pivotal Tracker, Maven, ELK, Nagios
Confidential, Newport News, VA
AWS DevOps Engineer
Responsibilities:
- Assisted in designing AWS CloudFormation templates to create custom sized VPC, subnets NAT to ensure successful deployment of web applications and database templates
- Instantly launched or scaled up high performance computing clusters on AWS with low latency.
- Managed CDN on Amazon CloudFront Origin Path Server/S3 to improve site performance
- Used IAM to provide security and to manage user access by creating policies and assigning those policies to group of users
- Created Shell and Python scripts to automate the creation of AMIs through pre-boot and bootstrapping techniques
- Used AWS Beanstalk for deploying and scaling web application and services developed with Java, PHP, Node.js, Python, Ruby, etc.
- Optimized the AWS cloud infrastructure for production, staging and development environments, troubleshoot performance issues, and resolve production incidents.
- Reduced storage costs by creating a Lambda job to move infrequently accessed files from AWS S3 to Glacier for archival storage
- Created additional Docker containers as slave nodes for Jenkins using custom Docker Images and pulled them to Elastic Container Registry
- Administration and maintenance of Docker runtime environment, versioning, lifecycle management of Docker images and experienced in docker orchestration framework
- Deployed code on web application servers like WebSphere, WebLogic, Apache Tomcat, JBoss, and built microservices using API Gateway.
- Containerized all the spring boot applications - Java and Nodejs using Docker.
- Wrote Chef Cookbooks for various DB configurations to modularize and optimize end-product configuration, converting production support scripts to Chef Recipes and AWS server provisioning using Chef Recipes
- Worked on Docker compose to schedule multiple containers and written compose.yml files to configure and scale multiple application services and dockerized applications by creating Docker images from DockerFile.
- Developed builds using Maven as build tool and used CI tools to kick off the builds from one environment to another environment
- Implemented Jenkins Master-Slave mechanism by configuring EC2 instances as slaves.
- Implemented availability, performance and log monitoring over AWS cloud platform using Splunk and Grafana.
- Wrote bash scripts to deploy java-based packages like ear and war on the Tomcat Application server.
- Troubleshooting webserver by checking config files and open ports.
Environment: AWS, Git, Jenkins, Docker, Nagios, MySQL, Nexus, Apache Tomcat, Pivotal Tracker, Chef, Splunk, Grafana, Python, Shell scripting, Apache Tomcat, WebSphere, Terraform, Kanban, Jira, Confluence
Confidential
Build & Release Engineer
Responsibilities:
- Main duties were Performing software configuration/release management activities for three different Java applications.
- Developed Perl and shell scripts for automation of build and release process.
- Involved in defining, documenting, negotiating and maintaining Product/Application Release Roadmap.
- Integrated Git into Jenkins to automate the code check-out process.
- Performed Git Merge and resolve conflicts, if present once the code is deployed into the production environment or feature branch is ready to be released.
- Worked with an Agile development team to deliver an end-to-end continuous integration/ delivery product using Run deck integrated with Jenkins to get the job done.
- Responsible for the development and maintenance of processes and associated scripts/tools for automated build, testing, and deployment of the products to various departments.
- Modified pom.xml files, performed releases with Maven release plugin
- Installed and Administered Jenkins CI for Ant and Maven Builds.
- Maintain and track inventory using Jenkins and set alerts when the servers are full and need attention.
- Extended an existing Puppet for an operations tools start-up to enable seamless full infrastructure provisioning for site redundancy and staging/development environments.
- Used Jira to track issues and change management
- Involved in various Web Application Servers (WAS) administration and troubleshooting.
Environment: Git, Jenkins, Maven, Puppet, Jira, Perl, Shell scripting, MS SQL, MySQL, WAS, Agile methodology
Confidential
Linux System Engineer
Responsibilities:
- Resolved TCP/IP network access problems for the clients. Developed, maintained, update various scripts for services (start, stop, restart, recycle) UNIX based shell.
- Maintained security by installing and configuring SSH (Secure Shell) encryption to access on Ubuntu and Red Hat Linux.
- Monitoring server performance and troubleshooting server related problems to SNMP.
- Scheduled various regular, periodic, future and queue tasks by using Crontabs.
- Submitted reports such as Routine perform analysis, Capacity analysis, Security audit analysis to customer for necessary planned changes
- Monitored system performance, Server load and bandwidth issues.
- Managed program cost, schedule and reported program performance metrics to management
- Installation of Oracle Patches and Troubleshooting, Creating and modifying application related objects, Creating Profiles, Users, Roles and maintaining system security.
- Configured Iptables and used Tcpdump and Nmap port scanner for testing firewall configuration
- Deployed and managed CAD file servers with improved availability through RAID storage sub-systems.
- Performed root cause analysis and troubleshooting to investigate problems and implement appropriate resolutions
Environment: Unix, Linux, Cron, NFS, SNMP, SSH, Oracle, NIS, RAID, CAD