We provide IT Staff Augmentation Services!

Aws/sre Devops Engineer Resume

0/5 (Submit Your Rating)

CA

SUMMARY

  • AWS/SRE cloud engineer with 10 plus years of experience in IT industry comprising of Systems Administration, Software Configuration Management (SCM), Build and Release management.
  • Currently working on Enterprise - wide development team for Tomcat/Java Based Applications, using CloudFormation/Service Catalog to support multiple Dev/test/pre-prod Environments.
  • Experienced in working on DevOps/Agile operations comprising of delivery management, design, development, release & deployment, cloud implementation, containerization and managing Infrastructure as code.
  • In-depth understanding in all phases of the software development life cycle (SDLC) with specific focus on developing CI/CD pipelines that helps quality software releases.
  • In-depth understanding of principles and best practices of Software Configuration Management (SCM) in Agile, Scrum and Kanban methodologies.
  • Developed scripts to automate DevOps functions, i.e., source code checking with continuous integration, automated test tool configuration, application topology configuration, automated deployment, and proactive environment monitoring & management, etc.
  • Experienced in using version control tools Git, Gitlab, GitHub and Bit bucket and has expert knowledge of source code management concepts (branching, merging, integration, versioning, etc.)
  • Configured and maintained Jenkins to implement the CI process and integrated the tool with Ant and Maven to schedule the builds and set up Continuous Integration for major releases in Jenkins.
  • Experience in administering and configuring Nexus Repository Manager for Maven builds.
  • Hands on experience writing CloudFormation and Terraform templates to automate the process of building the required infrastructure as code.
  • Experienced in writing bash scripts to automate deployments through CloudFormation and terraform.
  • Experienced in configuring and managing Chef master server and also update and create modules and push them to Chef clients.
  • Automated JBoss/Tomcat application server deployments using Chef manifests. Used these manifests to replicate various environments.
  • Managed Linux and Windows virtual servers on AWS EC2 using Chef Server. Configured, monitored and distributed multi-platform servers using Chef.
  • Worked on Ansible binary and playbook version control, established procedure for GitHub repository on how to checkout, change test, merge ansible code.
  • Experienced in containerizing the application utilizing Docker and ECS and managing containers in production.
  • Experience with container-based deployments using docker and also working with docker images, docker hub and docker registries.
  • Used Jenkins pipeline to drive all microservices builds out to the Docker registry and then deployed to Kubernetes, Created Pods and managed using Kubernetes.
  • Created automated image builds using packer, Jenkins and Kubernetes.
  • Wrote terraform template to deploy Kubernetes cluster on AWS and managed kubernetes administration knowledge on designing the application on AWS taking advantage of Fault tolerance and Disaster recovery.
  • Hands on experience using AWS services like CloudFormation, Auto scaling, ECS, lambda, EC2, EBS, IAM, S3, ELB, RDS, VPC, Route 53, Cloud Watch, etc.
  • In-depth knowledge on AWS infrastructure, logical workflow of how routing works in VPC, how route tables should be associated with subnets, difference between public and private subnets, AWS best practices.
  • Created AWS CloudFormation templates (CFT) to create custom-sized VPC, subnets, EC2 instances, ELB, security groups. Worked on tagging standard for proper identification and ownership of EC2 instances and other AWS services like Cloud Watch, RDS, S3, Route53, EBS.
  • Experienced in writing lambda function to monitor application or a particular service status and sent alerts to slack using slack web hooks.
  • Experience with setting up HA proxy solution for application monitoring using Grafana, Auto scaling, Elastic container service and Application load balancer.
  • Created RDS Aurora DB with cross region replication using CloudFormation template and used that database as session manager to implement session service for Grafana nodes using ephemeral ports.
  • Wrote lambda function using python and automated consul-backup solution for all the key values and ACLs from Consul cluster to AWS S3.
  • Experienced in using monitoring tools like Grafana, Kibana and Splunk.
  • Experience in Log Insight and Cloud watch services to diagnose problems and configuring alerts.
  • Setting up Auto scaling of the instance group using AWS command line tools and AWS cloud environment for Dev/QA
  • Troubleshoot and resolve Splunk performance, Log Monitoring issues, Dashboard creation, Data models.
  • Installed, tested and deployed monitoring solutions with Splunk services.
  • Designed and developed distributed private cloud system solution using kubernetes on CoreOS
  • Expertise in working with different Bug Tracking Tools like JIRA, Clear Quest and Quality center.
  • Experience is using Tomcat, JBOSS, WebLogic and WebSphere Application servers for deployments.
  • Coordinated with the Offshore and Onshore teams for Production Releases.
  • Performed numerous server migrations on both Linux/Unix and Windows servers. Migrations include moving all clients and their data, configuration settings, testing and verifying everything is correct with zero downtime.
  • Knowledge about Microsoft Azure in creating cloud environment and managing the application servers across Microsoft managed global data cent.
  • Excellent Analytical and Communication skills, Quick Learner, Versatile Team player, Organized, Self-Motivated, have ability to pick up new technologies and solve complex technical problems.
  • Experience in creating custom tools to stop AWS environments whenever not being used based on the schedule and start on demand whenever needed.
  • Wrote lambda function that generates AWS Cost report based on Platform, Project tags (Taxonomy) and created a static website hosted using python flask application.
  • Worked on Access Control for cross functional teams utilizing IAM policies, roles and groups.
  • Infrastructure as Code CloudFormation, Terraform

TECHNICAL SKILLS

Configuration Management Tools: Chef, Vagrant, Ansible.

Continuous Integration: Jenkins

Build Tools: Ant, Maven

Project management: Jira, Confluence.

Version Control: Git, CVS, Subversion.

Cloud Platforms: AWS, Oracle

Containerization: Docker, ECS, Kubernetes

Virtualization: Platform VMware and Oracle VirtualBox

Ticketing Tools: JIRA, ServiceNow, Rally

Monitoring Tools: Grafana, Splunk, Cloud Watch, Nagios, Pager, Ocean, Kibana, Telegraph, Chronograph

Operating Systems: Linux (Red Hat 4/5/6,7), Ubuntu, CentOS, Windows Variants and Unix.

Database: RDS, Oracle, DynamoDB

Programming Languages: Python, Java

Scripting: Python, Bash, Ruby, Groovy

Web/App Servers: Nginx, Tomcat, Apache 2.x, 3.x, JBoss

Testing Tool: Selenium

PROFESSIONAL EXPERIENCE

Confidential, CA

AWS/SRE DevOps Engineer

Responsibilities:

  • Designed, built, and deployed a multitude application utilizing almost all of the AWS stack (Including Cloud formation, Service catalog, EC2, R53, S3, RDS, SNS, SQS, IAM, EIP, EBS, ECS, LAMBDA, VPC), focusing on high-availability, fault tolerance, and auto-scaling with Cloud watch monitoring.
  • Performed AWS cloud deployments for web applications utilizing cloud formation as IAC and orchestrating the service catalog products to standardize the deployment process.
  • Worked on creating custom tools using bash and python to automate the CI/CD pipelines.
  • Created a tool using python and boto3 that can help start/stop the AWS non prod environments which in turn can help reduce the AWS cost expenditure.
  • Expertise in right sizing of the infrastructure and keeping the bill of the cloud Infra optimized.
  • Hands on experience to design/build/setup an application’s database, backend apps, deployment strategy, release management, configuration management from scratch on AWS cloud.
  • Wrote custom python boto3 scripts to check the limits of AWS services using Trusted Advisor Service and alert DevOps team using SNS Topic when the service usage reached to warning (50%) and critical (80%) to increase the service limit capacity.
  • Wrote cloud custodian policies to auto tag all AWS resources key as creator name value as IAM user who created the resource, sends them email if they didn't follow security best practices. For Instance, security group ingress has 0.0.0.0/0 custodian immediately sends email to creator and auto deletes the rule.
  • Designed & implemented VPC service for extension of customer's on-premises data center with AWS Cloud using
  • AWS VPC and VPN & Direct connect services.
  • Used CloudFront to deliver content from AWS edge locations to users, allowing for further reduction of load on front-end servers.
  • Written helm templates and packaged, deployed helm charts as part of CI/CD process into the PKS Clusters.
  • Parameterize helm templates using helm functions and conditional statements to deploy one helm chart to dev, qa, uat and prod clusters.
  • Written Kubernetes yaml manifests to deploy scale, load balance, scale and manage docker containers with multiple namespaces for PKS Clusters.
  • Build application using Jenkins’s server and troubleshoot build and release job failures, resolve, work with engineering team on resolution

Environment: AWS, Cloud Formation, Service Catalog, Jenkins, Ansible, IAC, CI/CD, Docker, ECS, Kubernetes, Helm

Confidential, OH

DevOps AWS Engineer

Responsibilities:

  • Extensively used AppDynamics to monitor and trouble shoot server metrics, Business transactions and application response time.
  • Written Splunk quires to monitor latency, errors, HTTP Codes to troubleshoot application functionality.
  • Created/maintained continuous integration and continuous delivery workflows using tools Jenkins, Cloud formation, bitbucket, Maven, Artifactory over different environments to facilitate an agile development process.
  • Migrated Jenkins server jobs and all the configuration used by cross functional teams to Jenkins Enterprise.
  • Worked on a platform to ensure cost savings using Native AWS resources and best practices.
  • Designed and implemented a project that requires an application to run on-demand to migrate customer data using container capabilities.
  • Lift and shift of legacy data center application to AWS cloud.
  • Setup K8S Cluster, Continuous docker image build using Jenkins, Continuous pod deployment - configuration management - monitoring.
  • Created custom AWS Terraform modules (Ec2-ASG, CLB, ALB, NLB, EFS, RDS).
  • Automated application deployment related scripts to configure windows servers.
  • Designing technical architecture and develop cost-effective solutions using various configuration management tools to make infrastructure compliant with the company cloud policies along with ensuring high availability.
  • Interact with developers and enterprise configuration management team for changes to best practices and tools to eliminate non-efficient practices and bottlenecks.
  • Worked with AWS EC2/VPC/S3/SQS/SNS based infrastructure automation through cloud formation, Jenkins, python, bash Scripts.
  • Automated Node.js app deployment using Elastic Beanstalk to different environments like dev, qa, preprod and prod
  • Created and maintained continuous integration (CI) using tools Jenkins/Maven over different environments to facilitate an agile development process which is automated and repeatable enabling teams to safely deploy code many times a day while ensuring operational best practices are supported.
  • Responsible for orchestrating CI/CD processes by responding to Git triggers, human input, and dependency chains and environment setup.
  • Integration of automated Build with Deployment Pipeline. Currently installed Chef Server and clients to pick up the Build from Jenkins repository and deploy in target environments (Integration, QA, and Production).
  • Worked on Cloud automation using AWS Cloud Formation templates.
  • Configured AWS Identity and Access Management (IAM) users and groups for improved login authentication.
  • Experience with Auto scaling, Self-healing mechanism, Chaos Engineering, Performance optimization techniques.
  • Monitoring & alerting setup experience with AWS, New Relic, Data Dog, PagerDuty, Splunk, Prometheus, Grafana, Kibana, ELK etc.
  • Proven ability to participate with other functional teams in systems integration and design including writing operational specifications, test plans and requirements management with attention to detail.
  • UNIX/LINUX and Windows server expertise in system installation, configuration, administration, troubleshooting, performance tuning, preventative maintenance, capacity planning, monitoring, and security procedures.
  • Web Application (NodeJS, Apache, IIS, Nginx) expertise including installation, administration, configuration, troubleshooting, performance tuning, preventative maintenance, capacity planning, monitoring, and security procedures.

Environment: AWS, Terraform, Docker, Kubernetes, Helm, Jenkins, CI/CD, Maven, App Dynamics, Cloud Formation, VPC, Chef, ECS, EKS

Confidential, Austin, TX

DevOps Engineer

Responsibilities:

  • Created and maintained the Microservices using YAML scripting, to implement Infrastructure as a Code (IaC), over the private cloud virtual machines
  • Setup a cluster using BareMetal, OCI, AWS.
  • Experience on Working with creating docker compose files, build, docker images, docker registry and handling multiple images primarily for middleware installations and domain configurations.
  • Expertise in writing Ansible playbooks, helm charts, templates and yaml scripting.
  • Expertise in writing bash and python scripting.
  • Having experience in creating Jenkins Pipeline workflow using plug-in to integrate SCM Tools, Code Has Experience in Compilation and Building packages using Maven build tool.
  • Having experience in build tools Such as Maven to generate project artifacts and troubleshooting in build issues.
  • Experience in working on DevOps /Agile operations process and tools area (Code review, unit, test automation, Build & Release automation, Environment, Service, Incident and Change.
  • Support the code builds by integrating with continuous integration tool (Jenkins) for automation of daily process.
  • Written Maven Scripts, installed Jenkins, written shell script for end-to-end build and deployment automation.
  • Branching, Tagging, Release Activities on Version Control Tools GIT.
  • Responsible for user management, plug-in management, Build/ Deploy setup and end-end job setup of all projects in Jenkins.
  • Installation / Configuration of Nexus Repository Manager.
  • Bug tracking using JIRA tool.
  • Good experience on monitoring, alerting, logging frameworks such as Prometheus, Elasticsearch, Grafana.
  • Worked on migration of the legacy application to new OCI/AWS cloud-based infrastructure, and implemented it till production.
  • Worked on the creation of a Blue-Green deployment model for the investment portfolio env.
  • Worked on fixing application vulnerabilities identified by the Security team, worked on certificate renewals.
  • Worked on automating the deployment of the legacy application using the in-house deployment tool and shell scripts.
  • Successfully setup Splunk monitoring tool with Clustered environments and used extensively to collect the logs, search, analyze, and create visualizations.
  • Helped development, QA, and other teams to troubleshoot, identify and fix various issues identified while working with our investment banking application environments.
  • Used Terraform IaC for the creation of system accounts, virtual machines, various other resources and holds good experience and certification in Terraform.
  • To setup Nexus using Ansible playbook, worked on CI/CD pipeline setup to integrate multiple DevOps tools together, wherein nexus setup using Ansible was one of the tool setup and integration process.
  • Possesses broad experience with automation, deployment tools along with a wide range of infrastructure suite of components like Internal Load Balancer, Artifact’s storage tool, Confluence, Splunk, JIRA, GitLab, Deployment tool, Subversion, AppDynamics …etc.
  • Worked on various other initiatives at organization level like, interview process automation… etc.

Environment: Oracle Cloud, Terraform, Jenkins, CI/CD, Bitbucket, VCN, ELK, EFK, Oracle IAM, EC2, Grafana, Kibana, Docker, Load Balancer

Confidential

System Administrator

Responsibilities:

  • Coordinated with Release Management regarding appropriate system releases among other development platforms.
  • Installing and configuring Oracle Identity and access Manager for provisioning and reconciliation with other resources in the enterprise.
  • Working on Web gate installations and Configurations.
  • Have good experience on deploying of java applications in WebLogic and troubleshooting the deployment and runtime failures.
  • Hands on Traffic Flopovers from Primary to DR during Outage Scenarios.
  • Designed and configured reconciliation of various resources with active sync.
  • Designed and configured role-based access control in Identity Manager.
  • Updated and automated Release reports for Change Management.
  • Developed a continuous deployment pipeline using Jenkins, shell scripts.
  • Deployed and developed J2EE applications in WebLogic.
  • Troubleshoot, optimize performance and health check the solutions.
  • Wrote scripts to automate administration tasks using shell scripts.
  • Used Shell scripts to replicate production build environments on a local dev box using Vagrant and Virtual Box.
  • Configured Splunk search head cluster and indexer cluster to maintain high availability and fault tolerance
  • Supported Clients Environment, and systems troubleshooting that involving RHEL and SUSE servers in a large-scale environment.
  • Performing software installations and upgrades to Red Hat Linux 4.x, 5.x, 6.x and maintaining them in accordance with established policies, procedures and service level agreements.
  • Creating and maintaining User accounts, profiles, security, rights, disk space and process monitoring.
  • Performance Management & Tuning of RHEL.
  • Coordinated with security team to deploy of patches and packages to RHEL and Solaris servers.
  • Hands on experience in installing and maintaining of Splunk on servers.
  • Monitoring system resources, logs, disk usage, scheduling backups and restore.
  • Configuring SUDO and granting root permission to users for performing certain activities.
  • File system tuning, growing, and shrinking with Veritas File system 3.5/4. x.
  • Modifying user's password policies, group policy, UID and GID assignment policies.
  • Monitored the performance of the system using top, sar, ps, prstat, vmstat, netstat, iostat, and cpuinfo to check the CPU utilization, bottlenecks of I/O devices, memory usage and network traffic.
  • Troubleshoot network connectivity used ping, netstat, ifconfig, and trace route commands. Login to the remote system using sudo, putty, and telnet. Transfer files across the systems on the network. Linux technical support and prepared technical documentation for check in verification.
  • Installed, configured and supported Apache, JBoss, Oracle 10g, and Clear case. Configuration of virtual hosts, plugins and SSL certificates on web servers and domains and deployments on application servers.

Environment: RHEL, SUSE Linux, Apache, JBoss, VMware, Java, Red Hat Linux4,5, x6 VMware ESX 3.5/4.1, Windows Server 2008 R2, sun Solaris 8,9,10, Solaris Volume Manager, WebLogic, Oracle 9i/10g.

We'd love your feedback!