We provide IT Staff Augmentation Services!

Site Reliability Engineer (sre) Resume

5.00/5 (Submit Your Rating)

Jersey City New, JerseY

SUMMARY:

  • A proactive, result oriented IT Professional with 6+ years of experience as a SCM, DevOps Engineer in solving complex
  • Problems with creative solutions, supporting development, Deployment operations in different environments. Experienced in all phases of Software Development Life Cycle (SDLC), Quality Assurance Life Cycle (QALC), Linux Administration, Software
  • Configuration Management (SCM), Continuous Integration (CI), Continuous Deployment (CD), Release Management, and experienced in Automating, Configuring and deploying instances on Cloud Computing Platforms like AWS, Microsoft Azure, GoogleCloud
  • Experience and expertise in GCP environment in particular Google Big Query, Google Pub/sub, Google Spanner, Dataflow Compute Engine, Google Storage.
  • Experience in providing highly available and fault tolerant applications utilizing orchestration technologies like Kubernetes and Apache Mesos on Google Cloud Platform.
  • Experience in designing a Terraform and deploying it in cloud deployment manager to spin up resources like cloud virtual networks, Compute Engines in public and private subnets along with AutoScaler in Google Cloud Platform.
  • Architected real time google Cloud Dataflow batch/streaming and analytics solutions for BIG ETL. Served as the single point f accountability for client satisfaction across multiple cloud infrastructure projects.
  • Experience in Google Cloud components, Google container builders and GCP client libraries and cloud SDK's
  • Used Google Cloud Platform (GCP) services like Compute Engine, Cloud Functions, Cloud DNS, Cloud Storage and SaaS PaaS and IaaS concepts of Cloud computing architecture and implementation using GCP.
  • Experienced in provisioning of IaaS, PaaS, SaaS virtual machines and web/worker roles on Microsoft Azure classic and Azure

PROFESSIONAL EXPERIENCE:

Confidential, Jersey city, New Jersey

Site Reliability Engineer (SRE)

Responsibilities:

  • Created Azure ExpressRoute to establish connection from Azure to On - premise datacenter. Working knowledge on AzureFabric, Micro services, Lot & Docker containers in Azure. Created Azure cloud services, Azure storage, Azure active directory, Azure service bus. Create and manage
  • Azure AD tenantsand configure application integration with Azure AD. Integrate on-premises Windows AD tenants and Configure applicationintegration with Azure AD. Configured continuous integration from Source control, setting up build definition within Visual Studio Team Services (VSTS)and configure continuous delivery to automate the deployment of ASP.NET MVC applications to Azure web apps. Maintain storing s and secrets for Azure APIM and Azure Application Gateway and involved in combining allRelease pipelines into Single Release pipeline. We used AZURE DevOps, VSTS & PCF. Managing keys by creating the keys and attaching them to library & Variable Groups with the help of Key Vault. Created premise applications on cloud platform Azure in dealing with Azure IaaS - Virtual Networks, Virtual Machines, CloudServices, Resource Groups, Express Route, Traffic Manager, VPN, Load
  • Balancing, Application Gateways, Auto-scaling. Implemented Azure SQl for handling most of the database management functions such as upgrading, patching, backups,and monitoring without user involvement. Implemented ETL (Extract, transform, and load) process to import all the data, clean it in place, and then store it in arelational data engine. Created GIT repository for storing Terraform files and maintaining versioning. Converted existing Terraform modules thathad version conflicts to utilize cloud formation during Terraform deployments to enable more control or missing capabilities. Converted existing Terraform modules that had version conflicts to utilize cloud formation during Terraform deployments to enable more control or missing capabilities. Utilized Docker for running different programs on single VM, Docker images includes setting the entry point and volumes,also ran Docker containers and worked on installing
  • Docker and creation of Docker container images, tagging and pushingthe images. Evaluated Kubernetes for Docker container orchestration. Managed Kubernetes charts using Helm and created reproduciblebuilds of the Kubernetes applications, templatize Kubernetes manifests, provide a set of configuration parameters to customize the deployment and Managed releases of Helm packages. Implemented Ansible Playbooks with Python, SSH to Manage Configurations of Open Stack Nodes and Test Playbooks onAWS instances using Python. Working with Ansible Tower to manage Web Applications, Config Files, Data Base, Commands, User Mount Points, Packagesand for running playbooks stream in real-time and amazed to see the status of every running job without any further reloads. Working with

Confidential, Dublin, OHIO

AWS DevOps Engineer

Responsibilities:

  • Responsible for creating tagging standards for proper identification and ownership of EC2 instances and other AWS resources. Configured and designed EC2 instances in all the environments to meet high availability and complete security. Setting up the Cloud Watch alerts for EC2 instances and using in Auto scaling launch configuration. Used IAM to create new accounts, roles, and groups. Extensively automated the deployments using AWS by creating IAM sand integrated the Jenkins with AWS plugins to pipeline the code. Designed and developed AWS Cloud Formation templates to create custom VPC, Subnets,
  • NAT to ensure deployment of webapplications. Created Multiple AWS instances, set the security groups, Elastic Load Balancer and AMIs, Auto scaling to design costeffective, fault tolerant and highly available systems. Setup Terraform to create stacks in AWS from the scratch and updated the terraform as per the organizations requirementon a regular basis. Created templates for AWS infrastructure as a code using Terraform to build staging and production environments. Setup Kubernetes to manage containerized applications using its nodes, Config Maps, Selector, Services, and deployedapplication containers as Pods.
  • Building/Maintaining Docker container clusters managed by Kubernetes Linux, Bash, GIT, Docker, on GCP (Google Cloud Platform). Utilized Kubernetes and Docker for the runtime environment of the CI/CD system to build, test deploy. Involved in development of test environment on Docker containers and configuring the Docker containers using Kubernetes. Involved in Upgrade of Jenkins & Artifactory Server by scheduling backups in S3. Managed the Code Repository by maintaining code in GIT, improve practices of branching and code merge to custom needsof development team. Responsible for creating and maintaining automated builds for projects written in java, PHP using Jenkins. Designed and Implemented CI (Continuous Integration) system, configuring Jenkins servers, Jenkins nodes, creatingrequired scripts (Perl, Python) Wrote Ansible Playbooks with Python SSH as the Wrapper to Manage Configurations of Open Stack Nodes and
  • TestPlaybooks on AWS instances using Python. Involved in Docker container snapshots, attaching to a running container, removing images, managing Directory structures,and managing containers. Worked on PaaS service like OpenShift provided by the RedHat and Streamlined installation of OpenShift on partner cloudinfrastructure such as AWS. Involved in creating different applications like Kafka, Zookeeper, and Solar and used to expose any route to the externaltraffic around OpenShift. Installed and configured Splunk to monitor applications deployed on application server, by analyzing the application and server log files. Worked on setup of various dashboards, reports, and alerts in Splunk. Adapted to use Nagi

Confidential

Sr. Cloud and DevOps Engineer

Responsibilities:

  • Skilled in Infrastructure Development and Operations involving AWS Cloud platforms, EC2, EBS, S3, VPC, RDS, SES, ELB, Autoscaling, Cloud Front, Cloud Formation, Elastic Cache, Cloud Watch, SNS. Involved in designing and deploying multitude applications utilizing almost all the AWS stack (Including EC2, Route53, S3, RDS, Dynamo DB, SNS, SQS, IAM) focusing on high - availability, fault tolerance, and auto-scaling in AWS Cloud Formation. Involved in Planning, deploying, monitoring, and maintaining AWS cloud infrastructure consisting of multiple EC2 nodes and VMware Vm's as required in the environment.
  • Responsible for AWS platform and its dimensions of scalability including VPC, EC2, ELB, S3, and EBS, Route53. Involved in cloud automation using AWS cloud Formation Templates, Chef. Good knowledge in architecting and deploying of fault tolerant, cost effective, highly available and secure servers in AWS. Utilized Elastic Load Balancers with EC2 auto scaling groups Used Identify and Access Management (IAM) to assign roles and to create and manage AWS users and groups, and usepermissions to AWS resources. Expertise in Appdynamics Controller administrative activities like user management, application management, monitoringcontroller performance etc. Good understanding of Open shift platform in managing Docker containers and Kubernetes Clusters Configured Apache Web server in the Linux AWS cloud environment using CHEF automation. Exposure in Elastic Cloud Computing (EC2) instances utilizing auto scaling
  • Elastic Load Balancing, and Glacier forour QA and UAT environments as well as infrastructure servers for GIT and CHEF. Good skills in Install, configuration, and operation of Red Hat Open shift. Involved in creating Snapshots and Amazon Machine Images (AMI's) of EC2 Instance for snapshots and creating clone'sinstances. Utilized Configuration Management tool CHEF and created Chef Cookbooks using recipes to automate system operations. Involved in setting up the Chef repo, Chef Work stations and Chef Nodes and also involved in chef-infra maintenance includingbackup/security fix on Chef Server Use build tools to aggregate projects using Apache, Ant, Maven, Groovy tools, and Gradle Involved in Creating test branches from master branch of each repositories on GIT to perform testing of Gradle upgrade to LSRand then assisted DEV teams to do the same successfully. Involved in Pipelined Application Logs from App Servers to Elastic Search (ELK Stack) through Log stash
  • Worked on Built new headless framework for system agent and different agent plug-in. Used Gradle and Jenkins to triggerbuild process. Involved in Coordinate/assist developers with establishing and applying appropriate branching, labeling/naming conventionsusing Subversion (SVN) and GIT source control. Worked on integrating application logs with Splunk and wro

Confidential

Build & Release Engineer

Responsibilities:

  • Launched AWS EC2 Cloud Instances using Amazon Images (Linux/ Ubuntu) and configuring launched instances with respect to specific applications. Performed S3 buckets creation, and policies on the IAM role - based policies and customizing the JSON template. Created Virtual Private Cloud (VPC) and brought instances under them based on the requirement and also created Publicand private subnets in the VPC and attaching them to the EC2 instances based on the requirement. Implemented a 'server less' architecture using API Gateway, Lambda, and Dynamo DB and deployed AWS Lambda codefrom
  • Amazon S3 buckets. Created a Lambda Deployment function and configured it to receive events from your S3 bucket. Involved in designing and deploying multiple applications utilizing almost all the AWS stack (Including EC2, Route53, S3, RDS, DynamoDB, SNS, SQS, IAM) focusing on high-availability, fault tolerance, and auto- scaling in AWS Cloud Formation. Implemented AWS Code Pipeline and Created Cloud formation and JSON templates in Terraform for infrastructure as code. Involved in using Terraform to migrate legacy and monolithic systems to Amazon Web Services and provisioned the universally available EC2 Instances using Terraform and cloud formation and wrote new plugins to support newfunctionality in Terraform and adapted to Terraform for deploying infrastructure in AWS as per the requirement. Created script to build and push docker images in Docker Hub and maintained and supported
  • Docker containers running onLinux machines and created docker-compose.yaml file templates to deploy images in docker containers managed byDocker Swarm. Created Jenkins pipelines to drive all micro services builds out to the Docker registry and then deployed to Kubernetes,created Pods and managed using Kubernetes. Involved in development of the test environment on Docker containers and configuring the Docker containers usingKubernetes and managed local deployments in Kubernetes, creating local cluster and deploying application containers. Configured Ansible to manage all existing servers and automate the build/configuration of new servers and deployedmicroservices, including provisioning AWS environments using Ansible Playbooks. Involved in writing various custom Ansible playbooks for deployment orchestration and developed Ansible Playbooks to simplify and automate day-to-day server administration tasks. Involved in setting up Jenkins Master and multiple slaves for the entire team as a CI tool as part of the Continuousdevelopment and deployment process.
  • Created Jobs for Builds and Deployments, installed several plug-ins in Jenkins to support multiple tools required for the implementation of projects. Implemented Jenkins Code Deploy plugin to deploy to AWS and used to automate the build process and deploy the application to Tomcat server. Set up CI/CD pipelines for Microservices and int

Confidential

VMware and Linux Administrator

Responsibilities:

  • Installation and Configuration of Redhat, SLES 9, 10 servers. Installation, configuration and management of Apache and Tomcat servers, maintenance of local and Network based Printers. Applying patches to fix the holes that are found during the quarterly scan basis using Nessus scan. Upgraded and maintained servers, operating

    Systems and patches. Install/configure/maintain the Linux servers, NIS, DNS, NFS, Mailing List, Send mail, apache, ftp, sshd host firewall IP Tablesfor Redhat Linux and Centos. Troubleshoot various systems problems such as application related issues, network related issues, hardware related issues etc. Applied Shell scripting (ksh, bash) to automate system administration jobs, and automated tasks using bash, cron shell scripts. Involved in installing subversion version control and creating and administering repositories. Involved in taking the weekly backups of the repositories and managing the repositories. Did the user management for the Linux based servers and also installing different applications on the different environments.

    Designed and configured ESXi 4.x infrastructure environment for huge Data Center migration. Physical to virtual (VMware) Migrations for over 180 servers using VMware Converter Standalone. Managed and maintained virtual computing environment based on VMware and administered virtual windows and Linuxoperation systems. Create templates and deploying Virtual Machines through templates, cloning Virtual Machines and managing Virtual Centerpermissions.

    Creating and Managing VMware cluster with HA and DRS, resource pools for Virtual Machines. Perform Life Cycle Management for ESXi hosts. Worked on capacity planning and management of Virtual machines in VMwareenvironment. Managing and maintaining Linux running servers in same environment Rebuilt OS and automate the processes Responsible for troubleshooting the issues on the servers and provide a solution in a timely fashion. Install / Configure VIO servers as well as configuring virtual Ethernet shared Ethernet (SEA), virtual SCSI and NPIV on IBM Power6 and 7 systems. Administration of Red Hat Linux DNS, Web Server and built software packages on Red Hat Linux (RPM) Installed, upgraded and configured SUN Solaris 9/10 on Sun Servers.

    Troubleshooting issues related to NFS, SSH, NIS, DNS, FTP, VMWARE, NETBACKUP, VERITAS (VCS), ZONES, LVM, RAID, FileSystem, Permissions, Performance Monitoring, IP Bonding, Multipathing, NAS, SAN, Storage, V - Center, Opsware, PowerBroker, etc. Rectifying hardware failure and coordinating with vendors like Symantec, Oracle, Dell, and HP to get them repaired. Involve multiple teams sometime for providing a quick resolution to a high priority ticket.

We'd love your feedback!