We provide IT Staff Augmentation Services!

Site Reliablity Engineer Resume

0/5 (Submit Your Rating)

Plano, TX

SUMMARY

  • Overall, 8 years of IT experience in DevOps, Release Engineering, CI/CD, Security, build/release, Infrastructure Automation, Cloud technologies etc.
  • Worked in various methodologies likeAgile,Scrum, Kanban,Waterfalland RUP and following practices like planning sessions, retrospectives, daily standups, pair programming, code reviews, TDD, ATDD.
  • Hands on experience in Amazon Web Services AWS provisioning and good knowledge ofAWS services like EC2, S3, Glacier, AWS Lambda, ELB (Load Balancers), IAM (Identity Access and Management), Cloud Watch, RDS, SNS, SQS, SWF and EBS and Dynamo DB.
  • IntegratedJenkinswithDockercontainer using Cloud beesDockerpipeline plugin and provisioned the EC2 instance using Amazon EC2 plugin.
  • Experience in writingInfrastructure as a code (IaC) inTerraform, Azure resource management, AWS Cloud formation.Created reusableTerraform modulesin both Azure and AWS cloud environments.
  • Expertise in Architecting and ImplementingAzure ServiceOffering, such asAzure cloud services, Azure storage, IIS, Azure Active Directory (AD), Azure Resource Manager (ARM), Azure Storage, Azure, Blob Storage, Azure VMs, SQL Database, Azure Functions, Azure Service Fabric, Azure Monitor, and Azure Service Bus.
  • Working Experience onAzure Cloud Services, Azure Storage, SQL Azureand in different PaaS Solutions with Web, and worker Roles and Azure Web Apps.
  • Implementing & maintaining branching and build/release strategies utilizing GIT, Bit - bucket, Azure repos. Experience in deploying resources using Azure Resource Management (ARM) templates.
  • Create clusters inGoogle Cloudand manage the clusters usingKubernetes. Using Jenkins to deploy code to Google Cloud, create new namespaces, creating docker images and pushing them tocontainer registryofGoogle Cloud.
  • Build and configure a virtual data center in theGoogle cloud platformto support Enterprise Data Warehouse hosting includingVirtual Private Cloud (VPC), Public and Private Subnets, Security Groups, Route Tables,Google Cloud Load Balancing.
  • Advising developers on code optimizations and best practices for AWS, Google Cloud/Elastic Search and Extensively used Google stack driver for monitoring the logs of bothGKEandGCPinstances and configured alerts from Stack driver for some scenarios.
  • Used Jenkins CI/CD pipelines to drive all micro services builds out to the Docker registry and then deployed to Kubernetes, Created Pods and managed using Kubernetes.
  • Implemented a CI/CD pipeline usingAzure DevOps (VSTS, TFS)in both cloud and on-premises withGIT, MS Build, Docker, Mavenalong withJenkinsplugins.
  • Experience in Kubernetes to deploy scale, load balance and manage Docker containers with multiple names spaced versions and good understanding of Open Shift Platform in managing Docker Containers and Kubernetes Clusters.
  • Good experience in writing Helm Charts, Kubernetes YAML files for deployment of microservices into Kubernetes clusters.
  • Expertise withDockerimages using aDockerfile, worked on container snapshots, removing Images, and managingDockervolumes. Orchestration ofDockerimages and Containers usingKubernetesby creating master and node.
  • Responsible for administrating and maintaining Jenkins and Jenkins slaves on Windows and Linux (Debian/Ubuntu). Created many Jenkins slaves and set up jobs on master to run slaves.
  • WroteAnsible PlaybookswithPython, SSHas the Wrapper to Manage Configurations ofAWS Nodesand Test Playbooks on AWS instances usingPython, run Ansible Scripts to provision development servers.
  • Written Chef Cookbooks and recipes to Provision several pre-prod environments consisting of Cassandra DB installations, WebSphere installation and creation of profiles.
  • Experience in setting upNexus Repository Managerfrom scratch and configuring different repositories for bothANT/Mavenbuilds.
  • Expert in working on version control systems like Subversion and GIT and used Source code management client tools like Git Bash, GitHub, Bit bucket, Git GUI, and other command line applications etc.
  • Good experience ofUNIXshell scripts and PowerShell scripts for software automation and involved in writingPython, Perlandshellscripts for compilation and deployment process.
  • Good experience in using Elasticsearch, Kibana and CloudWatch, Nagios, Splunk, Prometheus and Grafana for logging and monitoring.
  • ImplementedJIRAfor ticketing and as well as defect tracking system and configure various workflows, customizations and plug-ins forJIRAbug/issue tracker, integratedJenkinswithJIRA,GitHub, Maven.
  • Developed and maintained mostly Python and some Perl ETL scripts to scrape data from external web sites and load cleansed data into a MySQL DB.

TECHNICAL SKILLS

Cloud Platforms: AWS, Azure, GCP, Snowflake

Automation Tools: Ansible, Terraform, CloudFormation, Chef, Puppet

Continuous Integration (CI): Jenkins, Bamboo and Gitlab

Containers: Docker, Kubernetes

Build Tools: ANT, Maven, Gradle

Repositories: Nexus, Artifactory

Version Control Tools: GIT, Bit Bucket

Infrastructure as Code: Terraform, AWS Cloud formation, OpenStack

Configuration Management: Ansible, Chef, Puppet

Languages: Python, SQL, YAML

Web Servers: Apache Tomcat, Nginx, WebLogic, Web Sphere, JBoss

Database Systems: Oracle DB, SQL, PL/SQL, MySQL, AWS RDS, Dynamo DB, Redis, Cassandra DB, Mango DB

Scripting Languages: Perl, Python, Bash/Shell Scripting

Storage: NetApp, EMC, Pure storage, VMware, Gluster, Ceph

Scripting Languages: Python, Perl, Shell Scripting

Networking Protocols: TCP/IP, DNS, DHCP, Cisco Routers/Switches, WAN, LAN, FTP/TFTP, SMTP.

Monitoring Tools: Apica, Datadog, Splunk, Stack drive, CloudWatch, AppDynamics, New Relic, Grafana, Dynatrace.

PROFESSIONAL EXPERIENCE

Confidential, Plano, TX

Site Reliablity Engineer

Responsibilities:

  • Worked on Azure Site Recovery and Azure Backup- Deployed Instances on Azure environments and in Data centers and migrating to Azure usingAzure Site Recoveryand collecting data from all Azure Resources using Log Analytics and analyzed the data to resolve issues.
  • Configured Azure Multi-Factor Authentication (MFA) as a part of Azure AD Premium to securely authenticate users and worked on creating custom Azure templates for quick deployments and advancedPowerShellscripting. Deployed Azure SQL DB with GEO Replication, Azure SQL DB Sync to standby database in another region & fail over configuration.
  • Worked onServerless services, created and configuredHTTPTriggers in theAzure Functionswith application insights formonitoringandperformingload testing on the applications using theVisual Studio Team Services(VSTS)alsocalled asAzure DevOps Services.
  • Created Azure Automation Assets, Graphical runbook, PowerShell runbook that will automate specific tasks, deployed Azure-AD Connect, configuring Active-Directory Federation Service (ADFS) authentication flow, ADFS installation using Azure AD Connect, and involved in administrative tasks that include Build, Design, Deploy of Azure environment.
  • Implemented a CI/CD pipeline, Jenkins (TFS Plugin installed), Team Foundation Server (TFS), GitHub and Azure Container service, whenever a new TFS/GitHub branch gets started, Jenkins, our Continuous Integration (CI) server, automatically attempts to build a new Docker container from it.
  • Implemented a CI/CD pipeline usingAzure DevOps (VSTS, TFS)in both cloud and on-premises withGIT, MS Build, Docker, Mavenalong withJenkinsplugins.
  • Managed Azure Infrastructure Azure Web Roles, Worker Roles, VM Role, Azure SQL, Azure Storage, Azure AD Licenses, Virtual Machine Backup and Recover from a Recovery Services Vault using Azure PowerShell and Azure Portal.
  • Architect & implement medium to large scale BI solutions on Azure using Azure Data Platform services (Azure Data Lake, Data Factory, Data Lake Analytics, Stream Analytics, Azure SQL DW, HDInsight/Databricks, NoSQL DB)
  • Deployed the Azure VM’s, Azure Storage (Blob, Files) using ARM Templates and PowerShell scripts.
  • Experience in designing, configuring, and deploying solutions on Microsoft Azure usingARMTemplates, Azure PowerShell Module and, Azure CLI focusing on high - availability and auto-scaling.
  • Working Experience onAzure Databrickscloud to organizing the data into notebooks and making it easy to visualize data using dashboards.
  • Deploying windowsKubernetes (K8s)cluster withAzure Container Service (ACS)fromAzure CLIand UtilizedKubernetesandDockerfor the runtime environment of theCI/CDsystem tobuild,testandOctopus Deploy.
  • UsingAnsiblecreated multipleplaybooksfor machine creations andSQL server, cluster serverand my SQL installations.
  • UsedAnsibleto Setup/teardown ofELKstack (Elasticsearch, Log stash, Kibana) and troubleshoot the build issues withELKand work towards the solution.
  • DevelopedAnsibleplaybooks forSplunkin cloud environments with auto scaling for task force initiatives requiring big data analysis.
  • CreatedDockerslave Nodes using custom Docker images and pulled them toACRand worked on all major components of Docker such asDocker Daemon,Docker Hub,Docker Images,Dockerregistry.
  • Designed, wrote, and maintained systems in Python scripting for administering GIT, by using Jenkins as a full cycle continuous delivery tool involving package creation, distribution, and deployment onto Tomcat application servers via shell scripts embedded into Jenkins jobs.
  • MaintainedArtifactsin binaryrepositoriesusingJ-FrogArtifactory and pushednew Artifactsby configuring theJenkinsprojectJenkins Artifactory plugin.
  • Involved inJiraas defecttrackingsystem and configure various workflows, customizations, andpluginsforJira bug/issue trackerintegratedJenkinswithJira,GitHub.
  • Creating and Configuring Azure Redis cache, configuring cache clients using ARM, Azure PowerShell and adding and removing objects from the cache.
  • Built andmanageda highly availablemonitoring infrastructureto monitor differentapplication servers, and its components usingNagios.
  • Developed Micro services tools using Python, Shell scripting, XML to automate some of the menial tasks.

Environment: s: Jenkins, Docker, Kubernetes, Helm, Kubeapps, Kubelens, Terr grunt, Git lab, AWS, Azure, GCP, Shell Scripting, PowerShell, Confluence, Terraform, Snowflake, Artifactory, Ansible, Splunk, Python, ELK stack, Grafana, Kibana, Istio, Spinnaker

Confidential, Atlanta, GA

Site Reliablity Engineer

Responsibilities:

  • Hands on experience in Amazon Web Services (AWS) provisioning and good knowledge of AWS services like EC2, Elastic Load-balancers, EC2 Container Service (Docker Containers), S3, Elastic Beanstalk, CloudFront, Elastic Filesystem, RDS, DynamoDB, DMS, VPC, DirectConnect, Route53, CloudWatch, CloudTrail, CloudFormation, IAM, EMR, ElasticSearch.
  • Launching Amazon EC2 Cloud Instances using Amazon Web Services (Linux/ Ubuntu) and configuring launched instances with respect to specific applications.
  • Working on Multiple AWS instances, set the security groups, Elastic Load Balancer and AMIs, Auto scaling to design cost effective, fault tolerant and highly available systems.
  • Develop CloudFormation scripts to automate EC2 instances and created CloudFormation templates and deployed AWS resources using it.
  • Implemented Elastic Load Balancer (ELB's) and Auto-scaling groups in AWS on production EC2 instances to build Fault-Tolerant on highly available applications.
  • Implemented a serverless architecture using API Gateway, Lambda, DynamoDB and deployed AWS Lambda code from Amazon S3 buckets.
  • Created Ansible Playbooks Provision Tomcat servers, Nginx and other applications.
  • Worked on GitLab files that will be used to perform the build process in Gitlab.
  • Created inventory in Ansible for automating the continuous deployment and wrote playbooks using YAML scripting.
  • Developed data warehouse model in snowflake for over 100 datasets using whereScape.
  • Implemented a CI/CD pipeline with Docker, Jenkins and GitHub by virtualizing the servers using Docker for the Dev and Test environments by achieving needs through configuring automation using Containerization.
  • CreatedDockerimages using aDocker file, worked onDockercontainer snapshots, removing images and managingDockervolumesand implementedECS inAWS.
  • Install Kubernetes from scratch and Configure ETCD, deployments, Daemon Sets, Network Policies etc. for managing COSA architecture on Kubernetes.
  • Implementation of Nginx ingress for applications for Kubernetes clusters and monitors the traffic across the environments.
  • Created Docker images using a Docker file, worked on Docker container snapshots, removing images and managing Docker volumes.
  • Creating and Managing ELK Stack for Log-processing and visualizations.
  • Splunk SPL (Search Processing Language) and Dashboarding/Visualization. Setup dashboards for network device logs.
  • Implement Apache Kafka clusters for log storage to be processed by Logstash nodes.
  • Used Terraform to set up the AWS infrastructures such as launching the EC2 instances, S3 buckets objects, VPC, Subnets, etc.
  • Implement python scripts to manage auto-scaling properties using AWS Lambda.
  • Implemented Kubernetes to deploy scale, load balance, scale and manage Docker containers with multiple namespace versions.
  • Deployed Elastic Cloud Enterprise on the AWS with the help of custom written Terraform, Ansible and Shell scripts.
  • Deployed services using Helm charts in Kubernetes and customize the charts based on requirements.
  • Deployed Open Policy Agent for Terraform and EKS based deployments and wrote policies from scratch to implement the best security standards.
  • Interact with product management, project management and development teams to develop a strong understanding of the project.
  • Monitored and visualized VPC Flow Logs, Infrastructure Logs, Cross Account EKS and CloudWatch logs using the ELK stack.
  • Facilitate 24/7 production deployment of applications dependent on other teams during the off-hour’s maintenance windows.

Environment: s: Jenkins, Docker, Kubernetes, Helm, Kubeapps, Kubelens, Terr grunt, Git lab, AWS, Azure, GCP, Shell Scripting, PowerShell, Confluence, Terraform, Snowflake, Artifactory, Ansible, Splunk, Python, ELK stack, Grafana, Kibana, Istio, Spinnaker

Confidential

Site Reliablity Engineer

Responsibilities:

  • Plan, deploy, monitor, and maintain Amazon AWS cloud infrastructure consisting of multiple EC2 nodes and VMware VM's as required in the environment.
  • Created AWS cloud formation templates for the migration of applications from on-premises to AWS and to create custom-sized VPC, subnets, EC2 instances, ELB, security groups.
  • Creating EC2 instances in secured private subnet and deploying them in load balancing, assigning Elastic IP Addresses for EC2 instances for QA & UAT environments and GIT servers
  • Used Amazon S3 to backup database instances periodically to save snapshots of data.
  • Used Cloud Watch logs to move application logs to S3 and create alarms raised by applications.
  • Used Amazon Route53 to manage DNS zones and give public DNS names to elastic load balancers.
  • Implemented and maintained the monitoring and alerting of production and corporate servers/storage using AWS CloudWatch.
  • Implemented a GIT mirror for SVN repository, which enables users to use both GIT and SVN.
  • Used Bitbucket, a git’s solution for distributed version control system for collaborating our code with the team.
  • Used Terraform to set up the AWS infrastructures such as launching the EC2 instances, S3 buckets objects, VPC, Subnets, etc.
  • Implemented Ansible playbooks, inventory, dynamic inventory and automated Existing cloud environment.
  • Implemented container-based deployments usingDocker, working withDocker images, Docker Hub, Docker Compose, Docker-registriesandKubernetes.
  • Based on organizational requirements, prepared POCs for different automation tools and presented to higher management which helped them take decisions towards future migrations
  • Implemented and maintained complete continuous integration and continuous deployment (CI/CD) pipeline by using Jenkins.
  • Monitored the Jenkins server for the upgrades and new plugin installations required to run the CI/CD jobs.
  • Created the dockerized deployment environments and maintained most of the deployment environments like QA, staging, Production environments.
  • Used Kubernetes to provide a platform for automating deployment, scaling, and operations of application containers across clusters of hosts.
  • Created private cloud using Kubernetes that supports DEV, TEST, and PROD environments.
  • Generated UNIX scripts for build activities in QA, Staging and Production environments
  • Created scripts in Python to automate log rotation of multiple logs from web servers.
  • Responsible for Continuous Integration (CI) and Continuous Delivery (CD) process implementation from Dev to QA, QA to Pre-Prod, Pre-Prod to Production systems using Jenkins, Bamboo, GIT, SVN and Chef automation tool.
  • Implemented Continuous Integration using Jenkins and GIT.
  • Setup Datadog monitoring across different servers and on cloud environment.
  • Created Datadog dashboards for various applications and monitored real-time and historical metrics.
  • Created system alerts using various Datadog tools and alerted application teams based on the escalation matrix.
  • Installation and configuration of Splunk apps to onboard data sources into Splunk
  • Manage SPLUNK user accounts (create, delete, modify, etc.)
  • Create data retention policies and perform index administration, maintenance and optimization
  • Supports, Monitors and manages the SIEM environment

Environment: s: AWS, EC2, EBS, ELB, AMI, Splunk, auto scaling, VMware, S3, Route53, SNS, Ubuntu, RHEL, Windows, GIT, Docker, Terraform, Ansible, Kubernetes, puppet, PowerShell, chef, Nginx, Python, Splunk, Jira, GIT hub, Jenkins, JAVA/J2EE applications.

Confidential, San Jose, CA

Build & Release Engineer

Responsibilities:

  • Automated build and deployment process for application and leading up to building a continuous integration system using Jenkins.
  • Implementing a Continuous Delivery framework using Jenkins, Chef, Maven & Nexus in Linux environment.
  • Managing virtual machines using VM ware, Oracle Virtual box, Docker.
  • Primarily used Ruby to write GIT applications and Chef Recipes.
  • Integration of Maven/Nexus, Jenkins, GIT, Confluence and Jira.
  • Branching and merging, resolve conflict, push changes to TFS branches etc. experience with operating systems including deployment, maintaining and troubleshooting.
  • Managing day-to-day user administration like adding or deleting users, password aging users/groups, NFS using theTCP/IPapplications such as Telnet,FTP, and R-logins.
  • Responsible for nightly and weekly builds for different modules. Perform Deployment of Release to various QA & UAT in Linux environments.
  • Responsible for design and maintenance of the GIT repositories, views, and the access control strategies.
  • Created the ANT scripts and extension of existing ANT scripts for deployment of applications.
  • Used ANT and MAVEN as a build tool on java projects for the development of build artifacts on the source code.
  • Responsible for build and deployment automation using VM Ware ESX, Chef, Puppet.
  • Using Confluence for technical documentation, setting up a knowledge base, setting up an intranet and Confluence for software teams.
  • Setting up Change Management process and standardizing the policies for various applications.
  • Developed Linux, UNIX, Perl and Shell Scripts for manual deployment of the code to various environments.
  • Automate the processes using PowerShell and windows command shell.
  • Configured and monitored distributed and multi-platform servers using Nagios and App Dynamics.
  • Setup the Chef Workstation, Chef Server and chef nodes. Worked in creating, uploading new cookbooks and recipes.
  • Troubleshoot the build issue during the Jenkins build process. Setup various non-production environments for validating various applications.
  • Very good understanding of Microservices architecture, adapting Industry best practices for microservices while we break down existing apps into micro services.
  • Created dynamic routing/load balancing capability enabling large application scaling, used ingress rules and Ingress Controllers -Nginx.
  • Orchestrated CI/CD processes by responding to Git triggers, human input, and dependency chains and environment setup.
  • Create custom log metrics usingStack driverlogging and create charts and alerts using the custom log metrics.
  • Troubleshooting and performance tuning issues with applications like Oracle10g, MySQL 5. xand Java Applications on operating systems which included RHEL.

Environment: s: Subversion, Artifactory, Selenium, Jenkins, Ansible, Nagios, Zabbix, GIT, Chef, Maven, Jira, Nexus, Nginx, ANT, Linux/Unix, SQL, MYSQL VMware, Shell Scripting, Dynatrace PUTTY, PowerShell, CONFLUENCE, SOA.

Confidential

Assistant Professor

Responsibilities:

  • Teach courses in their subject area.
  • Managed servers, workstations, computers, laptops, printers, routers, switches and existing network applications.
  • Worked with Telnet, FTP, TCP/IP, iLink, rlogin, used to inter-operate hosts.
  • Performed reorganization of disk partitions, file systems, hard disk addition, and memory upgrade.
  • Conducted daily classes assignments and tasks.
  • Monitored system activities, log maintenance, and disk space management.
  • Assess student’s progress by grading papers, tests, and other work.
  • Supported all LAN, WAN and other networking systems.
  • Reviewed servers and application logs.
  • Designs and develops programs and applications to supports operations.

Environment: Unix, Linux, C, C++, HTML, CSS, JavaScript, Python.

Confidential

System Administrator(Linux/DevOps)

Responsibilities:

  • Installation of Ubuntu and RHEL operating systems on HP and dell.
  • Installation and configuration of Webserver (Apache 2.2.17), MySQL 5.5 and PHP in a LAMP stack.
  • Provided support for server and workstation upgrades, backup and disaster recovery monitoring, user account setup and security administration.
  • Set up and administered user and groups accounts, setting permissions for users.
  • Used LVM for the management of logical volumes including creation of physical volumes in Linux.
  • Created disk partitions and file systems and extended storage space using Logical Volume manager.
  • User Group creation, monitoring and maintaining log for system status/health using Linux commands and Nagios system monitor.
  • Collaborated with Network Admin in Installing, configuring, securing, and implementing slave replication on DNS BIND servers.
  • Installed, updated, and configured the necessary packages using the RPM and YUM software/package management.
  • Performed various activities on version control tool GIT.
  • Strong understanding in writing the automation of processes using the Shell script with Bash and Python.
  • Troubleshooting OS / Hardware related issues and monitor server health on a day-to-day basis.
  • Responsible for backup of all servers and recovery of all servers

Environment: Ubuntu, RHEL 5/5.5/6.0, Nagios, LVM, DNS, DHCP, NFS, HTTPD, LAMP stack, DNS, DHCP, NFS, HTTPD.

We'd love your feedback!