We provide IT Staff Augmentation Services!

Site Reliability Engineer Resume

4.00/5 (Submit Your Rating)

Rockville, MD

SUMMARY

  • Professional experience of over 7 + years in the field of IT that includes Development, designing with a foremost focus on DevOps, provisioning, build and release, Cloud Automation, Continuous Integration/Deployment, Configuration Management, Source Code Management, Virtualization technologies, System Administration and cloud migration.
  • Experience with an in - depth level of understanding in the strategy and practical implementation of AWS Cloud-Specific technologies including EC2, EBS, S3, VPC, RDS, SES, ELB, EMR, ECS, Cloud Front, Cloud Formation, Elastic Cache, RedShift, Lambda, SNS, Dynamo DB
  • Designed, configured and managed public/private cloud infrastructures utilizing Amazon Web Services.
  • Created load balancers in auto scaling group for creating highly available environment.
  • Creating snapshots and Amazon machine images (AMIs) of the instances for backup and creating clone instances.
  • Good knowledge in Developed Java API to connect with AWS S3 Services for storing and retrieving data stored in the S3 buckets.
  • Experienced in designing and establishing the connectivity between VPCs of same region (VPC Peering), design VPC tunneling between VPCs in different regions.
  • Leveraged “Azure Automation” and PowerShell, Chef and Ansible to automate processes in the Azure cloud.
  • Hands on experience with Microsoft Azure cloud services, Storage Accounts and Virtual Networks.
  • Expertise in Google Cloud Platform (GCP)Services Like Compute Engine, Cloud functions, Cloud DNS, Cloud Storage and Cloud Deployment and SaaS, PaaS and IaaS concepts of Cloud computing architecture and Implementation using GCP.
  • Experience in migrating the Legacy application into GCP platform and managing the GCP services such as Compute engine, cloud storage, BigQuery, VPC, Stack Driver, Load Balancing and IAM.
  • Deployed and Monitored Micro Services using Pivotal Cloud Foundry (PCF) and managed domains and routes with the cloud foundry.
  • To map more complex dependencies and identify network issues and tool for building, changing, and versioning infrastructure safely and efficiently by using Terraform.
  • Automated infrastructure provisioning on AWS using Terraform and Ansible
  • Used Chef Automate to build and experienced in using Chef and Knife commands to manage nodes, Cookbooks, Chef recipes, Chef attributes, Chef templates, Run-lists, environments.
  • Experience in installing and configuring the Ansible management node to deploy configuration to the end user nodes and Writing Ansible playbooks to deploy configuration to the production’s servers.
  • Configuring and managing Puppet master server and experience in updating and creating modules and pushing them to Puppet clients and dealing with Puppet manifest files to deploy automated tasks to many servers at once and used Puppet to create manifests, modules, files, and packages stored in GIT repository.
  • Experience in working on several Docker components like Docker-Hub, engine, Compose, Swarm and Docker registry and creating Images and Containers and pushing them to Docker Hub, optimized infrastructure by making use of resources effectively and decreased cost with Docker.
  • Familiarity with Kubernetes cluster management and administration, creating pods and managing them by updating resources depending on the requirement. set Jenkins by integrating various Devops tools such as Nexus, SonarQube, Maven, and maintain and configuring Jenkins by continuous integration (CI) and End to End automation for all build and deployments, involved in writing Groovy scripts for building CI/CD pipeline with Jenkins file.
  • Hands on experience in installing and administrating CI tools like Bamboo and Team City and other Tools like JIRA, Atlassian stack of tools like Stash, Confluence.
  • Well experienced in Branching, Merging, Tagging, and maintaining the version across the environments using SCM tools like GIT on Linux platforms (Red-hat, centos, ubuntu) and involved in the Branching strategy, thereby creating different branches in the existing workflow process using revision control tool GIT.
  • Hands on experience with Bitbucket in Branching, merging strategies, Tagging and maintain versions across windows and Linux platforms and integrating to push and pull the code for automation.
  • Hands on experience using MAVEN, ANT as build tools for writing build.xml and pom.xml and building of deployable artifacts from source code and administering and configuring Nexus repository Manager for Maven builds.
  • Monitoring tool like Nagios is used as a monitoring tool to identify and resolve infrastructure critical processes and integrated application logs with Splunk and wrote custom Splunk queries for alerting.
  • Creating and administrating Splunk Software Development Kit for JavaScript, Python on and AWS Cloud formation templates for Splunk distributed cluster deployment.
  • Working knowledge on Python /Bash/Shell /YAML scripts used to automate day to data administrative tasks. Tested various Linux Environments in the Development/Test stages to make the environment and better and clearing the problems if they are occurred, and effective in creating and assigning roles in AWS Lambda to run Python scripts.

TECHNICAL SKILLS

Cloud Platforms: AWS, Microsoft Azure, Google Cloud Platform (GCP), OpenStack and PCF.

CI/CD Tools: Jenkins, Bamboo, TeamCity.

Containerization and Orchestration: Docker, Kubernetes Clusters.

Configuration Management Tools: Ansible, Puppet and Chef.

Source Control Management Tools: GIT, Bitbucket and SVN.

Build Tools: Maven, ANT and Gradle.

Tracking Tools: JIRA and Orange Scrum.

Artifact Repositories: Nexus and Artifactory.

Logging & Monitoring Tools: Nagios, Splunk and ELK Stack (Elastic search, Log stash & Kibana), CloudWatch.

Web Servers and Application Servers: Apache, Nginx, JBOSS, Apache Tomcat and WebLogic.

Databases: Amazon Aurora, Dynamo DB, MongoDB, Oracle, SQL Server, MySQL.

Operating Systems: Windows, Linux/Unix and MAC OS.

Network Services and Topologies: LDAP, DNS, Web, FTP, Sub netting, LAN, VPC, WAN and firewalls.

Scripting languages: Shell, Python, SQL, XML, HTML, CSS3, Ruby, JSON and YAML.

Infrastructure as a code(IAC): Terraform, CloudFormation and Azure Resource Manager Templates.

PROFESSIONAL EXPERIENCE

Confidential, Rockville, MD

Site Reliability Engineer

Responsibilities:

  • Worked on Azure ExpressRoute to establish connection from Azure to On-premise datacenter. Working knowledge on Azure Fabric, Micro services, Lot & Docker containers in Azure.
  • Worked on Azure cloud services, Azure storage, Azure active directory, Azure service Bus. Create and manage Azure AD tenants and configure application integration with Azure AD. Integrate on-premises Windows AD tenants and Configure application integration with Azure AD.
  • Worked onAzure Resource Managerfor creating and deploying templates, assigning roles, and getting activity logs.
  • Creating,validating,and reviewing solutions and effort estimate of converting existing workloads from classic toARMbasedAzureCloud Environment.
  • IntegratedAzure DevOpswithTerraformandranADO CI/CD pipelinesfor creating the infrastructure on Azure, utilizedADO Pipelinesfor end-to-endCI/CDdeployment.IntegratedAzure Log AnalyticswithAzure VMsfor monitoring thelog files, store them and track metrics and usedTerraformas a tooland manageddifferent infrastructure resourceson Cloud,VMware, andDockercontainers.
  • Created theAzurestorage accounts,VNET’s,Subnets, assigned thepublic IP, altered thesecurity groups,NIC’sandAzure VM’sfor internal applications and assignedAzure Auto Scaling.
  • Wrotepower shellscripts to create the parameter files automatically for all the services inAzure Resource Manager and worked withNagiosforAzure Active Directory&LDAPand Data consolidation forLDAP users.
  • Worked on Premises applications on cloud platform Azure in dealing with Azure IaaS - Virtual Networks, Virtual Machines, Cloud Services, Resource Groups, Express Route, Traffic Manager, VPN, Load Balancing, Application Gateways, Auto-scaling.
  • Maintain storing certificates and secrets for Azure APIM and Azure Application Gateway and used Blue/Green Deployment to maintain zero Run time during deployment process.
  • Worked on Google Cloud Components, Google container builders and GCP client libraries and cloud SDK’s. Experience on Google Cloud Platform (GCP)and worked on OpenStack-manuals and Security Guide to the OpenStack Community.
  • Configured a Google Cloud Virtual Private Cloud (VPC) and Puppet Subnet Group for isolation of resources. Architecting the infrastructure on Google Cloud Platform using GCP services and automated GCP infrastructure using GCP Cloud Deployment manager.
  • Secured the GCP infrastructure with Private, Public subnets as well as security groups and leveraged the Google cloud services such as compute, auto-scaling and VPC to build secure, Scalable systems to handle unexpected workloads.
  • Using GIT repository for storing Terraform files and maintaining versioning. Converted existing Terraform modules that had version conflicts to utilize cloud formation during Terraform deployments to enable more control or missing capabilities.
  • Converted existing Terraform modules that had version conflicts to utilize cloud formation during Terraform deployments to enable more control or missing capabilities.
  • Written the Ansible playbooks which are the entry point for Ansible provisioning. Where the automation is defined through tasks using YAML format to setup continuous Delivery pipeline and ran Ansible Scripts to provision Dev servers.
  • Written Ansible playbooks for automating tasks and managed Ansible to configure Web Apps and deploy them on AWS server.
  • Used Docker for running different programs on single VM, Docker images includes setting the entry point and volumes, also ran Docker containers and worked on installing Docker and creation of Docker container images, tagging and pushing the images.
  • Worked on Docker registry, Machine, Hub and creating, attaching, networking of Docker containers, container orchestration using Kubernetes for clustering, load balancing, scaling and service discovery using selectors, nodes and pods.
  • Used Kubernetes for container operation in Azure and used Kubernetes clusters as a network and load balancer, and chosen Kubernetes is also good at running web applications in a cluster way, also used in multiple services by creating images and reduced space.
  • Maintaining Jenkins in various multiple environments by installing packages on Jenkins master and slaves and perform regular security updates for Jenkins.
  • Migrated the Jenkins free style jobs to pipeline jobs by writing Jenkins file in groovy and written helper functions Library and reused those libraries in Jenkins file.
  • Implemented collaborative development environment using GIT, GitHub and integrated it with Jenkins, and Maintained branches/forks in GitHub version control for the changes made in cookbooks as per release.
  • Developed a fully automated continuous integration system using GIT, Jenkins and custom tools developed in Python and Bash.
  • Used build tool Maven for building deployable artifacts such as jar, war, and ear from source code and Artifactory Repository like Sonatype Nexus for Maven and ANT builds to upload artifacts using Jenkins. managed and monitored the server and network infrastructure using Splunk applied blackouts for any outages, pulling reports by providing them to the client.
  • Implementing a Continuous Delivery framework using Jenkins, Maven & Nexus in Linux environment.
  • Supported multiple teams for multiple applications including .Net and Java/j2EE.
  • Identifying opportunities to improve infrastructure that effectively and efficiently utilize the Windows server 2008/2012, Microsoft Visual Studio, windows PowerShell, Cloud infrastructure.
  • Build and release of Cloud based products containing Linux (Centos, RHEL, Ubuntu) and Windows environments, using PowerShell, Python, Ruby.
  • Created Terraform modules to create instances in AWS & automated process of creation of resources in AWS using Terraform.
  • AmazonIAMservice enabled to grant permissions and resources to users. Managed roles and permissions of users with the help ofAWSIAM.

Environment: AWS, Azure, PCF, Terraform, Chef, Ansible, Docker, Kubernetes, Jenkins, GIT, Maven, Splunk.

Confidential, Irving, TX

Sr Cloud/DevOps Engineer

Responsibilities:

  • Build servers using AWS includes importing necessary volumes, launching EC2 instances, S3 for object static webpages and created security groups, auto scaling, load balancer, Route 53 and SNS as per architecture using IAAC tools like Cloud formation JSON Templates.
  • Worked on AWS EC2 Instances creation, setting up VPC, launching EC2 instances different kind of private and public subnets based on the requirements for each of the applications and Used IAM to assign roles, to create and manage AWS users, groups, and required permissions to use AWS resources.
  • Wrote Ansible Playbooks with Python SSH as the Wrapper to manage configuration of AWS Nodes and Test Playbooks on AWS instances using Python and ran Ansible Scripts to provision Dev servers.
  • Infrastructure buildout, maintenance & automation, collaborated with infrastructure team to maintain servers using Terraform for provisioning, Ansible for automating software configuration. Servers were spread across various regions and availability zones on AWS.
  • Created Terraform modules to create instances in AWS & automated process of creation of resources in AWS using Terraform.
  • Developed Ansible Playbooks using YAML scripts for launching different EC2 virtual servers in the cloud using Auto-Scaling and Amazon Machine Images (AMI).
  • Used Ansible server to manage and configure nodes, Managed Ansible Playbooks with Ansible roles. Used file module in Ansible Playbook to copy and remove files on remote systems.
  • Created AWS, VPC network for the installed instances and configured the Security Groups and Elastic IP’s and used EC2 Container Service to support Docker containers to run applications on cluster of EC2 instances.
  • Building and maintaining Docker container clusters managed by Kubernetes, Linux, Bash, GIT, Docker, on GCP. Utilized Kubernetes and Docker for the runtime environment of the CI/CD system to build, test, deploy.
  • Efficiently worked on building the data centers using Amazon Web Service, installed images on Amazon web services using Jenkins/Hudson, GIT.
  • Designing and implementing CI (Continuous integration) system and configuring Jenkins servers and nodes by writing required scripts (Bash & Python) and creating configuring VMs.
  • Used GIT version control to manage the source code and integrating with Jenkins to support build automation and integrated with JIRA to monitor the commits and worked with Ansible tower for scheduling playbooks and used GIT repository to store these playbooks and implemented continuous deployment pipeline with Jenkins.
  • Installed and administrated Nexus repository to deploy the artifacts generated by Maven and to store the dependent jars which are used during the build.
  • Utilized Splunk and New Relic for monitoring of logging, software, operating system, and hardware resources and used these monitoring tools for working of instances in AWS platform.
  • Automate the Build and deploy of all internal Java &SC environments using various continuous integration tools and by Python Scripting language.
  • Worked with PCF OAuth2 server and PCF login server to provide identity management and configured PCF nsync, PCF BBS, PCF cell repo to keep application running
  • Involved in automation of Linux based servers (Centos, RHEL, Ubuntu) and windows-based servers (2009R2, 2012R2) form on-premises to cloud platforms such as PCF.
  • Developed automation Scripting in Python using Chef to deploy and manage Java application across Linux servers.
  • Administrated Linux servers for several functions including managing Apache/Tomcat, and MYSQL Puppet in both development and production.
  • Managed Virtual servers (Ubuntu, RHEL, Centos) on AWS EC2 using Open-source Chef Server.
  • Perform daily maintenance routines on Linux servers, monitoring system access, managing file space and tuning the system for optimum performance.

Environment: AWS, GCP, OpenStack, Terraform, Chef, Ansible, Docker, Kubernetes, Jenkins, GIT, Maven, New Relic.

Confidential, Madison, NJ

DevOps engineer

Responsibilities:

  • Involved in designing and deploying multiple application utilizing AWS stack and implemented AWS solutions like EC2, S3, IAM, EBS, Elastic Load Balance (ELB), Security Group, Auto Scaling.
  • Automated and implemented the Cloud Formation Stacks for creating AWS resources like VPC, Subnets, Gateways, Auto-Scaling, Elastic-Load-Balancers (ELB), creating DB Instances and many others across different Availability Zones.
  • User management including IAM level (AWS Console level) by creating roles to allow the multiple users to switch roles and editing trust relationship to allow switch from main account to other account and at AWS instance level as well.
  • Implemented Chef Recipes for Deployment on build on internal Data Centre Servers. Also re-used and modified same Chef Recipes to create a Deployment directly into Amazon EC2 instances.
  • Worked with Chef servers and management application that can use Service Now data to bring computers into a desired state by managing files, and used Chef attributes, Chef templates, Chef recipes, for managing the configurations across various nodes.
  • Written Chef cookbooks to automate the installation and configuration of Docker VM’s, Jenkins master and slave setup using swarm Jenkins plugin on AWS VM’s.
  • Wrote Chef Cookbooks and recipes in Ruby to provision several pre-prod environments consisting of Cassandra DB installations, WebLogic domain creations and several proprietary middleware installations.
  • Used TeamCity Enterprise CI and distributed build that supports all of the environments to run the build, promotions and deployments, and Used Shell Scripts to automate the deployment process, and also uploading the code in GITHUB and generating build number using TeamCity.
  • Performed SVN to Bitbucket migration and managed branching strategies using GIT workflow. Managed User access control, Triggers, workflows, hooks, security, repository control in Bitbucket.
  • Created snippets that allow developers to share code segments and generated pulled requests for code review and comments using Bitbucket.
  • Built and Deployed Java/J2EE to Tomcat Application servers in an Agile continuous integration process and automated the whole process using Maven and Automated Weekly releases with Maven Scripting for Compiling Java Code, Debugging and placing Builds into Maven Repository.
  • Installed and configured, managed Monitoring Tools such as Nagios and used to identify and resolve infrastructure problems before they affect critical processes and worked on Nagios Event handlers in case of automatic restart of failed applications and services.
  • Worked on integrating Nagios with cloud watch as a monitoring solution, implementing monitoring tool as Nagios for analyzing and monitoring the network loads on the individual machines.
  • Developed Shell scripts for automation of the build and release process, to monitor repositories developed Custom Scripts and Used for automating process and collecting reports for daily tasks.
  • Extensive exposure to configuration management policies and practices with regards to SDLC along with automation of Scripting using Bash.
  • Installation of Solaris 8, 9 based servers with Jumpstart and RedHat Linux EL 4.x based servers using Kickstart development, test, and production environments.
  • Performed Web logic Server administration tasks such as installing, configuring, monitoring and performance tuning on Linux Environment, maintaining security by installing and configuring SSH encryption to access on Ubuntu and RHEL Linux.

Environment: AWS, Chef, Ansible, Bamboo, Bitbucket, Maven, Nagios.

Confidential, Houston, TX

Infrastructure Engineer

Responsibilities:

  • Excellent experience and knowledge with designing installing and implementing VMware ESX server, VMware virtual center, setting up V-Motion, HA, DRS, and related VMware products such as VM Workstation, VMware converter VM Backup products and other products for virtualization.
  • Excellent troubleshooting skills both with hardware and software in windows environment and Managing and Monitoring Dynamic Host Configuration Protocol DHCP and DNS .
  • Wrote custom script in Puppet for package management (rpm, yum) in RHEL Linux and worked closely with the development and operations organizations to implement the necessary tools and process to support the automation of builds and deployments.
  • Worked in building Puppet Enterprise modules using Puppet DSL to automate infrastructure provisioning and configuration automation across the environments and done node classifications and external node classifiers and parameterization for Puppet modules.
  • Worked with Puppet administrator, adding new Puppet enterprise nodes to the master, deactivating the nodes and troubleshooting connection issues, and troubleshooting, event application and reporting various Puppet issues and starting or restarting the Puppet enterprise services.
  • Configured Jenkins to build and deploy by setting up SonarQube, Maven, Nexus to build CI/CD pipeline which includes to trigger auto builds, code analysis, and deploy it nexus for various projects.
  • Maintain and track inventory using Jenkins and set alerts when the servers are full and need attention, and integrated GIT with Jenkins to automate the code checkout process with the help of Jenkins DSL plugin.
  • Used Webhooks for pushing the commits from GIT to Jenkins and written Groovy scripts to automate Jenkins Pipeline and set up the automate the build in periodic time and set alerts to notify after the build.
  • Build end to end CI/CD Pipelines in Jenkins to build CI/CD Pipeline and code deployment guide for Developers, Testers and Production management.
  • Used SonarQube for continuous inspection of code quality and to perform automatic reviews of code to detect bugs and Automated Nagios alerts and email notifications using Python script.
  • Used GIT version control to manage the source code and integrating GIT with Jenkins to support build automation and integrated with Nagios to monitor the commits.
  • Created ear, war, jar files using ANT scripts and responsible for builds and managing the testing and Pre-Pod environments, and setup ANT script-based jobs in Jenkins and worked with Jenkins pipelines.
  • Involved in developing custom scripts using Python, JSON, PowerShell, Perl, Shell to automate jobs, and wrote Python scripts for automated backup -ups and Cron Jobs.
  • Developed UNIX and Perl Scripts for manual deployment of the code to the different environments and E-mail the team when the build is completed.
  • Wrote Puppet modules for installing and managing Java Versions and Build and Deployment of the Java application onto different environments Dev, QA.
  • Configured Jenkins for .Net applications using MS build and PowerShell Scripting and Used Maven as a build tool on Java projects for development of build artifacts of the source code.
  • Administrated Linux servers for several functions including managing Apache/Tomcat server, mail server, and MYSQL Puppet in both development and production.

Environment: Puppet, Jenkins, ANT, GIT, Nagios, Linux, Puppet, windows.

Confidential

Build & Release Engineer

Responsibilities:

  • Developed and implemented Software Release Management strategies for various applications according to the agile process.
  • Participated in weekly release meetings with Technology stakeholders to identify and mitigate potential risks associated with the releases.
  • Imported and managed multiple corporate applications into Subversion (SVN) and provided end-user training for all Subversion (SVN) users to effectively use the tool.
  • Built and Deployed Java/J2EE to a web application server in an Agile continuous integration environment and also automated the whole process.
  • Created and maintained the Shell/python deployment scripts for TC Server/Tomcat web application servers.
  • Involved in editing the existing MAVEN files in case of errors or changes in the project
  • Requirements and managed Maven project dependencies by creating parent-child relationships between projects.
  • Installed Jira and confluence on Tomcat server.
  • Installed and configured Nagios monitoring tool to manage infrastructure.
  • Responsible for the development and maintenance of processes and associated scripts/tools for automated build, testing and deployment of the products to various developments.
  • Installed/Configured/Managed Jenkins CI Automation and responsible for the Plugin Management, User Management, Build/Deploy Pipeline Setup and End-End Job Setup of all the projects.
  • Used Puppet to make sure all the Jenkins slaves are in sync.
  • Installed/Configured and Managed Nexus and Repository Manager and all the Repositories.
  • Created the Release process of the artifacts and did upgrades to applications using puppet manifests.

Environment: Subversion (SVN), Jenkins, Java/J2EE, ANT, MAVEN, Puppet, Tomcat, Shell Scripts, UNIX, Linux, Putty, Oracle.

Confidential

Software Engineer

Responsibilities:

  • Installing and configuring Linux servers, user administration and profiles.
  • Managed Disk file systems, Server performance, Users creation and granted file access permissions.
  • Participated in Java/J2EE and developed XML, HTML, Shell Scripts.
  • Worked in infrastructure team on installation, configuration and administration of RHEL 5.x/6.x/7.x and SUSE Linux 10.x/11.x
  • Installed, configured and maintained DNS servers, Mail servers, FTP servers, NFS, and RPM.
  • Installing Linux servers using Kickstart and Cloning on ESX servers.
  • Performing migrations of applications from different UNIX servers like Solaris, HP-UX to Red hat Linux.
  • Worked with various Agile development teams to standardize branching and tagging of code in our TFS repository and maintain code base integrity.
  • Implemented SUDO, SSH, SOX (Sarbanes Oxley) security compliance and other security tools.
  • Worked on installing, integrating, tuning and troubleshooting Apache tomcat, WebSphere and WebLogic application server.
  • Worked with network operations like configuring firewall, router, storage and security.
  • Involved in installation, configuration, performance tuning, security, backup, recovery and update/patches of Linux and Unix servers using Crontab.
  • Extensive use of Bash shell for scripting and worked on backup and restore process.
  • Configured Linux servers like FTP, NFS, Telnet, SSH, send mail, Apache and Package management using RPM and YUM.

Environment: Linux, Java/J2EE, XML, HTML, Shell, RHEL, SUSE Linux, RPM, YUM, Crontab, Tomcat, WebLogic, WebSphere, Agile, TFS, Solaris, ESX, HP-UX, UNIX, SUDO, SSH, SOX, Firewall, Kickstart.

We'd love your feedback!