Sr.devops/site Reliability Engineer Resume
Houston, TX
SUMMARY
- 8+ years of overall experience working in DevOps and Java. 6+ years of experience in the field of DevOps Engineer in application configurations, code compilation, packaging, building, automating, managing and releasing code from one environment to other and deploy to servers in various cloud platforms like AWS, Azure and GCP.
- Expertise in Architecting and Implementing Azure Service Offering, such as Azure cloud services, Azure storage, IIS, Azure Active Directory (AD), Azure Resource Manager (ARM), Azure Storage, Azure, Blob Storage, Azure VMs, SQL Database, Azure Functions, Azure Service Fabric, Azure Monitor, and Azure Service Bus.
- Hands on experience on Backup and restore Azure services and in Design and configure Azure Virtual Networks (VNets), subnets, Azure network settings, DHCP address blocks, DNS settings, security policies and routing. Azure cloud services, Blob storage, Active directory, Azure Service Bus, Cosmos DB.
- Expertise in Azure Scalability and Azure Availability - Build VMs availability sets using the Azure portal to provide resiliency for IaaS based solution and Virtual Machine Scale Sets (VMSS) using Azure Resource Manager (ARM) to manage network traffic.
- Experience in migrating on-premises applications to Azure and configured VNETs and subnets as per the project requirement also performed PowerShell scripting to do Patching, Imaging, and Deployments in Azure.
- Knowledge of Azure Site Recovery and Azure Backup and Configured the Azure Backup agent and virtual machine backup, Enabled Azure Virtual machine backup from the Vault and configured the Azure Site Recovery.
- Implemented a CI/CD pipeline using Azure DevOps (VSTS, TFS) in both cloud and on-premises with GIT, MS Build, Docker, Maven along with Jenkins plugins.
- Provisioned the highly available EC2 Instances using Terraform and cloud formation and wrote new plugins to support new functionality in Terraform.
- Experience in writing Infrastructure as a code (IAAC) in Terraform, Azure resource management, AWS Cloud formation. Created reusable Terraform modules in both Azure and AWS cloud environments.
- Experience in Infrastructure Development and Operations involving AWS Cloud Services, EC2, EBS, VPC, RDS, SES, ELB, Auto scaling, CloudFront, Cloud Formation, Elastic Cache, API Gateway, Route 53, Cloud Watch,AppDynamcis, SNS.
- Expertise in building CI/CD on AWS environment using AWS Code Commit, Code Build, Code Deploy and Code Pipeline and experience in using AWS CloudFormation, API Gateway, and AWS Lambda in automation and securing the infrastructure on AWS.
- Experience in changing over existing AWS infrastructure to Serverless architecture (AWS Lambda, AWS Kinesis) through the creation of a Serverless Architecture using AWS Lambda, API gateway, Route 53, S3 buckets.
- Experience in Kubernetes to deploy scale, load balance, and manage Docker containers with multiple names spaced versions using Helm charts
- Experience in managed Kubernetes services such as Google Kubernetes engine (GKE), Azure Kubernetes and AWS Elastic container service.
- Hands-on in using OpenShift for container orchestration with Kubernetes, container storage, automation, to enhance container platform multi-tenancy.
- Experience with OpenShift, Kubernetes architecture and design, troubleshooting issues and multi-regional deployment models and patterns for large-scale applications.
- Proficiency in using Docker Hub, Docker Engine, Docker images, Docker Weave, Docker Compose, Docker Swarm, and Docker Registry and used containerization to make applications platform when moved into different environments.
- Proficient in creating Docker images using Docker File, worked onDocker containersnapshots, removing images, and managing Docker volumesand implemented Docker automation solution for CI/CD model.
- Extensive experience in installing, configuring and administering Jenkins CI tool on Linux machines and used Jenkins Pipelines to drive all Microservices builds out to the Docker Registry and then deployed to Kubernetes, Created Pods and managed using Kubernetes.
- Involvement in developing APIs using Kubernetes to manage and specify the copies of the containers to run the actual servers in the cloud environment.
- Scheduled, deployed and managed container replicas onto a node cluster using Kubernetes and deployed Docker containers through Kubernetes to manage the microservices using its Nodes, Pods, ConfigMaps, Selectors, Services.
- Expertise in configuring the monitoring and alerting tools according to the requirement like Prometheus and Grafana. setting up alerts and deployed multiple dashboards for individual applications in Kubernetes.
- Experience in Configuration management tools such as Chef, Ansible, and Puppet. Wrote Chef recipes and cookbooks in Ruby scripting.
- Hands-on experience in bootstrapping the nodes using knife and automated by testing Chef Recipes, Cookbooks with test-kitchen and chef spec. Refactored Chef and Ops Works in AWS cloud environment.
- Experienced in Ansible Tower, which provides an easy-to-use dashboard and role-based access control and in developing Ansible playbooks for managing the application/OS configuration files in GIT hub, integrating with Jenkins, and Verifying with Jenkins plugins, deploying the application in Linux environment.
- Experience in Ansible setup, managing hosts file, Using YAML linter, authoring various playbooks and custom modules with Ansible and Ansible Playbooks to automate in AWS services like Launching EC2, Provisioning IAM, Configuring VPC, EBS, Monitoring using Cloud Watch and Cloud Trail.
- Expert in Setting up Continuous Integration (CI) by configuring Build, Code, Deploy and test automation Jobs in Jenkins for different applications and in creation and deployment using Codeship to automate branch & project creation in Git using (Groovy language) in Jenkins file and automating (using Ansible).
- Professional in deploying and configuring Elasticsearch, Logstash, Kibana (ELK) and AWS Kinesis for log analytics and skilled in monitoring servers using Nagios, Splunk, AWS CloudWatch, Azure Monitor and ELK.
- Skilled with Python, Bash/Shell, PowerShell, Ruby, Perl, PowerShell, JSON, YAML, Groovy. Developed Shell and Python Scripts used to automate day to day administrative tasks and automation of the build and release process.
- In-depth knowledge and experience in implementing Service-Oriented Architecture (SOA) using XML based SOAP/RESTful API Web Services.
- Worked on google cloud platform (GCP) services like compute engine, cloud load balancing, cloud storage, cloud SQL, stack driver monitoring and cloud deployment manager.
- Experience in Designing, Architecting and implementing scalable cloud-based web applications using AWS and GCP.
- Experience in providing highly available and fault tolerant applications utilizing orchestration technologies like Kubernetes and Apache Mesos on Google Cloud Platform.
- Experience in installing, configuring, managing MYSQL, SQL Server, PostgreSQL, Oracle, MongoDB & Cassandra. Ability to build deployment, build scripts and automated solutions using scripting languages such as Bash, Shell, Ruby, and Python.
- Experience in System Administration, Configuration, upgrading, Patches, Troubleshooting, Security, Backup, Disaster Recovery, Performance Monitoring and Fine-tuning on Unix & Linux Systems.
- Examined whether users, owners, custodians, systems and networks are in compliance with internal security procedures and external laws and SOX regulations.
TECHNICAL SKILLS
Cloud Platforms: AWS, Azure, Google Cloud Platform (GCP), OpenStack and PCF.
Operating Systems: Windows, Linux/Unix and MAC OS.
Continuous Integration Tools: Jenkins, Bamboo, VSTS, Gitlab CI, Azure Devops.
Containerization & Orchestration Tools: Docker, Docker Swarm, Kubernetes Clusters.
Configuration Management Tools: Ansible, Puppet and Chef.
Source Control Management & Version Control Tools: GIT, Git Hub, Bitbucket and SVN.
Build Tools: Maven, ANT and Gradle.
Tracking Tools: JIRA and Orange Scrum.
Artifact Repositories: Nexus and Jfrog.
Logging & Monitoring Tools: Nagios, Splunk and ELK Stack, CloudWatch, Grafana, Prometheus.
Web / Application Servers: Apache Tomcat, Nginx, WebSphere, WebLogic, JBoss, Samba, SQL Server.
Database: Amazon Aurora, Dynamo DB, MongoDB, Oracle, SQL Server, MySQL.
Network Services and Topologies: DNS, Web, FTP, Sub netting, LAN, VPC.
Scripting languages: Shell, Python, SQL, Groovy, Golang, XML, HTML, JSON and YAML.
Infrastructure Spin-up Tools: Terraform, CloudFormation and Azure Resource Manager Templates.
PROFESSIONAL EXPERIENCE
Confidential, Houston, TX
Sr.Devops/Site Reliability Engineer
Responsibilities:
- Worked on converting design into infrastructure code developing solutions using AZURE(VNET, VM, BLOB, LB, SQL Database, AAD, Resource Manager, Traffic Manager, Application Insights, CDN, Operational Insights) using orchestration tools Jenkins, Terraform & Ansible.
- Responsible for implementing containerized based applications on Azure Kubernetes by usingAzure Kubernetes service (AKS),Kubernetes Cluster, which are responsible for cluster management, Virtual Network to deploy agent nodes,Ingress API Gateway,My SQL Databases and Cosmo DBfor stateless storage of external data, and setup reverse proxy Nginx in the cluster.
- Hands on experience in Azure DevOps in using all the modules like Azure Boards, Azure Pipelines, Azure Repos, Azure Test plans, and Azure Artifacts.
- Worked in Creation and configuring Load balancers and auto scaling groups to distribute traffic in a cost efficient, fault tolerant in a highly available environment.
- EvaluatedKubernetesfor Docker container orchestration. Managed Kubernetescharts using Helm and created reproducible builds of the Kubernetes applications, templatize Kubernetes manifests, provide a set of configuration parameters to customize the deployment and Managed releases of Helm packages.
- Worked on setting up dashboards for monitoring server infrastructure and SNS alerts for corrective actions or notifications using Azure Application Insights, Splunk, Prometheus & Grafana.
- Created Azure Blob Storage to store files for serving static content for web applications.Configured storage with various life cycle policies to archive the infrequently accessed data to storage classes based on requirements.
- Experience in managing Terraform projects in a team environment maintaining state using storage and developing pipelines to deploy infrastructure using Jenkins.
- Configured and launching VM instances using AMI’s of Linux (Azure, RHEL, SUSE, Ubuntu).
- Creating roles, users, groups and also implemented MFA to provide additional security toAzureaccount and its resources.
- Managed Azure global components such as users, roles and policies, key pairs and policies using Ansible playbooks or python scripts using Azure CLI.
- Implemented Domain name service (DNS) through Traffic Manager for highly available and scalable applications.
- Created and Managed Storage volumes for storing application files for use within VM instances.
- Experience in creating SQL database instances, configured snapshots to take backups of volumes and images to store launch configurations of the Virtual Machines.
- Scripted Templates for Azure infrastructure as a code using ARM / Terraform templates to build sandbox, development, staging and production environments.
- Managed Ansible playbooks for installing Apache, Tomcat & Jboss EAP servers and delivered through pipelines to Ansible Tower.
- Build Automation process implementation using Jenkins along with Python and Shell scripts to automate routine administrative tasks reducing manual intervention.
- Implemented Continuous Integration and Continuous Delivery using Jenkins and GIT from scratch building custom pipelines running automation tests.
- Well versed in using Atlassian confluence for creating documentation and JIRA for issue management as well as Agile/Scrum Tool.
- Used ServiceNow for managing Incidents, Requests, problems and integrated change processes with Jenkins for standard changes.
- Build additional Docker Slave nodes for Jenkins using custom built Docker images and instances.
- Worked on google cloud platform (GCP) services like compute engine, cloud load balancing, cloud storage, cloud SQL, stack driver monitoring and cloud deployment manager.
- Setting-up GCP Firewall rules to allow or deny traffic to and from the VM's instances based on specified configuration and used GCP cloud CDN (content delivery network) to deliver content from GCP cache locations drastically improving user experience and latency.
- Worked with high volume datasets from various sources like Oracle, Text files, Relational Tables and xml targets. Responsible for mandatory checks, date field checks and number field checks inETLpackages.
Confidential, Chicago, IL
Sr AWS Cloud DevOps Engineer
Responsibilities:
- Provisioned and administered EC2 instances and configuring EBS, Simple Storage(S3) cross region replication, Elastic Load Balancer, configure Auto scaling, setting up CloudWatch alarms, Virtual Private Cloud (VPC), mapping with multi-AZ VPC instances and RDS, based on architecture.
- Worked on Amazon EC2 setting up instances, virtual private cloud (VPCs), and security groups and created AWS Route53 to route traffic between different regions and used BOTO3 and Fabric for launching and deploying instances in AWS.
- Configured Amazon S3, Elastic Load Balancing, IAM and Security Groups in Public and Private Subnets in VPC, created storage cached and storage volume gateways to store data and other services in the AWS.
- Architected and configured a virtual data center in the AWS cloud to support Enterprise Data Warehouse hosting including Virtual Private Cloud (VPC), Public and Private Subnets, Security Groups and Route Tables.
- Used Security Groups, Network ACLs, Internet Gateways, NAT instances and Route tables to ensure a secure zone for organizations in AWS public cloud.
- Create develop and test environments of different applications by provisioning Kubernetes clusters on AWS usingDocker, Ansible, and Terraform.
- Automate Datadog Dashboards with the stack through Terraform Scripts. Wrote terraform scripts for CloudWatch Alerts.
- Created aContinuous Deliveryprocess to include support building of Docker Images and publish into a private repositoryNexus v3.
- Responsible for installation & configuration of Jenkins to support various Java builds and Jenkins plugins to automate continuous builds and publishingDocker images to the Nexus repository.
- Automated applications andMySQLcontainer deployment inDocker using Pythonand monitor them usingNagios.
- Experience with container-based deployments using Docker, working with Docker images, Docker Hub and Docker-registries and Kubernetes.
- Building/Maintaining Docker container clusters managed by Kubernetes Linux, Bash, GIT, Docker. Utilized Kubernetes and Docker for the runtime environment of the CI/CD system to build, test deploy.
- UsedJenkins pipelinesto drive all microservices builds out to theDocker registryand then deployed toKubernetes, CreatedPodsand managed usingKubernetes.
- Deploying windows Kubernetes cluster with Azure Container Service from Azure CLI and Utilized Kubernetes and Docker for the runtime environment of the CI/CD system to build, test and Octopus Deploy. UtilizedKubernetesfor the runtime environment of theCI/CDsystem to build, test deploy.
- Automated configuration management and deployments usingAnsibleplaybooks andYamlfor resource declaration. And creating roles and updating Playbooks to provision servers by usingAnsible.
- Installing, Configured and management inAnsibleCentralized Server and creating the playbooks to support various middleware application servers, and involved in configuring theAnsibletower as a configuration management tool to automate repetitive tasks.
- Wrote theAnsible YAMLscripts to configure the remote servers. Integrated Jenkins andAnsible. Designed and developedJenkinsBuild deployments.
- Installing Ansible Tower and using it to manage systems. Used Ansible Tower, which provides an easy-to-use dashboard and role-based access control, so that it's easier to allow individual teams access to use Ansible for their deployments.
- Setup full CI/CD pipelines so that each commit a developer makes will go through standard process of software lifecycle and gets tested well enough before it can make it to the production.
- Helped individual teams to set up their repositories in bit bucket and maintain their code and help them setting up jobs which can make use of CI/CD environment.
- Installed and configured Apache HTTP Server, IHS and WebSphere Application Server and also configuration of Virtual Hosts for Apache HTTP Server, and Tomcat Application Server.
- Creation, configuration and monitoring Shards sets. Analysis of the data to be shared, choosing a shard Key to distribute data evenly. Architecture and Capacity planning for MongoDB clusters. Implemented scripts for mongo DB import, export, dump and restore.
- Worked on MongoDB database concepts such as locking, transactions, indexes, Sharding, replication, schema design. Created multiple databases with sharded collections and choosing shard key based on the requirements. Experience in managing MongoDB environment from availability, performance and scalability perspectives.
- Responsible for managing MongoDB environment with high availability, performance and scalability perspectives.
- Installation and implementation of the Splunk App for Enterprise Security and documented best practices for the installation and performed knowledge transfer on the process.
- Configured Syslog server for the forwarding the logs to Splunk server via network protocols like TCP and UDP.
- Developed Splunk infrastructure and related solutions as per automation tool sets. Involved in building the application which works on mainly Groovy Scripting.
Confidential, Houston, TX
Sr. DevOps/Cloud Engineer
Responsibilities:
- Automated the cloud deployments using chef, python (boto,fabric) and AWS Cloud Formation Templates.
- Installed Workstation, Bootstrapped Nodes, Wrote Recipes, Cookbooks and uploaded them to Chef-server, Managed On-site OS/Applications/Services/ Packages using Chef as well as AWS for EC2/S3&ELB with Chef Cookbooks.
- Create Chef Automation tools and builds, and do an overall process improvement to any manual processes.
- Wrote Chef Cookbooks for various DB configurations to modularize and optimize end product configuration.
- Involved in the installation and configuration of Nginx as a web server.
- Experience with Build Management tools Ant and Maven for writing build.xmls and pom.xmls
- Integration of Automated Build with Deployment Pipeline. Currently installed Chef Server and clients to pick up the Build from Jenkins repository and deploy in target environments (Integration, QA, and Production).
- Utilized WAR and JAR files for deployment of enterprise apps.
- Building and deployment of Java applications on to different environments such as QA, UAT and Production.
- Branching, Tagging, Release Activities on Version Control Tools: SVN, GitHub.Extensive usage of Tortoise SVN in windows environment for version control activities.
- Worked with cloud providers and API's for Amazon (AWS) EC2, S3, VPC with Cloud Sigma (EU) and GFS storage.
- Implemented Chef Recipes for Deployment on build on internal Data Centre Servers. Also re-used and modified same Chef Recipes to create a Deployment directly into Amazon EC2 instances.
- Evaluated testing of Chef Recipes - Concept of Test-Driven Development for Infrastructure as a Code. Implemented Chef Cookbooks for OS component configuration to keep AWS server's template minimal.
- Wrote Chef Cookbooks for various DB configurations to modularize and optimize end product configuration.
- Virtualized the servers using the Docker for the test environments and dev-environments needs. And also configuration automation using Docker containers.
- Build artifacts are deployed into Tomcat instances and were integrated using the Perl and Shell scripts.
- Implementing a Continuous Delivery framework using Jenkins, Puppet, Maven& Nexus in Linux environment. Manage deployment automation using Puppet, MCollective, Hiera, and Custom Puppet modules, Ruby.
- Involved in periodic archiving and storage of the source code for disaster recovery. Prepared Junit test cases and executed server configurations.
- Supported and developed tools for integration, automated testing and release management. Used Anthill pro as Continuous Integration (CI) server.
Confidential, Chicago, IL
DevOps Engineer
Responsibilities:
- Configured Security group for EC2 instances for puppet master and puppet agents and Developing AWS cloud formation templates by setting up Auto scaling for EC2 instances which are Responsible for automation on AWS Cloud formation template.
- Automate creating AWS components like EC2 instances, Security Groups, S3, ELB, RDS, IAM through Cloud Formation templates.
- Configured AWS EC2 instances using AMI (Linux/Ubuntu) for specific application. Maintained EC2 instances and S3 buckets on AWS.
- Worked on creating load-based Auto scaling to increase capacity at peak times and reduce the costs at off-peaks times for production infrastructure.
- Setup Kubernetes to deploy scale, load balance and manage Docker containers with multiple name space versions for development, test and production environments
- Configured Continuous Integration and Continuous Delivery build pipelines in Jenkins to create Containers for the application, virtualized servers using Docker for the development, test and prod environment needs.
- Configured Docker
- Trusted Registry (DTR) to store and distribute images used in Jenkins workflow to deploy applications to Kubernetes.
- In depth knowledge of CI (Continuous Integration) and CD (Continuous Deployment) methodologies using Jenkins, Maven, Ant, and Puppet.
- Responsible for SCM (source code management) configuration and deployment and utilized Puppet to automate Configuration management to manage Web Applications, Config Files, Data Base, Users Mount Points and Packages.
- Configure dashboards in Splunk/Grafana to monitor server and application health.
- Use GITLAB as a Version Control with GIT branching strategy that included develop branches, feature branches, staging branches and master. Pull requests and code reviews were performed.
- Worked with Agile/Scrum methodology to deliver quality work in bi-weekly sprints.
- Configured Managed and controlled the source code repository, currently housed in Subversion, GIT.
Confidential, New Jersey, NJ
Java Developer
Responsibilities:
- Implemented Agile Development methodology of software development and design JSON requests according to the front-end pages.
- Developed various Java Bean components for implementing business logic.
- Extensively worked with Java Collection classes like List, Set etc.
- Involved in performance tuning of JVM and used Garbage Collection Algorithms for better performance.
- Involved in writing complex multi-table joins and conditional queries in Databases.
- Worked extensively in the combination of JavaScript, JSON and JQUERY.
- Involved in data base design and responsible for creating and maintaining data base objects.
- Involved in support QA team testing in various environments.
- Implemented OOPs and other core java concepts like multithreading, concurrency, inheritance in development.
Confidential, Roanoke, VA
Linux Engineer
Responsibilities:
- Responsible for installation, configuration and upgradation of Red Hat Linux, CentOS on remote servers.
- Installation, Configuration, Service start and troubleshooting of Apache Tomcat Server.
- Log management like monitoring and cleaning of old log files.
- Configuring and monitoring DHCP and DNS Server.
- Installation and management of packages through RPM and YUM in Red Hat Linux and CentOS.
- Experience in writing bash scripts for job automation.
- Worked in building scripts and responsible for installation procedures.
- Performance monitoring of Linux servers using native tools.
- Reviewing system log files for errors.
- Involved in creating new users, groups, and setup their home directories and appropriate access, restrictions to software in the Active Directory environment and NIS domains by using SUDO.
- Troubleshooting network, software and /or hardware problems, monitor and document of all Linux-based systems in centralized IT environment.