Cloud Devops Engineer Resume
CT
SUMMARY
- Having around 8 years of experience as Linux System Administrator, Cloud DevOps Engineer, build and release engineer,working on the server - based operating system; kernel configurations on Red Hat Enterprise Linux, CentOS, VMWare and Windows server models.
- Installation and upgradation of Packages and Patches configuration management, version control, service pack & reviewing security vulnerabilities.
- Experienced at DevOps Engineering in automating, building, deploying, and releasing of code from DEV env to PROD environment.
- Certified AWS Developer Associate with expertise in Amazon AWS Cloud Administration which includes services like EC2, S3, EBS, VPC, ELB, Route 53, Auto Scaling, Security Groups, SNS, SQS and Cloud Formation.
- Created AWS computing instance services like EC2 and Amazon Elastic Load Balancing in AWSCloudand managed AWS storage with S3, EBS and Amazon Cloud Front.
- Created lightweight containers using docker to run the micro services as part of the application.
- Used Kubernetes to orchestrate the deployment, scaling and management of Docker Containers.
- Proficient in Build & Release automation framework designing, Continuous Integration and Continuous Delivery automation.
- Good at documenting and implementing procedures related to build, deployment and release.
- Experienced with infrastructure as a code on AWS cloud platform using Terraform.
- Worked with Ansible playbooks for virtual and physical instance provisioning, configuration management, patching and software deployment.
- Extensive experience using MAVEN and ANT as a Build Tool for the build and deploy artifacts (war & ear) from source code.
- Experienced in branching, tagging and maintaining the version across the environments using SCM tools like GIT, Subversion (SVN) and TFS on Linux and windows platforms.
- Experience in integrating Ansible Tower to deploy the playbooks.
- Managed 4 AWS Environments holding 300+ Instances. Wrote several Lambda scripts to manage Instances as per the usage.
- Currently working on migrating backups from Veeam to Commvault and configuring Instance level backups for DB2 Servers.
- Developing and executing all the required Scripts to maintain the system security using the IRS remediation’s and to automate the new Linux builds to match and maintain the required restrictions and remediation’s.
- Created Python scripts for various System administration tasks to automate repetitive tasks.
- Integrated Python scripts into Ansible playbooks to automate the tasks on different environments.
TECHNICAL SKILLS
Operating Systems: Linux RHEL, CentOS, Windows
Virtualization: VMware, Hyper-V
Scripting Languages: Python, Shell, Power Shell
Configuration & Management Tools: Ansible, Chef, Puppet
Infrastructure Provisioning Tools: Terraform
Build Tools: Maven, Ant, Gradle
Release Tools: Jenkins
Version Control Tools: Git, SVN
Code analysis and:
Cloud Platforms: AWS, Azure, Open stack
Containerization Tools: Docker, Kubernetes, Openshift
Webservers: Apache, Tomcat
AWS Services: EC2, ELB, VPC, RDS, IAM, Cloud Formation, S3, Cloud Watch, Cloud trial, SNS, SQS, EBS, Lambda, Auto scaling, DynamoDB, Route53, RDS, Elastic Beanstalk
PROFESSIONAL EXPERIENCE
Confidential, CT
Cloud DevOps Engineer
Responsibilities:
- Our AHCT application majorly sits on AWS cloud platform and my responsibility is to maintain the environments and try to achieve complete end to end automation.
- Created launch configuration based on the requirement which can be used in autoscaling group to keep the application highly available and scalable accordingly
- Widely used the AWS provided services like EC2, S3, RDS, VPC, Route53, CodeCommit, Snowball, SQS, CloudWatch, CloudFront, CloudTrail, etc
- Migrated servers to AWS from On-prem. Created EC2 instances, VPC, Security groups, ELB, Auto-Scaling, IAM users/roles and S3 buckets using Terraform.
- Setting up production and staging clusters on AWS VPC.
- We coordinated with Terraform DSL for automating inside the AWS cloud and worked with Terraform key features such as Infrastructure as code, Execution plans, Resource Graphs, Change automation.
- Developed Terraform templates in AWS to automate the provision of the full stack of servers based on VPC, Subnets, Security Groups, Load Balancers, Policies for both IAM and S3 buckets, IAM role-based policies, auto-scaling and different requirements as per the application’s architectural design and security.
- Enhanced Terraform template by adding the resource block for creating additional network interface, so that while executing the template it will launch the instance with two NAC cards which routes traffic accordingly.
- Used Python, Ansible and Terraform for tasks such as encrypting EBS volumes backing AMIs and scheduling Lambda functions for routine AWS tasks.
- Developed Python scripts for key management to fetch keys from the vault to be used for various applications and pipelines
- Extensively worked with Python scripts to automate and improve the configuration time.
- Created Python scripts for various System administration tasks to automate repetitive tasks
- Established infrastructure and service monitoring using Prometheus and Grafana.
- Written Ansible playbooks for various DB configurations to modularize and optimize end-product configuration and ansible roles to deploy, configure and manage both Linux and Windows servers.
- Configured and integrated maven, Git in the global tool configuration of Jenkins so that jobs that are created will be using the same configuration while building the jobs
- Working with Jenkins workflow engine to manage CI/CD pipeline from source to delivery.
- Working with Jenkins jobs by polling source code from Git, and building the code using maven
- Integrated and configured AWS S3 publisher plugin in Jenkins so that the artifacts will get stored in the s3 buckets.
- Used Kubernetes as an open-source platform for automating deployment, scaling and operations of applications containers across clusters of hosts, providing container-centric infrastructure.
- Configured Kubernetes provider with Terraform, which is used to interact with resources supported by Kubernetes to create several services such as Config Map, Namespace, Volume, Autoscaler, etc.,
- Worked on application containers using docker using custom docker files and orchestrating using docker swarm and Kubernetes on both on-premises and cloud
- Used Kubernetes to orchestrate the deployment, scaling and management of Docker Containers.
- Worked on several Docker components like Docker engine, creating docker images by writing Docker files, maintained docker registry and handling multiple images primarily for middleware installations and domain configuration.
- Used Confluence for documenting the tasks that were done during the application provisioning and Administration related tasks
- Create projects on Jira and assigned Jira Agile boards.
- Worked closely with team which includes confluence configuration, setup and administration tasks like managing users & groups, add-ons, plugins, macros and templates
Environment: s: Terraform, AWS, Ansible, Kubernetes, Docker, Git, Jenkins, Jira, Confluence, Grafana, Apache Webserver, Maven, Python
Confidential, CT
Systems/DevOps Engineer
Responsibilities:
- Worked in maintenance of the version control system (branching model, security), creation and maintenance of build scripts, and the setup of work item areas and iterations in GIT.
- Worked with the groovy scripts in Jenkins to execute jobs for a continuous integration pipeline where 'Groovy Jenkins Plugin' and 'Groovy Post-Build Action Plugin' is used as a build step and post build actions
- Created and maintained build scripts in ANT and worked with dev team to migrate Ant scripts to Maven build tool.
- Worked in authoring pom.xml files, perform builds with Maven release plugin, and manage artifacts in Nexus repository.
- Designing and implementing CI (Continuous Integration) system: configuring Jenkins servers, Jenkins nodes, creating required scripts (Shell & Groovy), and creating/configuring VMs (Linux/Windows).
- WroteJenkinsfilecore pipeline and push to the source code management to create continuous integration/delivery.
- Worked on Amazon AWS EC2 cloud services for installing, configuring and troubleshooting on various Amazon images like Amazon Linux AMI, RedHat and Ubuntu.
- Configured S3 versioning and lifecycle policies to backup files and archive files in Glacier.
- Created templates to totally automateAWSservices which includes web servers, ELB, Cloud Front distribution, database, EC2 and database security groups, S3 bucket and application configuration, this script creates stacks, single servers, or joins web servers to stacks.
- Rolled out Ansible automation and wrote playbooks to update configurations of agents.
- Worked on installing the Veeam backup software and scheduled backup for 600+ servers, which we are currently using.
- Worked with the team in designing the network for the HP Nimble storage. We have done the POC and bought it to production.
- I am currently administering the AWS, Nimble storage, Veeam, Commvault and VMware.
- Expertise in managing the Nimble storage, creating LUNS and managing them. Currently working on 3 Nimble SAN’s with more than a 5 Petabyte Storage.
- Migrated VMs, Hosts and Clusters from VCenter 6.5 to 6.7 and 6.7 to 7.x.
- Currently working on migrating backups from Veeam to Commvault and configuring Instance level backups for DB2 Servers.
- Worked on TSM to schedule the backups for DB2 Servers.
- Integrate new RPMs to provide new features stability or security vulnerability mitigation.
- Supporting Windows Cluster servers running 2-4 nodes supporting applications like web servers, SharePoint and Active Directory environmentand Migrations from Windows 2012 R2 - 2016 environments.
- Maintaining the IBM Purescale Servers and supporting them 24/7 depending on schedule of the deployment jobs.
- Maintain virtual server environment using VMWARE ESXi, VSphere, VCenter and Cloud Technologies like AWS.
- Identifying the Critical applications for System resource utilization (CPU, Memory, Threads etc.) & JVM heap size was monitored using App-Dynamics.
- Built PRTG tool to monitor 2000+ sensors (Includes Stage, UAT and Prod) by sending the error notifications through Slack. This includes Data centers and call centers.
- Regular disk management like adding and replacing hard drives in existing servers and workstations.
- Creating partitions per requirements, creating new file system or expanding the existing one over the hard drives and managing the file system.
- Assisting capacity planning, space and data maintenance activities for database systems.
- Building the servers per the IRS remediation and Audits are conducted frequently.
- Developing and executing all the required Scripts to maintain the system security using the IRS remediation’s and to automate the new Linux builds to match and maintain the required restrictions and remediation’s.
- Ensure all the Patches are up to date, and take backups to rollback if any RPMs are non-supportive by application.
- Manage complex migrations on older versions of Axway products to the latest versions and monitor the traffic of the SFTP jobs that run all day.
- Work with Avaya teams to maintain IVR and monitor the call flows.
- Evaluate new hardware for OS compatibility and determine the requirements to integrate new hardware.
- Extensively worked on Dell Physical Servers (R720, R730, EMC R820, R 0 and R940) and can troubleshoot any issues.
- Improving security by controlling the user privileges/roles and implementing profiles to match the security standards.
Environment: s: Terraform, AWS, Ansible, Kubernetes, Python, Docker, Git, Jenkins, Jira, Confluence, Grafana, Apache Webserver, Maven, VMware, Nimble Storage, Dell Servers
Confidential, Seattle
AWS DevOps Engineer
Responsibilities:
- Worked on migrating IBM environment to AWS using Terraform, Jenkins, Ansible, Git, Nexus
- Initially did provisioned lower environment manually using AWS console. Made use of services like EC2, S3, RDS, EFS, VPC, Route53
- Provisioned infrastructure using golden image provided by AWS by installing and configuring application-specific requirements like Java, NFS, Nfs-utils, Pacemaker for clustering
- Created s3 buckets in Dev environment and copied the objects from POC VPC account by adding bucket policies and gave individual access to the users, so that they can list, create, modify, delete the objects in that specific bucket
- Created DB instances by creating subnet group adding security groups specific to the database.
- Gathered information from DBA’s and worked with them in parallel to migrate the database.
- Created Elastic File System, which comprises in three Availability zones, so that we can mount on the component servers and create security group rules accordingly to get access to EFS only from that component.
- Created Security groups and added rules, which are specific to the application’s components.
- Created Application Load Balancers to route traffic to the URL specified in the target groups.
- Created Network Load Balancers, which comprises in two availability zones internally and enable cross-zone load balancing, by taking the any of the IP provided by the Eni for that load balancer, traffic evenly distributed across all the instances residing in different availability zones.
- Created mount points of EFS in two different AWS accounts with VPC peering connection established and by configuring security group for the instances associated, EFS and made the communication happen between two accounts
- Instantiated VPC peering connection between two VPC accounts and added routes to the subnets where the communication is needed to happen, created security groups in Source and Destination and configured it accordingly
- Added above created security group to the instances where the connection should establish and perform tests accordingly.
- Created additional network interface for monitoring purposes in a box and configured accordingly to make it work as intended.
- Default NAC card will route application traffic and an additional NAC card would be routing management traffic
- Used Ansible for configuration management, configured all the components of application by writing playbooks and executed it accordingly.
- Wrote Ansible playbooks to start/stop AWS instances at a specific time of the day which would be cost-effective for the firm.
- Created Ansible roles for Apache Tomcat, Java and other package installations.
- Created Terraform templates for all the infrastructure services by using Modules, Data Sources and resources provided by it.
- Wrote custom reusable templates there by other applications in our project may using use those modules by making minor changes and use it to provision infrastructure as needed
- Used Jenkins to integrate terraform, Git, and Ansible to deliver the infrastructure as per the architectural design.
- Created jobs in Jenkins to pull saved state files of terraform, ansible scripts from GitHub and provision infrastructure for both staging and production environments.
- Created pipeline jobs in Jenkins which include gathering build requirements and triggering the build using maven plugin, provisioning AWS infrastructure using Terraform, configuring the environment using Ansible, deploying the build into different environments and performing tests for each application.
- Added users to specific groups to provide ssh privileges to the servers, sudo privileges to the service user created.
- Resolved Jira tickets as part of the sprint which would be routed to team, tickets would be resolved on priority basis.
- Documented the work done by team in confluence pages.
Environment: AWS, Linux, Ansible, Jenkins, Terraform, Git, Nexus, Jira, Confluence, Citrix, IBM, Maven, Apache, Nginix
Confidential - Seattle, WA
Build and Release Engineer
Responsibilities:
- Worked on Release management, testing small enhancements developed by developers in Staging environment and pushed that code to Production.
- Updated Configurations in bitbucket as per the config guide provided by the release manager before taking the deployments on staging Environment.
- Monitored the issues aroused during the new code deployments using Splunk Dashboard
- Created slave nodes on Jenkins to reduce the load on master server
- Integrated service now with Jenkins there by notifications to the developers will be sent through emails and they can request for automated deployments using Service Now.
- Created Docker images and integrated in Jenkins to choose the node where the build should take
- Configured Docker images to build Angular code which requires npm and node version and running the image every time when developer triggers a build
- Gave permissions to users for accessing the apps folder and packages in CRX manager there by developers can make temporary changes to the environment and to author the content to the publish servers.
- Automated CMS servers to create replication agents thereby if the content is activated on author instances it will push across all the publish instances
- Used ad-hoc commands in Ansible to edit, modify and delete the files in the infrastructure
- Updated LDAP certificates across all the nodes which is used for authorization using Ansible
- Used Bash Scripts in Jenkins which will invoke RUBY scripts and Ansible plays to maintain the intended behavior
- Created playbooks and roles which will deploy Angular 2 and 4 code to all environments.
- Maintained 500 nodes which includes both lower and production environments in AWS
- Used ASG rigorously to increase/decrease the instances required for lower and production environments, while patching the environments (which would be happened on quarterly basis)
- Used AWS to warm up the environment at peak times and doing the code and content sync accordingly as required
- Created Launch configurations to update the AMI which has the latest patching and updating that LC in Cloud Formation
- Used AWS SDK to write the ruby scripts which will fetch the complete information of lower and production environments to make the changes on the stacks accordingly
- Used Service Now for regular operations by assigning top priority tickets to myself and resolving them before the deadline
- Integrated Service now with Jenkins to generate emails on the requests made my developers, which will send emails to concerned team/person on every successful/failure deployment on pipeline and will create a ticket to our team on every failed deployment to debug the issue
- Used Rally to create User Stories on issues aroused on the process of automated deployments and worked on debugging them on Priority basis
- Used IAM and EUI for authentication and authorization of the user to enter the application
- Used commerce servers to store the information related to products
- Created stacks which will include one WEB, one CMS and one Publish and make it functional before handing it over to the concerned team, a Jenkins job has been created to do this work which will create a Cloud formation template and version of code passed in through the parameters thereby the application will be in the intended state
- Worked on Incapsula (Web Access Firewall) whenever user hits our application it will route traffic to AWS through Incapsula to switch the production stacks and configured public facing staging environments.
- Used Bitbucket to store environment specific configuration, ruby scripts, playbooks, and developers' raw code
- Created pages in confluence which will populate current version codes of CMS, WEB in production along with .htaccess
- Created confluence pages that will generate the information of code and .htaccess for each lower environment
Environment: s: Linux, AWS, Ansible, Jenkins, Maven, Git, Bash, Python, Docker, Apache, Jira, Confluence, Service Now, Splunk
Confidential
System Administrator
Responsibilities:
- Trained and worked Primarily on RHEL6 Operating Systems.
- Assisted senior-level administrators in various aspects of Linux (Red Hat) server administration including installing and maintaining the operating system software, performance monitoring, problem analysis and resolution and production support.
- Assisted other Linux/UNIX administrators when help was needed (i.e. creating Linux/UNIX accounts, writing scripts to perform system administrator functions, responding to trouble tickets, etc.)
- Involved in preparation of functional and system specifications. Estimated storage requirements for applications.
- Perform primary Linux Server administration tasks, including setup, installation, OS patching, data backup, user account management and access control.
- Disk Management like adding and replacing of hot swappable drives on existing servers.
- Established and maintained user accounts, assigned file permissions and established password and account policies.
- Troubleshoot and resolved basic level system hardware, software and communication problems.
- Improve system performance by working with the development team to analyze, identify and resolve issues quickly.
- Monitored server and application performance and tuned I/O, memory and Installation of SSH and configuring of keys base authentication.
- Created Local Yum Repositories to support Package Management with Yum and RPM and Installed and configured secure FTP daemon to support a FTP-based Yum repository.
- Scheduled jobs and automating processes using CRON and AT and Created and maintained file systems and performed RAID configuration on LINUX.
- Monitored everyday systems and evaluate availability of all server resources and perform all activities for Linux servers.
- Managed and maintained user accounts and Configured and managed network interfaces.
Environment: Linux, Red Hat 4.x, SAN, NAS, Samba, Jira, Apache, Tomcat Web Sphere.
