Cloud Engineer Resume
4.00/5 (Submit Your Rating)
SUMMARY
- Overall 14 Years of experience in UNIX based Systems Administration, Application Management with Project / People Management skills
- Extensive Systems Administration & troubleshooting experience in Red Hat Enterprise Linux
- Experience in AWS Cloud environment
- Experience in VMware ESXi Administration
- Experience in Red Hat Cluster Suite Administration
- Experience in BASH Shell Scripting for Unix environment
- Experience in Cloud formation, Docker containers, Git Hub, Jenkins and Maven
- Intuitive analytical ability to attain network, system and application level solutions
- Committed to continuous skill development of self and team members for the best team results
- Good knowledge in ITIL process
- Strong analytical, problem solving & organizational abilities
TECHNICAL SKILLS
Operating System: Red Hat Enterprise Linux
Cluster Software: Red Hat Cluster Suite with GFS
Virtualization/Cloud Technology: VMware and AWS
Configuration Management: Ansible
Scripting: Shell (BASH)
Webserver: Apache (httpd)
Container Technology: Docker
Continues Integration: Git, Jenkins and Maven
Log Management/Monitoring: Splunk
Patch Management: Redhat Satellite
PROFESSIONAL EXPERIENCE
Confidential
Cloud Engineer
Responsibilities:
- Launching Amazon EC2 Cloud Instances using Amazon Web Services (Linux/ Ubuntu) and Configuring launched instances with respect to specific applications.
- Implemented AWS solutions using EC2, S3, EBS, Elastic Load Balancer, Auto scaling groups and EC2 instances
- Leverage AWS cloud services such as EC2, auto - scaling and VPC to build secure, highly scalable and flexible systems that handled expected and unexpected load bursts.
- Designing and Implementing 3 tire architecture with Web, Application and Database servers using Elastic Load Balancer (ELB) with Route 53 DNS registered.
- Design roles and groups using AWS Identity and Access Management (IAM).
- Included security groups, network ACLs, Internet Gateways, and Elastic IP's to ensure a safe area for organization in AWS public cloud.
- Configuring alarms using Cloud Watch service for monitoring the server's performance, CPU Utilization, disk usage etc. to take recommended actions for better performance.
- Designed, Installed and Implemented Ansible configuration management system.
- Providing a support for on-premises Linux servers which are in VMware environment and perform security hardening of servers to meet PCI guidelines.
- Maintaining Red Hat Cluster Suite for NFS Mount point.
- Providing System administration and application related OS level tasks, provisioning support of the application servers.
- Installing and Configuring ESXi Servers; Perform Network and Storage Configuration
- Host Profile setup for configuration management of ESXi Hosts, HA/DRS, VMotion and SVMotion configurations, create templates for Redhat Linux and windows VMs.
- Use VMware update manager to create or edit baselines and baseline groups for patch management.
- Involved in setting up the CI/CD pipeline using Jenkins, Maven, GitHub, Ansible on AWS.
- Involved in Installing Jenkins on a Linux machine and created a master and slave configuration to implement multiple parallel builds through a build farm.
- Installed and Administered Jenkins CI for Maven Builds
- Used Ansible to manage Web applications, Environments configuration Files, Users and Packages.
- Implemented cloud formation script for App server provisioning.
- Automation of nightly build, daily task using scripting and Jenkins.
- Coordinate/assist developers with establishing and applying appropriate branching, labeling /naming conventions using GIT source control.
- Integrated GIT into Jenkins to automate the code check-out process
- Written shell, Maven scripts, Installed Jenkins for end to end build and deployment automation.
- Monitoring Live Traffic, logs, Memory utilization, Disk utilization and various other factors which are important for deployment.
- Provide daily monitoring, management, troubleshooting and issue resolution to systems and services hosted on cloud resources
- Configure Splunk forwards on all the Linux Servers.
- Gathering new requirements and working with Splunk admin to test and deploy new apps.
Confidential
Application Support - Customer Engineer
Responsibilities:
- Worked as an Application support engineer for the applications which handles $90B worth of transactions every year for many of the Public sector and government clients with PCI compliant environment
- Enterprises application support for Java based Applications on Linux based Operating Systems with Databases (Oracle and MySQL). This support was based on deployments, resolution of incidents and problems based on ITIL processes, and attend service requests
- Worked on Application that process financial transactions related to financial payments, debits, adjustments, etc. The system interacts with POS devices, Third Party Processors, Banks, Web Services, etc. for the transaction delivery and processing. These transactions use the ISO-8583 standard, so I have deep knowledge on the different implementations and customizations for the different products and LOBs.
- Responsible for providing software support for all customer-escalated issues and bugs, with ongoing research, debugging and subsequent release and deployment of product updates.
- Deploy and support applications into a high availability environment.
- As part of the system, the applications generate reports and files that have to be delivered to different clients. We managed and handled this using file movement tool.
- Installed and configured Java based application on Linux.
- Worked on implementations of new projects, migrate solutions to new systems and/or architectures, upgrades, certificate new functionalities on Project implementations, setup of new environments, etc.
- Coordinated with Multiple team (Dev and QA) and ensure Major & Minor releases successfully deployed in PRODUCTION.
- Prepared Release documents and conduct pre-release meeting with Dev team.
- Handled Change Approval Meeting with Business Stakeholders and LOB.
- Handled RCA for the repeated issue and apply a fix on all affected programs.
- Monitored the Ticket queue and provide a solution with in the SLA.
- Involved in Weekly Release and Maintenance Build for supported programs.
- Provided production support in an on-call rotation in a 24x7 environment. Excellent client relation skills and the drive to complete tasks effectively, and efficiently where customer services and technical skills are demanded.
- Participated on special project teams for development or corporate initiatives as such projects arise.
- Setup DR site and coordinate with State on DR Drill Test.
- Automated the manual jobs using Scripts wherever possible.
- Co-ordinated and gathered updates from offshore team (China and India).
- Represented the application team on the PCI DSS audit.
- Worked with Splunk team to enable application monitoring.
- Developed bash shell scripts for admin tasks like customizing user environment, performance monitoring and Responsible for cron job management.
- Querying, updating SQL statements based on the requirement.
- Constantly worked with system support team and cleaned up unused virtual and physical servers which helped customer to save $200K+ cost annually.
Confidential
Offshore Lead
Responsibilities:
- Development, installation, administration, and maintenance of Linux based infrastructure.
- Build Linux servers and configures networks; both physical and virtual.
- User administration on RHEL and managed the privilege using sudo.
- Configured and Managed Application High Availability using Red Hat Cluster Suite.
- Managed File systems using Logical Volume Manager.
- Upgraded the Red Hat Linux version to keep the system up-to-date.
- Automated the server / cluster health check reports.
- Monitored System Performance of Virtual memory, System events, Swapping, Disk utilization, CPU utilization by running scripts.
- Created Shell Scripts for automating repetitive System Administration tasks.
- Worked with Hardware team on fixing hardware related issues.
- Proactively identify areas of improvement and help streamline operations.
- Closely monitor and control the production environment to ensure stability and uptime.
- Provided 24 X 7 On-call production support; Provided extended availability as needed.
- Responsible for reviewing all open tickets, resolve and close any existing tickets.
- Conduct Root Cause Analysis (RCA) on Severity 1 issues.
- Ensured Performance, Scalability, Availability and Reliability of servers as agreed.
- Documented solutions for any issues that have not been discovered previously.
- Scheduled and executed regular system management activities, including system reboot, backup, recovery, Patching (OS & Firmware level), archiving and restoration.
Confidential
Tech Lead
Responsibilities:
- Red Hat Linux System Administration, Configuration and Troubleshooting.
- Installed RHEL OS on Standalone servers using kickstart installation.
- User administration on RHEL and managed using Power Broker.
- Installed and Configured of NFS, NIS, FTP and CUPS.
- Installed Packages and Patches as per the requirement of projects and security needs.
- Scheduled of automatic, repetitive Jobs using commands with crontab.
- File system Cleanup & Disk Utilization Activities; Increasing File systems on Logical Volume Manager.
- Monitored System Performance of Virtual memory, System events, Swapping, Disk utilization, CPU utilization by running scripts.
- Maintained availability and coordinated with Hardware team to increase capacity & performance on production machines by upgrading their hardware (disks, CPU, memory, etc) & firmware.
- Maintained Shell Scripts for automating repetitive System Administration tasks.
- Worked thru ILO console in case of server-reboots/maintenances/boot-issues.
- Participate in on-call rotation.
- Controlled the System Logging services and examining system Log Files of all system events and worked with Red Hat ticket to resolve the server issues.
- Responsible for reviewing all open tickets, resolve and close any existing tickets.
- Conduct Root Cause Analysis (RCA) on Severity 1 issues.
- Document technical configurations and procedures.