Devops Engineer/site Reliability Engineer Resume
PROFESSIONAL SUMMARY:
- 8+ years of experience in Systems Administration, DevOps methodologies, CI/CD, Configuration Management & Release management.
- Experienced in using Version Control Tools Git, SVN
- Expertise in Configuration Management tools with Puppet,Ansible & Chef
- Extensively used build tools like Maven,ANT for building of jar,war files.
- Experienced on Network monitoring with Nagios, Yamas & icinga in a large environments
- Experienced with working on Docker hub, Creating Docker images and handling multiple images primarily for middleware installations and domain configuration.
- Experienced in collaborating with other engineers on code reviews, internal infrastructure improvements and process enhancements.
- Experienced in managing artifacts generated by Maven in the Nexus repository.
- Have Good troubleshooting skills to find the Root causes, Experienced in using the logging tools like Splunk
- Excellent knowledge on implementation & monitoring of AWS cloud environment( EC2,S3,IAM,ELB)
- Good Knowledge of Networking (TCP/IP) protocols, Subnets interfacings.
- Well Experienced on LVM manager, supporting LAMP stack
- Excellent knowledge on Public Key Infrastructure (SSH Key Exchange, SSL Certificates, Cryptographic Principles)
- Experienced in Installing, Configuring and managing Redhat Enterprise Linux(RHEL)
- Well Experienced on performing Kickstart Installations
- Installed and configured DNS, DHCP, FTP and send mail servers.
- Experienced in Developing the Automation scripts using Shell & Python Scripts.
- Experienced with Job schedulers (Crontab, Autosys)
- Good Experience with Remedy, Jira, Servicenow, Confluence & Opsgenie
- Experienced with Agile Project Methodology
- Willing to work on flexible hours for on - call support for 24/7 environment.
- Good Communication Skill and self-motivated with a high degree of attention.
TECHNICAL SKILLS
Operating Systems: Redhat 5/6, SUSE 9/10, CentOS 4/5, Solaris 9/10, Windows
SCM Tools: Git,GitHub,SVN
DevOps tools: Ansible,Puppet,Chef, Maven,Ant & Jenkins
Scripting: Python,Perl,Shell
Middleware: Apache Tomcat, Web Logic, Kafka,Jetty
Language support: C,C++,Java,Ruby
Database: MySQL, Oracle, Hadoop
Cloud & VM Services: AWS, Openstack, VMWare
Other tools: Nagios,Icinga,Yamas,Confluence,Splunk,Jira,Servicenow,Opsgenie
PROFESSIONAL EXPERIENCE
Confidential
DevOps Engineer/Site Reliability Engineer
Responsibilities:
- Maintaining & Automating the Infrastructure of High Trafficking Environment & Maintain 99.999% uptime of LINUX production servers and follow proper escalation.
- Setup CI/CD environment with Puppet, Ansible, Jenkins, Maven,Nexus for different properties to automate application deployments for Linux platforms
- Created and updated Puppet manifests and modules, files, and packages stored in the GIT
- Used M-collective for Parallel jobs execution & monitored Puppet Dashboard
- Identifying user and revenue impacting incidents to resolution and using critical knowledge of Linux/Unix Processes & other technologies with in the OSI stack
- Worked on release deployment, branching, tagging for prod and non-prod deployment
- Co-ordinating daily Releases of new tags for different Components
- Experience with GIT to create repositories for multiple projects and create branches based on development model
- Monitoring Health Checks and code quality using SonarQube
- Using TCP/IP,HTTP, SMTP, Load-Balancers & networking concepts to resolve networking issues.
- Experience with Automation of Build Docker images,Have deployed various Applications through Docker Containers and maintaining different versions in private registries.
- Worked on infrastructure with docker containerization and Kubernetes.
- Automated new service packages installation using Ansible.
- Wrote Ansible Playbooks for Applications deployment
- Configuring Threshold levels to each component for Alert management
- Monitoring the AWS infrastructure, Created & terminated multiple EC2 instances
- Managing Amazon Web Services like S3 bucket, IAM through AWS Console
- Working on Hadoop clusters
- Hands on experience with web servers like httpd and Nginx.
- Spinning up new boxes(bare metal, VM) on Openstack using Nova tool, installing packages on new hosts and bringing them to rotation
- Using Splunk for monitoring System/Application logs to detect Prod/Non-Prod errors etc.
- Handling host level CMR’s for maintenance.
- Implementing scripts to Automate live environment processes using shell & python.
- Implemented Python script for host inventory tool purpose.
- Used Apache Kafka for building real-time streaming data pipelines that reliably get data between systems and applications
- Upgrading the Security Kernels on Ubuntu servers
- Monitoring of Linux, Solaris servers using tools like vmstat, iostat, SAR, top, free etc.
Environment: Linux, DevOps, Puppet, Ansible, Git, Yamas, icinga, Shell, Python, Openstack, AWS, Opsgenie, Jira,Nexus,SonarQube,Splunk,Servicenow.
Confidential, AR
DevOps Engineer
Responsibilities:
- Developed build and deployment processes for Pre-production environments
- Coordinating with development teams,QA teams,IT support teams to ensure smooth releases.
- Implemented build scripts,continuous integration infrastructure and deployment tools to build a continuous delivery pipeline.
- Used Puppet modules for server provisioning, configuration, release and deployment automation.
- Installed/Configured/Managed Puppet Master/Agent. Wrote custom Modules and Manifests, downloaded pre-written modules.
- Worked with Jenkins continuous integration tool and Github repository hosting services.
- Responsible for Release management activities, handling releases, close coordination in test execution, Defect management and control.
- Worked with JIRA for Issue Tracking..
- Branching, Merging, and tagging with best practices with release cadences.
- Coordinate/assist developers with establishing and applying appropriate branching, labelling/naming conventions using GIT source control.
- Used Ant and Maven as build tools and wrote build.xml and pom.xml respectively.
- Worked on automation of middleware configuration work through Chef.
- Managed Nodes, Run lists, roles,environments,data bags,cookbooks, recipes in Chef.
- Triage the Blockers and issues with respective teams and resolve the issues.
- Work closely with Developers and testers to improve the automated frame works like Junit with analysis of code coverage reports.
- Installed and configured Apache, MySQL and PHP products on the Linux servers
- Installation & Configuration of software packages in Redhat Linux, HP-UX, AIX, Solaris
- Applied patches and packages on Linux servers using rpm and yum tools
- Implemented the monitoring tool Nagios 3.0 to monitor services like CPU, Hard Drive, Memory, Users, HTTP, and SSH etc.
- Wrote shell scripts to automate the insertion of software usage and MySQL database systems
- Shell scripting to customize and automate system installations
- Responsible for setting up cron jobs scripts on production servers.
- Responsible for supporting LAMP Stack, TCP/IP, IPSec
- Modified Kernel parameters to improve the server performance in Linux.
- Responsible to implement the Firewall rules
- Performing security audits and remediating the Vulnerabilities.
- Familiar and Experienced with Agile Scrum Development
Environment: RHEL 5.x/6.x, Puppet,Chef,Jenkins,Nexus,Git,Maven,Ant,Shell,Python, Splunk, Nagios, Jira, venafi, Remedy.
Confidential, Atlanta, GA
Linux/Unix Systems Administrator
Responsibilities:
- Worked on Red hat Enterprise Linux 3/4/5, SUSE Linux 9/10/11, and Ubuntu (8.04 & 9.04). Centos 7.
- Worked on Applying patches, installed monitoring tools, and upgraded BIOS on HPC Clusters.
- Installation and configuration of Linux servers using kick start & support for various applications, which runs on Linux environment followed by performance tuning, patching and maintenance.
- Designing, Implementation, Configuring, & Managing RHEL server.
- Working knowledge on Apache, Tomcat and Jboss.
- Supporting 300+ Linux servers in production/stage/development environment.
- Creating VMware Lab setup as per Internal QA team's requirement using VMware ESXi servers.
- Worked on VMware Virtual Center Server for centralizing ESX and ESXi servers.
- Support and troubleshooting of Linux Apache, MySQL. Have knowledge on MySQL, Oracle Database Administration.
- Created SQL queries to perform insert,delete or update operations on Oracle 11g database.
- Setup and configured network TCP/IP at all levels on Linux servers
- Implemented Domain Name Server by Using Primary and Secondary DNS Servers.
- Installed and Configured DNS, DHCP, NFS, NIS, http, tftp and other network services.
- Worked with object oriented software development methodologies and practices across all phases of the Software Development Lifecycle.
- Worked on Unix/Linux user's creation, file/dir. level permissions, Sudo permissions etc.
- Worked on Updating/patching Linux Servers with latest rpm packages.
- Extensively Worked on Apache webservers and Nagios.
- 24x7 support for the LINUX servers.
Environment: Red hat Linux (3.x, 4.x, 5.x), SUSE (9.x, 10.x, 11.x), Ubuntu (8.04, 9.04),Tomcat, J boss, VMware, XEN, KVM, ESX, My SQL,Oracle 11g, Apache High Availability, CISCO Devices, Subversion, Nagios.
Confidentia
Systems Administrator
Responsibilities:
- Security and maintenance of UNIX, Solaris, HP-UX, RHEL Servers
- Performed LVM Administration (Working with Logical Volume Manager and creating of volume groups/logical)
- Installed and configured Apache / Tomcat web server.
- Working on daily Incident tickets, General requests (Memory, CPU performance, sudo Access)
- Repo management for patching RHEL and SUSE servers.
- Worked on SSL certificates
- Scheduling the Jobs using Autosys, monitored the jobs status, running batches
- Installation & Configuration of software packages in RedHat Linux Using both yum and rpm
- Used various networking tools such as ssh, sftp, scp, netstat, winscp, telnet, rlogin, ftp and ping to troubleshoot daily issues.
Environment: RHEL, Unix, LVM, Shell scripting, Remedy, Autosys, VMWare