Devops Engineer, Maintenance, Monitoring Resume
Irving, TX
PROFESSIONAL SUMMARY:
- Extensive experience in IT industry. Experienced as a Linux Systems Administrator work on Red Hat Linux, Solaris, SUSE Linux, CentOS, Fedora, Windows environment, VMware, vSphere and Software Configuration Management(SCM). And worked on DevOpsBuild/Release Management tools.
- Certified AWS Solutions Architect - Associate.
- Experience in Linux Administration ( Installation, Configuration, Tuning and Upgrades of Linux Red Hat and Oracle ) and Experience in Creation and managing user accounts, security, rights, disk space and process monitoring in Solaris and Red Hat Linux.
- Experience with Linux/UNIX environments and scripting for Build & Release automation.
- Good knowledge in AWS Cloud platform and its features which includes API's for Amazon (AWS) EC2, VPC, ELB, AMI, SNS, SQS, RDS, EBS, Cloud Watch, Cloud Trail, CloudFormation AWS Config, AutoScaling, CloudFront, IAM, Route S3 DNS, MangoDB, OPSwork with Cloud Sigma (EU) and GFS Storage, Openstack.
- Experienced in working on DevOps/Agile operations process and tools area (Code review, unit test automation, Build & Release automation, Environment, Service, Incident and Change Management) and specializing in Agile, Continuous Integration ( CI ), Continous Delivery ( CD ), Cloud, Infrastructure as Code, Infrastruture Provisioning, Orchestration, Monitoring, Alerting and Service level Dashboards.
- Extensively worked with automation tools like Hudson, Jenkins, Docker, Bamboo, Teamcity and Build Forge to implement the End-to-End Automation.
- Extensively worked with Version Control Systems GIT, CVS, SVN (Subversion),ClearCase Perforce, BitBucket and IBM Rational Team Concert.
- Provisioning and configuring servers running with Redhat Enterprise Linux, Ubuntu, CoreOS & OEL.
- Developing and deploying container orchestration platform using Apache mesos, Marathon, fleet cluster to deploy application containers and to manage applications running as Docker containers.
- Designing AWS cloud Formation templates for to deploy various AWS services which include EC2 instances, S3 buckets, VPC, Identity and access management (IAM).
- Creating Snapshots and amazon machine images (AMIs) of instances for backup and used them to create auto scaling and load balancing EC2 instances. Configuring Cloud watch alarms to monitor servers and application traffic.
- Expertise in using Build tools like MAVEN, ANT and GRADLE for the building of deployable artifacts such as war & ear from source code and NEXUS in Linux environment .
- Application Deployments & Environment configuration using Chef, Puppet and Ansible.
- Experience in integrating Unit Tests and Code Quality Analysis Tools like JUnit, Cobertura, Clover, PMD, find bugs and checkstyle and tracking systems like JIRA, Asana, Remedy and IBM ClearQuest.
- Experience in using Nexus and Artifactory Repository Managers for Maven builds.
- Experience writing and execution of SHELL, PHP, Python, BASH and PERL scripts to automate the deployments.
- Managed local deployments in Kubernetes, creating local cluster and deploying application containers .
- Good Experience in using APACHE TOMCAT, JBOSS, JETTY, Weblogic and WebSphere Application servers for deployments on Solaris and Linux boxes .
- Configured and Administered Nexus Repository Manager and JFrogArtifactory.
- Good communicative, Interpersonal Skills, team-working attitude, takes initiatives and very proactive in solving problems and providing best solutions. Able to work efficiently in both independent and team work environments.
- Experience supporting 24x7 production computing environments and providing on-call and weekend support.
TECHNICAL SKILLS:
Operating Systems: Linux (RedHat 5.x/6.x/7.x), UNIX, Solaris Release 2.10/2.9/2.8/2.7, IBM-AIX 5.X, zVM Windows 8,10 Windows 2000,2003, 2008, 2012, XP, VISTA and NT.
Version Control Tools: GIT, Stash, BitBucket and SVN.
Tools: Ansible, Docker, Mesos Marathon, Zabbix, Jenkins, Maven, ELK, Jira, Confluence, Kibana, NginxProgramming Languages: C, HTML, Bash and Python.
Web/ Application Server: Apache-Tomcat, My SQL.
Network Protocols: DNS, SNMP, SMTP, Ethernet, IOS, NFS, TCP/IP, tcpdump, FTP and TFTP, Wireshark.
Cluster Environment: Sun Cluster Servers 3.2, Redis, NDB, SwiftMQ, RabbitMQ.
Software: MS-Office/MS-Project.
Cloud: AWS - Ec2, S3, AMI, cloud Front, Dynamo DB, VPC, Cloudwatch, IAM, Opswork, SNS, SQS, SES, Workspace, EBS, RDS, CloudFormation AWS Config, AutoScaling.
Databases: My SQL, MariaDB, MongoDB, Gluster, Oracle, Hadoop, Cassandra.
PROFESSIONAL EXPERIENCE:
Confidential, IRVING, TX
DevOps Engineer, Maintenance, Monitoring
Envinorment: Python, Linux, Cassandra, MySql DB, MariaDB, Ansible, Marathon, Apache Mesos, DC/OS, RabbitMQ, GIT, JENKINS, TOMCAT, Nginx, Haproxy, AWS, Redis, DOCKER, Hadoop, Spark, Kubernetes, Zabbix, SwiftMQ, ELK Stack, CONFLUENCE, JIRA, Bash, Stash, BitBucket and JFrog Artifactory.
Responsibilities:
- Manage containerized cloud applications under Laniakea platform in production with more than 50+ micro services components with 40 TB of data ingest every day.
- Test and deploy latest builds of application. Create YAML’s and JSON’s required for the deployment to pull docker container images from antifactory and push them to mesosphere DC/OS to automate deployment.
- Orchestrate and mange containerized application through Marathon Mesosphere with underlying zookeeper and Mesos master/slave architecture. Scale applications as per the load in a timely manner.
- Create AWS Cloud formation templates to build instance, setup storage, DNS, Database, RabbitMQ and networking to analyzing images and Train on AWS Rekognition Machine Learning service.
- Used AWS Kubernetes Service to deploy a managed Kubernetes cluster in AWS and created an EKS cluster.
- Designing Ansible playbooks to manage fleet cluster, systemd and etcd service units. Creating Ansible playbooks for CoreOS and firmware upgrades, to deploy services on to the servers.
- Deploying and configuring ELK stack, Mesos agents, haproxy’s, server mounts and user administration using Ansible playbooks.
- Containerize portable software’s using technologies like Docker.
- Debug and analyze production issues and monitor applications in Zabbix.
- Debug application logs in real time and analyze historic logs stored in Hadoop using Spark whenever required.
- Maintain and upgrade containerized Redis cluster across all the datacenters.
- Administrating Oracle and MySQL NDB clusters running as Docker containers and Perform regular backups and fail over tests for databases to maintain stable environment for applications.
- Building Cassandra clusters across datacenters and Managing replication factors for storing and retrieving data from clusters and also working on replacement of failed drives in cluster without any data losses.
- Perform frequent health checks and troubleshooting the nodes in cluster in case of failure.
- Designed and deployed container-based production clusters using Docker, Kubernetes and OpenShift.
- Retrieve Java application stack trace using JStack utility and debug the tomcat logs.
- Manage and monitor Cassandra cluster. Query and retrieve data from Cassandra Keyspaces using cqlsh commands. Check cluster health status using nodetool and perform operations like enabling gossip, thrift, binary and add or remove node from the token ring.
- Manage MySQL database cluster configured with NDB engine. Use MySQL MGM to move MySQL node to healthy server whenever a node is the cluster in down.
- Manage python and bash shell script to perform activities such health checks ove REST API, enable trigger and perform application specifics backup, and schedule jobs in crontabs.
- Work on Nginx and HAProxy for load balancing and tcp proxies respectively.
- Work on managing Memcache, Hazelcast, SwiftMQ and RabbitMQ clusters.
- Create Kibana dashboards and elastalerts for the application logs on Beehive Platform applications.
- Perform load test on storage solutions such as glusterFS, NFS, Infinidat and Wasabi to migrate current storage for all the application components.
- Implementing Zabbix as a monitoring solution, depending on project requirements.
- Performs daily system monitoring, verifying the integrity and availability of all hardware, server resources, systems and key processes, reviewed system and application logs and verified completion of scheduled jobs such as backups.
- Perform regular security monitoring to identify any possible intrusions and find problems on systems well before impacting the Million users.
- Creating the automated build and deployment process for application, re-engineering setup for better user experience, and leading up to building a continuous integration system for all our products.
- Implementing new projects builds framework using Jenkins & maven as build framework tools.
- Implementing a Continuous Delivery framework using JIRA, Jenkins, Git, Bamboo, Ansible, Maven and Nexus in Linux environment.
- Implementing a globally distributed monitoring practices with Proxies and Agents to manage the monitoring load across globe.
- Built a new CI pipeline, testing and deployment automation with Docker and Jenkins.
- Building the source code using GIT and Jenkins and manage Git hub Projects and troubleshooting Git related problem.
- Designing, configuring and troubleshooting Elastic search clusters to Manage server logs and generate reports in Kibana using Logstash for to analyze issues with server hardware.
- Maintaining the 12 Data Centers and checking each Datacenter Health-Check Hourly.
- Created CM Plans and Build Documentation while maintaining source code.
- Collaborating with leads from DEV, QA, PM, DevOps and other functional groups to implement Agile SDLC and Continuous Integration (Build -> Test -> Deploy -> Report).
- Developed chef cookbooks and recipes for various services like ntp, ldap, dns, mysql, java, tomcat and tooling for encrypting chef data bags with out access to chef server.
- Doing Load Tests and Performance tests to identify the threshold limit and breaking point of Monitoring tool.
- Experience with Zabbix/Observium/NewRelic monitoring and alerting services for servers, switches, applications and services.
- Created recipes in cookbooks to automate system operations and Created multi AZ VPC instances.
- Integration of Maven/Nexus, Jenkins Urban Code Deploy with Patterns/Release, Git, Confluence, Jira and Cloud Foundry.
- Configuring the Monitoring tool for MySQL database.
- Container management using Docker by writing Docker files and set up the automated build on Docker HUB and installed and configured Kubernetes.
- Scripting in multiple languages on UNIX, LINUX and Windows - Batch, Shell script python and Java Script.
- Improvised Zabbix API calls to show the on-demand status for the requested machines on GE's internal application.
- Work with the offshore team to coordinate all the changes made and assist team in resolving the issue during their time when ever required.
- Participating in 24x7 production on-call support of Linux and provided technical support to users.
- Expertise in converting AWS existing infrastructure to server less architecture( AWS Lambda , Kinesis ) and deployed.
- Responsible for Installation of applications on AWS cloud and implemented AWS solutions using S3, EC2, RDS, EBS, Elastic Load Balancer, Auto-scaling groups.
- Involved in designing and deploying multi-tier applications using all the AWS services like ( EC2 , Route53 , S3 , RDS , Dynamo DB , SNS , SQS , IAM ) focusing on high-availability, fault tolerance, and auto-scaling in AWS Cloud formation.
- Provisioned AWS resources using management console as well as Command Line Interface ( CLI ).
- Used the AWS-CLI to suspend an AWS Lambdafunction processing an Amazon Kinesis stream, then to resume it again.
Confidential
Digital Media Technology On LINUX Environment
Envinorment: RedHat Linux (RHEL 4/5), HP-DL585, Oracle 9i/10g, Samba, VMware, Tomcat, Apache Server, Bash, Red Hat Cluster Servers, CentOS, MySQL, Jenkins, DNS, and NFS.
Responsibilities:
- Administered, maintained Red Hat 3.0/4.0/5.0, AS, ES, Troubleshooting Hardware, Operating System Application & Network problems and performance issues; Deployed latest patches for, Linux and Application servers, Performed Red Hat Linux Kernel Tuning.
- Installing, configuring and administering Jenkins CI tool on Linux machines.
- Administered Linux servers for several functions including managing Apache/Tomcat server, mail server, MySQL databases in both development and production.
- Experience in implementing and configuring network services such as HTTP, DHCP and TFTP.
- Install and configure DHCP, DNS (BIND, MS), web (Apache, IIS), mail (SMTP, IMAP, POP3), and file servers on Linux servers.
- Worked closely with software developers and DevOps to debug software and system problems.
- Installation of patches and packages using RPM and YUM in RedHat Linux.
- Installed and configured SAMBA server for Windows and Linux connectivity.
- Installing and setting up Oracle9i on Linux for the development team.
- Migrated database applications from Windows 2000 Server to Linux server.
- Linux kernel, memory upgrades and swaps area. Red hat Linux Kickstart Installation.
- Installed and configured Virtual Network Computing (VNC) server/client.
- Monitored System Activities like CPU, Memory, Disk and Swap space usage to avoid any performance issues.
- Provided periodic feedback of status and scheduling issues to the management.
- Worked with DBA team on installation of RDBMS database, network related issues on Linux Servers and on restoration and log generation.
- Creation of Logical Volume Manager (LVM) for Linux operating systems.
- Involved in design, configuration, installation, implementation, management, maintain and support for the Corporate Linux servers RHEL 4x / 5 . x, SLES 9, CENTOS 5.x .
- Maintained proper documentation of all the activities carried out during the project.
- Sole person to be responsible for Linux environment control process.
- Involved in periodic archiving and storage of the source code for disaster recovery.
- Coordinating users for any server activities which may involve major changes on software or any hardware related issues.
- Developed, maintained, and distributed release notes for each scheduled release.
- Coordinating with 24x7 on-call support personnel in debugging.