Sr. Devops Analyst Resume
Chicago, IL
SUMMARY:
- DevOps Engineer, Design and built Private Cloud infrastructures and Container Platforms.
- Migrated applications between bare - metal, AWS, Azure and Openstack.
- Planned, deployed and managed Infrastructure Automation from ground up.
- Full Stack Development, Application Builds, CI and CD.
- Operational support for AWS, Azure, Openstack Cloud and Kubernetes Platforms.
- Built and managed monitoring and metric collection frameworks.
- Expertise in Object Oriented Programming, Server Virtualization, Operating System Administration, Physical and Virtual Networking.
- Expertise in IaaS and PaaS.
- Expertise in DevOps methodology, Application Management and Support.
- Project management activities in ITIL incident, problem, change and release management and Agile software development lifecycle.
- Good business knowledge in healthcare, finance, banking, trading and e-commerce domains.
TECHNICAL SKILLS:
Cloud Platforms: AWS, Azure and OpenStack
Container Platforms: Openshift, Kubernetes(K8S) and Docker.
Automation: Chef, Puppet, Azure ARM, Cloud formation, Cobbler and Ansible
Build and Deploy: Git, Stash(Bitbucket), Jenkins, Artifactory
Operating Systems: RHEL, CentOS, Ubuntu, Solaris and Windows
Programming Languages: Ruby, Python, Perl, UNIX Shell Scripting (Bash), PowerShell and SQL
Virtualization: VMWare vSphere, KVM and VMWare NSX
Databases: MongoDB, MariaDB, Couchbase and SQL Server
Storage Technologies: Ceph, Swift, NAS, SAN and Linux LVM
Hardware Platforms: Intel, Sun Spark, and IBM Power.
Networking: VMWare NSX, Open vSwitch, F5 BigIP load balancing
Monitoring and Metrics: Sensu, Nagios, Datadog, Prometheus, Observium, Graphite, Grafana, Collectd, FogLight, AppResponse Expert, App Internals Expert, SCOM, Tealeaf, Gomez
Logging: Splunk, Elastic Search, Logstash and Kibana
Project Management: Microsoft Team Foundation server(TFS) and Jira
PROFESSIONAL EXPERIENCE:
Sr. DevOps Analyst
Confidential, Chicago, IL
Responsibilities:
- Migrate MyQ IOT web application infrastructure to Azure.
- Implement new monitoring solutions like Prometheus and Grafana
- Implement new configuration management tools like ARM templates and Ansible.
- Improve core infrastructure components like RabbitMQ, Couchbase and MSSQL.
- Day-to-Day operational support and periodic maintenance.
- Training peers on various new implementations on monitoring, logging, configuration management etc.
- Stabilize Azure infrastructure by implementing custom metric collection with Log Analytics, Application insights and custom alerting around critical application components.
Cloud Platform Engineer
Confidential, Minneapolis, MN
Responsibilities:
- Build and operate Openstack Private cloud and OpenShift Kubernetes container platform on AWS.
- Infrastructure automation using Terraform, Cloud Foundry, Chef and Ansible.
- Build monitoring and metric gathering systems around OpenShift and Openstack using Sensu, Grafana and Prometheus.
- Build Docker images; create build and deployment configurations to deploy applications on Openshift.
- Plan and migrate applications from bare metal and AWS to Openshift.
- Migrate Java, Tomcat, node.js, ruby, and python applications into Kubernetes.
- Patch, upgrade and scale OpenShift environment.
- Onboarding applications for Test, Stage and Prod environments.
- Build and operate OpenStack private cloud using from ground up.
- Plan, design and migrate applications from bare metal and Azure to OpenStack.
- Build OpenStack control plane hosts on VMWare.
- Build OpenStack KVM Hypervisors on physical hardware.
- Configure and maintain physical and virtual networks for OpenStack.
- Install and upgrade OpenStack components like Nova, Neutron, Keystone, Glance, Ceph and Swift.
- Implemented continuous integration system with git, Stash and Jenkins.
- Install and maintain OpenStack testing and benchmarking frameworks using Rally and Tempest.
- Install, patch and upgrade VMWare vCenter and ESXi hosts.
- Install and manage Sensu to monitor and gather metrics for all hosts.
- Install and manage Graphite and Grafana for metric collection.
- Install and manage Observium for monitoring network devices, Rack management controllers, ESXi and all other hosts that support SNMP.
- Write Sensu plugins, Collectd plugins for monitoring OpenStack performance and functionality.
- Maintain core components of OpenStack like RabbitMQ and MariaDB.
- Install and upgrade VMWare NSX SDN.
- Manage package repositories on Artifactory and deploy them Puppet.
- Configuration management using vCenter Orchestrator, Chef and Ansible.
- Provide CI, CD platform using Jenkins for app teams for to build and deploy Java, Node.js, Apache Tomcat, Ruby and Python apps.
- Handle releases and take part in on call rotation.
Application Management Advisor
Confidential, Roundrock, TX
Responsibilities:
- Providing L3 support for high and critical incidents for applications running on 4000+ physical servers and Azure virtual machines.
- Partnering with L2 in actively restoring the services.
- Working on various tools including Tealeaf, Gomez, ARX, AIX, Empirix, Splunk, SCOM, FogLight, Squaredup, Riverbed, etc for monitoring, troubleshooting and reporting production environment.
- Checking the production stability when data centers are switched during deployments and making sure that there are no monitor fallouts.
- Auditing applications periodically in areas like user access, OS and IIS versions, hardware infrastructure, network, security settings, certificates etc.
- Reporting weekly statues on team progress, proactive goals and continuous improvement.
- Creating Run books for disaster recovery, Risk mitigation and Critical incident handlings.
- Working on process improvement in the areas of production support engagement, incident and defect handling and in converting reactive actions to proactive.
- Automation of regular and time consuming tasks using PowerShell.
- Improved builds using Jenkins build and deployment pipelines.
- Control, deploy and monitor the post deployment stability in the process of Release Management.
- Reviewing all Install/Rollback documentation towards technical accuracy and providing approvals for change requests.
- Reviewing the test results off the layer and on the layer applications and providing signoff.
- Providing sign off for the changes over Go/No-Go calls after reviewing business and functional scope and impacts.
- Providing signoff for operational readiness tracker items to move the CRQs towards implementation
- Participating in Launch Checkpoint Calls to determine if there are any deviations from LOD and if rollback is warranted.
- Reviewing RCA and PKE (Problem Known Errors) drafts for High and Critical INCs.
- Providing approvals for closure of PKEs after validating the corrective actions.
- Working on logging standardization, automated alerting and other automations to monitor servers and apps.
Application Support Engineer
Confidential, San Jose, CA
Responsibilities:
- Providing Level 1 and Level 2 support to multiple applications related to healthcare and chemical engineering.
- Monitoring multiple applications running on windows and Linux servers.
- Accessing Linux and Windows servers to fetch log files and configuration files.
- Worked on Remedy, HP service desk, WebEx, Filezilla, SQL tools, Windows RDP, and Citrix.
- Maintaining KEDB and monitoring ticket queue for proper resolutions and updates.
- Automations of monitoring, alerting systems using bash scripts, VB and Perl.
- Perform application upgrades as per the given schedules with proper approvals.
- Administer and implement all new systems and ensure transition of plans to production.
- Monitor all performance metrics for various production systems and identify root cause for all technical issues and recommend solutions
- Automated server build, management, monitoring and deployment solutions spanning multiple platforms, tools and technologies including Jenkins Nodes/Agent, Puppet and Ansible
- Maintain schedule jobs and perform troubleshoot on processes and resolve all issue.
- Troubleshooting the issues and incidents raised by end users and requesting for configuration changes.
- Implementing the code and configuration changes suggested by developers.
- Recreating the issues reported by end user and recording that in detail in log files.
- Forwarding detailed log files to appropriate teams for further investigation.
- Performing outage activities, like log clean-ups, application recycle and running test transactions.
Operations Engineer
Confidential
Responsibilities:
- Managing application for 6 different clients running over 200+ UNIX, Linux and Windows servers.
- Providing support for Production, QA, Test and Dev environments for all 6 clients.
- Providing second and third level support in troubleshooting Eagle application and application server issues.
- Managing file system utilizations on UNIX/Linux and windows servers.
- Managing CPU and Memory utilizations on UNIX/Linux and Windows servers.
- Managing over 120 websites running on Apache and IIS over UNIX and Windows servers.
- Monitored, maintained, and controlled hardware and software configurations in classified network environment.
- Established and overseen change request and implementation procedures for controlled items.
- Extensively worked on Eagle PACE, STAR, Message centre and SRM applications.
- Deep knowledge of database, system design, and technical architecture of the application.
- Interacted with client implementation team to understand the requirements and create technical design for inward and outward interfaces to/from Eagle.
- Built server and application monitoring tools with Shell and Perl and configured alerting systems.
- Root cause analysis at application level and interacting with other teams for further investigations.
- Recycle web services to pick up changes made to configuration and related files.
- Monitoring different server status, events during critical business
- Recycling of environments on a periodic basis
- Troubleshooting external and internal ftp file transfers.
- File drop, retrieval, FTP etc.
- Worked on ticketing tools BMC Remedy, HP Service centre.
- Worked on Precise, Toad, SQL tools for monitoring and accessing databases.
- Worked on HPOV, Tivoli alerting systems.
- Worked on Harvest Version control system for code implementations and migrations.
- Certifying the application modules after upgrades and migrations.
- Coordinating with UNIX, Windows and network admin teams in case of core server issues.
- Handling change requests for various components of eagle application.
- Providing support for component migration for Eagle Infrastructure users and application developers.
- Preparation and updating of Knowledge Repository.
- Ensure team’s adherence to ITIL processes in the scope of the project viz. event, incident, problem management, request fulfilment, IT service continuity management, knowledge management, and risk management.
- To perform trend analysis of the issues and work towards minimizing the number of incidents and/or defects through various methods.
- Constant encouragement to the team and initiate value adds to the customer in the form of automation or other service improvement/transformation plans.
- Auditing tickets on a daily basis; prepare daily reports, weekly reports & monthly reports to track the SLA adherence and to facilitate the monthly review meeting of the project which involves the stakeholders.
- Preparing the monthly SLA report, which is discussed during the monthly review call which involves the management and the Client Team.
- Study the incident pattern / trend and work towards customer & company’s value add initiatives.
- Conducted knowledge sharing sessions within the project on ITIL framework, Six Sigma concepts and continuous service improvement model.
System and Network Administrator
Confidential
Responsibilities:
- System Administration on all Sun and RHEL servers in the Software Development Department.
- Performed all software installation and configuration.
- Tuned servers for maximum efficiency for Oracle and DB2 Data Bases.
- Aided Oracle and DB2 data base administrators in installing data bases, building data base, refreshing and overlaying data bases, removing data bases, and backing up of all data bases.
- Wrote shell scripts in the Bourne, Korn, and bash shells.
- Supported internal users in mapping PC s and Laptops to databases.
- Installing, configuring and maintaining the servers (DNS, NIS, NFS and SAMBA).
- Using Solaris and Linux LVM for dynamically extending or reducing partition size.
- User management with disk quota’s and security settings.
- Ensuring maximum uptime of servers by timely installing of Patches and Service Packs.
- Planning and Implementation of Disaster Recovery of Major servers.
- Configuring Database servers and Application servers for the Software Development teams