We provide IT Staff Augmentation Services!

Sr. Devops Analyst Resume

4.00/5 (Submit Your Rating)

Chicago, IL

SUMMARY:

  • DevOps Engineer, Design and built Private Cloud infrastructures and Container Platforms.
  • Migrated applications between bare - metal, AWS, Azure and Openstack.
  • Planned, deployed and managed Infrastructure Automation from ground up.
  • Full Stack Development, Application Builds, CI and CD.
  • Operational support for AWS, Azure, Openstack Cloud and Kubernetes Platforms.
  • Built and managed monitoring and metric collection frameworks.
  • Expertise in Object Oriented Programming, Server Virtualization, Operating System Administration, Physical and Virtual Networking.
  • Expertise in IaaS and PaaS.
  • Expertise in DevOps methodology, Application Management and Support.
  • Project management activities in ITIL incident, problem, change and release management and Agile software development lifecycle.
  • Good business knowledge in healthcare, finance, banking, trading and e-commerce domains.

TECHNICAL SKILLS:

Cloud Platforms: AWS, Azure and OpenStack

Container Platforms: Openshift, Kubernetes(K8S) and Docker.

Automation: Chef, Puppet, Azure ARM, Cloud formation, Cobbler and Ansible

Build and Deploy: Git, Stash(Bitbucket), Jenkins, Artifactory

Operating Systems: RHEL, CentOS, Ubuntu, Solaris and Windows

Programming Languages: Ruby, Python, Perl, UNIX Shell Scripting (Bash), PowerShell and SQL

Virtualization: VMWare vSphere, KVM and VMWare NSX

Databases: MongoDB, MariaDB, Couchbase and SQL Server

Storage Technologies: Ceph, Swift, NAS, SAN and Linux LVM

Hardware Platforms: Intel, Sun Spark, and IBM Power.

Networking: VMWare NSX, Open vSwitch, F5 BigIP load balancing

Monitoring and Metrics: Sensu, Nagios, Datadog, Prometheus, Observium, Graphite, Grafana, Collectd, FogLight, AppResponse Expert, App Internals Expert, SCOM, Tealeaf, Gomez

Logging: Splunk, Elastic Search, Logstash and Kibana

Project Management: Microsoft Team Foundation server(TFS) and Jira

PROFESSIONAL EXPERIENCE:

Sr. DevOps Analyst

Confidential, Chicago, IL

Responsibilities:

  • Migrate MyQ IOT web application infrastructure to Azure.
  • Implement new monitoring solutions like Prometheus and Grafana
  • Implement new configuration management tools like ARM templates and Ansible.
  • Improve core infrastructure components like RabbitMQ, Couchbase and MSSQL.
  • Day-to-Day operational support and periodic maintenance.
  • Training peers on various new implementations on monitoring, logging, configuration management etc.
  • Stabilize Azure infrastructure by implementing custom metric collection with Log Analytics, Application insights and custom alerting around critical application components.

Cloud Platform Engineer

Confidential, Minneapolis, MN

Responsibilities:

  • Build and operate Openstack Private cloud and OpenShift Kubernetes container platform on AWS.
  • Infrastructure automation using Terraform, Cloud Foundry, Chef and Ansible.
  • Build monitoring and metric gathering systems around OpenShift and Openstack using Sensu, Grafana and Prometheus.
  • Build Docker images; create build and deployment configurations to deploy applications on Openshift.
  • Plan and migrate applications from bare metal and AWS to Openshift.
  • Migrate Java, Tomcat, node.js, ruby, and python applications into Kubernetes.
  • Patch, upgrade and scale OpenShift environment.
  • Onboarding applications for Test, Stage and Prod environments.
  • Build and operate OpenStack private cloud using from ground up.
  • Plan, design and migrate applications from bare metal and Azure to OpenStack.
  • Build OpenStack control plane hosts on VMWare.
  • Build OpenStack KVM Hypervisors on physical hardware.
  • Configure and maintain physical and virtual networks for OpenStack.
  • Install and upgrade OpenStack components like Nova, Neutron, Keystone, Glance, Ceph and Swift.
  • Implemented continuous integration system with git, Stash and Jenkins.
  • Install and maintain OpenStack testing and benchmarking frameworks using Rally and Tempest.
  • Install, patch and upgrade VMWare vCenter and ESXi hosts.
  • Install and manage Sensu to monitor and gather metrics for all hosts.
  • Install and manage Graphite and Grafana for metric collection.
  • Install and manage Observium for monitoring network devices, Rack management controllers, ESXi and all other hosts that support SNMP.
  • Write Sensu plugins, Collectd plugins for monitoring OpenStack performance and functionality.
  • Maintain core components of OpenStack like RabbitMQ and MariaDB.
  • Install and upgrade VMWare NSX SDN.
  • Manage package repositories on Artifactory and deploy them Puppet.
  • Configuration management using vCenter Orchestrator, Chef and Ansible.
  • Provide CI, CD platform using Jenkins for app teams for to build and deploy Java, Node.js, Apache Tomcat, Ruby and Python apps.
  • Handle releases and take part in on call rotation.

Application Management Advisor

Confidential, Roundrock, TX

Responsibilities:

  • Providing L3 support for high and critical incidents for applications running on 4000+ physical servers and Azure virtual machines.
  • Partnering with L2 in actively restoring the services.
  • Working on various tools including Tealeaf, Gomez, ARX, AIX, Empirix, Splunk, SCOM, FogLight, Squaredup, Riverbed, etc for monitoring, troubleshooting and reporting production environment.
  • Checking the production stability when data centers are switched during deployments and making sure that there are no monitor fallouts.
  • Auditing applications periodically in areas like user access, OS and IIS versions, hardware infrastructure, network, security settings, certificates etc.
  • Reporting weekly statues on team progress, proactive goals and continuous improvement.
  • Creating Run books for disaster recovery, Risk mitigation and Critical incident handlings.
  • Working on process improvement in the areas of production support engagement, incident and defect handling and in converting reactive actions to proactive.
  • Automation of regular and time consuming tasks using PowerShell.
  • Improved builds using Jenkins build and deployment pipelines.
  • Control, deploy and monitor the post deployment stability in the process of Release Management.
  • Reviewing all Install/Rollback documentation towards technical accuracy and providing approvals for change requests.
  • Reviewing the test results off the layer and on the layer applications and providing signoff.
  • Providing sign off for the changes over Go/No-Go calls after reviewing business and functional scope and impacts.
  • Providing signoff for operational readiness tracker items to move the CRQs towards implementation
  • Participating in Launch Checkpoint Calls to determine if there are any deviations from LOD and if rollback is warranted.
  • Reviewing RCA and PKE (Problem Known Errors) drafts for High and Critical INCs.
  • Providing approvals for closure of PKEs after validating the corrective actions.
  • Working on logging standardization, automated alerting and other automations to monitor servers and apps.

Application Support Engineer

Confidential, San Jose, CA

Responsibilities:

  • Providing Level 1 and Level 2 support to multiple applications related to healthcare and chemical engineering.
  • Monitoring multiple applications running on windows and Linux servers.
  • Accessing Linux and Windows servers to fetch log files and configuration files.
  • Worked on Remedy, HP service desk, WebEx, Filezilla, SQL tools, Windows RDP, and Citrix.
  • Maintaining KEDB and monitoring ticket queue for proper resolutions and updates.
  • Automations of monitoring, alerting systems using bash scripts, VB and Perl.
  • Perform application upgrades as per the given schedules with proper approvals.
  • Administer and implement all new systems and ensure transition of plans to production.
  • Monitor all performance metrics for various production systems and identify root cause for all technical issues and recommend solutions
  • Automated server build, management, monitoring and deployment solutions spanning multiple platforms, tools and technologies including Jenkins Nodes/Agent, Puppet and Ansible
  • Maintain schedule jobs and perform troubleshoot on processes and resolve all issue.
  • Troubleshooting the issues and incidents raised by end users and requesting for configuration changes.
  • Implementing the code and configuration changes suggested by developers.
  • Recreating the issues reported by end user and recording that in detail in log files.
  • Forwarding detailed log files to appropriate teams for further investigation.
  • Performing outage activities, like log clean-ups, application recycle and running test transactions.

Operations Engineer

Confidential

Responsibilities:

  • Managing application for 6 different clients running over 200+ UNIX, Linux and Windows servers.
  • Providing support for Production, QA, Test and Dev environments for all 6 clients.
  • Providing second and third level support in troubleshooting Eagle application and application server issues.
  • Managing file system utilizations on UNIX/Linux and windows servers.
  • Managing CPU and Memory utilizations on UNIX/Linux and Windows servers.
  • Managing over 120 websites running on Apache and IIS over UNIX and Windows servers.
  • Monitored, maintained, and controlled hardware and software configurations in classified network environment.
  • Established and overseen change request and implementation procedures for controlled items.
  • Extensively worked on Eagle PACE, STAR, Message centre and SRM applications.
  • Deep knowledge of database, system design, and technical architecture of the application.
  • Interacted with client implementation team to understand the requirements and create technical design for inward and outward interfaces to/from Eagle.
  • Built server and application monitoring tools with Shell and Perl and configured alerting systems.
  • Root cause analysis at application level and interacting with other teams for further investigations.
  • Recycle web services to pick up changes made to configuration and related files.
  • Monitoring different server status, events during critical business
  • Recycling of environments on a periodic basis
  • Troubleshooting external and internal ftp file transfers.
  • File drop, retrieval, FTP etc.
  • Worked on ticketing tools BMC Remedy, HP Service centre.
  • Worked on Precise, Toad, SQL tools for monitoring and accessing databases.
  • Worked on HPOV, Tivoli alerting systems.
  • Worked on Harvest Version control system for code implementations and migrations.
  • Certifying the application modules after upgrades and migrations.
  • Coordinating with UNIX, Windows and network admin teams in case of core server issues.
  • Handling change requests for various components of eagle application.
  • Providing support for component migration for Eagle Infrastructure users and application developers.
  • Preparation and updating of Knowledge Repository.
  • Ensure team’s adherence to ITIL processes in the scope of the project viz. event, incident, problem management, request fulfilment, IT service continuity management, knowledge management, and risk management.
  • To perform trend analysis of the issues and work towards minimizing the number of incidents and/or defects through various methods.
  • Constant encouragement to the team and initiate value adds to the customer in the form of automation or other service improvement/transformation plans.
  • Auditing tickets on a daily basis; prepare daily reports, weekly reports & monthly reports to track the SLA adherence and to facilitate the monthly review meeting of the project which involves the stakeholders.
  • Preparing the monthly SLA report, which is discussed during the monthly review call which involves the management and the Client Team.
  • Study the incident pattern / trend and work towards customer & company’s value add initiatives.
  • Conducted knowledge sharing sessions within the project on ITIL framework, Six Sigma concepts and continuous service improvement model.

System and Network Administrator

Confidential

Responsibilities:

  • System Administration on all Sun and RHEL servers in the Software Development Department.
  • Performed all software installation and configuration.
  • Tuned servers for maximum efficiency for Oracle and DB2 Data Bases.
  • Aided Oracle and DB2 data base administrators in installing data bases, building data base, refreshing and overlaying data bases, removing data bases, and backing up of all data bases.
  • Wrote shell scripts in the Bourne, Korn, and bash shells.
  • Supported internal users in mapping PC s and Laptops to databases.
  • Installing, configuring and maintaining the servers (DNS, NIS, NFS and SAMBA).
  • Using Solaris and Linux LVM for dynamically extending or reducing partition size.
  • User management with disk quota’s and security settings.
  • Ensuring maximum uptime of servers by timely installing of Patches and Service Packs.
  • Planning and Implementation of Disaster Recovery of Major servers.
  • Configuring Database servers and Application servers for the Software Development teams

We'd love your feedback!