Site Reliability Engineer Resume
5.00/5 (Submit Your Rating)
Charlotte N, C
SUMMARY
- Engineer with 13+ years of professional experience which includes planning, building, deploying, managing, and monitoring server and network environments of varying size and complexity. Responsible for designing data contingency plans or data disaster recovery plans that meet specific industry needs. Strong background in providing in depth technical support, troubleshooting, and customer help desk support. Demonstrated ability providing guidance and applying current technologies and concepts for multi - term business goals.
- Heading project to replace neglected Linux infrastructure with current Linux offerings and high availably clustering.
- Engineering and implementing a three-month staged deployment to replace failing VMware 5.0 hosts with up-to-date VMware 6.5 HA cluster, while providing minimal downtime.
- Performing server infrastructure audits to find out-of-compliance server operating systems and plan remediation/upgrade plans for each system/service; Primarily Windows Server 2008R2/2012 to Windows 2016.
- Planning and performing on-prem Exchange to Exchange Online (Office 365) migrations with no downtime to users.
TECHNICAL SKILLS
- Windows and Linux server support
- Enterprise LAN & WAN management
- VMware & Hyper-V Virtualization
- Cloud deployment/architecture (AWS, Azure)
- System Monitoring (OpsView, SolarWinds, SCCM)
- SAN/NAS storage technologies
- Application Performance Management (Dynatrace, AppDynamics, SentryOne)
- Confidential Exchange
- High Availability Systems (clustering, load balancing, failover, disaster recovery)
- Deployment systems (Ansible, Chef, Puppet, Octopus, Azure DevOps SCCM)
- Enterprise Apple solutions (JAMF, DEP, VPP)
- Programming/Scripting (C#, PowerShell, Python, Bash)
- ITSM incident management (ServiceNow, Remedy)
- VPN, IPsec Metro-E, MPLS
PROFESSIONAL EXPERIENCE
Confidential, Charlotte N.C.
Site Reliability Engineer
Responsibilities:
- Production operational and administration duties to maintain the health and reliability of SaaS platform.
- Participate in an Agile work environment, to produce, deliver and test cases using Agile methodologies.
- Release management & deployment tool experience in Azure DevOps, Octopus Deploy, TeamCity, etc.
- Monitor Service Level Indicators (SLI) for application performance issues and analyze poor performing SQL queries with Dynatrace and coordinate with DBAs and Dev teams to meet Service-Level Objectives (SLO).
- Collaborate with DevOps teams to implement CICD performance testing, monitoring, and reliability strategy using Visual Studio Team Services and Azure DevOps.
- Develop system for automated reporting of SLIs via synthetic and real-time user interactions utilizing Dynatrace, Azure Log Analytics, Solarwinds, and SentryOne.
- Primary Linux support engineer for all production SaaS operations.
- Deployment and maintenance of micro services with Docker and Azure Services
- Automate deployment and operational tasks via a variety of scripting languages. (Powershell, Python, Bash, etc.)
- Perform incident/problem response and root cause analysis while collaborating with other teams to resolve production issues in a timely and organized manner.
- Assist development teams with diagnosing and prioritizing existing backlog of production issues and tech debt.
- Create framework for automated testing of applications with Selenium through lower environments.
- Participate in on-call rotation and perform weekly incident review with pertinent teams and individuals.
- Standardize documentation into a uniform runbook format broken down by major applications.
Confidential, Charlotte N.C.
Systems Engineer
Responsibilities:
- Assist managers and team leads in identifying test scope, identifying project risks, and implementing process changes and efficiencies.
- Lead standardized design of new and existing processes and policies as needed.
- Work with team leads to automate installation steps with a variety of scripts and BladeLogic to ensure consistency between systems and environments.
- Document new system architecture into standard runbooks.
- Acted as a liaison between internal bank security management and LOB managers to quickly and efficiently
- Automate or improve existing automation to monitor and resolve configuration issues.
- Release management & deployment pipeline tool experience in TeamCity, Octopus Deploy.
- Collaborate with Development, QA, and DevOps teams to help ensure designs and implementations meet security standards.
Confidential, Charlotte, N.C.
Support Escalations Engineer
Responsibilities:
- Act as the final point of escalation and resolution for technical issues ranging from basic advisory questions to large scale outages affecting thousands of users.
- Diagnose mail flow in enterprise environments through a variety of systems including network load balancers (NLBs), security appliances, and third-party spam filtering services.
- Spearheaded development of a PowerShell toolkit to streamline diagnostic processes of Exchange environments.
- Utilized code management tools such as Git and Azure DevOps\Visual Studio Team Services
- Provide advanced technical support to developers, corporate customers and end users for Confidential Exchange and related products including highly escalated issues.
- Perform in-depth analysis of diagnostic data to write reports of varying degrees of detail for presentation to engineers, managers, or executives of corporations and organizations.
- Act as technical lead and provide technical training, coaching, and mentoring to other engineers.
Confidential, Charlotte, N.C.
Senior Systems Engineer
Responsibilities:
- Automated deployment of applications and services via change management systems and infrastructure as code (IaC) frameworks with a variety of scripting languages and tools including PowerShell, Bash, Python, Puppet and Chef
- Utilized Ansible for packaging and deployment of software updates and configuration changes to streamline existing workflows to enable junior technicians to perform work that was previously outside of their skill set.
- Implement and maintain Linux infrastructure to provide applications and services to 4000+ end users.
- Maintain monitoring and logging systems for Windows and Linux servers and infrastructure with SNMP and syslog tools such as SCCM, SolarWinds NPM, Splunk, OpsView, Nagios, Zabbix, and PRTG.
- Use historical data to create growth projections for services and plan expansion projects accordingly.
- Rapid troubleshooting of issues with software and hardware to create short-term and long-term solutions intended to prevent interruptions of day-to-day business operations.
- Provide secure remote access assistance to companies via a variety of technologies such as RDS (Remote Desktop Services), VPN, SSH (Secure Shell), HTTPS tunneling.
- Responsible for design, implementation, and maintenance of cross-platform networks to deliver stable, secure, and redundant environments using a variety of technologies including Demilitarized Zones (DMZ’s), Virtual Private Networks (VPN’s), VLAN’s, Intrusion Detection Systems (IDS’s), and Certificates.
- Documenting, assigning, tracking and monitoring tickets to ensure a timely resolution of client issues.
- Perform migrations of Active Directory (AD) domains to newer Windows server versions including Server 2012, 2016, and Azure AD.
- Utilized Azure AD Connect to connect on-prem AD environments with Azure Cloud services to allow for single sign-on experience for end users.
- Deploy and manage Exchange platforms, including hybrid Office 365 deployments to meet company’s needs.
- Manage mobile devices with MDM solutions such as AirWatch and JAMF.
- Implement and support of virtual environments (vSphere and Hyper-V) and redundant SAN/NAS appliances (Dell EMC, EqualLogic, and Compellent) to provide high availability systems and disaster recovery solutions.