We provide IT Staff Augmentation Services!

Snr Infrastructure Engineer Resume

4.00/5 (Submit Your Rating)

Portland, ME

CAREER OBJECTIVE:

Seeking a Systems Engineer role with responsibility for the Installation, Configuration, Migration, Monitoring, Patching, Resolving Incidents, Maintenance and Administration of Systems Users. Seeking employment with a company, which will allow me to address and solve your company’s technical needs by applying my experience within the areas of System Engineering. Have the ability to complete deadline - driven projects.

SUMMARY:

  • Possess over 8 years of IT Industry experience in Red hat Linux System Administration and Global Incident Management role, good team player, energetic, willing to learn, good knowledge about O.S. concepts and a clear-cut solution for the problem through technical analysis and problem resolution skills. Expertise in Patching OS base servers, System Backup and Restore, Storage Administration, TCP/IP Protocols, Confidential troubleshooting, Unix/Linux-based O.S. and Virtual Environment, Data Center Migration, Deployment, Staging or Development and Production Support. Have basics in MySQL Administration/Java, Apache (Tomcat) Applications,AWS and Openstack.
  • Familiar with Networking and Protocols and doing maintenance,support,management,design and development of an application on Cloud(creating IAM roles and policies. Experienced in raising Change Request (CR), Incident ticket and Itask. Good knowledge in building servers, configuration and maintenance of system application software, DNS, NFS, DHCP and FTP. Strong knowledge in monitoring using tools such as Opnet, AMP,Sitescope, Splunk, CA Spectrum and Nagios. Effective combination of systems administration and incident management, high technical inclination and communication skills during bridge calls, proper documentation, paging team members using Xmatters and finding the root cause analysis (RCA) of an Incident. Ability to work remotely and in a team environment (both small and large), emphasizing team goals over personal goals. Self-motivated, drive with the ability to absorb technical information/knowledge quickly and exhibit a positive and professional attitude. Strong Experience in O.S. patching/updating of servers either manually or using automation tools like BigFix, Puppet, Chef and Ansible. Experience in Configuration and Installation of Splunk,Nagios monitoring tool on Unix/Linux based O.S.Good in using Service Now, Cherwell for ticketing system.
  • Experience with software like Rapid SQL, CA Spectrum, Appinternals, Oracle Weblogic Server, BigFix, NoMachine, Nice,Genesys, WinScp, Citrix, Putty, Secure CRT, HP Service Manager software, SST, Confidential SecurID Help Desk Admin portal,ServiceNow,BIG-IPEdge Client,Cherwell, Nutanix, Oracle Enterprise Manager (OEM), Remedy,ETM to handle tickets and Confidential Devices like Juniper, Cisco, Asset Management,Infoblox Grid Manager, IBM/UCS Manager,CommonVault etc.
  • Ability to manage contingency plans execution request, manage all facilitation requirements of running different Severity Incident Bridge and also designing,planning,managing the cloud and maitaining EC2 instances,S3 storage.
  • Attending CAB meetings to get CR approval, creating,assigning,resolving and closing aged tickets in SNOW, updating SharePoint and KBA with new docs on resolving issues, self assigning basic Linux tickets and monitoring applications.

TECHNICAL SKILLS:

Platforms: Linux (RHEL) Enterprise, UBUNTU, Cento 6 and 7, Linux 5.11, Window NT/2003,2010,2012 server,Azure, AWS, OpenStack API(Software)

Virtual: VMWare ESXi, vSphere 4.0, vSphere 5.0, vSphere 6.0, vCenter SRM

Monitoring: Nagios,Splunk,SitesScope.Opnet,OracleWeblogicServer,Nutanix,Spectrum

Automation: Puppet, Ansible, Control M, Chef, Snort, BigFix, CA Workload AutomationWeb client, Riverbed,AMP

Protocols: HTTP, FTP, DHCP, DNS

PROFESSIONAL EXPERIENCE:

Confidential, Portland, ME

Snr Infrastructure Engineer

Responsibilities:

  • Patching Linux OS servers (manually or running script), using Ansible, BigFix or Oracle enterprise manager (OEM) for CRS and Oracle patching. Working with DBA team on updating Kernel defaults (Rolling back/forward), install/uninstall acfs filesystem.
  • Using UCS Manager to convert boot from SAN luns from VMAX to XIO.
  • Restoring multiple files, unix backup/archive files using CommonVault.
  • Perform migration either in Linux or Cloud environment (moving database storage to a cloud based system)
  • Troubleshooting patching errors like updating the repo file and new installed kernel.
  • Use Nutanix web console to run NCC health checks directly from Prism.
  • Working on basics AWS task like creating users,groups, setting up policies and working on S3 buckets.
  • Responsible for Asset and Infrastructure Change Management process within all Data Centers.
  • Experience with blade chassis Infrastructure (IBM Blade Center ) and (USC chassis) in troubleshooting servers.
  • Setting up AD on Linux servers.
  • Doing Automation,containization with docker and kubenetes.
  • Use Nutanix to build and operate multi cloud architures.
  • Building Linux servers using VMware, configuring and troubleshooting Confidential issues using commands like ping, Nestat, traceroute, tcdump, ifconfig, telnet, dig.
  • Monitoring file system usages, CPU memory and disk utilization using top, vmstat and taking proactive steps by housekeeping file systems as necessary.
  • Using Linux/Unix command line tools to check log files, space issue, cpu usage issues, working on removing decommission servers, migration, use Bash shell and Perl for scripting.
  • Troubleshooting Confidential issues, working on assigned and unassigned tickets like migration of files, deployment, changing ownership and permissions of a file, setting up AD on Linux servers and updating yum repo file and kernel (changing it to default) incase there is an error in the patching.
  • Manage and triage service request, incident request, and change request tickets for deployment.
  • Attending CAB meetings to get CR approved.
  • Finding ways to automate processess,finding optimizations for cost and performance and responding to customer inquiries.

Confidential,Tysons Corner, VA

L3 /LP/LQA/HVE/PE Application Support

Responsibilities:

  • 24/7 monitoring servers and application using Opnet, Sitescope, Splunk, SteelCentral,Nagios and Ca Spectrum.
  • Using the OpenStack API/CLI to manage, create domains,groups,projects,users and roles for the environment. Managing compute instance actions like lunch, shutdown and terminate.
  • Patching servers, creating tickets and Task.
  • Work at the Data Center, experience with blade chassis Infrastructure (IBM Blade Center ) and (USC chassis) in troubleshooting servers.Familiar with Uninterrupted Power Supply (UPS) which helps the server to be up incase the power goes off.
  • Maintaining EC2 instances(virtual machines) monitor incoming transaction queues,,maintaining S3(storage) and creating identity access management (IAM) roles and policies.
  • Monitoring file system usages, CPU memory and disk utilization .
  • Appinternals (Riverbed) maintain dashboards and alerting functions.
  • Doing installation and configuring monitoring tools like Sitescope,Nagios and Splunk.
  • Addressed Spectrum alarms and nimsoft alarms, use CA Spectrum to track, capture, monitor and track immediate configuration change and errors on application of the dashboard.
  • Setting up IP address, DNS, Gateway and firewalls rules using Iptables and using Confidential protocols like TCP/IP.
  • Using Rapid SQL to run queries, Running Smoke test on Sitescope, using Web logic console to check Selling System Application Health, test PRC-pools and Opnet, Appinternals to find transaction segments.
  • Updating the company’s wiki (Knowledge Base) by approved documentation internally use and for customer facing basic issues.
  • Creating escalation requests (using Service Now) and submitting product defect reports to management. Raising incident request, Change request resolving the tickets assigned with the service level agreement. Conducting monthly and annual incident trend analysis.
  • Sending Daily reports either end of shift or Incident reports with documentation for Incident, root cause analysis (RCA) and Resolution.
  • Manage and triage service request, incident request, and change request tickets for deployment.
  • Handling bridge calls, been On Call schedule,paging teams using Xmatters, working with Operation Center, Web Infrastructure Operations (WIO) team to resolve critical incidents.
  • Creating, Executing and Running Autosys jobs, installing and upgrading security, critical patches on Linux
  • Creating, Executing and Running Autosys jobs, installing and upgrading security, critical patches on both Linux /Windows OS
  • Leading, planning, coordinating and monitoring projects activities. And also, providing knowledge to new team members.
  • Performing ongoing system and cloud related administration tasks.
  • Working with load and capacity testing teams to develop profiles of application performance.
  • Performing change implementations, configuration of systems and supports performance metrics.

Confidential, Ashurn, VA

System and Application Engineer /Incident Management

Responsibilities:

  • Confidential and Server monitoring using Sitescope, SCOM, Service Manager, SCCM.
  • Use OpenStack to manage and create volume group for block storage, monitor reserve capacity of block storage devices.
  • AWS cloud platform design and administration(compute,storage,DB,migration, Confidential and content delivery services.
  • Coverted queries and match data with Genesys callcon.
  • Use CA Spectrum to find root cause analysis (RCA).
  • Manage Confidential resources(routers, subnets ) in Opnestack environment.
  • Connecting Confidential to switch at the Data Center .
  • Genesys Reporting, experience working with ticketing system (ETMS, Service Now) relating to support issues.
  • Identify and assisting with implement routing changes as required, achieving end state reporting.
  • Lead the patching team, updated the Os patching doc, attend CAB meetings to get patching change request approved.
  • Run the patching process by assigning servers to team members for patching, worked with the DBA/Middleware Team when patching and updated the patching excel sheet with the status for each server.
  • Performed standard changes and provide escalation support for complex technical issues.
  • Provided recommendations to Tier 3, Service Delivery team, manage on process.
  • Performed change implementation (as Implementer, Verifier or Checker), updating MOP documents for future changes and performing migration from one environment to another.
  • Able to communicate with clients providing management with reports.
  • Performing testing, debugging and documentation for new and existing systems.
  • Work with client alerts and associated trouble tickets to meet all contractual SLA.
  • Performed Validations, performed test calls, familiar with Nice, Citrix applications, do basic scripting using PowerShell (Windows) and also Bash (Linux).
  • Developed incident management for staff. Managed and triaged service request, incident request, and change request tickets. Acted as primary technical in critical event management situations including providing support on Technical and customer bridges.
  • Handling task like changing the style of HTML element using JavaScript on Window servers.
  • Automating process like backup, scheduling updating and synchronization of files on Linux O.S using Perl (cron).
  • Monitored requests and key metrics for Cloud Resources (AWS)
  • Monitoring system and services on servers using Splunk.

Confidential, Richmond, Va

Incident Manager/Linux System Administrator

Responsibilities:

  • Facilitated executive bridge calls under the guidance of the supervisor, updated the incident records
  • With appropriate notes, engaged appropriate resources on the call per define practice and protocol.
  • Backup the database used by Openstack instance. And deployed a new image to an Openstack instance.
  • Updated critical incident dashboard tool and monitored activities on servers, using CA Spectrum One Click, Splunk, SitesScope, Control M etc, kept track of events within applications.
  • Paged other teams with MIR3 software, called vendors as requested by supervisor and handled phone calls with clients.
  • Received information from Application Service Providers ( Confidential, Confidential, Confidential, Confidential ) and worked to assess what action needed to be taken.
  • Managed approvals to implement urgent changes, and contingency plan execution request.
  • Did morning validations, monitored the queue(HPSM) and the mailbox, provide Service Center
  • Updates as required.
  • Managed approvals to escalate incidents to High Severity status(3+) and maintained contact system of record for accuracy.
  • Confidential and Server monitoring by Sitescope.
  • Used tools like Remedy for Incident and Configuration management.
  • Supported Java based Applications.
  • Retrieved all Users from a SharePoint Group, and added a User to a SharePoint group with JavaScript on Windows servers.
  • Installed and configured open stack on Linux servers.
  • Solved basic Linux task like creating user accounts, running cron jobs, migration, space and health issues etc.

Confidential, Fairfax,Va

Linux System Administrator/Incident Management

Responsibilities:

  • Experienced in Software and Patches installation for Linux using YUM package manager.
  • Verified operation of the image Service (manage images like add,update,remove) in OpenStack instance.
  • Patched management and performed changes on scheduled window.
  • Configured and Installed printers on Linux-based Operating Systems.
  • Scheduled to work On Calls( PagerDuty) resolved incidents on services like AMS Luminis, UNIX and AWS Cloud environment, helped team members as and required.
  • Monitored file system usages, cpu memory and disk utilization using top, vmstat and taking proactive steps by housekeeping file systems as necessary.
  • Worked on Linux logical volume, created volume group, logical volumes, filesystems and fixed any issues.
  • Logged into remote systems, executed commands and performing migration.
  • Created partitions, monitored var, log files to check the health status of cluster nodes.
  • Worked with Firewalls, VPN, TCP/IP, and utilize tcp wrappers for traffic monitoring and used a proactive search for filtering spammers.
  • Experienced with Linux family (Red Hat Enterprise Server, Cent OS 5 and 6, VMware. vSphere6.0).
  • Familiar with tools like Secure CRT to provide secure remote access, file transfer, patching and data tunneling.
  • Regular team meetings, shift management, team activities, one on one discussions, team concerns and made documentation on new projects, updated the company’s wiki occasionally on solving certain Linux task.
  • Was responsible for operating systems backup configuration, system maintenance and support, and general data center support. 24x7 on call rotation.
  • Worked with new technology and Applications (Java, Apache Tomcat).
  • Handled task like building Active Directory on Linux server.
  • Installed Splunk Enterprise on Linux server using RPM or DEB packages or a tar file.
  • Performed daily administrative task such as troubleshooting automation equipment and unit maintenance on multi functional process systems.

Confidential, Washington, Dc

Linux System Administrator

Responsibilities:

  • Set up and administer user and groups accounts, setting permissions Web servers, file servers,firewalls and directory services with ability to diagnose basic Apache Issues.
  • Built and installed multiple Linux machines. (Centos 6 and 7)
  • Managed the Linux infrastructure using Puppet, Chef and VMware for virtualization.
  • System and Application troubleshooting, administering and tuning Java based applications.
  • Provided management of system process in areas like boot process, startup and system shutdown.
  • Installed, configured, maintained and administered Linux / UNIX operating systems and components.
  • Ensure change management policies and procedures are followed.
  • Disk management, and implementation of RAID levels using parted and mdadm
  • Planned and executed packages and updates installations necessary for optimal system performance.
  • Diagnosed and resolved problems associated with DNS, DHCP, VPN, NFS, and Apache.
  • Used Bash and some Perl scripting for automated processes in managing disk space, deleting old logs and scripts for cron jobs.
  • Hands-on experienced supporting Redhat6, Cent0S 6,7 in larger environment (more than 200 plus Servers, 70% virtual, 30% physical)
  • Provided assistance with software and hardware issues, in addition to troubleshooting application errors and Confidential connection problems with WAN and LAN.Set up Cron jobs for automated processes, used Kickstart for installation.
  • Worked with the Scrum team on daily basis developing software.
  • Installed, configured and maintained Java Application Server administration (Tomcat).
  • Kernel update patching and tuning, monitored event logs, transaction logs and provided support for Internal customers.
  • Worked with Databases (basic SQL querying), used Putty to work on some Windows servers.
  • Used Nagios, Splunk for monitoring and Secure Socket Layer (SSL) for secure access. Also formatted firewalls, Confidential switching and routing settings for extra client security. Configured Authority to sign requests for SSL and configured secure talk for ldaps, https and smtps.
  • Review and deploy Linux system releases and vendor-supplied patches according to best practices.
  • On-call Storage resource, responsible for troubleshot storage related issues and performed corrective actions.

Confidential

Linux System Administrator

Responsibilities:

  • Worked as a Junior Linux administrator consultant in assisting private businesses with configuring Mail, DNS, and web servers on windows and Linux platforms. Troubleshot DNS and Confidential issues by using Confidential tools and monitoring graphs: telnet, ping, dig, netstat, etc.
  • Installed, configured, secured, and patched servers for clients using the latest Linux platform
  • Manage and work in Unix operating systems and services environment.
  • Installed web servers using Apache.
  • Used Splunk to checked impacts before they become problems that threaten uptime on the server.
  • Worked side by side with QA, BA, Account Managers, kept track of events that happened within Applications.
  • Configured ldap set up for verification of a mail server. Used postfix and openSMTPD mail transfer agents.
  • Installed and configured virtualization for both windows and Linux platforms using VMware
  • Installed and setup of volume management and RAID hardware/software for data backup and Storage, used logical volume management (LVM) on Linux platforms
  • Responded to security alerts with risk evaluation and monitoring /report on unauthorized access attempts.
  • Monitored security logs to determine security problems.
  • Managed the Linux infrastructure using Puppet and Chef.

Confidential

System Administrator/Incident Management

Responsibilities:

  • Performed software and hardware upgrades and routine system maintenance by installing current Patches and Packages.
  • Wrote scripts to automate various tasks and used cron to schedule jobs.
  • Set up Puppet and Ansible for automation and also managed host servers.
  • Resolving the incident tickets being assigned.
  • Assisted in various aspects of server administration including installing and maintaining the operating system software, performance monitoring, problem analysis and resolution of production issues. Creating and management of User accounts and setting permissions on files to provide administrative support for various departments.

Confidential

Linux System Administrator

Responsibilities:

  • Monitored web servers using nagios monitoring tool.
  • Author and modify scripts for Application deployment as well as System monitoring.
  • Administered local and remotely servers on daily basis and providing weekly status reports to Management.
  • Supported data management through on-site & off-site storage and retrieval service.
  • Troubleshot and resolved software and hardware problems, interface with vendor technical support to resolve problems and worked with other technical staff on supporting their needs.
  • Optimized system performance by tracking daily system utilization, to determine if problems are Imminent.
  • Monitored and provided daily reports on system performance.
  • Built, installed, configured Red Hat Linux servers (RHEL 5) in a data center environment.

We'd love your feedback!