Devops Engineer Resume
Palo Alto, CA
SUMMARY:
- Unix Systems Engineer offering over fifteen years experience solving complex problems with creative solutions, supporting development and operations environments from young startups to large enterprises.
- Work hard, play hard, result - oriented leader who believes in adding value to the company, making the team better and strives to always understand and exceed customer needs.
TECHNICAL SKILLS:
O/SLinux: RHEL, CentOS, Ubundu/Debian, SuSE), Unix: Solaris, HPUX, BSDI Code, Config, Builds, etcChef, cfEngine, Git, Perforce, CVS, Subversion, Bamboo, Jenkins, Jira, Confluence, Wiki
Apps/DBs: MonitoringOpenStack, Tomcat, MySQL, PostgreSQL, Sensu, Nagios, Opsview, check mk
Scripting: editorsBash, Python, Ruby, Perl, awk/sed, rudimentary Java, vi, ed.
Cloudbased providers: Google Compute, AWS, RackSpace, Joyent, DigitalOcean, Azure
Internet Services: DNS/BIND, NFS, FTP, Sendmail/postfix, Apache, nginx, NTP
Security: PCI, Security audits/hardening, sudo, lsof, nmap, pki, crack, nessus, snort, tcpdump
Networks:, L/B, etc.DHCP, PXE, tftp, NICs/switches, Cyclades, Consoles, Wireshark, F5 L/B, NetScaler
Other: HTML, kickstart, yast, yum, VMWare, KVM, Hadoop, LVM, HW RAID, NetBackup, bacula
PROFESSIONAL EXPERIENCE:
Confidential, Palo Alto, CA
DevOps Engineer
Responsibilities:
- Member of three person DevOps team, responsible for maintaining physical and virtual systems with maximum uptime, as well as the build release process. Deploy and manage cloud-based instances in Google Compute, AWS, RackSpace and DigitalOcean. Share in both 3-person Ops and 12-person enterprise support rotations.
- Live in Git committing Bash, Python and Ruby to Github. Code review all Ruby, Python, and Bash.
- Chef automation: Web-based doc servers, release cluster, bamboo agent capabilities and server, MySQL master-master replication, Hadoop components, deploying public Crowd, Jira, Wiki and Bamboo instances to cloud, etc.
- Automate: Artifacts upload with scripts run in Bamboo; syncing from Nexus to staging servers and S3 buckets.
- Help maintain Coopr and Coopr plugins and templates, and write DigitalOcean and Azure plugins.
- Create and present design docs and implement various automation projects (e.g. Hadoop cluster testing).
- Review engineering design docs and company technical blog posts. Write technical blog articles.
- Narrate and record audio-based instructions for company products.
Confidential, Palo Alto, CA
OpenStack Cloud Architect and Senior Linux Engineer
Responsibilities:
- Responsible for all aspects of OpenStack cloud: Proof of concept, architecture/design, planning, implementation (nova, glance, cinder, keystone, ceilometer, heat), capacity planning, consolidation, physical migration, documentation and labs support. Active member of OpenStack community (submit and keep track of bugs).
- Senior Linux Engineer and top escalation point for all Linux-related issues in three US regions.
- Use Chef to automate app deployment, and system management of VMWare, and OpenStack instances.
Confidential
Operations Engineer
Responsibilities:
- Member of 24x7 Ops team responsible for 500 virtual nodes in 19 environments in Chef, and 60 physical servers.
- Planned, performed and documented system/application upgrades in Prod and Beta environments.
- Created, maintained and fixed Chef cookbooks, roles and data bags. Wrote and ran Jenkins jobs.
- Researched, documented and implemented system and application log shipping (Logstash/ElasticSearch)
- Implemented new and maintained old Nagios monitoring systems. Wrote Nagios plug-ins (Bash and Perl) for production apps and DBs. Assisted Dev and QA engineers in troubleshooting day-to-day technical issues.
Confidential, Palo Alto, CA
Operations Engineer
Responsibilities:
- Member of two person DevOps team (400+ Linux Systems in one physical & four virtual data centers)
- Personally responsible for end to end installation and maintenance of Hadoop clusters, Flume-ng configuration, automation (Perl, Python) and troubleshooting, and pushing syslog data to HDFS and Amazon S3.
- Wrote Chef recipes (e.g. Edge servers running nginx) as necessary and pushed to servers.
Confidential
Unix System Administrator
Responsibilities:
- Personally responsible for Trymedia (San Jose) 24x7: 160+ servers in 16 racks at Equinix SV4 data center, network gear, hardware, O/S, applications, databases, security, incidents, upgrades.
- Coordinated and implemented functional migration of full Trymedia stack to Seattle data center.
- Remote member of 4-person Seattle-based Games TechOps team; managed 400+ servers and design decisions.
- Performed PCI compliance security audit & remediation of Trymedia environment.
- Completed alert/monitoring migration (SiteScope & Nagios to Nagios/check mk) of all servers.
- Executed deployments, migrations, upgrades and updates to all systems, within customer SLAs.
- Redesigned kickstart environment for new servers, built systems using KVM and other tools.
- Reverse engineered and documented Trymedia processes in Wiki.
- Supported systems, websites, and apps in Spain, and Holland, and worked with remote staff.
Confidential
Senior Systems Engineer
Responsibilities:
- Wrote/modified Python, Bash & ant scripts for deployment automation of confiedntial
- Deployed updates for RPG games to all environments and tested game functionality/performance.
- Worked with Ops group, helping troubleshoot all performance issues and outages end to end.
- Consulted with game teams, and made recommendations to help maximize user experience and retention.
Confidential, Palo Alto, CA
Senior Systems Engineer
Responsibilities:
- Solely responsible for Unix Ops: Set up and maintain production customer-facing servers and infrastructure, automate processes, define/act on security policies. Design, test and deploy new builds and OS. Fully configure and install applications and DBs on Prod and Dev systems (AWS). Define Ops, QA and IT requirements.
Confidential
Automation Engineer
Responsibilities:
- Member of Ops team responsible for all Unix systems, build release/deployment and app continuity/testing.
- Designed and implemented a test framework using Selenium RC; automated testing of company's websites.
- Took over build release deployment and coordination responsibility for production servers. Within two months, increased average number of weekly builds from 2.5 to almost 4.
- Took over web application continuity management & troubleshooting app outages and performance issues.
Confidential
Senior Unix Systems Engineer
Responsibilities:
- NOC Ops: Installed and managed RHEL and Solaris systems. Maintained Nagios, administered MySQL DBs, deployed builds, set up backup system, did minor network administration.
- NOC Eng.: Supported NOC Dev efforts: Manage 60+ systems and new environment set up. App and DB installation, Config. and support. Reproduce/fix NOC app, DB, UI, performance and build-related issues. Author project and process designs and review team members’ design docs. Design and automate unattended system OS and virtualization (VMware) installations (PXE, DHCP, tftp). As a result, the company saved 9 hands-on hours of installation on appliances in the field. Helped QA reduce deployment times from 8+ hours to <1 hour.