Cloud Techops Engineer Resume
Tampa, FL
PROFESSIONAL SUMMARY:
- Seasoned Cloud TechOps Engineer with 6+ years of extensive experience in management and configuration of the mission critical solutions with responsibility for installation, planning, support and maintenance.
- Conscientious and resourceful with diversified experience in AWS and Postgres field. I can create and automate a tool to monitor and support software development, release processes and repetitive tasks.
- My greatest strength is attention to detail which helps me to ensure that the product I deliver conforms to the client's standards and quality together with cost effectiveness. Positive in word, act & feeling.
TECHNICAL SKILLS:
Cloud Management: - AWS (EC2, EMR, Lambda, S3, ELB, RDS, DMS, VPC, Route53, Security Groups, CloudWatch, Cloud Trail, IAM Rules, SNS)
Programming language: - Python, HTML, CSS, Bash, Shell Scripting
Monitoring Tools: - CloudWatch, Nagios, New Relic, Evident. IO, SpotFire, DynaTrace
Big Data Ecosystems: Hadoop, YARN, MapReduce, HDFS, HBase, Hive, Impala, Cloudera manager
Development Tools: - PyCharm, PgAdmin
Database Management: - Postgres, Hive, MYSQL, Beeline
CI/CD Management/Containerization Tools: - Jenkins, Jira, Docker
SCM/Documentation/Project Management: - BitBucket, Git, Confluence, Hip Chat, Slack, Serive-Now
Configuration Management: - Ansible, Terraform, Cloud Formation
WORK EXPERIENCE:
Confidential, Tampa, FL
Cloud TechOps Engineer
Responsibilities:
- Working directly under the vice-president to streamline and enhance the cloud adaptation across the company.
- I’m responsible to for migrating on-Prem hybrid clusters (PostgreSQL, Hadoop, Sun Grid) to the AWS.
- Working on designing and deploying a multitude application utilizing almost all of the main services of the AWS stack focused on high availability, fault tolerance environment. Utilizing Github for version control.
- Researched and implemented New Relic proof of concept that provides an efficient way of monitoring both On-Prem and AWS infrastructure.
- Deploying Docker images to support various projects; distributed Jenkins, Postgres, Terraform etc.
- Working on designing the Terraform playbooks for building the AWS infrastructure from scratch.
- Managing multiple AWS accounts with multiple VPC's for both production and non-prod where primary objective included automation using Serverless Lambda (FAAS) on Python, built out, integration and cost control. Managed to reduce monthly expenditure of the company by 70%.
- Using Boto3 the AWS SDK for Python in analyzing and monitoring performance bottlenecks and key metrics to optimize software and system performance by developing dashboard.
- Installing and maintaining Jenkins for CI/CD as well as automating changes to a CSM tool like Ansible.
- Monitoring security aspect of AWS with Evident. IO. Implemented SNS topic to post alerts to Slack/Hipchat and then trigger Lambda to automatically fix the noncompliant resources.
- Creating an S3 bucket and also managing policies for S3 buckets and utilized s3 bucket and glacier for storage and backup on AWS. Developing both the user and the bucket level policy.
- Creating Cloudwatch alerts for instances and using them in the auto-scaling launch configuration.
- Planned and migrated on-Prem Postgres to AWS using DMS (Data Migration Service).
- Designing Ansible playbooks to create custom sized VPC, subnets, and database templates.
- Stay current with the latest features/capabilities of the AWS platform.
- Good and very co-operative Team player, fluent communication skills and innovative, self-starter and self-motivated.
Confidential
Hadoop Production Support
Responsibilities:
- Solving problems, cutting costs, improving efficiencies, adding revenue for the clients.
- I’m responsible for maintaining PostgreSQL DBA team and involved in several DBA activities.
- Conducting data analysis on TBs of data using advanced SQL, Excel function and Python modules to identify the gaps and provide trend reports to the clients
- Good hands-on knowledge in Cloudera Hadoop ecosystem and its components.
- Experience monitoring and troubleshooting issues with Linux memory, CPU, OS, storage and network.
- Leveraging internal technical expertise, including development engineers, knowledge base, and other internal tools to provide the most effective solutions to customer issues.
- Providing 24*7 support to coordinate with the teams and customers for delivery purposes and maintain hybrid infrastructure that includes Hadoop cluster and Sun Grid Engine.
- Performing disaster recovery tests to ensure system availability and functionality.
- Successful planning and execution of DC migration of over 1200 Linux and Virtual servers to achieve more than 75 percent growth in application performance.
- Highly productive, and results-driven in team and individual projects, strong research and time management skills. Interactive communication skills with clients in core decision making meetings.
- Creating postmortem document once the issue is fixed, problem ticket for permanent fixing.
- Created and wrote shell scripts (Bash), Python for setting up automation process across the environments.
Confidential
PostgreSQL DBA
Responsibilities:
- Preparing the disaster recovery plan of the production DB with an approx size of 2TB.
- Writing Ansible playbooks to deploy Postgres cluster on RedHat Linux machines across the prod and non-prod environment. Upgrading earlier releases of PostgreSQL to version 9+ on servers running on Redhat.
- Insure nightly cron jobs for backups, re-indexing and vacuuming and materialized views executed properly on 24/7/365 high availability Postgres production and development database.
- Reviewing and administering database schema & structure for design improvements and data normalizations. Defining and managing roles, privileges by Ansible playbooks.
- Implementing PITR, Streaming/Hot-standby replication for Dev, QA and Prod databases. Performing database failover to the standby server. Maintaining detailed technical documentation.
- Facilitated Postgres and Python Web UI to store both small and large information to gain control and in sight of application.
- Create, Backup, Restore & Drop databases per developer’s requirements using pg dump, pg restore.
- Excellent problem solving skills, including the ability to foresee problems before they occur.
