Aws Devops Engineer Resume
Reston, VA
SUMMARY
- Over 10 years of substantial IT experience with expertise in AWS & Azure cloud services, DevOps, Build and release Engineering, Software configuration Management (SCM), Cloud Infrastructure, Automation...
- Heavy experience using AWS Services and DevOps tools
- Experience in Infrastructure Development and Operations involving AWS Cloud platforms, EC2, EBS, S3, VPC, RDS, SQS, ELB, Auto scaling, Cloud Front, Cloud Formation, Cloud Watch, SNS, Route 53, AWS workspaces.
- Worked on Multiple AWS EC2 instances, set the security groups, Elastic Load Balancer and AMIs, Auto scaling to design cost effective, fault tolerant and highly available systems.
- Expertise in automation tools like Git, Jenkins, Chef, Ansible, Subversion, Nagios and Docker.
- Worked with Docker and created multiple containers and images (Tomcat, Ubuntu, Etc.). Configured these containers to communicate with each other.
- Strong background in Branching, Merging, Tagging, and maintaining the version across the environments using SCM tools like Subversion (SVN) and GIT (GitHub, GitLab).
- Ability to lead teams with strong knowledge of AWS
- Having knowledge of AWS Lambda Serverless service
- Worked on Jenkins for continuous integration and for End - to-End automation for all build and deployments.
- Experienced with Ansible playbooks for virtual and physical instance provisioning, configuration management, patching and software deployment.
- Able to set up and provision databases (RDS) in AWS
- Able to set up Compute (Kubernetes cluster), storage, secrets management (Team Password Manager tool)
- Able to set up accounts in SCM (Source Control Management) & CI CD tools (DevOps process)
- Able to set up Container Repos
- CI CD Pipelines, DevOps - Terraform modules for infra - EC2, EKS, RDS, S3 etc.
- Had experience with logging infrastructure and tools such as Logstash, Elasticsearch, Kibana, Splunk, DevOps
- Had automation experience with at configuration /deployment management systems such as Terraform, Chef, AWS Cloud.
- Had knowledge on Terraform concepts like Workspaces, modules, States etc.
- Worked on creation of windows AMIs using Packer in AWS.
- Knowledge of the Data Analytics services on AWS
- Jenkins/Ansible and thorough hands-on experience on CI/CD
- Knowledge of various DevOps Toolsets in the CI/CD pipeline
- Having experience on deploying code on Cloud based Infrastructure (AWS, Azure, OCI (oracle cloud infrastructure))
- Proficient in AWS services like VPC, EC2, S3, ELB, Auto Scaling Groups (ASG), EBS, EFS, RDS, IAM, Cloud Formation, Route53, Cloud Watch, Cloud Front, Cloud Trail. Web hosting services.
- Working experience in Cloud Formation scripts and Infrastructure as Code scripting
- Experience in Continuous Integration, Continuous Delivery, and Continuous Deployment software tools
- Knowledge on automation to reduce the number of incidents or repetitive incidents and also on DevOps.
- Able to identify and gather requirements to define a solution to be built and operated on AWS, perform high level and low-level design for the AWS platform.
- Capable to provide AWS operations and deployment guidance and best practices throughout the lifecycle of a project.
- Understands the significance of the different metrics under the monitoring, their threshold values and be able to take necessary corrective measures based on the thresholds.
- Developed automated pipelines, processes & scripts to support the deployment of applications in cloud environments
- Supported production issues and involved in problem-solving and troubleshooting of infrastructure and applications
- Researched and implemented new technologies to improve the environment
- Provided cross-training to other engineers across the organization
- Solid in-depth automation experience using configuration management tools, such as Chef, Ansible
- Experience with Linux and Windows platforms
- Administering and troubleshooting experience on Linux based systems
- Able to perform infrastructure cost analysis and optimization
- Documented the project guidelines and run training programs for further development progress in internal project.
- Automated the administration of users in Windows AD using Power shell.
TECHNICAL SKILLS
SCM Tools: Subversion (SVN), GitHub
CI Tools: Jenkins
Configuration Tools: Chef, Ansible, Terraform, Packer
Containerization: Docker, Kubernetes
Monitoring Tools: Splunk, Nagios, Cloud Watch, Logstash
Bug Tracking Tools: Jira, ServiceNow, GotoAssist
Cloud Platforms: AWS, Azure, OCI (Oracle Cloud Infrastructure)
AWS Services: EC2, ELB, VPC, RDS, IAM, CloudFormation, S3, CloudWatch, CloudTrail, SNS, SQS, EBS, EMR, Lambda
AWS RDS: MySQL
Database System: SQL Server 2000/2005, Dynamo DB, AWS snowflake
Scripting Languages: Python, Shell Scripting, Power Shell
Servers: WebLogic, WebSphere, JBOSS, Apache Tomcat
Network Services: FTP-Vsftpd, SSH, TELNET, TCP/IP, HTTP, DHCP, SNMP, FTP, SMTP, NFS, WinSCP, FileZilla
Platforms: Linux, HP-UX, Centos 6.5,7, Windows NT/2003/2008, Microsoft windows Vista/XP/2000, Red Hat
PROFESSIONAL EXPERIENCE
Confidential - Reston, VA
AWS DevOps Engineer
RESPONSIBILITIES:
- Used multiple EC2 instances at the same time and provided highly durable and available data by using S3 data store, versioning, lifecycle policies, & create AMIs for mission-critical production server’s backup.
- Provided highly durable and available data by using S3 data store, versioning, lifecycle policies.
- Performed the automation deployments using AWS by creating the IAMs and used the code pipeline plugin to integrate Jenkins with AWS and created the EC2 instances to provide the virtual servers.
- Worked on Ansible Playbooks with Ansible roles. Created inventory in Ansible for automating the continuous deployment. Configured the servers, deployed software, and orchestrated continuous deployments or zero downtime rolling updates.
- Composed Ansible Playbooks with Python SSH as the Wrapper to Administer Configurations of AWS infrastructure and Test Playbooks on AWS Ec2 instances using Python.
- Configured nodes express applications in production and pre-production environment by configuring with Nginx and Ansible Web servers.
- Use Ansible, Chef, Jenkins, Git, for implementing Continuous Integration from scratch, optimize the Continuous Integration using Jenkins, and troubleshoot the deployment build issues using the triggered logs.
- Used Ansible Tower, which provides an easy-to-use dashboard and role-based access control, to make it easier in allowing individual teams access to use Ansible for their deployments.
- Setting up Kubernetes (k8s) Clusters for running microservices and pushed microservices into production with Kubernetes backed Infrastructure. Development of automation of Kubernetes clusters via playbooks in Ansible.
- Created virtual machines, created serverless API using AWS Lambda, AWS Elastic Bean Stalk, manage scalable containers, and was responsible for connecting aws to on-premises data center using AWS Gateway.
- Worked to setup Jenkins as a service inside the docker swarm cluster to reduce the failover downtime to minutes and to automate the docker containers deployment without using configuration management tool.
- Implemented a Continuous Delivery pipeline with Docker, Jenkins, and GitHub. Responsible for installation & configuration of Jenkins to support various Java builds and Jenkin plugins to automate continuous builds and publishing Docker Images to the Docker Hub Repository.
- Wrote scripts using Azure services template available with Azure SDK in Visual Studio to automate creation of nodes on Azure. Deployed and redeployed VNET, NIC, Public IP, Storage Account and VM on Azure using this automation.
- Set up GCP Firewall rules to allow or deny traffic to and from the VM's instances based on specified configuration and used GCP cloud CDN (content delivery network) to drastically improve user experience and deliver content from GCP cache locations latency.
- Written Terraform/ CloudFormation templates and deployed AWS resources using it.
- Lead implementation of changes to existing processes, automations of process as per client requirement and effective services.
- Leading the ITIL Incident, Change and Management Mutualization.
- Managed Ansible Playbooks with Ansible modules, implemented CD automation using Ansible, managing existing servers and automation of build/configuration of new servers.
- Responsible for implementing containerized based applications on Azure Kubernetes by using Azure Kubernetes service (AKS), Kubernetes Cluster, which are responsible for cluster management, Virtual Network to deploy agent nodes, Ingress API Gateway, MySQL Databases and Cosmo DB for stateless storage of external data, and setup reverse proxy Nginx in the cluster.
- Monitored and analyzed various network and performance data with the Splunk tool.
- Used Git for source code version control and integrated with Jenkins for CI/CD pipeline, code quality tracking and user management with build tools Maven and Ant and written Maven pom.xml build script.
- Created terraform templates for provisioning virtual networks, subnets, VM Scale sets, Load balancers and NAT rules and used terraform graph to visualize execution plan using the graph command.
- Managed Azure Infrastructure Azure Web Roles, Worker Roles, VM Role, Azure SQL, Azure Storage, Azure AD Licenses, Virtual Machine Backup and Recover from a Recovery Services Vault using Azure PowerShell and Azure Portal.
- Highly motivated team lead with strong communication, analytical and organizational skills.
- Maintained and developed a production-level cloud-based architecture in AWS, including creating machine images like AMI and helped migrating and managing multiple applications from on premise to cloud using AWS services like S3, Glacier, VPC, EFS, EC2, RDS, SQS, SNS, S3, IAM, EBS, ELB, Security Group
- Installed, Configured, and Managed Monitoring Tools such as Nagios for Resource Monitoring/ Network Monitoring.
- Auto Scaling, Elastic Cache, CloudFront, CloudWatch, SNS and RDS in Cloud Formation JSON templates.
- Used Ansible as a configuration management tool to mainly manage Infrastructure as Code (IaC).
- Created AWS workspaces with both Linux and windows flavors.
- Installed, configured, and maintained web servers like Apache Web Server and WebSphere Application Server on Red Hat Linux.
- ManagedDocker orchestration and Docker containerizationusingKubernetes.
- UsedKubernetes, EKSto orchestrate the deployment, scaling and management of Docker Containers.
- Used Jenkins pipelines to drive all micro services builds out to the Docker registry and then deployed to Kubernetes, Created Pods and managed using Kubernetes.
- Building/Maintaining Docker container clusters managed by Kubernetes Linux, Bash, GIT, Docker, on GCP (Google Cloud Platform). Utilized Kubernetes and Docker for the runtime environment of the CI/CD system to build, test deploy.
- Involved in development of test environment on Docker containers and configuring the Docker containers using Kubernetes.
- Experience with writing Python scripts to automate some of the IAM user administration tasks.
- Documented the coding guidelines and ran training programs for future references and knowledge transfer sessions.
- I did plan and execution of applications on our existing infrastructure in the data center. Identified what type of services can be used and what configuration to better our servers. Right from identifying Availability Zones and regions and eventually migrating these applications by programmatically creating temporary infrastructure and once it is created, we configured Chef to deploy these applications to be running in cloud.
- Initiated alarms using CloudWatchservice for monitoring the performance of compute instances,CPU Utilization,disk usage, etc.
- Used Ansible Playbooks to setup Continuous Delivery Pipeline. Deployed micro services, including provisioning AWS environments using Ansible Playbooks.
- Configured Security Groups and NACL’s for various AWS services to limit Inbound and Outbound traffic based on the security recommendations.
- Design AWSCloud Formationtemplates to create VPC,subnets,security groups to ensure successful deployment of Web applications and database.
- Created AWS Multi-Factor Authentication(MFA),for instance,RDP/SSHlogon, worked with teams to lockdownsecurity groups.
- Create develop and test environments of different applications by provisioning Kubernetes clusters on AWS using Ansible and Terraform.
- Provide monitoring, reporting, troubleshooting, and remediation for all areas of our datacenter infrastructure, including handling incoming trouble tickets.
- Datacenter Hard drives replacements, coordinating with HD vendor.
- Involvement in replacing a failed hard drive in a datacenter right from HD failure until rebooting the replaced HD.
- Kickstart of Linux servers using Chef tool.
- Provision servers: AWS, Azure, OCI (Oracle Cloud Infrastructure) using Gitlab
- Administered and maintained the users in Oracle Cloud Infrastructure.
- Provision accounts within AD (Manually and using power shell script)
- Provide metric reports on the data center in AWS and in JIRA ticketing tool.
- Work with team members and IT stakeholders to design and deploy an automated, scalable dev/test environment infrastructure.
- Administer and Maintain the users and services in AWS.
- Coordinated with Tier 3 support teams to establish and manage clear escalation guidelines for supported system components.
Environment: AWS (EC2, S3, SQS, SNS, Route 53, VPC, Lambda, Glacier, EFS, VPC, EBS, RDS, ELB, Cloud Front, CloudWatch), ELK, GIT 1.7, Maven 3.2.x/3.1, Jenkins 1.5x, Cloud Formation, DynamoDB, Azure, Sql Server, Chef, Ansible, PostgreSQL, Docker 0.x, Kubernetes, Terraform, Ubuntu, EKS, Nagios 4.0, Python 3.4/3.3, Shell Scripts, Bash, CI/CD, Unix/ Linux environment, Jira, Confluence.
Confidential - Peoria, IL
AWS Cloud/ DevOps Engineer
Responsibilities:
- Create IAM user accounts and role-based policies for access to AWS services. Perform automated snapshots using CloudWatch rules and create EBS volumes from snapshots.
- Implement and maintain monitors, alarms, and notifications for EC2 instances using CloudWatch and SNS. Troubleshoot production issues pertaining to AWS Cloud Resources and Application Infrastructure point of view.
- Performed as System Administrator on AWS to include security, load balancing, provisioning, and metrics.
- Assisted in cloud provisioning tools and Devops tools such as Terraform and CloudFormation.
- Worked with container-based deployments using Docker, working with Docker images, Docker hub and Docker registries.
- Designed and implemented fully automated Continuous Integration and Continuous Delivery (CI/CD) process. Introduced Automation into the CI/CD pipeline using Chef.
- Implemented AWS Code Pipeline and Created Cloud formation JSON templates in Terraform for infrastructure as code.
- Set up build pipelines in Jenkins by using various plug-ins like Maven plug-in, EC2 plug-in, Docker, Terraform, JDK.
- Created multiple terraform modules to manage configurations, applications, services, and automate the installation process for web server and AWS instances.
- Implemented and maintained the branching and build/release strategies utilizing GIT on AWS.
- Experienced in automating infrastructure by writing scripts in JSON, YAML, Terraform.
- Installing and configuring SSH (Secure Shell) encryption to access securely on QNX and RedHat Linux.
- Implemented Docker to provision slaves dynamically as needed. Created and maintained Docker files in Source Code Repository build images. Handled multiple Docker images primarily for middleware installations and domain configurations.
- Worked on AWS Cloud Trail to setup auditing for our AWS account.
- Implemented a continuous Delivery Pipeline with Docker, Jenkins, and GitHub to build a new Docker container automatically using Jenkins when a new GitHub branches are created.
- Installed and supervised Middleware tools through Chef such as WebLogic Application Server, Apache, Tomcat, and other web-oriented applications.
- Configured Security Groups for various AWS services to limit Inbound and Outbound traffic based on the security recommendations.
- Extensively worked on Jenkins/Hudson, Build Forge for continuous integration and End to End automation for all build and deployments.
- Used Identify and Access Management (IAM) to assign roles and to create and manage AWS users and groups and use permissions to AWS resources.
- Setup GIT repo server to keep the source code for code management. Implemented rapid-provisioning and life-cycle management for Ubuntu Linux using Chef, and custom Ruby/Bash scripts.
- Develop metrics, dashboards, and advanced filters in Jira to provide end-users and business leadership with performance metrics and status reports. Atlassian Confluence was primarily used for documenting the work we do and publishing the release notes.
- Building/Maintaining Docker container clusters managed by Kubernetes Linux, Bash, GIT, Docker, on AWS.
- Used Chef as a configuration management tool to mainly manage Infrastructure as Code (IaC).
- Worked with AWS platform in managing Docker containers and Kubernetes Clusters.
- Building/MaintainingDockercontainer clusters managed byKubernetes Linux, Bash, GIT,Docker, on AWS.
- UtilizedKubernetes, EKSfor the runtime environment of theCI/CDsystem to build, test deploy.
- Changing the AWS infrastructure Elastic Beanstalk to Docker withKubernetes.
- Security Groups and Key pairs management in AWS
- Initiated alarms using CloudWatchservice for monitoring the performance of compute instances,CPU Utilization,disk usage, etc.
- Hands-on experience in alert monitoring using Logic Monitor, CloudWatch, Splunk and Nagios.
- Worked on automation of IAM user administration using Python scripting.
- Having work experience on Performance monitoring, User management, File System Administration.
- Experience in handling customer queries and coordinating with internal teams in troubleshooting production issues.
- Scripted in multiple languages on UNIX, LINUX, and Windows - Bash, Shell, PowerShell
- Resizing the volumes of EC2 instances on AWS
Environment: AWS (EC2, S3, SNS, SQS, Glacier, VPC, EBS, RDS, ELB, Cloud Front, CloudWatch), ELK, GIT, Maven, Jenkins, Route53, Cloud Formation, DynamoDB, Sql Server, Azure, Chef, Ansible, PostgreSQL, Docker, Nginx, Kubernetes, Terraform, Ubuntu, EFS, Nagios, Python, Shell Scripts, Bash, CI/CD, Unix/ Linux environment, Jira, Confluence.
Confidential - Reston, VA
Site Reliability Engineer
Responsibilities:
- Created Group Policies and implemented efficient procedures for problem identification, tracking & documentation.
- Design, Implement and maintain Windows, UNIX, Linux, CentOS, and Ubuntu systems in a mixed environment.
- Create, maintain and administrating GitHub and Subversion repositories as source code managements
- Evaluated and controlled shutdown and restart of services & dependencies.
- Monitored the performance of the existing servers on the network and ensuring technical excellence in service delivery to optimize the performance of the network and to reduce the downtime.
- Configured secure HTTP for SVN source control, and back-end communication with SQL DB and manage user access to SVN repositories.
- Build and Release software baselines, code merges, branch and label creation in Subversion and GIT, worked as migration lead to perform migration from Subversion to Git via Bit Bucket (DVCS).
- Installed, configured & managed:
- AD / DHCP / DNS Services
- Implemented Maven Release Plug-in through Jenkins Jobs for deploying the artifacts to GIT repository.
- Initiated responsibility for administering the GIT servers which included install, upgrade, backup, adding, users, creating repository/branches, troubleshooting merging, tagging issues & maintenance.
- Configured JIRA as defect tracking system and configured various workflows, customizations, and plugins for the JIRA bug/issue tracker.
- Utilized Jenkins for enormous business scale establishment and application organizations and pulling the code from GIT and utilizing Ant to construct Jar, war, and ear ancient rarities.
- OS, Tooling, Backup Software & System Management Components
- Liable for Shift Handover at the end of SE shift
- Resolved network faults on client’s side, analyzing the faults through root-cause analysis and suggesting technical solutions for overcoming the same; performed capacity planning and reviewed change order implementation plan
- Supervised server administration including user administration, malware removal/antivirus management, license management and software compliance
- Executed administrative tasks such as creating, maintaining, and reviewing admin accounts (if more than 10), managing user accounts, monitoring workspace activity, and viewing log files
- Conducted timely patch management including acquiring, testing & installing monthly MS fixes / patches to administered computer; controlled reboot of all systems after patch application & removal of patches if issue occurred.
Environment: Jenkins, Web logic, JIRA, Maven, Java, Linux, Apache TOMCAT, GIT, LDAP, NFS, NAS, XML, Windows, Shell Scripts
Confidential - Reston, VA
Unix/ Linux System Administrator
Responsibilities:
- Perform hardware configuration, operating system loads, and assist with troubleshooting installation issues.
- Maintained and monitored all of company servers, operating system and application patch level, disk space and memory usage, user activities on day-to-day basis.
- Installations of all the required patches, drivers, third-party software. Supports multiple customers and internal teams on an as-needed basis
- Collaborates with other teams on tools for systems automation
- Perform as System Administrator; environment includes Linux System Administration, Cloud hosting, and server-side scripting.
- Expertise in managing shared NFS files system, mounting and un-mounting NFS server, NFS client on remote machine, sharing remote file folder, starting, and stopping the NFS services.
- Collaborate with development/test teams to design, build, and deploy an automated platform.
- Installed, configured, and maintained DNS, FTP, TCP/IP, and Samba on Red Hat Linux.
- Co-ordinate with web administrator's team in installation of Web Logic Application Server.
- Creating user accounts, user administration, local and global groups on Solaris and Red Hat Linux platform.
- Internal Applications Defect Management by Logging and Tracking the issues in Service Now
- Confluence Administration: Managing users and groups, managing pages, and providing access related permissions.
- Implemented System policies and hardware profiles, hard disk configuration for fault tolerance, disk mirroring and back up the data using BACKUP devices.
- Set up Axway (S)FTP transfer sites for product delivering, averaging 10 per week.
Environment: Sun E250, Sun E3000, Sun Fire v440, Solaris 8, SDS 4.1, RHEL, WebLogic, Axway.
