Hadoop Admin Resume San Ramon, CA - Hire IT People

SUMMARY:

Over 8 years of total Information Technology experience with expertise in Administration and Operations experience, in Big Data and Cloud Computing Technologies
Expertise in setting up fully distributed multi node Hadoop clusters, with Apache, Cloudera Hadoop.
Expertise in AWS services such as EC2, Simple Storage Service (S3), Autoscaling, EBS, Glacier, VPC, ELB, RDS, IAM, Cloud Watch, and Redshift.
Expertise in MIT kerberos and High Availability as well as Integration of Hadoop clusters.
Experience in upgrading Hadoop clusters.
Strong knowledge in installing, configuring and using ecosystem components like Hadoop MapReduce, Oozie, Hive, Sqoop, Pig, Flume, Zookeeper, Kafka, NameNode Recovery, HDFS High Availability Experience in Hadoop Shell commands, verifying managing and reviewing Hadoop Log files.
Designed and Implemented CI & CD Pipelines achieving the end to end automation Supported server/VM provisioning activities, middleware installation and deployment activities via puppet.
Written puppet manifests Provision several pre - prod environments.
Written puppet modules to automate our build/deployment process and do an overall process improvement to any manual processes.
Designed, Installed and Implemented / puppet. Good Knowledge in automation by using Puppet
Implementing AWS architectures for web applications
Experience in EC2, S3, ELB, IAM, Cloudwatch, VPC in AWS
Experience in understanding the security requirements for Hadoop and integrate with Kerberos authentication and authorization infrastructure.
Extensive experience on performing administration, configuration management, monitoring, debugging, and performance tuning in Hadoop Clusters.
Performed AWS EC2 instance mirroring, WebLogic domain creations and several proprietary middleware Installations.
Worked in agile projects delivering end to end continuous integration/continuous delivery pipeline by Integration of tools like Jenkins, puppe and AWS for VM provisioning.
Evaluating performance of EC2 instances their CPU, memory usage and setting up EC2 Security Groups and VPC.
Configured and Managed Jenkins in various Environments, Linux and Windows.
Administered Version Control systems GIT, to create daily backups and checkpoint files.
Created various branches in GIT, merged from development branch to release branch and created tags for releases.
Experience creating, managing and performing container based deployments using Docker images Containing Middleware and Applications together.
Enabling/Disabling of Passive and Active check for Hosts and Service in Nagios.
Good knowledge in installing, configuring & maintaining Chef server and workstation
Expertise in provisioning clusters and building manifests files in puppet for any services.
Excellent knowledge in Import/Export structured, un-structured data from various data sources such as RDBMS, Event logs, Message queues into HDFS, using a variety of tools such as Sqoop, Flume etc.
Expertise in converting non kerberoized Hadoop cluster to Hadoop with kerberoized cluster
Administration and Operations experience with Big Data and Cloud Computing Technologies
Handling in setting up fully distributed multi node Hadoop clusters, with Apache and AWS EC2instances
Handling in AWS services such as EC2, Simple Storage Service(S3), Auto scaling, EBS, ELB, RDS, IAM, Cloud Watch
Performing administration, configuration management, monitoring, debugging, and performance tuning in Hadoop Clusters.

TECHNICAL SKILLS:

Hadoop/Big Data: HDFS, HBase, Pig, Hive, Zookeeper, Sqoop, MapReduce, Kafka, Spark, Oozie, Hortonworks and Cloudera

Languages: Unix Shell Script, JavaScript, Python, Java, Pig, MySQL, HiveQL, CSS, JavaScript, HTML

DevOps Tools: Puppet, Jira, Jenkins, Docker, GIT, GitHub

Frameworks: Apache and Cloudera Hadoop, Amazon Web Services

Platforms: Ubuntu, Centos, Redhat, Amazon Linux

Web Servers: Apache/HTTPD, Nginx

Operating Systems: Windows, Macintosh, Ubuntu (Linux).

Monitoring Tools: Nagios, Pager Duty

RDBMS: MySQL, SQL Server

Data Warehousing: Hive

NoSQL Databases: MongoDB, HBase, Cassandra

Log Collector & Aggregation: Flume, Kafka

Source Control Tools: Github

Team Communication: Slack

Defect Tracking Tools: FogBugz, Jira

AWS Components: EC2, Simple Storage Service (S3), EBS, VPC, ELB, RDS, IAM, CloudWatch

Other Tools: S3Organizer, Sqoop, Oozie, Zookeeper, Hue, DBVisualizer, Google authentication to Services

PROFESSIONAL EXPERIENCE:

Confidential, San Ramon, CA

Hadoop Admin