We provide IT Staff Augmentation Services!

Hadoop Administrator Resume

2.00/5 (Submit Your Rating)

FL

SUMMARY

  • 8 years of professional experience in analysis, design, development, implementation, integration and testing of Client - Server applications using Object Oriented Analysis Design (OOAD) with 3+ years of experience in deploying, maintaining, monitoring and upgrading Hadoop Clusters. (Apache Hadoop, Cloudera, Hortonworks).
  • Experience with managing, troubleshooting and security networks.
  • Experience working with, extending, and enhancing monitoring systems like Nagios.
  • Storage experience with JBOD, NFS, SAN and RAID
  • Hands on experience using Hadoop ecosystem components like Hadoop Map Reduce, HDFS, ZooKeeper, Oozie, Hive,Sqoop, Pig.
  • Worked on Multi Clustered environment and setting up Cloudera Hadoop echo-System.
  • In-depth understanding of Data Structure and Algorithms.
  • Experience in setting up monitoring tools like Nagios and Ganglia for Hadoop.
  • Experience in Importing and exporting data from different databases like MySQL, SQL Server, Oracle into HDFS and Hive using Sqoop.
  • Experience in configuring Zookeeper to provide Cluster coordination services.
  • Experience in providing security for Hadoop Cluster with Kerberos.
  • Troubleshooting and Transform data from RDBMS to HDFS.
  • Setup and manage HA on nodes to avoid single point of failures in large clusters.
  • Experience in setting up the High-Availability Hadoop Clusters.
  • Experience in writing UDFs for Hive and Pig.
  • Experience in developing Shell Scripts for system management.
  • Experience in upgrading the existing Hadoop cluster to latest releases.
  • Setting up and maintaining NoSQL Databases like Cassandra,MongoDB.
  • Experience is deploying, Cassandra cluster (Apache Cassandra, Datastax).
  • Monitoring a Cassandra Cluster using OpsCenter
  • Ability to diagnose network problems
  • Installing and Administration Name node and Job tracker High Availability
  • Understanding of TCP/IP networking and its security considerations.
  • Excellent in communicating with clients, customers, managers, and other teams in the enterprise at all levels.
  • Effective problem solving skills and outstanding interpersonal skills. Ability to work independently as well as within a team environment. Driven to meet deadlines.
  • Motivated to produce robust, high-performance software.
  • Benchmarking and Stress Testing on Hadoop Cluster.
  • Ability to learn and use new technologies quickly.

TECHNICAL SKILLS

Bigdata Technologies: - MapReduce, Hive, Pig, Zookeeper, Sqoop, Oozie, Flume, Hbase, AWS

Bigdata Frameworks: - HDFS, YARN, Storm, Kafka

Hadoop Distributions: Cloudera(CDH3, CDH4, CDH5), Hortonworks HDP(2.7.3)

Programming Langauges: - Java SE 7, Shell Scripting, PowerShell,Python-2.7

Operating Systems: - Windows 98/2000/XP/Vista/NT/ 8.1, Redhat Linux/Centos 4, 5, Unix

Database: - Oracle 10g/11g, T-SQL, MongoDB, PL/SQL

ETL Stack: - Informatica 8x/9x, SSIS,Tableu

Business Modeling tools: - UML, MS office, Remedy, Service Now, MS-Visio

PROFESSIONAL EXPERIENCE

Confidential, FL

Hadoop Administrator

Responsibilities:

  • Responsible for architecting Hadoop clusters Translation of functional and technical requirements into detailed architecture and design.
  • Installed and configured multi-nodes fully distributed Cloudera Hadoop cluster of large number of nodes.
  • Provided Hadoop, OS, Hardware optimizations.
  • Setting up the machines with Network Control, Static IP, Disabled Firewalls, Swap memory.
  • Installed and configured Hadoop ecosystem components like MapReduce, Hive, Pig, Sqoop, HBase, ZooKeeper and Oozie.
  • Involved in testing HDFS, Hive, Pig and MapReduce access for the new users.
  • Cluster maintenance as well as creation and removal of nodes using Hortonworks Manager Enterprise.
  • Worked on Apache Storm in combination with Kafka for Website Activity, Tracking Metrics Collection & Monitoring.
  • Implemented best income logic using Pig scripts and UDFs.
  • Worked on setting up high availability for major production cluster and designed automatic failover control using zookeeper and quorum journal nodes.
  • Implemented Fair scheduler on the job tracker to allocate fair amount of resources to small jobs.
  • Set up automated processes to archive/clean the unwanted data on the cluster, in particular on Name node and Secondary name node.
  • Performed operating system installation, Hadoop version updates using automation tools.
  • Configured Oozie for workflow automation and coordination.
  • Implemented rack aware topology on the Hadoop cluster.
  • Importing and exporting structured data from different relational databases like MySQL into HDFS and Hive using Sqoop.
  • Configured ZooKeeper to implement node coordination, in clustering support.
  • Configured Flume for efficiently collecting, aggregating and moving large amounts of log data from many different sources to HDFS.
  • Involved in collecting and aggregating large amounts of streaming data into HDFS using Flume and defined channel selectors to multiplex data into different sinks.
  • Worked on developing scripts for performing benchmarking with Terasort/Teragen.
  • Implemented Kerberos Security Authentication protocol for existing cluster.
  • Good experience in troubleshoot production level issues in the cluster and its functionality.
  • Backed up data on regular basis to a remote cluster using DistCp.
  • Regular Commissioning and Decommissioning of nodes depending upon the amount of data.

Environment: - Hadoop HDFS, Mapreduce, Hive, Pig, Oozie, Sqoop, Cloudera Manager

Confidential, FL

Hadoop Administrator

Responsibilities:

  • Hands on experience with Apache & Hortonworks Hadoop Ecosystem components such as Scoop, Hbase and Mapreduce.
  • Instillation/configuration and troubleshooting to Apache Hadoop cluster (20+nodes) for application development.
  • Installation, monitoring, managing, troubleshooting, applying patches in different environments such as Development Cluster, Test Cluster and Production environments.
  • Excellent understanding / knowledge of Hadoop architecture and various components such as HDFS, Job Tracker, Task Tracker, Name Node, Data Node and Map Reduce.
  • Experience in managing Hadoop infrastructure like commissioning, decommissioning, rack topology implementation.
  • Experience in managing the cluster resources by implementing fair scheduler and capacity scheduler.
  • Experience in Implementing High Availability of Name Node and Hadoop Cluster capacity planning.
  • Developed automated scripts using Unix Shell for running Balancer, file system health check and User/Group creation on HDFS.
  • Experience in managing and reviewing Hadoop log files.
  • Experienced in Kafka in handling the real-time data feeds into storm and from there to Hadoop Cluster on HDFS.
  • Experience in upgrading Hadoop cluster from current version to minor version upgrade as well as to major versions.
  • Experience in designing and building disaster recovery planning across the data centers to provide business continuity.
  • Demonstrate and understanding of concepts, best practices and functions to implement a Big Data solution in a corporate environment.
  • Help design of scalable Big Data clusters and solutions.
  • Implemented Fair schedulers to share the resources of the cluster for the map reduce jobs given by the users.
  • Experience on writing automation scripts and setting up cron jobs to maintain cluster stability and healthy.
  • Monitoring and controlling local file system disk space usage, local log files, cleaning log files with automated scripts.
  • As a Hadoop admin, monitoring cluster health status on daily basis, tuning system performance related configuration parameters, backing up configuration XML files.
  • Work with Hadoop developers, designers in troubleshooting map reduce job failures and issues and helping to developers.
  • Work with network and Linux system engineers/admin to define optimum network configurations, server hardware and operating system.
  • Evaluate and propose new tools and technologies to meet the needs of the organization.
  • Production support responsibilities include cluster maintenance and on call support on weekly rotation 24/7.

Environment: RHEL, CentOS, Ubuntu, CDH3, Apache Hadoop, HDFS, Map, Reduce, Hbase, Shell Scripts

Confidential

Unix &Hadoop Administrator

Responsibilities:

  • Worked on Administration, monitoring and fine tuning on an existing Cloudera Hadoop Cluster used by internal and external users as a Data and Analytics as a Service Platform.
  • Worked on Cloudera Cluster backup and recovery, performance monitoring, load balancing, rebalancing and tuning, capacity planning, and disk space management
  • Assisted in designing, development and architecture of HADOOP and HBase systems.
  • Coordinated with technical teams for installation of HADOOP and third related applications on systems.
  • Formulated procedures for planning and execution of system upgrades for all existing HADOOP clusters.
  • Supported daily operations and helped to develop strategies in-order to improve availability and utilization of Unix environments.
  • Worked on system administration, user creation, file/directory permissions, LVM, loading of software and system patches.
  • Supported technical team members for automation, installation and configuration tasks.
  • Involved in troubleshooting problems and issues related to the efficient, secure operation of the Linux operating system.
  • Worked closely in designing and optimizing the configuration of Linux to meet the Service Level Agreements of our applications and services.

Environment: Cloudera 4.X, Java, HDFS, Hive, and Hbase, Redhat Linux 4, LVM, Firewall

Confidential 

LINUX Administrator

Responsibilities:

  • Installing and upgrading OE & Red hat Linux and Solaris 8/ & SPARC on Servers like HP DL 380 G3, 4 and 5 & Dell Power Edge servers.
  • Experience in LDOM's and Creating sparse root and whole root zones and administered the zones for Web, Application and Database servers and worked on SMF on Solaris 5.10.
  • Implemented and administered VMware ESX 3.0, for running the Windows, Centos, SUSE and Red hat Linux Servers on development and test servers.
  • Installed and configured Apache on Linux and Solaris and configured Virtual hosts and applied SSL certificates.
  • Implemented Jumpstart on Solaris and Kick Start for Red hat environments.
  • Experience working with HP LVM and Red hat LVM.
  • Experience in implementing P2P and P2V migrations.
  • Involved in Installing and configuring Centos & SUSE 11 & 12 servers on HP x86 servers.
  • Implemented HA using Red hat Cluster and VERITAS Cluster Server 4.0 for Web Logic agent.
  • Managing DNS, NIS servers and troubleshooting the servers.
  • Troubleshooting application issues on Apache web servers and also database servers running on Linux and Solaris.
  • Experience in migrating Oracle, MYSQL data using Double take products.
  • Used Sun Volume Manager for Solaris and LVM on Linux & Solaris to create volumes with ayouts like RAID 1, 5, 10, 15.
  • Re-compiling Linux kernel to remove services and applications that are not required.
  • Performed performance analysis using tools like prstat, mpstat, iostat, sar, vmstat, truss and Dtrace.
  • Experience working on LDAP user accounts and configuring LDAP on client machines.
  • Worked on patch management tools like Sun Update Manager.
  • Experience supporting middle ware servers running Apache, Tomcat and Java applications.
  • Worked on day to day administration tasks and resolve tickets using Remedy.
  • Used HP Service center and change management system for ticketing.

Environment: Redhat Linux/CentOS 4, 5, Logical Volume Manager, Hadoop, VMware ESX 3.0, Apache and Tomcat Web Server, Oracle 9g, Oracle RAC, HPSM, HPSA.

Confidential

System Administrator

Responsibilities:

  • Installation, Configuration & Upgrade of CentOS, Red-hat Linux.
  • Administered DNS, NIS, NIS+ and NFS, Send Mail and involved in troubleshooting.
  • Extensive experience in building servers using Jumpstart and Kick-start Process.
  • Expertise in package management using Red Hat RPM used in several Linux distributions such as Red Hat Enterprise Linux, SUSE Linux Enterprises and Fedora.
  • Disk and File system management through EXT and Logical Volume manager.
  • Performed Change Management, Problem Management, cloning, Operating system and data backups and recovery Strategies.
  • Configuring & handling Samba servers on Linux, Managing file system.
  • Installation, Configuration and Administration on Apache Servers
  • Installed and monitored VMware Virtual environments
  • Added Service Groups and resources based on the requirement with appropriate dependencies, documented change design/Architecture diagram of VERITAS Cluster Servers
  • Configured Users & Security administration, backup, recovery and maintenance of various activities.
  • Planning and implemented Disaster Recovery sites from the scratch; Involved in disaster recovery testing every quarter
  • Interaction with vendors for Hardware and software supports
  • Involved in on-call pager rotation for production support.
  • Demonstrated skill in supporting end users and proven ability to plan, organize and work as a member of a technical project team, both in small groups as well as large corporate areas.

Environment: -Redhat Linux/CentOS 4, 5, Logical Volume Manager, VMware ESX 3.0, Oracle 9g, Oracle RAC, HPSM, HPSA.

We'd love your feedback!