Hadoop Administrator Resume
FloridA
SUMMARY
- 8 years of professional experience in analysis, design, development, implementation, integration and testing of Client - Server applications using Object Oriented Analysis Design (OOAD) with 3+ years of experience in deploying, maintaining, monitoring and upgrading Hadoop Clusters. (Apache Hadoop, Cloudera, Hortonworks).
- Experience with managing, troubleshooting and security networks.
- Experience working with, extending, and enhancing monitoring systems like Nagios.
- Storage experience with JBOD, NFS, SAN and RAID
- Hands on experience using Hadoop ecosystem components like Hadoop Map Reduce, HDFS, ZooKeeper, Oozie, Hive,Sqoop, Pig.
- Worked on Multi Clustered environment and setting up Cloudera Hadoop echo-System.
- In-depth understanding of Data Structure and Algorithms.
- Experience in setting up monitoring tools like Nagios and Ganglia for Hadoop.
- Experience in Importing and exporting data from different databases like MySQL, SQL Server, Oracle into HDFS and Hive using Sqoop.
- Experience in configuring Zookeeper to provide Cluster coordination services.
- Experience in providing security for Hadoop Cluster with Kerberos.
- Troubleshooting and Transform data from RDBMS to HDFS.
- Setup and manage HA on nodes to avoid single point of failures in large clusters.
- Experience in setting up the High-Availability Hadoop Clusters.
- Experience in writing UDFs for Hive and Pig.
- Experience in developing Shell Scripts for system management.
- Experience in upgrading the existing Hadoop cluster to latest releases.
- Setting up and maintaining NoSQL Databases like Cassandra,MongoDB.
- Experience is deploying, Cassandra cluster (Apache Cassandra, Datastax).
- Monitoring a Cassandra Cluster using OpsCenter
- Ability to diagnose network problems
- Installing and Administration Name node and Job tracker High Availability
- Understanding of TCP/IP networking and its security considerations.
- Excellent in communicating with clients, customers, managers, and other teams in the enterprise at all levels.
- Effective problem solving skills and outstanding interpersonal skills. Ability to work independently as well as within a team environment. Driven to meet deadlines.
- Motivated to produce robust, high-performance software.
- Benchmarking and Stress Testing on Hadoop Cluster.
- Ability to learn and use new technologies quickly.
TECHNICAL SKILLS
Bigdata Technologies: - MapReduce, Hive, Pig, Zookeeper, Sqoop, Oozie, Flume, Hbase, AWS
Bigdata Frameworks: - HDFS, YARN, Storm, Kafka
Hadoop Distributions: Cloudera(CDH3, CDH4, CDH5), Hortonworks HDP(2.7.3)
Programming Langauges: - Java SE 7, Shell Scripting, PowerShell,Python-2.7
Operating Systems: - Windows 98/2000/XP/Vista/NT/ 8.1, Redhat Linux/Centos 4, 5, Unix
Database: - Oracle 10g/11g, T-SQL, MongoDB, PL/SQL
ETL Stack: - Informatica 8x/9x, SSIS,Tableu
Business Modeling tools: - UML, MS office, Remedy, Service Now, MS-Visio
PROFESSIONAL EXPERIENCE
Confidential, Florida
Hadoop Administrator
Responsibilities:
- Responsible for architecting Hadoop clusters Translation of functional and technical requirements into detailed architecture and design.
- Installed and configured multi-nodes fully distributed Cloudera Hadoop cluster of large number of nodes.
- Provided Hadoop, OS, Hardware optimizations.
- Setting up the machines with Network Control, Static IP, Disabled Firewalls, Swap memory.
- Installed and configured Hadoop ecosystem components like MapReduce, Hive, Pig, Sqoop, HBase, ZooKeeper and Oozie.
- Involved in testing HDFS, Hive, Pig and MapReduce access for the new users.
- Cluster maintenance as well as creation and removal of nodes using Hortonworks Manager Enterprise.
- Worked on Apache Storm in combination with Kafka for Website Activity, Tracking Metrics Collection & Monitoring.
- Implemented best income logic using Pig scripts and UDFs.
- Worked on setting up high availability for major production cluster and designed automatic failover control using zookeeper and quorum journal nodes.
- Implemented Fair scheduler on the job tracker to allocate fair amount of resources to small jobs.
- Set up automated processes to archive/clean the unwanted data on the cluster, in particular on Name node and Secondary name node.
- Performed operating system installation, Hadoop version updates using automation tools.
- Configured Oozie for workflow automation and coordination.
- Implemented rack aware topology on the Hadoop cluster.
- Importing and exporting structured data from different relational databases like MySQL into HDFS and Hive using Sqoop.
- Configured ZooKeeper to implement node coordination, in clustering support.
- Configured Flume for efficiently collecting, aggregating and moving large amounts of log data from many different sources to HDFS.
- Involved in collecting and aggregating large amounts of streaming data into HDFS using Flume and defined channel selectors to multiplex data into different sinks.
- Worked on developing scripts for performing benchmarking with Terasort/Teragen.
- Implemented Kerberos Security Authentication protocol for existing cluster.
- Good experience in troubleshoot production level issues in the cluster and its functionality.
- Backed up data on regular basis to a remote cluster using DistCp.
- Regular Commissioning and Decommissioning of nodes depending upon the amount of data.
Environment: - Hadoop HDFS, Mapreduce, Hive, Pig, Oozie, Sqoop, Cloudera Manager
Confidential, FL
Hadoop Administrator
Responsibilities:
- Hands on experience with Apache & Hortonworks Hadoop Ecosystem components such as Scoop, Hbase and Mapreduce.
- Instillation/configuration and troubleshooting to Apache Hadoop cluster (20+nodes) for application development.
- Installation, monitoring, managing, troubleshooting, applying patches in different environments such as Development Cluster, Test Cluster and Production environments.
- Excellent understanding / knowledge of Hadoop architecture and various components such as HDFS, Job Tracker, Task Tracker, Name Node, Data Node and Map Reduce.
- Experience in managing Hadoop infrastructure like commissioning, decommissioning, rack topology implementation.
- Experience in managing the cluster resources by implementing fair scheduler and capacity scheduler.
- Experience in Implementing High Availability of Name Node and Hadoop Cluster capacity planning.
- Developed automated scripts using Unix Shell for running Balancer, file system health check and User/Group creation on HDFS.
- Experience in managing and reviewing Hadoop log files.
- Experienced in Kafka in handling the real-time data feeds into storm and from there to Hadoop Cluster on HDFS.
- Experience in upgrading Hadoop cluster from current version to minor version upgrade as well as to major versions.
- Experience in designing and building disaster recovery planning across the data centers to provide business continuity.
- Demonstrate and understanding of concepts, best practices and functions to implement a Big Data solution in a corporate environment.
- Help design of scalable Big Data clusters and solutions.
- Implemented Fair schedulers to share the resources of the cluster for the map reduce jobs given by the users.
- Experience on writing automation scripts and setting up cron jobs to maintain cluster stability and healthy.
- Monitoring and controlling local file system disk space usage, local log files, cleaning log files with automated scripts.
- As a Hadoop admin, monitoring cluster health status on daily basis, tuning system performance related configuration parameters, backing up configuration XML files.
- Work with Hadoop developers, designers in troubleshooting map reduce job failures and issues and helping to developers.
- Work with network and Linux system engineers/admin to define optimum network configurations, server hardware and operating system.
- Evaluate and propose new tools and technologies to meet the needs of the organization.
- Production support responsibilities include cluster maintenance and on call support on weekly rotation 24/7.
Environment: RHEL, CentOS, Ubuntu, CDH3, Apache Hadoop, HDFS, Map, Reduce, Hbase, Shell Scripts
Confidential
Unix &Hadoop Administrator
Responsibilities:
- Worked on Administration, monitoring and fine tuning on an existing Cloudera Hadoop Cluster used by internal and external users as a Data and Analytics as a Service Platform.
- Worked on Cloudera Cluster backup and recovery, performance monitoring, load balancing, rebalancing and tuning, capacity planning, and disk space management
- Assisted in designing, development and architecture of HADOOP and HBase systems.
- Coordinated with technical teams for installation of HADOOP and third related applications on systems.
- Formulated procedures for planning and execution of system upgrades for all existing HADOOP clusters.
- Supported daily operations and helped to develop strategies in-order to improve availability and utilization of Unix environments.
- Worked on system administration, user creation, file/directory permissions, LVM, loading of software and system patches.
- Supported technical team members for automation, installation and configuration tasks.
- Involved in troubleshooting problems and issues related to the efficient, secure operation of the Linux operating system.
- Worked closely in designing and optimizing the configuration of Linux to meet the Service Level Agreements of our applications and services.
Environment: Cloudera 4.X, Java, HDFS, Hive, and Hbase, Redhat Linux 4, LVM, Firewall
Confidential
LINUX Administrator
Responsibilities:
- Installing and upgrading OE & Red hat Linux and Solaris 8/ & SPARC on Servers like HP DL 380 G3, 4 and 5 & Dell Power Edge servers.
- Experience in LDOM's and Creating sparse root and whole root zones and administered the zones for Web, Application and Database servers and worked on SMF on Solaris 5.10.
- Implemented and administered VMware ESX 3.0, for running the Windows, Centos, SUSE and Red hat Linux Servers on development and test servers.
- Installed and configured Apache on Linux and Solaris and configured Virtual hosts and applied SSL certificates.
- Implemented Jumpstart on Solaris and Kick Start for Red hat environments.
- Experience working with HP LVM and Red hat LVM.
- Experience in implementing P2P and P2V migrations.
- Involved in Installing and configuring Centos & SUSE 11 & 12 servers on HP x86 servers.
- Implemented HA using Red hat Cluster and VERITAS Cluster Server 4.0 for Web Logic agent.
- Managing DNS, NIS servers and troubleshooting the servers.
- Troubleshooting application issues on Apache web servers and also database servers running on Linux and Solaris.
- Experience in migrating Oracle, MYSQL data using Double take products.
- Used Sun Volume Manager for Solaris and LVM on Linux & Solaris to create volumes with ayouts like RAID 1, 5, 10, 15.
- Re-compiling Linux kernel to remove services and applications that are not required.
- Performed performance analysis using tools like prstat, mpstat, iostat, sar, vmstat, truss and Dtrace.
- Experience working on LDAP user accounts and configuring LDAP on client machines.
- Worked on patch management tools like Sun Update Manager.
- Experience supporting middle ware servers running Apache, Tomcat and Java applications.
- Worked on day to day administration tasks and resolve tickets using Remedy.
- Used HP Service center and change management system for ticketing.
Environment: Redhat Linux/CentOS 4, 5, Logical Volume Manager, Hadoop, VMware ESX 3.0, Apache and Tomcat Web Server, Oracle 9g, Oracle RAC, HPSM, HPSA.
Confidential
System Administrator
Responsibilities:
- Installation, Configuration & Upgrade of CentOS, Red-hat Linux.
- Administered DNS, NIS, NIS+ and NFS, Send Mail and involved in troubleshooting.
- Extensive experience in building servers using Jumpstart and Kick-start Process.
- Expertise in package management using Red Hat RPM used in several Linux distributions such as Red Hat Enterprise Linux, SUSE Linux Enterprises and Fedora.
- Disk and File system management through EXT and Logical Volume manager.
- Performed Change Management, Problem Management, cloning, Operating system and data backups and recovery Strategies.
- Configuring & handling Samba servers on Linux, Managing file system.
- Installation, Configuration and Administration on Apache Servers
- Installed and monitored VMware Virtual environments
- Added Service Groups and resources based on the requirement with appropriate dependencies, documented change design/Architecture diagram of VERITAS Cluster Servers
- Configured Users & Security administration, backup, recovery and maintenance of various activities.
- Planning and implemented Disaster Recovery sites from the scratch; Involved in disaster recovery testing every quarter
- Interaction with vendors for Hardware and software supports
- Involved in on-call pager rotation for production support.
- Demonstrated skill in supporting end users and proven ability to plan, organize and work as a member of a technical project team, both in small groups as well as large corporate areas.
Environment: -Redhat Linux/CentOS 4, 5, Logical Volume Manager, VMware ESX 3.0, Oracle 9g, Oracle RAC, HPSM, HPSA.
