We provide IT Staff Augmentation Services!

Hadoop Administrator Resume

2.00/5 (Submit Your Rating)

NJ

SUMMARY

  • Around 4+ years’ experience in Hadoop, and 9+ Yrs total IT experience in UNIX, Solaris and Red hat Linux Administration.
  • Working on Hortonworks (HDP 2.4) & Cloudera (CDH5) cluster setup
  • Providing hardware architectural guidance, planning and estimating cluster capacity, and creating roadmaps for Hadoop cluster deployment
  • Build the unix servers for hadoop clusters as per project requirement.
  • Managing the Cluster using Ambari and restarting the services when require
  • Installed and configured multi - nodes fully distributed Hadoop cluster.
  • Troubleshooting the issues like job failures and performance issues in Hadoop.
  • Participated in client Audits and lead the team to successful completion
  • Automate the repeated tasks using python.
  • Having knowledge in shell and python programing language.
  • Installation, configuration of RedHat Linux operating system on VMware, HP server
  • Installation, Configuration & troubleshooting of Administration of Logical Volume Manager
  • Linux kernel patching and rollback if new kernel failure occurs
  • Installing the packages using rpm, yum & up2date commands
  • Working on oracle and DB2 backup and restore.
  • Performance tuning of Oracle DB in memory and space from OS end.
  • Implementing RAID 0,1 and 5 levels using SVM, LVM & VxVM.
  • Implementing and managing NFS and NTP services
  • Experience in ITILV3 knowledge for better delivery to the customer
  • On call support in off business hours and make sure the availability 24*7

TECHNICAL SKILLS

Big Data: Hadoop Horton Works 2.4,Cloudera distributions

Eco System tools: Ambari, PIG, Hive, Sqoop, Hbase, knox, Flume,oozie,kafka

Operating systems: Sun Solaris 9, 10 &11 and Linux 5,6 and HP-UX, AIX

Hardware: V240,V480,T2000,M4000,Dell & HP Proliant Generation 5/6

Virtualization: Vmware,Solaris Zones and Ldoms

Disk Layout: Solaris Volume Manager, VERITAS Volume Manager,LVM

High Availability: VERITAS Cluster, Redhat Cluster, Sun Cluster&ServiceGuard

Monitoring Tools: Big Brother, Tivoli,Nagios,Ganglia

Ticketing Tools: SM9,SM7 and BMC Remedy

Languages: Shell scripting, Python

Storage Related: EMC (Clariion/Symmetrix )

Backup Related: Tivoli (TSM), EMC-Networker

Access Management: NIS, AD,CyberArk, Kerberos,LDAP,Rangers

Protocols: TCP/IP, FTP, SSH, Telnet, SCP,RSH,ARP and RARP

Storage Area n/w: RAID’s, SCSI, iSCSI, DAS,NAS and SAN

SMTP: SMTP mail Relay servers,Clear Swift tool

Configuration Tools: Puppet, IBM-TEM tool

Database: Oracle, DB2, My Sql

PROFESSIONAL EXPERIENCE

Confidential, NJ

Hadoop Administrator

Responsibilities:

  • Working for Confidential and financial services
  • Providing hardware architectural guidance, planning and estimating cluster capacity, and creating roadmaps for Hadoop cluster deployment.
  • Working on Hortonworks & Cloudera cluster setup
  • Automate the repeated tasks using python.
  • Having knowledge in shell and python programing language
  • Installing, Configuring, Maintaining, and Troubleshooting Standalone systems
  • Configuring and updating parameters in servers using python.
  • Analyze the shell/python script code during the migrations.
  • Troubleshooting the issues like job failures and performance issues
  • Hadoop user administration using Sentry.
  • Upgrade from CDH from 5.2 to 5.3
  • Responsible to manage data coming from different sources.
  • Knowledge on snapshots.
  • User administration via Ldap, Kerberos mechanism.
  • Involved in Hadoop Cluster environment administration that includes adding and removing cluster nodes, cluster capacity planning, performance tuning, cluster Monitoring, Troubleshooting.
  • Adding new nodes to an existing cluster, recovering from a Name Node failure.
  • Decommissioning and commissioning the Node on running cluster
  • Supported Map Reduce Programs those are running on the cluster.
  • Configured Fair Scheduler to provide service-level agreements for multiple users of a cluster.
  • Managing nodes on Hadoop cluster connectivity and security
  • Experienced in managing and reviewing Hadoop log files
  • Maintaining Backup for name node.
  • Knowledge in Name node recoveries from previous backups
  • Importing and exporting data into HDFS using Sqoop
  • Depth conceptual and functional understanding of Map Reduce and
  • Hadoop eco-system Infrastructure (Both MRv1 and MRv2)
  • Deploy & scale-out multi-node Hadoop cluster components including MapReduce, PIG, Hive, Hbase
  • Design/implement workflow and coordinator jobs using Oozie tool
  • Functional knowledge of flume, sqoop
  • Experience in trouble shooting, optimization& performance tuning
  • Experience in change management.
  • Follow the functional spec analysis and develop ETL pipeline Develop MapReduce/PIG application to transform the data available in HDFS
  • Generate reporting data using PIG/Hive to serve business team’s ad-hoc requests
  • Import data to Hive from RDBMS sources, process it, write resulted data to HDFS
  • Cluster management, troubleshoot, share best practices to the team
  • Schedule the jobs using Oozie workflow
  • Built hadoop cluster from scratch in a “start small and scale quickly” approach
  • Well versed with the security issues like Quotas, RBAC, ACL, setuid and sticky bit.
  • Using Kerberos, LDAP,Rangers for Access identification management.

Confidential

Hadoop Administrator

Responsibilities:

  • working for insurance Client DLG from RBS group
  • Providing hardware architectural guidance, planning and estimating cluster capacity, and creating roadmaps for Hadoop cluster deployment.
  • Working on Hortonworks & Cloudera cluster setup
  • Having knowledge in shell and python programing language
  • Installing, Configuring, Maintaining, and Troubleshooting Standalone systems
  • Troubleshooting the issues like job failures and performance issues
  • Hadoop user administration using Sentry.
  • Upgrade from CDH from 5.2 to 5.3
  • Responsible to manage data coming from different sources.
  • Knowledge on snapshots.
  • User administration via Ldap, Kerberos mechanism.
  • Involved in Hadoop Cluster environment administration that includes adding and removing cluster nodes, cluster capacity planning, performance tuning, cluster Monitoring, Troubleshooting.
  • Adding new nodes to an existing cluster, recovering from a Name Node failure.
  • Decommissioning and commissioning the Node on running cluster
  • Supported Map Reduce Programs those are running on the cluster.
  • Configured Fair Scheduler to provide service-level agreements for multiple users of a cluster. Managing nodes on Hadoop cluster connectivity and security
  • Experienced in managing and reviewing Hadoop log files
  • Maintaining Backup for name node. Knowledge in Name node recoveries from previous backups, Importing and exporting data into HDFS using Sqoop
  • Depth conceptual and functional understanding of Map Reduce and
  • Hadoop eco-system Infrastructure (Both MRv1 and MRv2)
  • Deploy & scale-out multi-node Hadoop cluster components including Map Reduce, PIG, Hive, Hbase
  • Design/implement workflow and coordinator jobs using Oozie tool
  • Functional knowledge of flume, sqoop Experience in trouble shooting, optimization& performance tuning Experience in change management.
  • Follow the functional spec analysis and develop ETL pipeline Develop
  • Map Reduce/PIG application to transform the data available in HDFS
  • Generate reporting data using PIG/Hive to serve business team’s ad-hoc requests
  • Import data to Hive from RDBMS sources, process it, write resulted data to HDFS
  • Cluster management, troubleshoot, share best practices to the team
  • Schedule the jobs using Oozie workflow
  • Built hadoop cluster from scratch in a “start small and scale quickly” approach
  • Well versed with the security issues like Quotas, RBAC, ACL, setuid and sticky bit.
  • Using Kerberos, LDAP, and Rangers for Access identification management.

Confidential

Hadoop Administrator

Responsibilities:

  • Installing, Configuring, Maintaining, and Troubleshooting Standalone systems
  • Troubleshooting the issues like job failures and performance issues
  • Automate the repeated tasks using python
  • Hadoop user administration using Sentry.
  • Upgrade from CDH from 5.2 to 5.3
  • Performing data analysis by using Apache Spark, Hive queries and Pig scripts; importing data from various sources and uploading into HDFS; utilizing Sqoop for extracting data from Oracle.
  • Experience in trouble shooting, optimization& performance tuning
  • Experience in change management.
  • Follow the functional spec analysis and develop ETL pipeline Develop Map Reduce/PIG application to transform the data available in HDFS
  • Generate reporting data using PIG/Hive to serve business team’s ad-hoc requests
  • Import data to Hive from RDBMS sources, process it, write resulted data to HDFS
  • Cluster management, troubleshoot, share best practices to the team
  • Schedule the jobs using Oozie workflow
  • Built hadoop cluster from scratch in a “start small and scale quickly” approach
  • Implemented capacity scheduler for efficient utilization of cluster resources
  • Analyze large historical data sets by Hive queries & Pig scripts to generate reports
  • Cluster management, enhance cluster resources capacity by adding nodes
  • Installing, Configuring, Maintaining, and Troubleshooting Standalone systems.
  • Applying Packages and Patches. Disk configuration & Managing file systems.
  • Configuring, Maintaining and Troubleshooting Server and Client Systems enabled with NFS with Auto mount services.
  • Administering and monitoring System Performance, disk space and memory
  • File system management, user accounts, Quotas and Job automation.
  • Installing and maintaining the Solaris Jumpstart Environment.
  • Knowledge on configuring root mirror, implementing Raid’s Using SVM
  • Upgrading the HBA firmware and Fcode during the VMAX storage migration
  • Installation of VERITAS Volume Manager in sun server environment

We'd love your feedback!