Hadoop Administrator Resume
NJ
SUMMARY
- Around 4+ years’ experience in Hadoop, and 9+ Yrs total IT experience in UNIX, Solaris and Red hat Linux Administration.
- Working on Hortonworks (HDP 2.4) & Cloudera (CDH5) cluster setup
- Providing hardware architectural guidance, planning and estimating cluster capacity, and creating roadmaps for Hadoop cluster deployment
- Build the unix servers for hadoop clusters as per project requirement.
- Managing the Cluster using Ambari and restarting the services when require
- Installed and configured multi - nodes fully distributed Hadoop cluster.
- Troubleshooting the issues like job failures and performance issues in Hadoop.
- Participated in client Audits and lead the team to successful completion
- Automate the repeated tasks using python.
- Having knowledge in shell and python programing language.
- Installation, configuration of RedHat Linux operating system on VMware, HP server
- Installation, Configuration & troubleshooting of Administration of Logical Volume Manager
- Linux kernel patching and rollback if new kernel failure occurs
- Installing the packages using rpm, yum & up2date commands
- Working on oracle and DB2 backup and restore.
- Performance tuning of Oracle DB in memory and space from OS end.
- Implementing RAID 0,1 and 5 levels using SVM, LVM & VxVM.
- Implementing and managing NFS and NTP services
- Experience in ITILV3 knowledge for better delivery to the customer
- On call support in off business hours and make sure the availability 24*7
TECHNICAL SKILLS
Big Data: Hadoop Horton Works 2.4,Cloudera distributions
Eco System tools: Ambari, PIG, Hive, Sqoop, Hbase, knox, Flume,oozie,kafka
Operating systems: Sun Solaris 9, 10 &11 and Linux 5,6 and HP-UX, AIX
Hardware: V240,V480,T2000,M4000,Dell & HP Proliant Generation 5/6
Virtualization: Vmware,Solaris Zones and Ldoms
Disk Layout: Solaris Volume Manager, VERITAS Volume Manager,LVM
High Availability: VERITAS Cluster, Redhat Cluster, Sun Cluster&ServiceGuard
Monitoring Tools: Big Brother, Tivoli,Nagios,Ganglia
Ticketing Tools: SM9,SM7 and BMC Remedy
Languages: Shell scripting, Python
Storage Related: EMC (Clariion/Symmetrix )
Backup Related: Tivoli (TSM), EMC-Networker
Access Management: NIS, AD,CyberArk, Kerberos,LDAP,Rangers
Protocols: TCP/IP, FTP, SSH, Telnet, SCP,RSH,ARP and RARP
Storage Area n/w: RAID’s, SCSI, iSCSI, DAS,NAS and SAN
SMTP: SMTP mail Relay servers,Clear Swift tool
Configuration Tools: Puppet, IBM-TEM tool
Database: Oracle, DB2, My Sql
PROFESSIONAL EXPERIENCE
Confidential, NJ
Hadoop Administrator
Responsibilities:
- Working for Confidential and financial services
- Providing hardware architectural guidance, planning and estimating cluster capacity, and creating roadmaps for Hadoop cluster deployment.
- Working on Hortonworks & Cloudera cluster setup
- Automate the repeated tasks using python.
- Having knowledge in shell and python programing language
- Installing, Configuring, Maintaining, and Troubleshooting Standalone systems
- Configuring and updating parameters in servers using python.
- Analyze the shell/python script code during the migrations.
- Troubleshooting the issues like job failures and performance issues
- Hadoop user administration using Sentry.
- Upgrade from CDH from 5.2 to 5.3
- Responsible to manage data coming from different sources.
- Knowledge on snapshots.
- User administration via Ldap, Kerberos mechanism.
- Involved in Hadoop Cluster environment administration that includes adding and removing cluster nodes, cluster capacity planning, performance tuning, cluster Monitoring, Troubleshooting.
- Adding new nodes to an existing cluster, recovering from a Name Node failure.
- Decommissioning and commissioning the Node on running cluster
- Supported Map Reduce Programs those are running on the cluster.
- Configured Fair Scheduler to provide service-level agreements for multiple users of a cluster.
- Managing nodes on Hadoop cluster connectivity and security
- Experienced in managing and reviewing Hadoop log files
- Maintaining Backup for name node.
- Knowledge in Name node recoveries from previous backups
- Importing and exporting data into HDFS using Sqoop
- Depth conceptual and functional understanding of Map Reduce and
- Hadoop eco-system Infrastructure (Both MRv1 and MRv2)
- Deploy & scale-out multi-node Hadoop cluster components including MapReduce, PIG, Hive, Hbase
- Design/implement workflow and coordinator jobs using Oozie tool
- Functional knowledge of flume, sqoop
- Experience in trouble shooting, optimization& performance tuning
- Experience in change management.
- Follow the functional spec analysis and develop ETL pipeline Develop MapReduce/PIG application to transform the data available in HDFS
- Generate reporting data using PIG/Hive to serve business team’s ad-hoc requests
- Import data to Hive from RDBMS sources, process it, write resulted data to HDFS
- Cluster management, troubleshoot, share best practices to the team
- Schedule the jobs using Oozie workflow
- Built hadoop cluster from scratch in a “start small and scale quickly” approach
- Well versed with the security issues like Quotas, RBAC, ACL, setuid and sticky bit.
- Using Kerberos, LDAP,Rangers for Access identification management.
Confidential
Hadoop Administrator
Responsibilities:
- working for insurance Client DLG from RBS group
- Providing hardware architectural guidance, planning and estimating cluster capacity, and creating roadmaps for Hadoop cluster deployment.
- Working on Hortonworks & Cloudera cluster setup
- Having knowledge in shell and python programing language
- Installing, Configuring, Maintaining, and Troubleshooting Standalone systems
- Troubleshooting the issues like job failures and performance issues
- Hadoop user administration using Sentry.
- Upgrade from CDH from 5.2 to 5.3
- Responsible to manage data coming from different sources.
- Knowledge on snapshots.
- User administration via Ldap, Kerberos mechanism.
- Involved in Hadoop Cluster environment administration that includes adding and removing cluster nodes, cluster capacity planning, performance tuning, cluster Monitoring, Troubleshooting.
- Adding new nodes to an existing cluster, recovering from a Name Node failure.
- Decommissioning and commissioning the Node on running cluster
- Supported Map Reduce Programs those are running on the cluster.
- Configured Fair Scheduler to provide service-level agreements for multiple users of a cluster. Managing nodes on Hadoop cluster connectivity and security
- Experienced in managing and reviewing Hadoop log files
- Maintaining Backup for name node. Knowledge in Name node recoveries from previous backups, Importing and exporting data into HDFS using Sqoop
- Depth conceptual and functional understanding of Map Reduce and
- Hadoop eco-system Infrastructure (Both MRv1 and MRv2)
- Deploy & scale-out multi-node Hadoop cluster components including Map Reduce, PIG, Hive, Hbase
- Design/implement workflow and coordinator jobs using Oozie tool
- Functional knowledge of flume, sqoop Experience in trouble shooting, optimization& performance tuning Experience in change management.
- Follow the functional spec analysis and develop ETL pipeline Develop
- Map Reduce/PIG application to transform the data available in HDFS
- Generate reporting data using PIG/Hive to serve business team’s ad-hoc requests
- Import data to Hive from RDBMS sources, process it, write resulted data to HDFS
- Cluster management, troubleshoot, share best practices to the team
- Schedule the jobs using Oozie workflow
- Built hadoop cluster from scratch in a “start small and scale quickly” approach
- Well versed with the security issues like Quotas, RBAC, ACL, setuid and sticky bit.
- Using Kerberos, LDAP, and Rangers for Access identification management.
Confidential
Hadoop Administrator
Responsibilities:
- Installing, Configuring, Maintaining, and Troubleshooting Standalone systems
- Troubleshooting the issues like job failures and performance issues
- Automate the repeated tasks using python
- Hadoop user administration using Sentry.
- Upgrade from CDH from 5.2 to 5.3
- Performing data analysis by using Apache Spark, Hive queries and Pig scripts; importing data from various sources and uploading into HDFS; utilizing Sqoop for extracting data from Oracle.
- Experience in trouble shooting, optimization& performance tuning
- Experience in change management.
- Follow the functional spec analysis and develop ETL pipeline Develop Map Reduce/PIG application to transform the data available in HDFS
- Generate reporting data using PIG/Hive to serve business team’s ad-hoc requests
- Import data to Hive from RDBMS sources, process it, write resulted data to HDFS
- Cluster management, troubleshoot, share best practices to the team
- Schedule the jobs using Oozie workflow
- Built hadoop cluster from scratch in a “start small and scale quickly” approach
- Implemented capacity scheduler for efficient utilization of cluster resources
- Analyze large historical data sets by Hive queries & Pig scripts to generate reports
- Cluster management, enhance cluster resources capacity by adding nodes
- Installing, Configuring, Maintaining, and Troubleshooting Standalone systems.
- Applying Packages and Patches. Disk configuration & Managing file systems.
- Configuring, Maintaining and Troubleshooting Server and Client Systems enabled with NFS with Auto mount services.
- Administering and monitoring System Performance, disk space and memory
- File system management, user accounts, Quotas and Job automation.
- Installing and maintaining the Solaris Jumpstart Environment.
- Knowledge on configuring root mirror, implementing Raid’s Using SVM
- Upgrading the HBA firmware and Fcode during the VMAX storage migration
- Installation of VERITAS Volume Manager in sun server environment
