Hadoop Administrator Resume
Hartford, CT
SUMMARY
- Multi - talented, cross-functional technical professional with progressive experience in Linux System, Security and Hadoop cluster Administration and technical support within large scale IT portfolios, service projects.
- Nine years extensive IT experience that includes application development, Data warehousing, Systems Administration, monitoring and troubleshooting experience on UNIX, Linux Hadoop and Teradata environments.
- Around 3 years of experience in Big data environment in which 2+ years of experience in Hadoop Cluster Administration.
- Skilled in: Hadoop Cluster Installation, Administration and maintenance, Shell/Bash Scripting automation.
- Excellent troubleshooting skills in Hardware, Software, Application and Network.
- In depth understanding/knowledge of Hadoop Architecture and various components such as hdfs, job tracker, tasktracker, Name node, Data node and mapreduce concepts.
- Extensively worked on commissioning and decommissioning of cluster nodes, replacing failed disks, file system integrity checks and maintaining cluster data replication.
- Installing and Administration of various Hadoop distributions like Cloudera, Hortonworks
- Hadoop integration with Business intelligence tools - Tableau
- Namenode and Jobtracker High Availability
- General system performance monitoring for machines running a Hadoop Cluster with Nagios, (for monitoring parameters like disk space, disk partitions, etc.); managing Kerberos authentication for Hadoop.
- Management Tool- Ambari Tool, Cloudera Manager
- Resource Allocation in Hadoop Cluster
- Participate in the research, design, and implementation of new technologies for scaling our large and growing data sets, for performance improvement, and for analyst workload reduction.
- End-to-end performance tuning of Hadoop clusters
- Monitor Hadoop cluster job performance and capacity planning
- HDFS support and maintenance
- Grown/Shrunk a Hadoop cluster by adding/removing Nodes and tuning server for optimal performance of the cluster; using Namenode and Jobtracker UI and rebalancing load in a cluster.
- Adding/Removing a Node, Data Rebalancing. Maintaining backups for name node
- Hadoop Upgrades
TECHNICAL SKILLS
Hadoop/BIG Data: HDFS, HBase, Hive, Sqoop, Oozie, Flume, Pig and MapReduce.
BI tools: Ab Initio.
No SQL Databases: HBase, MongoDB
Database: Oracle 9i/10g/11g, DB2, SQL Server, MySQL, Teradata.
Operation System: HP-UNIX, RedHat Linux, Ubuntu Linux and Windows XP/Vista/7/8
Scheduling tools: Autosys, Trivoli, Control-M, CA7.
Languages: SQL, C, C++, Core Java, AWK, Shell Scripting.
Host: COBOL, JCL, VSAM, Flat Files.
PROFESSIONAL EXPERIENCE
Confidential, Hartford, CT
Hadoop Administrator
Responsibilities:
- Responsible for setting up 24/7 Support on Big Data Clusters.
- Involved in log file management where the Hadoop logs greater than 7 days old were removed from log folder and loaded into HDFS and stored for 2 years for Audit purpose.
- Responsible for Availability of clusters and on boarding of the projects into Big Data Clusters.
- Automate common maintenance and BAU activities.
- Collaborate with cross-functional teams to ensure that applications are properly tested, configured, and deployed.
- Cluster maintenance including adding and removing cluster nodes; cluster Monitoring and Troubleshooting
- Extensively worked on cluster configuration files like hadoop-env.sh, core-site.xml, hdf-site.xml,Mapred-site.xml etc.
- Strong knowledge of open source system monitoring and event handling tools like Nagios and Ganglia.
- Experience in trouble shooting cluster problems by analyzing logs and setting log levels, fixing miss configuration, and resource exhaustion problems.
- Worked on troubleshooting performance issues and tuning Hadoop cluster.
- Managing the cluster and troubleshooting the issues using Cloudera manager.
Environment: Hadoop, MapReduce, HDFS, Hive, Pig, Sqoop, Oozie, Flume, Java (jdk1.6), Cloudera CDH 4.4, RHEL, UNIX Shell Scripting.
Confidential, Detroit
Ab Initio (ETL) and Hadoop Developer
Responsibilities:
- Gathering business requirements from the Business Partners and Subject Matter Experts.
- Importing the data from the DB2 to HIVE and HDFS using SQOOP for One time and daily solution.
- Experience on UNIX commands and Shell Scripting.
- Worked on Big Data Hadoop environment on multiple Nodes.
- Worked on integrating HIVE for Online transaction processing support.
- Providing solution to the business requirements (Systems Design Document - SDR) for various areas like Authorizations, Product-to-Product and Account Transfer, Customer Service etc.
- Providing estimates for preparing Program Specification Document (PSD) and Build & Unit Testing.
- Preparing the Impact Analysis Document and provide Program Specification Document (PSD).
- Mentoring relatively new resources in the project to understand the system and deliver the assigned tasks.
- Reviewing Test Cases prepared for Unit testing, System Testing, Regression Testing and End-to-End Testing.
- Preparing test data for System Testing, Regression Testing.
Environment: Hadoop, MapReduce, HDFS, Hive, Ab Initio 1.15 GDE and 3.0 GDE, 2.15 and 3.0 Co-Operating System, SQL Server, Linux, Autosys, UNIX Shell Scripting
Confidential
ETL Developer
Responsibilities:
- Understanding the business data model and customer requirements.
- Preparing low level designs of the system along with build strategy.
- Building various components i.e. DB2, Oracle, SQL, UNIX, Ab-Initio.
- Testing the changed and impacted components.
- Creating the test cases for the various modules that have been handed over to my team for enhancement.
- Communicating timely statuses to the Client about the on-goings of the enhancement
Environment: Ab Initio 1.15 GDE and 3.0 GDE, 2.15 and 3.0 Co-Operating System, SQL Server, Linux, Autosys, UNIX Shell Scripting
Confidential
Application Developer (Mainframe)
Responsibilities:
- Providing 24x7 supports to business users.
- Monitoring batch jobs.
- Solving production job failures and batch abends.
- Communicating with Front office and business users to resolve the business queries.
- Look into complex errors and production abends and makes changes in production server (Break fix).
Environment: COBOL, JCL, DB2, FILE-AID, TSO/ISPF, CHANGEMAN, SPUFI, NDM, EASYTRIEVE and IBM UTILITIES.
