We provide IT Staff Augmentation Services!

Hadoop Administrator Resume

2.00/5 (Submit Your Rating)

Hartford, CT

SUMMARY

  • Multi - talented, cross-functional technical professional with progressive experience in Linux System, Security and Hadoop cluster Administration and technical support within large scale IT portfolios, service projects.
  • Nine years extensive IT experience that includes application development, Data warehousing, Systems Administration, monitoring and troubleshooting experience on UNIX, Linux Hadoop and Teradata environments.
  • Around 3 years of experience in Big data environment in which 2+ years of experience in Hadoop Cluster Administration.
  • Skilled in: Hadoop Cluster Installation, Administration and maintenance, Shell/Bash Scripting automation.
  • Excellent troubleshooting skills in Hardware, Software, Application and Network.
  • In depth understanding/knowledge of Hadoop Architecture and various components such as hdfs, job tracker, tasktracker, Name node, Data node and mapreduce concepts.
  • Extensively worked on commissioning and decommissioning of cluster nodes, replacing failed disks, file system integrity checks and maintaining cluster data replication.
  • Installing and Administration of various Hadoop distributions like Cloudera, Hortonworks
  • Hadoop integration with Business intelligence tools - Tableau
  • Namenode and Jobtracker High Availability
  • General system performance monitoring for machines running a Hadoop Cluster with Nagios, (for monitoring parameters like disk space, disk partitions, etc.); managing Kerberos authentication for Hadoop.
  • Management Tool- Ambari Tool, Cloudera Manager
  • Resource Allocation in Hadoop Cluster
  • Participate in the research, design, and implementation of new technologies for scaling our large and growing data sets, for performance improvement, and for analyst workload reduction.
  • End-to-end performance tuning of Hadoop clusters
  • Monitor Hadoop cluster job performance and capacity planning
  • HDFS support and maintenance
  • Grown/Shrunk a Hadoop cluster by adding/removing Nodes and tuning server for optimal performance of the cluster; using Namenode and Jobtracker UI and rebalancing load in a cluster.
  • Adding/Removing a Node, Data Rebalancing. Maintaining backups for name node
  • Hadoop Upgrades

TECHNICAL SKILLS

Hadoop/BIG Data: HDFS, HBase, Hive, Sqoop, Oozie, Flume, Pig and MapReduce.

BI tools: Ab Initio.

No SQL Databases: HBase, MongoDB

Database: Oracle 9i/10g/11g, DB2, SQL Server, MySQL, Teradata.

Operation System: HP-UNIX, RedHat Linux, Ubuntu Linux and Windows XP/Vista/7/8

Scheduling tools: Autosys, Trivoli, Control-M, CA7.

Languages: SQL, C, C++, Core Java, AWK, Shell Scripting.

Host: COBOL, JCL, VSAM, Flat Files.

PROFESSIONAL EXPERIENCE

Confidential, Hartford, CT

Hadoop Administrator

Responsibilities:

  • Responsible for setting up 24/7 Support on Big Data Clusters.
  • Involved in log file management where the Hadoop logs greater than 7 days old were removed from log folder and loaded into HDFS and stored for 2 years for Audit purpose.
  • Responsible for Availability of clusters and on boarding of the projects into Big Data Clusters.
  • Automate common maintenance and BAU activities.
  • Collaborate with cross-functional teams to ensure that applications are properly tested, configured, and deployed.
  • Cluster maintenance including adding and removing cluster nodes; cluster Monitoring and Troubleshooting
  • Extensively worked on cluster configuration files like hadoop-env.sh, core-site.xml, hdf-site.xml,Mapred-site.xml etc.
  • Strong knowledge of open source system monitoring and event handling tools like Nagios and Ganglia.
  • Experience in trouble shooting cluster problems by analyzing logs and setting log levels, fixing miss configuration, and resource exhaustion problems.
  • Worked on troubleshooting performance issues and tuning Hadoop cluster.
  • Managing the cluster and troubleshooting the issues using Cloudera manager.

Environment: Hadoop, MapReduce, HDFS, Hive, Pig, Sqoop, Oozie, Flume, Java (jdk1.6), Cloudera CDH 4.4, RHEL, UNIX Shell Scripting.

Confidential, Detroit

Ab Initio (ETL) and Hadoop Developer

Responsibilities:

  • Gathering business requirements from the Business Partners and Subject Matter Experts.
  • Importing the data from the DB2 to HIVE and HDFS using SQOOP for One time and daily solution.
  • Experience on UNIX commands and Shell Scripting.
  • Worked on Big Data Hadoop environment on multiple Nodes.
  • Worked on integrating HIVE for Online transaction processing support.
  • Providing solution to the business requirements (Systems Design Document - SDR) for various areas like Authorizations, Product-to-Product and Account Transfer, Customer Service etc.
  • Providing estimates for preparing Program Specification Document (PSD) and Build & Unit Testing.
  • Preparing the Impact Analysis Document and provide Program Specification Document (PSD).
  • Mentoring relatively new resources in the project to understand the system and deliver the assigned tasks.
  • Reviewing Test Cases prepared for Unit testing, System Testing, Regression Testing and End-to-End Testing.
  • Preparing test data for System Testing, Regression Testing.

Environment: Hadoop, MapReduce, HDFS, Hive, Ab Initio 1.15 GDE and 3.0 GDE, 2.15 and 3.0 Co-Operating System, SQL Server, Linux, Autosys, UNIX Shell Scripting

Confidential

ETL Developer

Responsibilities:

  • Understanding the business data model and customer requirements.
  • Preparing low level designs of the system along with build strategy.
  • Building various components i.e. DB2, Oracle, SQL, UNIX, Ab-Initio.
  • Testing the changed and impacted components.
  • Creating the test cases for the various modules that have been handed over to my team for enhancement.
  • Communicating timely statuses to the Client about the on-goings of the enhancement

Environment: Ab Initio 1.15 GDE and 3.0 GDE, 2.15 and 3.0 Co-Operating System, SQL Server, Linux, Autosys, UNIX Shell Scripting

Confidential

Application Developer (Mainframe)

Responsibilities:

  • Providing 24x7 supports to business users.
  • Monitoring batch jobs.
  • Solving production job failures and batch abends.
  • Communicating with Front office and business users to resolve the business queries.
  • Look into complex errors and production abends and makes changes in production server (Break fix).

Environment: COBOL, JCL, DB2, FILE-AID, TSO/ISPF, CHANGEMAN, SPUFI, NDM, EASYTRIEVE and IBM UTILITIES.

We'd love your feedback!