We provide IT Staff Augmentation Services!

Hadoop Developer/administrator Resume

4.00/5 (Submit Your Rating)

TECHNICAL SKILLS

  • C++, Java, PHP, Scala, HTML
  • Hadoop, MapReduce, HDFS, Spark, Sqoop, Hive, Pig, HBase, Oozie MySQL, PostgreSQL, Oracle Database - 11g
  • Eclipse, Cloudera Manager, Ambari, Splunk, Weka, R Studio, Tableau, Google BigQuery Python (Pandas, Numpy, scikitlearn, Matplotlib, NLTK), Shell script, R, JavaScript Windows, Macintosh, Linux

PROFESSIONAL EXPERIENCE

Confidential

Hadoop Developer/Administrator

Responsibilities:

  • Optimized existing MapReduce jobs by converting them into Spark programs to decrease the time taken by long running MapReduce jobs by 20%.
  • Installed and configured components like MapReduce, Pig, Hive, Oozie, Zookeeper on 5 different Hadoop clusters containing 500 nodes each by using Cloudera Manager.
  • Led a team of 4 members in the migration of multiple Hadoop clusters from version CDH3 to CDH4.
  • Set up High Availability and Disaster Recovery environments in all the Hadoop clusters.
  • Created reports, dashboards, cluster monitoring and alerting tools using Splunk.
  • Enabled automated scheduling of MapReduce/Pig jobs and data loading into HDFS by using Oozie.

Confidential

Hadoop Developer - Big Data Edge

Responsibilities:

  • Developed multiple MapReduce programs in Java for data cleaning, preprocessing and to convert data formats like AVRO, JSON, HTML, XML into standard text format.
  • Developed Pig scripts to process terabytes of data stored in HDFS and find critical trends and key performance indicators.
  • Involved in the development of Sqoop scripts to move data from RDBMS into HDFS and vice versa.
  • Implemented automated Unix/Shell scripts to handle the deploying, scheduling and monitoring of various Hadoop jobs in the cluster.
  • Wrote custom Hive and Pig UDF s in Java to implement custom business logic.

We'd love your feedback!