Hadoop Developer/administrator Resume
4.00/5 (Submit Your Rating)
TECHNICAL SKILLS
- C++, Java, PHP, Scala, HTML
- Hadoop, MapReduce, HDFS, Spark, Sqoop, Hive, Pig, HBase, Oozie MySQL, PostgreSQL, Oracle Database - 11g
- Eclipse, Cloudera Manager, Ambari, Splunk, Weka, R Studio, Tableau, Google BigQuery Python (Pandas, Numpy, scikitlearn, Matplotlib, NLTK), Shell script, R, JavaScript Windows, Macintosh, Linux
PROFESSIONAL EXPERIENCE
Confidential
Hadoop Developer/Administrator
Responsibilities:
- Optimized existing MapReduce jobs by converting them into Spark programs to decrease the time taken by long running MapReduce jobs by 20%.
- Installed and configured components like MapReduce, Pig, Hive, Oozie, Zookeeper on 5 different Hadoop clusters containing 500 nodes each by using Cloudera Manager.
- Led a team of 4 members in the migration of multiple Hadoop clusters from version CDH3 to CDH4.
- Set up High Availability and Disaster Recovery environments in all the Hadoop clusters.
- Created reports, dashboards, cluster monitoring and alerting tools using Splunk.
- Enabled automated scheduling of MapReduce/Pig jobs and data loading into HDFS by using Oozie.
Confidential
Hadoop Developer - Big Data Edge
Responsibilities:
- Developed multiple MapReduce programs in Java for data cleaning, preprocessing and to convert data formats like AVRO, JSON, HTML, XML into standard text format.
- Developed Pig scripts to process terabytes of data stored in HDFS and find critical trends and key performance indicators.
- Involved in the development of Sqoop scripts to move data from RDBMS into HDFS and vice versa.
- Implemented automated Unix/Shell scripts to handle the deploying, scheduling and monitoring of various Hadoop jobs in the cluster.
- Wrote custom Hive and Pig UDF s in Java to implement custom business logic.