We provide IT Staff Augmentation Services!

Hadoop Developer Resume

4.00/5 (Submit Your Rating)

San Francisco, CA

SUMMARY

  • Seasoned and multifaceted professional, possessing a rich mix of transferable skills and expertise in areas of Big Data on apache hadoop, and SQL server with an outstanding experience and vast knowledge.
  • High - energy and dedicated with a proven track record. Adopt at administrative organization utilizing talent and resources. Excellent interpersonal and communication skills also a quick learner possessing the ability to work in fast pace environment.
  • Now looking to a new horizon in a creative and forward thinking, where I can utilize my unique blend of skills. Known for passion and commitment to highest levels of services.

PROFESSIONAL EXPERIENCE

Hadoop Developer

Confidential — San Francisco, CA

Responsibilities:

  • Analyzed, Designed and developed the system to meet the requirements of business users.
  • Participated in the design review of the system to perform Object Analysis and provide best possible solutions for the application.
  • Imported and exported terabytes of data using Sqoop from HDFS to Relational Database Systems.
  • Developed MapReduce Jobs using Hive and Pig.
  • Collected and aggregated large amounts of log data using Apache Flume and staging data in HDFS for further analysis.
  • Installed and configured Hadoop Map Reduce, HDFS, developed multiple Map Reduce jobs in Java for data cleaning and preprocessing.
  • Developed Map Reduce (YARN) jobs for accessing and validating the data.
  • Involved in managing and reviewing Hadoop log files.
  • Responsible to manage data coming from different sources Involved in loading data from LINUX file system to HDFS.
  • Installed and configured Hive and written Hive QL scripts.
  • Involved in creating Hive tables, loading with data and writing hive queries which run internally in map reduce way.
  • Implemented Partitioning, Dynamic Partitions, Buckets in HIVE.
  • Monitor System health and logs and respond accordingly to any warning or failure conditions.

Environment: Map Reduce, HDFS, Hive, Hadoop distribution of Hortonworks, Cloudera

Big Data Hadoop Consultant

Confidential - Seattle, Washington

Responsibilities:

  • Used Sqoop to extract data from Oracle SQL server and MySQL databases to HDFS
  • Developed workflows in Oozie for business requirements to extract the data using Sqoop
  • Developed Map Reduce(YARN) jobs for cleaning, accessing and validating the data
  • Wrote MapReduce jobs using Pig Latin
  • Hive scripts were written in HiveQL to de-normalize and aggregate the data
  • Optimized the existing Hive and Pig Scripts
  • Hive queries for data were written to meet the business requirements
  • Designed workflows by scheduling Hive processes for Log file data, which is streamed into HDFS using Flume
  • Real time streaming the data using Spark with Kafka.
  • Implemented Spark using pySpark and SparkSQL for faster testing and processing of data.
  • Actively participated in weekly meetings with the technical teams to review the code
  • Implemented test scripts to support test driven development and continuous integration
  • Responsible to manage data coming from different sources
  • Have deep and thorough understanding of ETL tools and how they can be applied in a Big Data environment
  • Involved in moving all log files generated from various sources to HDFS for further processing through Flume

Environment: Hadoop, Linux, MapReduce, HDFS, Hive, Pig, NoSQL, Sqoop, Open source technologies Apache Kafka, Apache Spark, ETL, Hortonworks, Unix/Linux

Big Data Hadoop Consultant

Confidential - Houston, TX

Responsibilities:

  • Installed and configured Hadoop MapReduce, HDFS
  • Installed virtual machines on Windows and Mac using Oracle Virtual Box and VMware.
  • Experience in installing, configuring and using Hadoop ecosystem components.
  • Experience in administration, installing, upgrading and managing CDH3, Pig, Hive & HBase
  • Importing and exporting data into HDFS and Hive using Sqoop and Flume.
  • Knowledge in performance troubleshooting and tuning Hadoop clusters.
  • Experienced in managing and reviewing Hadoop log files.

Environment: Apache Hadoop, HDFS, Hive, MapReduce, Hive, Pig, Sqoop, Flume, Cloudera CDH3, Oozie, MySQL.

Confidential

BI Developer

Responsibilities:

  • Performed data designing, modeling and mapping process to load data based on business requirements.
  • Created and maintained SSIS packages to construct high performance ETL process for data warehouse.
  • Deployed SSIS packages and scheduled job in SQL Server Agent to run the packages automatically.
  • Applied troubleshooting for ETL issues, validated result sets, recommended and implemented process improvements.
  • Modified the existing dimension and fact tables, and their relationships in the data management system.
  • Generated Snapshot, Drill Down, Sub, Cross Tab, and parameter reports using SSRS that were scheduled to refresh.
  • Created crucial store procedures and functions to support reporting Dataset manipulation.
  • Captured business requirements and translated them into design documents that lead to the delivery of reports and dashboards.

We'd love your feedback!