Hadoop Developer Resume San Francisco, CA - Hire IT People

SUMMARY

Seasoned and multifaceted professional, possessing a rich mix of transferable skills and expertise in areas of Big Data on apache hadoop, and SQL server with an outstanding experience and vast knowledge.
High - energy and dedicated with a proven track record. Adopt at administrative organization utilizing talent and resources. Excellent interpersonal and communication skills also a quick learner possessing the ability to work in fast pace environment.
Now looking to a new horizon in a creative and forward thinking, where I can utilize my unique blend of skills. Known for passion and commitment to highest levels of services.

PROFESSIONAL EXPERIENCE

Hadoop Developer

Confidential — San Francisco, CA

Responsibilities:

Analyzed, Designed and developed the system to meet the requirements of business users.
Participated in the design review of the system to perform Object Analysis and provide best possible solutions for the application.
Imported and exported terabytes of data using Sqoop from HDFS to Relational Database Systems.
Developed MapReduce Jobs using Hive and Pig.
Collected and aggregated large amounts of log data using Apache Flume and staging data in HDFS for further analysis.
Installed and configured Hadoop Map Reduce, HDFS, developed multiple Map Reduce jobs in Java for data cleaning and preprocessing.
Developed Map Reduce (YARN) jobs for accessing and validating the data.
Involved in managing and reviewing Hadoop log files.
Responsible to manage data coming from different sources Involved in loading data from LINUX file system to HDFS.
Installed and configured Hive and written Hive QL scripts.
Involved in creating Hive tables, loading with data and writing hive queries which run internally in map reduce way.
Implemented Partitioning, Dynamic Partitions, Buckets in HIVE.
Monitor System health and logs and respond accordingly to any warning or failure conditions.

Environment: Map Reduce, HDFS, Hive, Hadoop distribution of Hortonworks, Cloudera

Big Data Hadoop Consultant

Confidential - Seattle, Washington

Responsibilities:

Used Sqoop to extract data from Oracle SQL server and MySQL databases to HDFS
Developed workflows in Oozie for business requirements to extract the data using Sqoop
Developed Map Reduce(YARN) jobs for cleaning, accessing and validating the data
Wrote MapReduce jobs using Pig Latin
Hive scripts were written in HiveQL to de-normalize and aggregate the data
Optimized the existing Hive and Pig Scripts
Hive queries for data were written to meet the business requirements
Designed workflows by scheduling Hive processes for Log file data, which is streamed into HDFS using Flume
Real time streaming the data using Spark with Kafka.
Implemented Spark using pySpark and SparkSQL for faster testing and processing of data.
Actively participated in weekly meetings with the technical teams to review the code
Implemented test scripts to support test driven development and continuous integration
Responsible to manage data coming from different sources
Have deep and thorough understanding of ETL tools and how they can be applied in a Big Data environment
Involved in moving all log files generated from various sources to HDFS for further processing through Flume

Environment: Hadoop, Linux, MapReduce, HDFS, Hive, Pig, NoSQL, Sqoop, Open source technologies Apache Kafka, Apache Spark, ETL, Hortonworks, Unix/Linux

Big Data Hadoop Consultant

Confidential - Houston, TX

Responsibilities:

Installed and configured Hadoop MapReduce, HDFS
Installed virtual machines on Windows and Mac using Oracle Virtual Box and VMware.
Experience in installing, configuring and using Hadoop ecosystem components.
Experience in administration, installing, upgrading and managing CDH3, Pig, Hive & HBase
Importing and exporting data into HDFS and Hive using Sqoop and Flume.
Knowledge in performance troubleshooting and tuning Hadoop clusters.
Experienced in managing and reviewing Hadoop log files.

Environment: Apache Hadoop, HDFS, Hive, MapReduce, Hive, Pig, Sqoop, Flume, Cloudera CDH3, Oozie, MySQL.

Confidential

BI Developer

Responsibilities:

Performed data designing, modeling and mapping process to load data based on business requirements.
Created and maintained SSIS packages to construct high performance ETL process for data warehouse.
Deployed SSIS packages and scheduled job in SQL Server Agent to run the packages automatically.
Applied troubleshooting for ETL issues, validated result sets, recommended and implemented process improvements.
Modified the existing dimension and fact tables, and their relationships in the data management system.
Generated Snapshot, Drill Down, Sub, Cross Tab, and parameter reports using SSRS that were scheduled to refresh.
Created crucial store procedures and functions to support reporting Dataset manipulation.
Captured business requirements and translated them into design documents that lead to the delivery of reports and dashboards.

We provide IT Staff Augmentation Services!

Hadoop Developer Resume

San Francisco, CA

We'd love your feedback!

Resume Categories

Client Services

Job Seekers

Visa Sponsorship