Hadoop Developer Resume
4.00/5 (Submit Your Rating)
San Francisco, CA
SUMMARY
- Seasoned and multifaceted professional, possessing a rich mix of transferable skills and expertise in areas of Big Data on apache hadoop, and SQL server with an outstanding experience and vast knowledge.
- High - energy and dedicated with a proven track record. Adopt at administrative organization utilizing talent and resources. Excellent interpersonal and communication skills also a quick learner possessing the ability to work in fast pace environment.
- Now looking to a new horizon in a creative and forward thinking, where I can utilize my unique blend of skills. Known for passion and commitment to highest levels of services.
PROFESSIONAL EXPERIENCE
Hadoop Developer
Confidential — San Francisco, CA
Responsibilities:
- Analyzed, Designed and developed the system to meet the requirements of business users.
- Participated in the design review of the system to perform Object Analysis and provide best possible solutions for the application.
- Imported and exported terabytes of data using Sqoop from HDFS to Relational Database Systems.
- Developed MapReduce Jobs using Hive and Pig.
- Collected and aggregated large amounts of log data using Apache Flume and staging data in HDFS for further analysis.
- Installed and configured Hadoop Map Reduce, HDFS, developed multiple Map Reduce jobs in Java for data cleaning and preprocessing.
- Developed Map Reduce (YARN) jobs for accessing and validating the data.
- Involved in managing and reviewing Hadoop log files.
- Responsible to manage data coming from different sources Involved in loading data from LINUX file system to HDFS.
- Installed and configured Hive and written Hive QL scripts.
- Involved in creating Hive tables, loading with data and writing hive queries which run internally in map reduce way.
- Implemented Partitioning, Dynamic Partitions, Buckets in HIVE.
- Monitor System health and logs and respond accordingly to any warning or failure conditions.
Environment: Map Reduce, HDFS, Hive, Hadoop distribution of Hortonworks, Cloudera
Big Data Hadoop Consultant
Confidential - Seattle, Washington
Responsibilities:
- Used Sqoop to extract data from Oracle SQL server and MySQL databases to HDFS
- Developed workflows in Oozie for business requirements to extract the data using Sqoop
- Developed Map Reduce(YARN) jobs for cleaning, accessing and validating the data
- Wrote MapReduce jobs using Pig Latin
- Hive scripts were written in HiveQL to de-normalize and aggregate the data
- Optimized the existing Hive and Pig Scripts
- Hive queries for data were written to meet the business requirements
- Designed workflows by scheduling Hive processes for Log file data, which is streamed into HDFS using Flume
- Real time streaming the data using Spark with Kafka.
- Implemented Spark using pySpark and SparkSQL for faster testing and processing of data.
- Actively participated in weekly meetings with the technical teams to review the code
- Implemented test scripts to support test driven development and continuous integration
- Responsible to manage data coming from different sources
- Have deep and thorough understanding of ETL tools and how they can be applied in a Big Data environment
- Involved in moving all log files generated from various sources to HDFS for further processing through Flume
Environment: Hadoop, Linux, MapReduce, HDFS, Hive, Pig, NoSQL, Sqoop, Open source technologies Apache Kafka, Apache Spark, ETL, Hortonworks, Unix/Linux
Big Data Hadoop Consultant
Confidential - Houston, TX
Responsibilities:
- Installed and configured Hadoop MapReduce, HDFS
- Installed virtual machines on Windows and Mac using Oracle Virtual Box and VMware.
- Experience in installing, configuring and using Hadoop ecosystem components.
- Experience in administration, installing, upgrading and managing CDH3, Pig, Hive & HBase
- Importing and exporting data into HDFS and Hive using Sqoop and Flume.
- Knowledge in performance troubleshooting and tuning Hadoop clusters.
- Experienced in managing and reviewing Hadoop log files.
Environment: Apache Hadoop, HDFS, Hive, MapReduce, Hive, Pig, Sqoop, Flume, Cloudera CDH3, Oozie, MySQL.
Confidential
BI Developer
Responsibilities:
- Performed data designing, modeling and mapping process to load data based on business requirements.
- Created and maintained SSIS packages to construct high performance ETL process for data warehouse.
- Deployed SSIS packages and scheduled job in SQL Server Agent to run the packages automatically.
- Applied troubleshooting for ETL issues, validated result sets, recommended and implemented process improvements.
- Modified the existing dimension and fact tables, and their relationships in the data management system.
- Generated Snapshot, Drill Down, Sub, Cross Tab, and parameter reports using SSRS that were scheduled to refresh.
- Created crucial store procedures and functions to support reporting Dataset manipulation.
- Captured business requirements and translated them into design documents that lead to the delivery of reports and dashboards.
