We provide IT Staff Augmentation Services!

Hadoop Developer Resume

3.00/5 (Submit Your Rating)

Milpitas, CA

PROFESSIONAL SUMMARY:

  • 6+ years of design and development experience in IT industry, which includes 3.5+, years of Hadoop echo system development and design experience.
  • 3.5+ years of experience with Big Data Hadoop Ecosystem tools like Map Reduce, YARN, HDFS, HBase, Sqoop, Hive, Pig, Oozie, Apache Spark for ingestion, storage, querying, processing and analysis of data.
  • Used Pig as ETL tool to do transformations, Joins, filter and developed pig UDF’s when needed.
  • DevelopedUDFs in Java as and when necessary to use in HIVE queries.
  • Solved performance issuesin Hive andunderstand how does it translate to MapReduce jobs.
  • Good knowledge of Hadoop environment like Cloudera CDH3 and CDH4.S
  • Very good Exposure onSpark along with Scala Programming.
  • Exploring with theSparkimproving the performance and optimization of the existing functionality in Hadoop usingSparkContext, Data Frame, Pair RDDs,Spark,and YARN.
  • Proficient in using RDMS concepts with Oracle, SQL Server and MySQL.
  • Good Knowledge of analyzing data in NOSQL databases like Hbase.
  • Acquired knowledge on Amazon AWS concepts like EMR & EC2 web services which provides fast and efficient processing of Big Data.
  • Excellent project experience in various technologies like JAVA, HTML, XML, Scala.
  • Strong experience in using Integrated Development Environments (IDE’s) like Eclipse, Scala - Eclipse, Net-beans etc.
  • Having Experience on UNIX commands and Deployment of Applications in Server.
  • Experience in software configuration management using GIT and SVN.
  • Proficient in methodologies such as Agile Scrum and UML.
  • Knows popular software applications such as Word, Excel, and PowerPoint.
  • A highly-motivated, productive and customer-focused professional with advance communication skills; time management, analytical and problem solving skills.
  • Reliable, dedicated skills to meet deadlines and adapt to new challenges.

SKILLS:

Hadoop Components: HDFS, MapReduce, PIG, Hive, Hbase, Sqoop, Zookeeper, Flume, Kafka, Yarn, Cloudera Manager.

Spark Components: Apache Spark,Data Frames, Scala, YARN, Pair RDDs

Web Technologies / Other components: XML, Servlet & JSP.

Server SideScripting: UNIX Shell Scripting.

Databases: Oracle 10g, SQL Server, MySQL, HBase

Programming Languages: Java, C, C++, Scala.

Web Servers: Apache Tomcat

IDE: Eclipse, Scala-Eclipse, Net-beans

OS/Platforms: Windows 8/10, Linux (Red-Hat, Ubuntu), Unix.

NoSQL Databases: Hbase

Methodologies: Agile, UML.

WORK EXPERIENCE:

Confidential, Milpitas, CA

Hadoop Developer

Responsibilities:

  • Involvement in coding (configuration and backend business logic), deployment and Unit testing.
  • Created hive queries to analyze the data
  • Requirement gathering, Technical specification creation, Design, Estimation and planning.
  • Mentoring and assisting team members for quick completion of task.
  • Designing and creating Hive external tables, Views using shared meta-store with Partitioning, Dynamic Partitioning and buckets.
  • Developed UDF and UDAF in the HIVE queries based on the business requirement
  • Worked extensively with Text, Avro and csv Hadoop files format.
  • Developing a Sqoop job to load incremental data at regular intervals.
  • Move data incrementally using Sqoop to design overall ETL load process and transform into new data model.
  • Troubleshoot problems, monitor Hadoopcluster, file system management and monitoring.
  • Involved in converting Hive/SQL queries into Spark transformations using Spark RDDs, Scala and have a good experience in using Spark-Shell and Spark Streaming
  • Have worked on Big Data Hadoop cluster with 200 data nodes, 30 Cores per node. 500+ Oozie jobs On Cluster Set up standards and processes for Hadoopbased app- Location design and implementation.
  • Developed Oozie workflows by integrating all tasks relating to a project and schedule the jobs as per requirements.
  • Involved in loading data into HBase using HBase Shell, and Sqoop.
  • Responsible for preserving code and design integrity using Git.

Environment: Java, HDFS, Hadoop, Hive, Hive UDF, Oracle, TD, GIT, Eclipse,Sqoop, Oozie, MAPR distribution, SVN, Cloudera, Ubuntu, UNIX Shell Scripting.

Confidential

Hadoop Developer

Responsibilities:

  • 1+years of understanding on big data technologies in Hadoop using PIG, HIVE and understanding of Sqoop.
  • Hands on experience in writing core java level programming in order to perform cleaning, pre-processing and data validation.
  • Excellent understanding /knowledge of HadoopArchitect and various components such as HDFS, job Tracker, Task Tracker, Name mode, Data node, and map reduce Programming.
  • Developed multiple MapReduce Jobs in java for data cleaning and pre-processing.
  • Automation of data pulls from SQL Server to Hadoopeco System via scoop.
  • Developed workflow in Oozie to automate the tasks of loading the data into HDFS and pre-processing with PIG.
  • Developed the UDF's in PIG and HIVE using Java.
  • Worked on managing and reviewing Hadoop log files.
  • Created multiple Hive tables, implemented Partitioning, Dynamic Partitioning and Buckets in Hive for efficient data access.
  • Developed Oozie workflows by integrating all tasks relating to a project and schedule the jobs as per requirements.
  • Exported analyzed data to relational databases using Sqoop for visualization to generate reports for the BI Team.
  • Involved in loading data into HBase using HBase Shell, and Sqoop.
  • Implemented Agile methodology for improved data performance within the team cooperation.

Environment: Hadoop, HDFS, MapReduce, Sqoop, Hive, Flume, Oozie, Zoo keeper, MySQL, Eclipse.

Confidential

Software Engineer Trainee/Java Developer

Responsibilities:

  • Involvement in development, customization and design of the product.
  • Coding, Unit testing and error handling.
  • Requirement gathering, technical specification creation, design, estimation and planning.
  • Working in nonfunctional modules, which include analyzing business specifications and system design documents.

Environment: Java 1.6, JSP, JBOSS Server, Oracle, Eclipse, Maven

We'd love your feedback!