Hadoop Developer Resume
Milpitas, CA
PROFESSIONAL SUMMARY:
- 6+ years of design and development experience in IT industry, which includes 3.5+, years of Hadoop echo system development and design experience.
- 3.5+ years of experience with Big Data Hadoop Ecosystem tools like Map Reduce, YARN, HDFS, HBase, Sqoop, Hive, Pig, Oozie, Apache Spark for ingestion, storage, querying, processing and analysis of data.
- Used Pig as ETL tool to do transformations, Joins, filter and developed pig UDF’s when needed.
- DevelopedUDFs in Java as and when necessary to use in HIVE queries.
- Solved performance issuesin Hive andunderstand how does it translate to MapReduce jobs.
- Good knowledge of Hadoop environment like Cloudera CDH3 and CDH4.S
- Very good Exposure onSpark along with Scala Programming.
- Exploring with theSparkimproving the performance and optimization of the existing functionality in Hadoop usingSparkContext, Data Frame, Pair RDDs,Spark,and YARN.
- Proficient in using RDMS concepts with Oracle, SQL Server and MySQL.
- Good Knowledge of analyzing data in NOSQL databases like Hbase.
- Acquired knowledge on Amazon AWS concepts like EMR & EC2 web services which provides fast and efficient processing of Big Data.
- Excellent project experience in various technologies like JAVA, HTML, XML, Scala.
- Strong experience in using Integrated Development Environments (IDE’s) like Eclipse, Scala - Eclipse, Net-beans etc.
- Having Experience on UNIX commands and Deployment of Applications in Server.
- Experience in software configuration management using GIT and SVN.
- Proficient in methodologies such as Agile Scrum and UML.
- Knows popular software applications such as Word, Excel, and PowerPoint.
- A highly-motivated, productive and customer-focused professional with advance communication skills; time management, analytical and problem solving skills.
- Reliable, dedicated skills to meet deadlines and adapt to new challenges.
SKILLS:
Hadoop Components: HDFS, MapReduce, PIG, Hive, Hbase, Sqoop, Zookeeper, Flume, Kafka, Yarn, Cloudera Manager.
Spark Components: Apache Spark,Data Frames, Scala, YARN, Pair RDDs
Web Technologies / Other components: XML, Servlet & JSP.
Server SideScripting: UNIX Shell Scripting.
Databases: Oracle 10g, SQL Server, MySQL, HBase
Programming Languages: Java, C, C++, Scala.
Web Servers: Apache Tomcat
IDE: Eclipse, Scala-Eclipse, Net-beans
OS/Platforms: Windows 8/10, Linux (Red-Hat, Ubuntu), Unix.
NoSQL Databases: Hbase
Methodologies: Agile, UML.
WORK EXPERIENCE:
Confidential, Milpitas, CA
Hadoop Developer
Responsibilities:
- Involvement in coding (configuration and backend business logic), deployment and Unit testing.
- Created hive queries to analyze the data
- Requirement gathering, Technical specification creation, Design, Estimation and planning.
- Mentoring and assisting team members for quick completion of task.
- Designing and creating Hive external tables, Views using shared meta-store with Partitioning, Dynamic Partitioning and buckets.
- Developed UDF and UDAF in the HIVE queries based on the business requirement
- Worked extensively with Text, Avro and csv Hadoop files format.
- Developing a Sqoop job to load incremental data at regular intervals.
- Move data incrementally using Sqoop to design overall ETL load process and transform into new data model.
- Troubleshoot problems, monitor Hadoopcluster, file system management and monitoring.
- Involved in converting Hive/SQL queries into Spark transformations using Spark RDDs, Scala and have a good experience in using Spark-Shell and Spark Streaming
- Have worked on Big Data Hadoop cluster with 200 data nodes, 30 Cores per node. 500+ Oozie jobs On Cluster Set up standards and processes for Hadoopbased app- Location design and implementation.
- Developed Oozie workflows by integrating all tasks relating to a project and schedule the jobs as per requirements.
- Involved in loading data into HBase using HBase Shell, and Sqoop.
- Responsible for preserving code and design integrity using Git.
Environment: Java, HDFS, Hadoop, Hive, Hive UDF, Oracle, TD, GIT, Eclipse,Sqoop, Oozie, MAPR distribution, SVN, Cloudera, Ubuntu, UNIX Shell Scripting.
Confidential
Hadoop Developer
Responsibilities:
- 1+years of understanding on big data technologies in Hadoop using PIG, HIVE and understanding of Sqoop.
- Hands on experience in writing core java level programming in order to perform cleaning, pre-processing and data validation.
- Excellent understanding /knowledge of HadoopArchitect and various components such as HDFS, job Tracker, Task Tracker, Name mode, Data node, and map reduce Programming.
- Developed multiple MapReduce Jobs in java for data cleaning and pre-processing.
- Automation of data pulls from SQL Server to Hadoopeco System via scoop.
- Developed workflow in Oozie to automate the tasks of loading the data into HDFS and pre-processing with PIG.
- Developed the UDF's in PIG and HIVE using Java.
- Worked on managing and reviewing Hadoop log files.
- Created multiple Hive tables, implemented Partitioning, Dynamic Partitioning and Buckets in Hive for efficient data access.
- Developed Oozie workflows by integrating all tasks relating to a project and schedule the jobs as per requirements.
- Exported analyzed data to relational databases using Sqoop for visualization to generate reports for the BI Team.
- Involved in loading data into HBase using HBase Shell, and Sqoop.
- Implemented Agile methodology for improved data performance within the team cooperation.
Environment: Hadoop, HDFS, MapReduce, Sqoop, Hive, Flume, Oozie, Zoo keeper, MySQL, Eclipse.
Confidential
Software Engineer Trainee/Java Developer
Responsibilities:
- Involvement in development, customization and design of the product.
- Coding, Unit testing and error handling.
- Requirement gathering, technical specification creation, design, estimation and planning.
- Working in nonfunctional modules, which include analyzing business specifications and system design documents.
Environment: Java 1.6, JSP, JBOSS Server, Oracle, Eclipse, Maven
