We provide IT Staff Augmentation Services!

Data Engineer Resume

4.00/5 (Submit Your Rating)

Burlington, MA

SUMMARY

  • Cloudera Certified Data Engineer wif 6 years of IT experience in designing and Development of Hadoop/Big Data oriented technology.
  • Strong collaboration, team building, interpersonal, communication skills wif proficiency at grasping new technical concepts quickly & utilizing the same in a productive manner.
  • Working knowledge on major Hadoop ecosystems HIVE, MAPREDUCE and YARN
  • Writing MapReduce jobs to do the analysis on big volume of Data.
  • Adept in Scrum methodology. Also familiar wif SDLC life cycle from requirement analysis to system study, designing, testing, de - bugging, documentation and implementation.

TECHNICAL SKILLS

Programming Skills: Core Java

Big Data Platform: Hortonworks HDP, Cloudera Hadoop CDH3/4, Java MapReduce (MRV1, MRV2 YARN), Hive, Oozie, AWS

Database/Tools: DB2, Oracle, NoSQL, Cassandra

Other Tools: Maven, JUnit, Eclipse IDE

Scripting/Deployment tools: Shell Script, Docker, Python

Reporting Tool: BIRT eclipse

PROFESSIONAL EXPERIENCE

Confidential, Burlington MA

Data Engineer

Responsibilities:

  • Gather and process raw data at scale (including writing scripts, web crawling, calling APIs, write HQL/SQL queries, etc.)
  • Design and develop code, scripts and data pipelines that leverage structured and unstructured data integrated from multiple sources (e.g. Stream, Batch, etc.) via csv/xls file
  • Perform software installation and configuration
  • Participate in requirements and design workshops for enterprise dashboard creation
  • Build data platform capable for supporting data visualization using BIRT eclipse
  • Process unstructured dataparticularly logfiles into a form suitable for analysis - and tan do the analysis.
  • Develop Analytics ready data set for collaboration wif Data Analyst

Confidential

Hadoop Developer

Responsibilities:

  • My responsibility was to create the Low-level Design for the Data Ingestion Module, enhancement of Hadoop Map-Reduce job, which joins the incoming slices of data, and pick only the fields needed for further processing.
  • Development of Hadoop Map-Reduce job which makes a JNI call to the compiler, writing the grammar (SAS like) for the compiler, populating values into the compiler needed by it for calculation and retrieving back the same through native JNI calls.
  • All this happens in a distributed environment.
  • Developed several advanced Map Reduce programs to process data files received.
  • It makes use of LSH (Locality Sensing Hashing) to calculate the patterns defined in the grammar. It returns the customers who are hitting the patterns defined in the native language.
  • A standalone java program picks up the customers who have hit patterns and entries are made into a MySQL.
  • Developed Java programs to process huge JSON files received from marketing team to convert into format standardized for the application

Environment: MapReduce, Hive, Java, MySQL

Confidential

Java Hadoop Developer

Responsibilities:

  • Design the Data Model in Cassandra
  • Created the custom utility to load data into Cassandra
  • Using Hector APIs to insert and update data into Cassandra
  • My responsibility was to create the low level Design for Data Ingestion wif Map reduce programming on Cassandra
  • Bulk loaded data into Cassandra using SStableloader.
  • Worked on Performance measurement for Data Ingestion to Cassandra using various options.
  • Installed and configured Apache Hadoop 1.0.1 to test the maintenance of log files in Hadoop cluster
  • Developed Java MapReduce programs for the analysis of sample log file stored in cluster
  • Migration of ETL processes from MySQL to Hive to test the easy data manipulation
  • Developed Hive queries to process the data for visualizing

Environment: MapReduce, Cassandra, Hive, Core Java

Confidential

Hadoop Developer

Responsibilities:

  • Data generated for most frequent airlines used across the cities wif based on the requirement using core java
  • Using Cassandra and Hive reports generated.
  • Done Hive integration wif Cassandra
  • Created reports using Hector APIs.

Environment: Cassandra, Hive, MapReduce, core Java

Confidential

Smalltalk and Java Developer

Responsibilities:

  • My responsibility in LoanIQ is to understand the requirement, which is provided by the customer who is asking to customize the product wif his or her own basis.
  • Write the unit test cases in java based on the customization and modify the code.
  • Modify modules using smalltalk and java.
  • Do manual unit testing
  • After deployment me need to do documentation wif life cycle of the enhancement.

Environment: Smalltalk, Core Java, Oracle

Confidential

Core Java Developer

Responsibilities:

  • Design the front page of assigned module using Adobe Flex and Core Java
  • Write unit test cases in Core Java
  • After deployment me need to do documentation wif life cycle of the enhancement.

Environment: Adobe Flex, Core Java, Ant, MySql

Confidential

Core java Developer

Responsibilities:

  • Gather the requirement and design the module using Adobe Flex and Core Java
  • Write the script in ANT
  • Write Unit test cases in core java
  • Do documentation for full life cycle of module development

Environment: Flex, Java, Ant, Mysql

We'd love your feedback!