Sr. Big Data LeadDeveloper Resume Brenham, TX - Hire IT People

PROFESSIONAL SUMMARY:

A Hadoop Professional wif over 8 Plus years of IT experience includes 3 Plus years of experience in Big Data, Hadoop Eco System related technologies wif domain experience in Financial, Banking, Insurance, Retail and Non - profit Organizations in Software Development and support of applications.
Excellent understanding/noledge ofHadoopEcosystem including HDFS, MapReduce, Hive, Pig, Spark, Kafka, YARN, HBase, Oozie, ZooKeeper, Flume and Sqoop based Big Data Platforms.
Expertise in design and implementation of Big Data solutions in Banking, Retail and E-commerce domains.
Experienced wif NoSQL databases like Hbase and Cassandra.
Comprehensive experience in building Web-based applications using J2EE Frame works like EJB, Struts and JMS.
Excellent ability to use analytical tools to mine data and evaluate teh underlying patterns.
Assisted in Cluster maintenance, Cluster Monitoring, Managing and Reviewing data backups and log files.
Hands-on experience in developing MapReduce programs using ApacheHadoopfor analyzing teh Big Data.
Expertise in optimizing traffic across network using Combiners, joining multiple schema datasets using Joins and organizing data using Partitioners and Buckets.
Experienced in writing complex MapReduce programs dat work wif different file formats like Text, Sequence, Xml and Avro.
Expertise in composing MapReduce Pipelines wif many user-defined functions using Apache PIG.
Implemented business logic by writing Pig Latin UDFs in Java and used various UDFs from Piggybanks and other sourcesHIVE.
Expertise in Hive Query Language (HiveQL), Hive Security and debugging Hive issues.
Responsible for performing extensive data validation using HIVE Dynamic Partitioning and Bucketing.
Experience in developing custom UDFs for Pig and Hive to incorporate methods and functionality of Java into Pig Latin and HQL (Hive QL).
Worked on different set of tables like External Tables and Managed Tables.
Experience in scheduling MapReduce/Hive jobs using Oozie.
Experience in ingesting large volumes of data into Hadoop using Sqoop.
Expertise in creatingdatabases, users, tables, triggers,macros, views, stored procedures, functions, Packages and joins in Oracle database.
Experience in writing real time query processing using Cloudera Impala.
Acted as SME and Module Lead for teh major projects undertaken.
Expert database engineer, NoSQL and relational data modeling.
Responsible for building scalable distributed data solutions using Hbase, Cassandra.
Performed various CRUD operations in Cassandra database by loading time series data into Cassandra database.
Worked wif Apache Spark for quick analytics on object relationships.
Hands-on experience in writing Scala code to perform quick analytics in Spark.
Hands-on noledge about Spark RDD’s, DataFrames and to perform transformations and actions on RDD’s.
Experience in building clusters on AWS using Amazon EC2 services and Cloudera manager.
Experience in Big Data platforms like Hortonworks, Cloudera, Amazon AWS and Apache.
Complete domain and development Life Cycle noledge of Data Warehousing & Client Server Concepts and noledge of basicdata modeling.
Good noledge onAgile Methodologyand teh scrum process.
Very good in Fast learning, Analytical thinking, decision making and problem solving skills.

TECHNICAL SKILLS:

Web Technologies: JSP, REST API, HTML5, CSS, JavaScript

JEE Technologies: Servlets, Web Services, SOAP, WebLogic, Apache Jakarta-Tomcat Languages and Hadoop Components Java, Hadoop, COBOL, CICS, C, C++, SQL, PL / SQL, Sqoop, Flume, Hive, Pig, MapReduce, Scala, YARN, Oozie, Spark, Impala, Hue

SQL and NoSQL Databases: Cassandra, Hbase, Oracle, DB2, MySQL, SQLite, MS SQL Server 2008 / 2012, MS Access.

Operating Systems: Windows 98/NT/XP/Vista/7, Windows CE, Linux, UNIX, IOS, MAC.

Methodologies: Agile, Rapid Application Development, Waterfall Model, Iterative Model

Big data Platforms: Hortonworks, Cloudera, Amazon AWS, Apache

Frameworks: Spring, Hibernate, EJB, Struts

PROFESSIONAL EXPERIENCE:

Confidential, Brenham, TX

Sr. Big Data LeadDeveloper

RESPONSIBILITIES:

Acted as a SME and Module Lead for two major projects undertaken wif a team size of 4 people.
Involved in deploying Hadoop clusters on AWS using Amazon EC2 services and Cloudera Manager.
Worked on Big data platform Cloudera. Used Kafka as a messaging system to get data from different sources.
Creating various Hive and Pig Latin scripts for performing ETL transformations on teh transactional and application specific data sources.
Wrote and executed PIG scripts using Grunt shell.
Big data analysis using Pig and User defined functions (UDF).
Performed joins, group by and other operations in Hive and PIG.
Processed and formatted teh output from PIG, Hive before sending to theHadoopoutput file.
Used HIVE definition to map teh output file to tables.
Used Oozie to schedule map reduce and Hive jobs to generate weekly and monthly reports.
Reviewed teh HDFS usage and system design for future scalability and fault-tolerance.
Importing data from relational data stores toHadoopusing Sqoop.
Incremental data movement using Sqoop and Oozie jobs.
Used Impala for real time query processing in Cloudera.
Worked wif Apache Spark for quick analytics on object relationships by writing code in Scala.
Involved in Spark Streaming architecture and nodes setup meeting to perform real time processing of data dat is ingested from Kafka.
Developed scalable distributed solutions using Cassandra database and performed various CRUD operations.
Created UDF’s to encrypt teh customer sensitive data and stored into HDFS and performed analysis using PIG.
Effective working wif teh team in performing teh big data tasks and delivering teh projects in time.
Involved in cluster setup meetings wif teh administration team.

Environment: Apache Hadoop 2.2.0, Cloudera, MapReduce, Hive, Hbase, HDFS, Cassandra, PIG, Sqoop, Impala, Oozie, Kafka, Java 1.7, Python, UNIX, Shell Scripting, XML.

Confidential, Atlanta, GA

Sr. Hadoop Developer

RESPONSIBILITIES:

Worked on Hortonworks platform. Developed data pipeline using Flume and Sqoop to ingest customer behavioral data and financial histories from traditional databases into HDFS for analysis.
Ingested large volumes of data from Teradata to Hadoop using Sqoop.
Involved in writing Map Reduce jobs.
Involved in Sqoop, HDFS Put or Copy from Local to ingest data.
Used Pig to do transformations, event joins, filter boot traffic and some pre-aggregations before storing teh data onto HDFS.
Involved in developing Pig UDFs for teh needed functionality dat is not available from Apache Pig.
Used Hive to analyze teh partitioned and bucketed data and compute various metrics for reporting.
Involved in developing Hive DDLs to create, alter and drop Hive tables.
Involved in developing Hive UDFs for teh needed functionality dat is not available from Apache Hive.
Computed various metrics using Java Map Reduce to calculate metrics dat define user experience, revenue etc.
Responsible for developing data pipeline using flume, Sqoop and pig to extract teh data from weblogs and store in HDFS.
Performed various CRUD operations on Cassandra Clusters.
Involved in using SQOOP for importing and exporting data into HDFS.
Involved in processing ingested raw data using Map Reduce, Apache Pig and Hive.
Involved in developing Pig Scripts for change data capture and delta record processing between newly arrived data and already existing data in HDFS.
Involved in emitting processed data fromHadoopto relational databases or external file systems using SQOOP, HDFS GET or CopyToLocal.
Involved in developing Shell scripts to orchestrate execution of all other scripts (Pig, Hive, and Map Reduce) and move teh data files wifin and outside of HDFS.

Environment: Hadoop 2.2.0, Map Reduce, Cassandra, Kafka, Yarn, Hive, Pig, HBase, Oozie, Sqoop, Flume, Oracle 11g, Core Java, Hortonworks, HDFS, Eclipse.

Confidential, Jacksonville, FL

Hadoop Developer

RESPONSIBILITIES:

Part of team for developing and writing PIG scripts.
Loaded teh data from RDBMS SERVER to Hive using Sqoop.
Created Hive tables to store teh processed results in a tabular format.
Developed teh Sqoop scripts in order to make teh interaction between Hive and MySQL Database.
Developed Java Mapper and Reducer programs for complex business requirements.
Developed Java custom record reader, partitioner and serialization techniques.
Used different data formats (Text format and Avro format) while loading teh data into HDFS.
Created Managed tables and External tables in Hive and loaded data from HDFS.
Performed complex HiveQL queries on Hive tables.
Optimized teh Hive tables using optimization techniques like partitions and bucketing to provide better performance wif HiveQL queries.
Created partitioned tables and loaded data using both static partition and dynamic partition method.
Created custom user defined functions in Hive.
Performed SQOOP import from Oracle to load teh data in HDFS and directly into Hive tables.
Performed incremental data movement to Hadoop using Sqoop.
Developed Pig Scripts to store unstructured data in HDFS.
Scheduled map reduce jobs in production environment using Oozie scheduler.
Analyzed teh Hadoop logs using PIG scripts to oversee teh errors caused by teh team.
Experience in gathering requirements from teh client, giving estimates for developing projects and delivering teh projects in time.

Environment: HDFS, Map Reduce, Hive, Sqoop, Pig, Flume, HBase, Oozie Scheduler, Java, Oracle, Shell Scripts.

Confidential, Phoenix, AZ

Java and HadoopDeveloper

RESPONSIBILITIES:

Extensively implemented various QA methodologies, testing strategies, and test plans in all stages of SDLC by followed Agile SCRUM methodology.
Developed Pig Scripts for validating and cleansing teh data.
Developed MapReduce programs to phrase teh raw data, and stored teh refined data in Cognition DB.
Created HIVE queries for moving data from Cornerstone (Data Lake) to HDFS locations.
Provided design recommendations and thought leadership to sponsors/stakeholders dat improved review processes and resolved technical problems.
Managed and reviewedHadooplog files.
Shared responsibility for administration ofHadoop, Hive and Pig.
Involved in teh process of load, transform and analyze Transactions data from various providers into Hadoopon an on-going basis.
Extensively worked on PIG scripts.
WroteTeradata Macrosand used various Teradataanalytic functions.
Involved in migration projects tomigrate datafrom data warehouses on Oracle/DB2 and migrated those toTeradata.
Performance tuned and optimized various complex SQL queries.
SQL queries and back end testing, Tableau report testing, deployment into UAT and Production.
Participated and conducted Issue Log weekly status meetings, Report status meetings and Project status meetings to discuss issues and workarounds
Communicated wifdevelopersthroughout all teh phases of testing to eliminate Roadblocks
Generated daily progress report and represented in daily Agile Scrum meetings.

Environment: Hadoop, HDFS, Hive, Map Reduce, Core Java, Teradata, Oracle, UNIX, Tableau.

Confidential

Software Developer

RESPONSIBILITIES:

Designed and developed UI screens wif JSF to provide interactive screens to display data for Provider module.
Developed and implemented client side and server side validations.
Developed teh business layer logic and implemented EJBs Session beans.
Writing teh test plans and test cases for teh developed screens.
Worked on bug fixing and enhancements on change requests.
Designed and developed UI screens wif Struts to provide interactive screens to display data.
Performed bug verification, release testing and provided support for Oracle based applications.
Designed and developed presentation layers as well as business layer for teh entire application.
Database access was done using JDBC. Accessed stored procedures using JDBC.
Extensively involved in creatingPL/SQL objects me.e. Procedures, Functions, and Packages.
Extensively involved indebuggingthe existing PL/SQL objects.
Involved inperformance tuningthe existing objects.
Worked on bug fixing and enhancements on change requests.
Executing test cases and fixing teh bugs through unit testing.

Environment: Java/J2EE, Servlets, JSP, Apache Tomcat, Websphere Application server 6.0.1, EJB, Struts, Oracle, XML, HTML, MY SQL, MS-SQL server

We provide IT Staff Augmentation Services!

Sr. Big Data Leaddeveloper Resume

Brenham, TX

We'd love your feedback!

Resume Categories

Client Services

Job Seekers

Visa Sponsorship