We provide IT Staff Augmentation Services!

Sr. Big Data Developer Resume

3.00/5 (Submit Your Rating)

Greenwich, CT

PROFESSIONAL SUMMARY:

  • Around 9 years of IT Experience in RDBMS, SQL and PL/SQL Programming and Big Data technologies.
  • Around 5+ years of experience as a Designer & quality reviewer with cross platform integration experience using Hadoop, Java and J2EE
  • Qualified Hadoop, Java Developer with good experience in Analytics, Design and Development and maintaining applications for various Business enterprises.
  • Experience using Cloudera and Horton Works platform and their eco systems. Hands on experience in installing, configuring and using ecosystem components like Hadoop MapReduce, HDFS, Pig, Hive, Sqoop and Flume .
  • Extensive experience in data ingestion, big data storage planning, complex transformations, data integration, analysis for Logistics, Healthcare and Retail sectors.
  • Experienced in Integrating Hadoop with Apache Storm and Kafka . Expertise in uploading Click stream data from Kafka to Hdfs, Hbase and Hive by integrating with Storm.
  • Experience in Apache Spark, Spark Streaming, Spark SQL and No SQL databases like Cassandra and Hbase.
  • Experience in Hadoop distributions like Amazon, Cloudera and Hortonworks and Cloud ecosystems components of Amazon like Redshift, Dynamo DB
  • Hands - on experience in advanced Big-Data technologies like Spark Ecosystem (Spark SQL, SparkR and Spark Streaming, Yarn) .
  • Very good understanding of Hadoop ecosystems like Sqoop2, Spark and YARN.
  • Extensive knowledge on Tableau reporting.
  • Experienced in providing technical solutions to the business on applications developed on Hadoop and its eco systems.
  • Thrives on challenge and works well under pressure, with technical expertise to learn new environments quickly, locate inefficiencies in code, and provide quick solutions.
  • Expertise in developing web applications using JAVA, Spring MVC, vignette portal . Good experience to provide technical oversight for large complex projects and achieve desired customer satisfaction from inception to deployment.
  • Experience in Data Analysis, Data Validation, Data Verification, Data Cleansing, Data Completeness and identifying data mismatch.
  • Experience in working with MR, PIG scripts &HIVE query language and writing UDF’s.
  • Experience in importing and exporting data using Sqoop from HDFS to Relational Database Systems and vice-versa.
  • Extensive experience developing and deploying applications using Web Logic, Apache Tomcat and JBOSS.
  • Experienced with Upgrades, Migration and Backup, Disaster Recovery, Performance Monitoring and Fine-tuning of systems running various Linux platforms.
  • Installed, monitored, and supported web and application servers in Linux environments.
  • Experienced analyzing data using Hive QL, Pig Latin, and custom MapReduce programs in Java.
  • Knowledge of job workflow scheduling and monitoring tools like Oozie and Zookeeper
  • Have advanced analytical, problem solving, negotiation and organizational skills with demonstrated ability to multi-task, organize, prioritize and meet deadline. Ability to multitask and work multiple projects concurrently. Ability to work independently and as part of a team.
  • Experience using waterfall project execution methodologies in all phases of an application development starting from Planning all the way to the delivery of the Product
  • Experience in implementing Spark using Scala and SparkSQL for faster analyzing and processing of data.
  • Proficient in working with Various IDE tools including Eclipse Galileo and IntelliJ IDE.
  • Experience in implementing Spark using Scala and SparkSQL for faster analyzing and processing of data.
  • Java & J2EE Technologies, Core Java.
  • Knowledge of working SQL BI and Talend.

TECHNICAL SKILLS:

Big data: Hadoop, Map Reduce, HDFS, Hive, HBase, Pig, Sqoop, Flume, Oozie, Zookeeper, Netezza, Mahout, YARN, Storm, Spark, Kafka (0.8.2x, 0.9.0), Mongo DB , Cassandra, Tableau Reporting.

Hadoop Distributions Cloudera, Hortonworks, MapR.

Core Skills Core Java (OOPs and collections). J2EE Framework, JSP, Servlets, Oracle ADF, JSF, Linux Shell Script, JDBC, Scala.

Databases Oracle, SQL Server.

Design Patterns Singleton, Factory, MVC.

Build Tools ANT, Maven.

Browser Scripting Java Script, HTML OM, DHTML, AJAX, AngularJS.

IDE Eclipse/My Eclipse, JDeveloper.

Operating Systems Red-hat Linux, Windows, Unix.

PROFESSIONAL EXPERIENCE:

Confidential - Greenwich, CT

Sr. Big Data Developer

Roles and Responsibilities:

  • Experience with Cloudera distribution of Hadoop.
  • Installed/Configured/Maintained Apache Hadoop clusters for application development and Hadoop tools like Hive, Pig, HBase, Zookeeper and Sqoop.
  • Deployed Hadoop Cluster in the following nodes, Managing and scheduling jobs on Hadoop cluster.
  • Involved in analyzing system failures, identifying root causes and recommended course of actions.
  • Worked on Hive for exposing data for further analysis and for generating transforming files from different analytical formats to text files.
  • Handled importing of data from various data sources, performed transformations using Hive, Pig and Spark and loaded data into HDFS.
  • Wrote queries to create, alter, insert and delete elements from lists, sets and maps in Datastax Cassandra.
  • Involved in NoSQL (Datastax Cassandra) database design, integration and implementation.
  • Developed multiple MapReduce jobs in java for data cleaning and accessing.
  • Importing and exporting data into HDFS and Hive using Sqoop.
  • Created indices for conditioned search in Datastax Cassandra.
  • Worked on the core and Spark SQL modules of Spark extensively.
  • Implemented Name Node backup using NFS. This was done for High availability.
  • Worked on importing and exporting data from Oracle and DB2 into HDFS using Sqoop.
  • Developed PIG Latin scripts to extract the data from the web server output files to load into HDFS.
  • Worked on custom Pig Loaders and Storage classes to work with a variety of data formats such as JSON, Compressed CSV, etc.
  • Monitored workload, job performance and capacity planning using Cloudera Manager.
  • Created Hive External tables and loaded the data in to tables and query data using HQL.
  • Wrote shell scripts to automate document indexing to Solr Cloud in production.
  • Created Talend Mappings to populate the data into dimensions and fact tables
  • Developed jobs to move inbound files to vendor server location based on monthly, weekly and daily frequency in Talend.
  • Created Hbase tables to store various data formats of PII data coming from different portfolios.
  • Cluster co-ordination services through Zookeeper.
  • Used Flume to collect, aggregate, and store the web log data from different sources like web servers, mobile and network devices and pushed to HDFS.
  • Worked for Amazon Elastic Cloud project using Agile methodology
  • Analyzed the web log data using the HiveQL to extract number of unique visitors per day, page-views, visit duration, most purchased product on website.
  • Converting the Oracle table components to Teradata Table Components in Abilities Graphs.
  • Used Ambary to manage, provision and monitor Hadoop cluster.
  • Implemented Fair schedulers on the Job tracker to share the resources of the Cluster for the Map Reduce jobs given by the users.

Environment: Hadoop, MapReduce, HDFS, Hive, Java, SQL, Cloudera Manager, Spark, AWS, Cassandra, Pig, Sqoop, Oozie, Zookeeper, Ambari, Storm, Teradata, Oracle, NoSQL, Elastic Search, Oozie, Hbase, Talend.

Confidential - Atlanta, GA

Sr. Java/Hadoop Developer

Roles and Responsibilities:

  • Worked on importing and exporting data from Oracle and DB2 into HDFS using Sqoop.
  • Developed PIG scripts to extract the data from the webserver output files to load into HDFS .
  • Worked on custom Pig Loaders and Storage classes to work with a variety of data formats such as JSON, Compressed CSV etc.
  • Monitored workload, job performance and capacity planning using Cloudera Manager.
  • Worked with Cloudera distribution of Hadoop.
  • Deployed Hadoop Cluster worked on Managing and scheduling jobs on Hadoop cluster.
  • Involved in analyzing system failures, identifying root causes and recommended course of actions.
  • Adopted J2EE design patterns like Session Facade and Business Facade .
  • Configuration of application using spring 2.6, Struts 1.3, Hibernate, DAO’s, Actions Classes , Java Server Pages.
  • Configuring Hibernate Struts and Tiles related XML files .
  • Developed the application using Struts Framework that uses Model View, Controller (MVC) architecture with JSP as the view.
  • Developed presentation layer using JSF, JSP, HTML and CSS, JQuery .
  • Extensively used Spring IOC for Dependency Injection and worked on Custom
  • MVC Frameworks loosely based on Struts .
  • Worked on Hive for exposing data for further analysis and for generating transforming files from different analytical formats to text files.
  • Handled importing of data from various data sources, performed transformations using Hive, Pig and Spark and loaded data into HDFS.
  • Involved in NoSQL (Datastax Cassandra) database design, integration and implementation.
  • Wrote queries to create, alter, insert and delete elements from lists, sets and maps in Datastax Cassandra.
  • Created indices for conditioned search in Datastax Cassandra .
  • Developed multiple MapReduc e jobs in java for data cleaning and accessing.
  • Importing and exporting data into HDFS and Hive using Sqoop .
  • Worked on the core and SparkSQL modules of Spark extensively.
  • Implemented name node backup using NFS for High availability.
  • Created Hive External tables and loaded the data in to tables and query data using HQL .
  • Wrote shell scripts to automate document indexing to Cloud in production.
  • Created Talend Mappings to populate the data into dimensions and fact tables
  • Developed jobs to move inbound files to vendor server location based on monthly, weekly and daily frequency in Talend .
  • Created Hbase tables to store various data formats of PII data coming from different portfolios.
  • Cluster co-ordination services through Zookeeper.
  • Used Flume to collect, aggregate, and store the web log data from different sources like web servers, mobile and network devices and pushed to HDFS.
  • Worked for Amazon Elastic Cloud project using Agile methodology.
  • Created reports for the business requirements using Tableau .
  • Analyzed the web log data using the HiveQL to extract number of unique visitors per day, page-views, visit duration, most purchased product on website.
  • Converting the Oracle table components to Teradata Table Components in Abilities Graphs .
  • Used Ambary to manage provision and monitor Hadoop cluster.
  • Implemented Fair schedulers on the Job tracker to share the resources of the Cluster for the Map Reduce jobs given by the users.

Environment: Hadoop, MapReduce, HDFS, Hive, Java, SQL, Cloudera Manager, Spark, AWS, Cassandra, Pig, Sqoop, Oozie, Zookeeper, Ambari, Storm, Teradata, Oracle, NoSQL, Elastic Search, Oozie, Hbase, Talend, Tableau.

Confidential

Java/Hadoop Developer

Roles and Responsibilities:

  • Developer Hadoop ecosystem: Hadoop, MapReduce, Hbase, Sqoop, Amazon Elastic Map Reduce (EMR)
  • Developed a scalable, cost effective, and fault tolerant data ware house system on Amazon EC2 Cloud.
  • Developed MapReduce/EMR jobs to analyze the data and provide heuristics and reports. The heuristics were used for improving campaign targeting and efficiency.
  • Response time for web services built on typical LAMP (php) stack was too slow developed a high performant / high volume / highly scalable platform for bidding in real-time understand from the client the extraction process and decide on the load strategy i.e. whether they want historical data or the current view.
  • Involved in multi-tiered J2EE design utilizing spring framework and JDBC.
  • Worked on building a system using Model-View-Controller (MVC) architecture.
  • Designed the front end using HTML, CSS, Java Script, JSP, jQuery.
  • Designed and implemented the application using Spring MVC, JDBC, MYSQL.
  • Written complex HSQL’s to generate data required in the final reports and pass these HSQL’s to the Ruby programs to convert these HSQL’s to map Reduce programs
  • Importing, exporting data into HDFS and HIVE using Sqoop
  • Responsible for loading unstructured data into Hadoop file system (HDFS)
  • Created and scheduled jobs for maintenance
  • Configured Database Mail
  • Monitored File Growth
  • Maintained Operators, Categories, Alerts, Notifications, Jobs and Schedules.
  • Maintained database response times, proactively generated performance reports.
  • Automated most of the DBA Tasks and Monitoring stats
  • Developed complex stored procedures, views, clustered/non-clustered indexes, triggers (DDL, DML, LOGON) and user defined functions
  • Created a mirrored database using Database Mirroring with High Performance Mode
  • Created database snapshots and stored procedures to load data from the snapshot database to the report database
  • Restore Development and Staging databases from production as per the requirement
  • Involved in resolving Dead lock issues and Performance issues
  • Query Optimization and Performance Tuning for long running queries and created new indexes on tables for faster I/O.

Environment: MS SQL Server 2005/2000, Windows 2000/2003 Server, DTS, Web Logic, Redhat Enterprise MS Access, XML, Hadoop, MapReduce, Hbase, Sqoop, Amazon Elastic Map Reduce CDH, Cassandra, NOSQL, Teradata.

Confidential

Java/J2EE Developer

Roles and Responsibilities:

  • Involved in multi-tiered J2EE design utilizing spring framework and JDBC.
  • Worked on building a system using Model-View-Controller (MVC) architecture.
  • Designed the front end using HTML, CSS, Java Script, JSP, jQuery.
  • Designed and implemented the application using Spring MVC, JDBC, MYSQL.
  • Used SVN version control tool.
  • Automated the build process by writing Maven build scripts.
  • Wrote SQL queries, stored procedures, modifications to existing database structure as required for addition of new features using MySQL database.
  • Involved in installing and configuring Eclipse for development.
  • Configured and customized logs using Log4J and unit testing using Junit.
  • Developed JUnit Test cases and performed application testing for QC team.
  • Used JavaScript for client-side validations.
  • Participated in weekly project meetings, updates and Provided Estimates for the assigned Task.

Environment: Java, J2EE, JavaScript, JDBC, Spring, ASP.NET, VB.NET, AGILE - SCRUM, JSP, Servlet, XML, Design Patterns, Log4J, JUnit, SVN, MySQL, Eclipse.

We'd love your feedback!