Spark Developer Resume
Cherry Hill, NJ
SUMMARY:
- Experienced IT professional with 3 years as a Hadoop/Spark Developer in Big Data technologies development and 3 years as a Java developer.
- Experience working with Cloudera & Hortonworks Distribution System of Hadoop.
- Expertise in Hadoop ecosystem include HDFS, Hive, Pig, Sqoop, Kafka, Zookeeper, MapReduce and Spark.
- Experience in analyzing the different types of data that flow from data lakes to Hadoop Clusters.
- Hands on experience with Big Data Hadoop core and Eco - System components (HDFS, Yarn, Hive, Sqoop, Oozie, Hbase, Zookeeper and Pig).
- Experience in manipulating the streaming data to clusters through Kafka and Spark- Streaming.
- Experience in partitioning Data according to the business requirements using Hive.
- Working with data importing and exporting in Sqoop.
- Working with data extraction, transformation and load in Hive, Pig and HBase.
- Working with data transformation from HDFS, HIVE, PIG, HBase, and MySQL.
- Experience in creating UDF's, UDAF's for Hive.
- Acumen in choosing an efficient ecosystem in Hadoop and providing the best solutions to Big Data problems.
- Good Knowledge in Spark and Scala.
- Proficient in working with NoSQL database like MongoDB, Cassandra and HBase.
- Good Knowledge in Design Pattern.
- Prolific in generating the splendid and informative dashboards for Business Intelligence teams.
- Experience in using Front-end technologies like JavaScript, HTML, CSS, HTML5 and CSS3.
- Experience in Test such as JUnit
- Expertise in relational databases like Oracle, SQL, My SQL and SQL Server.
- Experience in Agile methodologies.
- Comfortable in Linux, UNIX environments. And Expertise in Unix-Shell
- Proficient communication skills with an ability to lead a team & keep them motivated.
- Extensive experience with Front-end and Back-end IDE's like Eclipse, Visual Studio.
- Highly motivated and versatile team player with the ability to work independently & adapt quickly to new emerging technologies.
TECHNICAL SKILLS:
Hadoop Ecosystem \Framework: MapReduce, Hive, Pig, HBase, Sqoop, Oozie, \ Hadoop, Spark, Spring. MVC, Node.JS, Spark, HDFS, YARN, Kafka, Zookeeper
Databases\Methodologies: Oracle, MySQL, SQL Server, Hbase, Cassandra, Agile Scrum MongoDB
Languages\Web Technologies: Java, Scala, C++, SQL, HiveQL, Pig Latin, \HTML 5, JQuery, JSON, XML, CSS 3
JavaScript, Shell: Scripting, Angular.JS
Systems\Other: Windows, Linux, UNIX Eclipse, Maven, JUnit, Avro, Git
PROFESSIONAL EXPERIENCE:
Confidential, Cherry Hill, NJ
Spark Developer
Responsibilities:
- Worked towards designing, development, and deployment of Hadoop cluster using Spark, Spark-Streaming, Apache Kafka and Cassandra.
- Designed data importing method from the data sources to Kafka clusters.
- Process and structure data utilizes Spark-Core, Spark-streaming Frameworks.
- Developed Scala for Data Streaming, Processing, Modeling and Testing Purposes.
- Migrating data from Spark-RDD into Cassandra.
- Realized read incoming data in Real Time and preprocess them within few seconds.
- Accomplished different goals of the management team by implement algorithms and business logics of data process in Spark related technologies.
- Contributed towards developing a Data Pipeline to load data from sources to Apache Kafka cluster.
- Expertised in improving the performance of application by analytical skills.
- Involve in retrieving data from Cassandra clusters.
- Participated in Data modeling, create storage procedure and index using Cassandra.
- Strong problem-solving experience on test, identify, address, debug and then resolve technical issues that affect the integrity of the application.
- Worked in the agile Environments, effectively communicated at all levels of an organization in Management and Technical roles.
Environment: Hadoop, Spark, Apache Kafka, Apache Avro, Cassandra, Scala, Java, JSON, Linux, Hortonworks
Confidential, NYC, NY
Hadoop/Spark Developer
Responsibilities:
- Developed a data pipeline using Kafka, Spark, Scala, MapReduce, Hive and Sqoop to ingest, transform and analyzing customer behavioral data.
- Imported the data from RDBMS into HDFS using Sqoop.
- Design dynamically storage logic of data and improved performance with HiveQL in Hive.
- Design and Developed MapReduce in Java to process large volume of data.
- Used Kafka to collect streaming data from customers' consumption records. Configured Kafka to read and write messages from external data sources.
- Implemented Spark using Scala and utilizing Spark Core, Spark Streaming and Spark SQL API for faster processing of data instead of MapReduce in Java
- Realize process of real time streaming data using Spark Streaming and Kafka, storage data into RDBMS
- Exploring with the Spark improving the performance and optimization of the existing algorithms in Hadoop MapReduce and Spark.
- Exported the analyzed data from HDFS to the relational databases using Sqoop to further visualize and generate reports for the BI team.
- Used to monitor and manage the Hadoop cluster using Cloudera Manager.
Environment: Kafka, Spark, Scala, Hadoop, HDFS, Hive, Pig, Sqoop, HBase, Java, MySQL, Oracle, Linux
Confidential, NY
Hadoop Developer
Responsibilities:
- Adept in Hadoop related technology experience in Storage, Querying, Processing and Data Analysis.
- Handled importing and exporting data using Sqoop from HDFS to RDBMS and vice-versa.
- Developed Map Reduce programs in Java to process data.
- Used Apache Avro to transform data between compact binary data format and JSON format.
- Implemented Proof of concepts on Hadoop Stack and different big data analytic tools, migration from different data sources like HDFS/Hbase into Hadoop?
- Implemented Pig scripts to clean and accomplish data preprocessing.
- To improve performance, utilize Hive scripts to reorganize data.
- Wrote various Queries on data according to business requirement.
- Effectively communicated at all levels of an organization’s Management and Technicians.
- Working closely with BI team.
Environment: Hadoop, Hive, Pig, Sqoop, HBase, Apache Avro, Java, JSON, MySQL, Oracle, Linux
Confidential
Java Developer
Responsibilities:
- Designed and developed of application using Spring. MVC framework with Agile methodology
- Developed JSP and HTML5 pages using CSS3 and JavaScript as part of the presentation layer
- Hibernate framework is used in persistence layer for mapping an object-oriented domain model to database.
- Developed database schema and SQL queries for querying, inserting and managing database
- Implemented various design patterns in the project such as Data Transfer Object, Data Access Object and Singleton
- Used Maven scripts to fetch, build, and deploy application to development environment
- Used JUnit for functional and unit testing code.
Environment: Spring. MVC, Hibernate, JSP, JavaScript, MySQL, JUnit, Java6, Servlets, Apache Tomcat, CSS 3, HTML 5, Eclipse, JIRA, JDBC, Jenkins
Confidential
Jr.Java Developer
Responsibilities:
- Developed of application using Spring. MVC to handle the data communication from User interface to Server then to Database and vice-versa.
- Deployed and configure Apache Tomcat as web server.
- Utilize JDBC technologies in Java make connection between Apache Tomcat and MS SQL Server, also User interface and Apache Tomcat.
- Accomplished exchange of data between the Client, Server and Database in Hibernate.
- Implement functions like log in, inquire, reservation, change, cancellation and Payment Method.
- Involve in design user interface in HTML, CSS, JavaScript, XML, JQuery and Angular.JS.
- Involve in designed and developed database with SQL and Oracle, participate in physic and logic data modeling, create storage procedure and index for the web application.
Environment: Spring. MVC, Hibernate, JSP, JavaScript, SQL Server, JUnit, Java6, Servlets, Apache Tomcat, CSS 3, HTML 5, SQL, Oracle, Eclipse, JDBC
