We provide IT Staff Augmentation Services!

Hadoop/big Data Developer Resume

2.00/5 (Submit Your Rating)

Columbia, MD

SUMMARY

  • Over 9 +years of experience in software development includes Analysis, Design and Development of quality software for Standalone Applications and Web - based applications using JAVA/J2EE Technologies using Software Development Methodologies / Frameworks like SDLC, OOAD and AGILE.
  • Developed web applications based on different Design Patterns such as Model-View-Controller (MVC), Data Access Object (DAO), Singleton Pattern, Front Controller, Business Delegate, Service Locator, Transfer Objects etc.
  • Experienced in using Java tools like Intelli J, Eclipse.
  • Good knowledge of Hadoop Architecture and various components such as HDFS, Job Tracker, Task Tracker, Name Node, Data Node, MapReduce concepts responsible for writing MapReduce programsand setting up standards and processes for Hadoop-based application design and implementation.
  • Performance benchmarking & optimization of H-scale implemented Big data Components.
  • Involved in the process of data acquisition, data pre-processing and data exploration of telecommunication project in Scala.
  • Expertise with different tools in Hadoop Environment including Pig, Hive, HDFS, MapReduce, Spark, Kafka, Yarn, and Zookeeper.
  • Extensively used Scalafor functional application programming for creating GUI and charts and data analytics.
  • Used Different Spark Modules like Spark core, Spark RDD's, Spark Data frame, Spark SQL.

TECHNICAL SKILLS

  • Hadoop Technologies: HBase, HIVE, Sqoop, Flume, HDFS, Oozie, Zookeeper, YARN, Spark, Kafka, Sentry, Falcon, Pig
  • J2EE Technologies: Servlets, JSP, EJB, JDBC, Web Services (WSDL, SOAP), Spring and
  • Web Services/ Application Servers: Apache tomcat Server, IBM WebSphere server, JBoss
  • Web Tools and Languages: HTML, XML, CSS, DHTML, Java Script
  • Databases (SQL): IBM DB2, Oracle8i/9i/10g, MS SQL Server 2005/2008, MySQL
  • DataBases (NO - SQL): PIG, HIVE, Cassandra, MongoDB, HBASE
  • Languages: Scala, Python, Java / J2EE, HTML, SQL
  • OS: Windows 2003/2008/XP/Vista, Unix, Linux (Various Versions)
  • Tools: MS-Office 2003/2007/2010, Eclipse3.3/3.4, Eclipse, Net Beans
  • Version Control: IBM RTC
  • Bug Reporting Tools: Bugzilla, IBM Rational Clearcase
  • Others: ASP.NET, VB.NET and C#
  • IDEs: Eclipse, NetBeans, JDeveloper, MyEclipse

PROFESSIONAL EXPERIENCE

Confidential - Columbia, MD

Hadoop/Big Data Developer

Responsibilities:

  • Worked with the advanced analytics team to design fraud detection algorithms, and retrieving real-time streaming datasets and then developed MapReduce programs to efficiently run the algorithm on the huge datasets.
  • Ran data formatting scripts in python and created terabyte csv files to be consumed by Hadoop MapReduce jobs.
  • Performed data analysis, feature selection, feature extraction using Apache Spark Machine Learning streaming libraries in Python.
  • Developed functional programs in SCALA for connecting the streaming data application and gathering webdatausing JSON and XML and passing it to FLUME.
  • Configured Kafka to read and write messages from external programs.
  • Configured Kafka to handle real time data.
  • Extensively used SCALA for connecting and retrieving data from NO-SQL databases such as MongoDB, PIG, HIVE, Cassandra, and HBASE
  • Involved in administration, installing, upgrading and managing CDH3, Pig, Hive&HBase.

Confidential - Harrisburg, PA

Hadoop/Big Data Developer

Responsibilities:

  • Developed data pipeline using Flume, Sqoop, Pig and Java map reduce to ingest customer behavioral data and financial histories into HDFS for statistical data analysis.
  • Worked on statistical regression and modelling, and Language processingand analysis using Python and Scala in HDFS
  • Involved in writing Map Reduce jobs.
  • Developed Spark code using Scala and Spark-SQL for faster testing and data processing.
  • Involved in Sqoop, HDFS Put or Copy from Local to ingest data.
  • Used Pig to do transformations, event joins, filter boot traffic and some pre-aggregations before storing the data onto HDFS.
  • Developed functional programs in SCALA for connecting the streaming data application in FLUME
  • Extensively used SCALA for connecting and retrieving data from NO-SQL databases such as MongoDB, PIG, andHIVE.
  • Involved in developing Pig UDFs for the needed functionality that is not out of the box available from Apache Pig.
  • Used Hive to analyze the partitioned and bucketed data and compute various metrics for reporting.

Environment: MapReduce, HDFS, Hive, Pig, Hue, Oozie, Core Java, Perl/Shell scripts, Eclipse, Hbase, Flume, Spark, Kafka, Cloudera Manager, Cassandra, REST API, Python, Greenplum DB, IDMS, VSAM, SQL*PLUS, Toad, Putty, Windows NT, UNIX Shell Scripting, Pentaho, Talend, Bigdata, YARN.

Confidential - Atlanta, GA

Java Developer

Responsibilities:

  • Worked with business analyst in understanding business requirements, design and development of the project.
  • Implemented the JSP frame work with MVC architecture.
  • Created new JSP's for the front end using HTML, Java Script, Jquery, and Ajax.
  • Developed the presentation layer using JSP, HTML, CSS and client side validations using JavaScript.
  • Involved in creating Restful web services using JAX RS and JERSEY tool.
  • Involved in designing, creating, reviewing Technical Design Documents.
  • Developed DAOs (Data Access Object) using Hibernate as ORM to interact with DBMS - Oracle.

Environment: Java, JSP, JavaScript, Servlets, Hibernate, REST, EJB, JSF, JSP, Ant, Tomcat, Eclipse, SQL, Oracle.

Confidential - Oak Brook, IL

System Lead

Responsibilities:

  • Wireless and Cable network management design, database management, and web application development
  • Cable systems performance analytics design, database management and web application development for huge customer base of cable industry.
  • Cable industry financial analytics development based on current performances based on logs of each cable consumer.
  • Developed five web design related projects and three database related application development.
  • Developed web applications using MVC 3/4, with front end using CSHTML and CSS
  • Implemented DATA TABLES in web applications using JQUERY DATATABLES and MVC WEBGRID methods.
  • Developed regression and analytics based applications and graphics displays using Google APIs.
  • Developed applications for XML data parsing and loading relevant data in WEB Applications and databases.

Environment: Visual C++, Visual C#, MVC 3/4/5, Google APIs, Servlets, Java Server Pages, HTML, and CSS

We'd love your feedback!