We provide IT Staff Augmentation Services!

Hadoop Development Engineer Resume

2.00/5 (Submit Your Rating)

Phoenix, AZ

SUMMARY

  • Around 7+ years of overall IT experience in various domains, which includes hands on experience of 3 years in Big Data technologies and extensive experience of 4+ years in Java/J2EE.
  • Experience in installation, configuration, management and deployment of Big Data solutions and the underlying infrastructure of Hadoop Cluster.
  • Real streaming the data using Spark with Kafka.
  • Hands on experience running Hive and Pig on Tez using HDP.
  • Worked on Hcatalog to share the schema across the distributed applications.
  • Hands on experience in installing, configuring, and using Hadoop ecosystem components like Hadoop Map Reduce, HDFS, Hive, Sqoop and Pig.
  • Worked on creating indexes and working with Indexes using SOLR on HDP.
  • Experienced in developing MapReduce programs using Apache Hadoop for working with Big Data.
  • Created Hive tables to store data into HDFS and processed data using Hive QL.
  • Extending HIVE and PIG core functionality by writing custom UDF’s
  • Worked with Importing and exporting the data using Sqoop from HDFS to Relational Database systems/mainframe and vice - versa.
  • Worked on Incremental Import by creating Sqoop Metastore Jobs.
  • Experience in using Apache Flume for collecting, aggregating and moving large amounts of data from application servers.
  • Can perform Production operations like administration, configuring Hadoop, Debugging and Performance analysis.
  • Experience in development and configuration of distributed computing system
  • Experience with data warehouse life cycles and methodologies
  • Experience in software configuration management using CVS, SVN Hands on experience with build tools like ANT, Maven and logging tools like Log4J
  • Expertise in testing with JUnit using tools like JMeter
  • Experience in Agile software development process, Test Driven Development and Test First Methodology
  • Strong knowledge of TCP/IP, HTTP, JavaScript, Web Services (e.g. HADOOP, MongoDB, Cassandra), big data, caching mechanism, Load Balancing, multi-data center architecture
  • Experience and knowledge with messaging application (SMS, MMS, RCS) and Android development is a plus.
  • Experience with cloud technologies. (Open Stacks, KVM, VMware, AWS, MS AZURE, Google Cloud)
  • Excellent written and verbal communication skills, Analytical, Problem Solving skills, strict attention to detail and ability to work independently, lead/work within a team environment.

TECHNICAL SKILLS

Hadoop/Big Data: HDFS, MapReduce, Pig, Hive, Impala, HBase, Cassandra Sqoop, Oozie, Zookeeper, Flume, Spark, Yarn

Java & J2EE Technologies: Core Java, Servlets, JSP, Hibernate, JDBC, JNDI and Java Beans

IDE’s: Eclipse, RAD

Programming languages: Java, JQuery, HTML, CSS, XML, JavaScript

DB Languages: SQL, PL/SQL

Databases: Oracle, MySQL

Operating Systems: Linux, Ubuntu, Windows, MS DOS

PROFESSIONAL EXPERIENCE

Confidential, Phoenix, AZ

Hadoop Development Engineer

Responsibilities:

  • Responsible for building scalable distributed data solutions using Hadoop.
  • Developed job processing scripts using Oozie workflow
  • Configured different topologies for Storm cluster and deployed them on regular basis.
  • Consumed the data from Kafka queue using Storm
  • Implemented ELK stack to collect and analyze the logs produced by the Storm cluster.
  • Spark streaming collects the data from Kafka in near real time and performs necessary transformations and aggregations on the fly to build the common learner data model and persists the data in Cassandra
  • Installed and configured Hive, Pig, Sqoop, Flume and Oozie on the Hadoop cluster.
  • Developed Simple to complex Map/reduce Jobs using Hive and Pig.
  • Optimized Map/Reduce Jobs to use HDFS efficiently by using various compression mechanisms.
  • Involved in Hadoop cluster task like commissioning & decommissioning Nodes without any effect to running jobs and data.
  • Involved in running Hadoop streaming jobs to process terabytes of text data.
  • Analyzed large data sets by running Hive queries and Pig scripts.
  • Helped the team to increase the Cluster size from 22 to 30 Nodes.
  • Job management using Fair scheduler.
  • Worked extensively with Sqoop for importing metadata from Oracle.
  • Involved in creating Hive tables, and loading and analyzing data using hive queries.
  • Designed, developed and did maintenance of data integration programs in a Hadoop and RDBMS environment with both traditional and non-traditional source systems as we as RDBMS and NoSQL data stores for data access and analysis. Experienced in running Hadoop streaming jobs to process terabytes of xml format data.
  • Load and transform large sets of structured, semi structured and unstructured data.
  • Assisted in exporting analyzed data to relational databases using Sqoop.
  • Responsible to manage data coming from different sources.
  • Wrote Hive Queries and UDF’s.
  • Developed Hive queries to process the data and generate the data cubes for visualizing.
  • Created Pig Latin scripts to sort, group, join and filter the enterprise wise data.
  • Implemented Partitioning, Dynamic Partitions, Buckets in HIVE.
  • Gained experience in managing and reviewing Hadoop log files.

Environment: Hadoop, MapReduce, Sqoop, HDFS, Hive, Pig, Oozie, Spark, Kafka, Cassandra, Elastic Search, Java, Oracle, MySQL, Ubuntu.

Confidential, Plano, TX

Hadoop Development Engineer

Responsibilities:

  • Installed and configuredHadoop. Developed multipleMapReducejobs in Java for data cleaning and preprocessing.
  • Installed and configuredPigforETLjobs.
  • Troubleshooting the cluster by reviewingHadoopLOGfiles.
  • Imported data usingSqoopfromTera datausing Tera data connector.
  • UsedOozieto orchestrate the workflow.
  • CreatingHive tablesand working on them for data analysis in order to meet the business requirements.
  • Got good experience withNoSQLdatabase.
  • Designed and implementedMapReduce-based large-scale parallel relation-learning system.
  • Installed and benchmarkedHadoop/HBaseclusters for internal use.
  • WrittenHBASEClient program in Java and web services.
  • Model, serialize, and manipulate data in multiple forms (xml).
  • Supported postproduction enhancements.
  • Experience with data model concepts-star schema dimensional modeling Relational design (ER).
  • Created tables, stored procedures inSQLfor data manipulation and retrieval, Database Modification using SQL, PL/SQL, Stored procedures, triggers, Views in Oracle, Created User Interface usingJSF.
  • Involved inintegration testingthe BusinessLogic layerandData Access layer.
  • IntegratedJSFwithJSPand usedJSF Custom Tag Librariesto display the value of variables defined in configuration files.
  • Used technologies likeJSP, JSTL, JavaScript and Tiles for Presentation tier
  • Involved inJUnit testingof the application usingJUnit framework

Environment: Hadoop, MapReduce, HDFS, Hive, Java, Cloud era, Pig, HBase, Linux, XML,MySQL Workbench, Java, Eclipse, Oracle, PL/SQL, SQL*PLUS.

Confidential, San Diego, CA

JAVA /J2EE Developer

Responsibilities:

  • Involved in Requirement Analysis, Design, Development and Testing of the risk workflow system.
  • Involved in the implementation of design using vital phases of the Software development life cycle (SDLC) that includes Development, Testing, Implementation and Maintenance Support.
  • Applied OOAD principle for the analysis and design of the system.
  • Implemented XML Schema as part of XQuery query language
  • Applied J2EE design patterns like Singleton, Business Delegate, Service Locator, Data Transfer Object (DTO), Data Access Objects (DAO) and Adapter during the development of components.
  • Used RAD for the Development, Testing and Debugging of the application.
  • Used WebSphere Application Server to deploy the build.
  • Developed front-end screens using Struts, JSP, HTML, AJAX, JQuery, JavaScript, JSON and CSS.
  • Used J2EE for the development of business layer services.
  • Developed POJO based programming model using spring framework
  • Used IOC (Inversion of Control) Pattern and Dependency Injection of Spring framework for wiring and managing business objects
  • SOAP has been used as a protocol to send request and response in the form of XML messages.
  • JDBC framework has been used to connect the application with the Database.
  • Used Eclipse for the Development, Testing and Debugging of the application.
  • Used SAX Parser to parse the xml files.
  • Log4j framework has been used for logging debug, info & error data
  • Used Hibernate framework for Entity Relational Mapping
  • Used Oracle 10g database for data persistence
  • SQL Developer was used as a database client
  • Extensively worked on Windows and UNIX operating systems
  • Used SecureCRT to transfer file from local system to Unix system
  • Performed Test Driven Development (TDD) using JUnit.
  • Used Ant script for build automation.
  • PVCS version control system has been used to check-in and checkout the developed artifacts. The version control system has been integrated with Eclipse IDE.
  • Used Rational Clearquest for defect logging and issue tracking.

Environment: J2EE, Struts, Spring, Hibernate, Web Services, Design Patterns, WebSphere, Ant, (Servlet, JSP), HTML, AJAX, JavaScript, CSS, jQuery, JSON,, SOAP, WSDL, XML, Eclipse, Oracle, WinSCP, Log4J, JUnit

Confidential, Salt Lake City, UT

JAVA Developer

Responsibilities:

  • Involved in various phases of Software Development Life Cycle (SDLC) of the application like Requirement gathering, Design, Analysis and Code development
  • Developed a prototype of the application and demonstrated to business users to verify the application functionality
  • Developed and implemented the MVC Architectural Pattern using Struts Framework including JSP, Servlets, Form Beans and Action classes
  • Implemented server side tasks using Servlets and XML
  • Helped developed page templates using Struts Tiles framework
  • Implemented Struts Validation Framework for Server side validation
  • Developed JSP's with Custom Tag Libraries for control of the business processes in the middle-tier and was involved in their integration
  • Implemented Struts Action classes using Struts controller component
  • Developed Web services (SOAP) through WSDL in Apache Axis to interact with other components
  • Integrated Spring DAO for data access using Hibernate used HQL and SQL for querying databases
  • Used parsers like SAX and XSD for parsing xml documents and used XML transformations using XSLT
  • Written stored procedures, triggers, and cursors using Oracle PL/SQL
  • Created and deployed web pages using HTML, JSP, JavaScript and CSS
  • Written JUnit Test cases for performing unit testing
  • Used Rational Clear Case as Version control

Environment: Java, JSP, JDBC, Spring Core, Struts, Hibernate, XML, Oracle, WebLogic, Apache Axis, MAVEN, SVN, JUnit, UML, Web services, SOAP, XSLT, Jira.

Confidential

Jr Java Developer

Responsibilities:

  • Technical responsibilities included high level architecture and rapid development
  • Design architecture following J2EE MVC framework
  • Developed interfaces using HTML,JSP pages and Struts -Presentation View
  • Developed and implemented Servlets running under JBoss
  • Used J2EE design patterns and Data Access Object (DAO) for the business tier and integration Tier layer of the project
  • Implemented CMP entity beans for persistence of business logic implementation
  • Development of database interaction code to JDBC API making extensive use of SQL Query Statements and advanced prepared statement
  • Inspection/Review of quality deliverables such as Design Documents
  • Wrote SQL Scripts,Stored procedures and SQL Loader to load reference data

Environment: J2EE (Java Servlets, JSP, Struts), MVC Framework, Apache Tomcat, JBoss, MySQL, HTML, JDBC, EJB

We'd love your feedback!