Big Data and ETL Developer Resume Minnetonka,MN - Hire IT People

SUMMARY:

8+ years of overall experience in building and developing Hadoop Map Reduce solutions.
Strong experience with Big Data and Hadoop technologies with excellent knowledge of Hadoop ecosystem: Hive, Spark, Sqoop, Impala, Pig, HBase, Kafka, Flume, Storm, Zookeeper, Oozie.
Experienced in transferring data from different data sources into HDFS systems using Kafka producers, consumers and Kafka brokers.
Experience in working with Code migration, Data migration and Extraction/Transformation/ Loading with Teradata and HDFS files on UNIX and Windows.
Expert in creating and designing data ingest pipelines using technologies such as Apache Storm - Kafka.
Good Knowledge in creating processing data pipelines using Kafka and Spark Streaming.
Experienced in data ingestion using Sqoop, Storm, Kafka and Apache Flume.
Experience in Apache Flume for efficiently collecting, aggregating, and moving large amounts of log data.
Knowledge in Streaming the Data to HDFS using Flume.
Hands on Flume to handle the real time log processing for attribution reports.
Experience in Importing and Exporting the data using Sqoop from HDFS to Relational Database systems/mainframe and vice-versa.
Experience with Apache Spark ecosystem using Spark-SQL, Data Frames, RDD's and knowledge on Spark MLib.
Experience in ingestion, storage, querying, processing and analysis of Big Data with hands on experience in Big Data including Apache Spark, Spark SQL, Spark Streaming.
Worked with Spark engine to process large scale data and experience to create Spark RDD.
Knowledge on developing Spark Streaming jobs by using RDDs and leverage Spark-Shell.
Expertise in Talend Big data tool with involved in architectural designing and development of ingestion and extraction job in Big Data and Spark Streaming.
Having experience on RDD architecture and implementing Spark operations on RDD and also optimizing transformations and actions in Spark.
Hands on Apache Spark jobs using Scala in test environment for faster data processing and used Spark SQL for querying.
Knowledge of Spark code and SparkSQL for testing and processing of data using Scala.
Knowledge on cloud services Amazon web services(AWS).
Good in analyzing data using HiveQL, PigLatin and custom MapReduce program in Java.
Expertise in MapReduce programs in HIVE and PIG to validate and cleanse the data in HDFS, obtained from heterogeneous data sources, to make it suitable for analysis.
Good in Hive and Impala queries to load and processing data in Hadoop File system (HFS).
Good understanding of NoSQL Data bases and hands on work experience in writing applications on NoSQL databases like Cassandra and MongoDB.
Hands on Datalake cluster with Hortonworks Ambari on AWS using EC2 and S3.
Strong knowledge in MongoDB concepts includes CRUD operations and aggregation framework and in document Schema design.
Experience in maintenance/bug-fixing of web based applications in various platforms.
Experience in managing life cycle of MongoDB including sizing, automation, monitoring and tuning.
Implemented Proofs of Concept on Hadoop Stack and different big data analytic tools, migration from different databases (i.e Teradata, Oracle,MYSQL ) to Hadoop.
Experience in storing, processing unstructured data using NoSQL databases like HBase.
Good in developing web-services using REST, HBase Native API Client to query data from HBase.
Knowledge of ETL methods for data extraction, transformation and loading in corporate-wide ETL Solutions and Data Warehouse tools for reporting and data analysis.
Experienced in job workflow scheduling, monitoring tools like Oozie and Zookeeper.
Experience with Oozie Workflow Engine in running workflow jobs with actions that run Java MapReduce and Pig jobs.
Good experience with both Job Tracker (Map reduce 1) and YARN (Map reduce 2).
Experience in managing and reviewing Hadoop Log files generated through YARN.
Experience in using Apache Solr for search applications.
Experienced in Java, Spring Boot, Apache Tomcat, Maven, Gradle, Hibernate and open source frameworks/ software's.
Preparation of Dashboards using Tableau.
Experience with all stages of the SDLC and Agile Development model right from the requirement gathering to Deployment and production support.
Proficient on Test driven development (TDD), Agile to produce high quality deliverables.
Hands on Agile (Scrum), Waterfall model along with automation and enterprise tools like Jenkins, Chef, JIRA, Confluence to develop projects and version control, Git.

TECHNICAL SKILLS:

Big Data: Hadoop HDFS, MapReduce, HIVE, PIG, HBase, Zookeeper, Sqoop, Oozie, Cassandra, Spark, Scala, Storm, Flume, Kafka and Avro.

Web Technologies: Core Java, J2EE, Servlets, JSP, JDBC, JBOSS, JSF, XML, AJAX, SOAP, WSDL.

Methodologies: Agile, SDLC, Waterfall model, UML, Design Patterns, Scrum.

Frameworks: ASP.NET, Java EE, MVC, Struts 2/1, Hibernate 3, Spring 3/2.5/2.

Programming Languages: Core Java, C, C++, SQL, Python, Scala, XML, Unix Shell scripting, HTML, CSS, JavaScript.

Data Bases: Oracle 11g/10g, IBM DB2, Oracle, MongoDB, Microsoft SQL Server, MySQL, MS-Access.

Apache Tomcat, JSON: RPC, Web Logic, Web Sphere.

NOSQL: Cassandra, MongoDB, HBase.

Monitoring, Reporting tools: Ganglia, Nagios, Custom Shell Scripts.

PROFESSIONAL EXPERIENCE:

Confidential, Minnetonka,MN

Big Data and ETL Developer