Sr. Hadoop/Spark Developer Resume Lockport, IL - Hire IT People

SUMMARY:

Software Developer with 8+ years of Professional experience which includes Analysis Design, Development, integration, Deployment and maintenance of quality software applications using Java/J2EE Technologies and big data Hadoop technologies.
Above 5+ years of working experiences in data analysis and processing using big data stack.
Proficiency in Java, Hadoop MapReduce, Pig, Hive, Oozie, Sqoop, Flume, HBase, Scala, Python, Kafka, Impala, NoSQL Databasesand AWS.
High Exposure on BigData technologies and Hadoop ecosystem, In - depth understanding of MapReduce and the Hadoop Infrastructure.
Excellent knowledge on Hadoop Architecture, ecosystems, MRV1 and MRV2such as HDFS, Job Tracker, Task Tracker, Name Node, Data Node, YARN and MapReduce programming paradigm.
Good exposure on usage of NoSQL databases column-oriented HBase, Cassandra and MongoDB(Document Based DB).
Extensive experience writing custom Map Reduce programs for data processing and UDFs for both Hive and Pig in Java.
Strong experience in analyzing large amounts of data sets writing Pig scripts and Hive queries.
Experience in importing and exporting data using Sqoop from HDFS to Relational Database.
Experienced in job workflow scheduling and monitoring tools like Oozie, NIFI.
Experience in Apache Flume for collecting, aggregating and moving huge chunks of data from various sources such as webserver, telnet sources etc.
Hands on experience in Zookeeper.
Experienced in implementing unified data platforms using Kafka producers/ consumers, implement pre-processing using Storm.
Experience in developing data pipeline using Kafka , Spark and Hive to ingest, transform and analyzing data.
Used Kafka for log accumulation like gathering physical log documents off servers and placed them in a focal spot like HDFS for handling.
Excellent Understanding of PYSpark and its benefits in Big Data Analytics.
Worked with data in multiple file formats including ORC, Text / CSV , AVRO,and Parquet, Developed Spark code using Scala and Spark-SQL for faster data processing.
Experience in Data modeling and connecting Cassandra from Spark and saving summarized data frame to Cassandra .
Hands on experience in Stream processing frameworks such as Storm, Spark Streaming .
Experience in scheduling, distributing and monitoring jobs using Spark core .
Exploring with the Spark for improving the performance and optimization of the existing algorithms in Hadoop using Spark Context , Spark-SQL , Data Frame , Pair RDD's , Spark YARN .
Experienced in Spark Core, Spark RDD, Pair RDD, Spark Deployment Architectures.
Developed applications using Scala , Spark SQL and MLlib libraries along with Kafka and other tools as per requirement then deployed on the Yarn cluster
Experience with using Big Data with ETL (Talend).
Experience using various Hadoop Distributions (Cloudera, Hortonworks, and MapReduce etc.) to fully implement and leverage new Hadoop featuresWorked on custom Pig Loaders and Storage classes to work with a variety of data formats such as JSON, Compressed CSV, etc.,
Good Knowledge in Amazon AWS computing like EC2webservices which provides fast and efficient processing of Big Data.
Experience on working with EMR for data visualization.
Experience in configuring KERBEROS to the cluster.
Experience in visualizing the streaming data using ApacheNIFI.
Experience in using S3 for storage purposes.
Experienced in working with different scripting technologies like Python, UNIX shell scripts.
Experience on Source control repositories like SVN, CVS and GIT.
Adequate knowledge and working experience in Agile&Waterfall methodologies.
Experience in writing database objects like Stored Procedures, Functions, Triggers, PL/SQL packages and Cursors for Oracle, SQL Server, MySQL databases.
Expertise in design and development of various web and enterprise applications using various technologies like JSP, Servlets, Struts, Hibernate, SpringMVC, JDBC, SpringBoot, JMS, JSF, XML, AJAX, SOAP and RESTful Web Services.
Experience working with Apache SOLR for indexing and querying.
Developing and Maintenance the Web Applications using the Web server Tomcat, IBM WebSphere.
Excellent problem solving, and analytical skills.
Ability to quickly master new concepts and application

TECHNICAL SKILLS:

Hadoop technologies: HDFS, Map Reduce, Sqoop, Flume, Pig, Hive, Oozie, impala, Apache Nifi, Zookeeper.

NoSQL Databases: MongoDB, HBase, Cassandra

Real time/Stream processing: Apache Spark

Distributed message broker: Apache Kafka

Monitoring and Reporting: Tableau, Zeppelin Note Book

Hadoop Distribution: Cloudera, Horton Works, AWS (EMR)

Build Tools: Maven, SBT

Cloud Technologies: AWS Glacier, S3

Programming & Scripting: JAVA, C, SQL, Shell Scripting, Python

Java Technologies: Servlets, JavaBeans, JDBC, Spring, Hibernate, SOAP/Restful services

Databases: Oracle, MY SQL, MS SQL server, Teradata

Web Dev. Technologies: HTML, XML, JSON, CSS, JQUERY, JavaScript, angular JS

Tools: & Utilities: Eclipse, Net Beans, SVN, CVS, SOAP UI,MQ explorer, RFH utilJMX explorer, SSRS, Aqua Data Studio, XML Spy, ETL(talend,: pentaho), IntelliJ(Scala)

PROFESSIONAL EXPERIENCE:

Confidential - Lockport, IL

Sr. Hadoop/Spark Developer