Scala Developer Resume PA - Hire IT People

SUMMARY:

Pprofessional IT experience which includes experience in Bigdata, Hadoop ecosystem related technologies in Banking, Retail, Insurance and Communication sectors.
Experience in developing Map Reduce Programs using ApacheHadoop for analyzing the big data as per the requirement.
Worked on majorHadoop ecosystems PIG, HIVE, HBASE and Cloudera Manager.
Experience in developing PIG Latin Scripts and using Hive Query Language.
In - depth understanding of Data Structure and Algorithms
Experience working on NoSQL databases including Hbase, Cassandra and MongoDB.
Experience using Sqoop to import data into HDFS from RDBMS and vice-versa.
Hands on experience on major components in Hadoop Ecosystem like Hadoop Map Reduce, HDFS, HIVE, PIG, HBase, Zookeeper, Sqoop, Oozie, Flume and Avro.
Performed data analysis using Hive and Pig.
Loading log data into HDFS using Flume.
Agile Methodology and Iterative development.
Functional knowledge in Content Management and Enterprise Search (Fast ESP, ApacheSolr, Oracle Endeca).
Expertise in Managing, Monitoring and Administration of ClouderaHadoop.
Good knowledge on Oozie, a workflow scheduler system to manage apache Hadoop jobs.
Proficiency in enterprise search with span experience inSolr3.4 to 4.10.2, Oracle Endeca, FAST ESP.
Experience on Upgrade of ClouderaHadoopversions in production size clusters.
Good involvement in upgrading MapReduce calculations utilizing Mappers, Reducers, Combiners and Practitioners to convey best results for the extensive dataset
Hands on experience in installing, configuring and using ecosystem components like Hadoop Map Reduce, HDFS, Hbase, AVRO, Zoo Keeper, Oozie, Hive, HDP, Cassandra, Sqoop, PIG, Flume.
Worked on Multi Clustered environment and setting up Cloudera Hadoop echo System.
Experience working with Cloudera & Horton works Distribution of Hadoop.
Extensive knowledge of NoSQL databases such as Hbase.
Good experience in working with cloud environment like Amazon Web Services () EC2 and S3.
Hands on experience on working with Amazon EMR framework transferring data to EC2 server.
Experience in performance tuning theHadoopcluster by gathering and analyzing the existing infrastructure.
Experience in automating theHadoopInstallation, configuration and maintaining the cluster by using the tools like puppet.
Profound experience in creating real time data streaming solutions using Apache Spark/Spark Streaming, Kafka.
Good knowledge on Spark Ecosystem and Spark Architecture.
An astounding cooperative person and self-starter with great relational abilities and demonstrated capacities to complete undertakings before target due dates.
Working knowledge of RDBMS such as MySQL, Oracle 8i/9i/10g, Microsoft SQL Server.
Understanding of Data warehouse and ETL tools.

TECHNICAL SKILLS:

Big Data: Hadoop, HDFS, Pig Hive Hbase, Sqoop, Cloudera, Oozie, Zookeeper, Cassandra, MongoDB

Language: Java, SQL, PL/SQL, HTML, XML, JavaScript, C#

Java/J2EE: Java, J2EE, Servlets, Portlets (JSR 168, JSR 286), JSP, JSF, Java Beans, JDBC, EJB

Open source framework and web development: Struts, Spring, Hibernate, JavaScript, AJAX, Dojo, fJQuery, Ehcache, Log4j, Ant, JBoss, Web services, SOA, SOAP, REST, WSDL and UDDI

Portals/Application servers: Weblogic, WebSphere Application server, WebSphere Portal server, JBOSS

Operating system: Windows, AIX, UNIX, Linux

Configuration Mgmt: CMVC, Clearcase, Clearquest, PVCS, CVS

Development Tools: Eclipse, Visual Studio, Net Beans, Rational Application Developer, WSAD, JUnit

Databases: NoSQL, Hbase, Cassandra, MongoDB, Oracle, SQL Server, DB2, MySQL, Toad, SQL

Software Engineering: UML 2.0, Rational Rose, Design Patterns (MVC, DAO etc)

PROFESSIONAL EXPERIENCE:

Confidential, PA

Scala Developer

Responsibilities:

Involved in complete SDLC - Requirement Analysis, Development, System Integration Testing and Performance Testing.
Involved in architecture and design of distributed time-series database platform using NoSQL technologies like Hadoop/HBase, Zookeeper.
Responsible for configuring deployment environment to handle the application using Jetty server and Web Logic 10 and Postgres database Confidential the back-end.
Involved in the implementation of Spring MVC Pattern and developed persistence layer using Hibernate framework.
Utilized Java, JSP,Solrand Spring Framework.
DevelopedScalascripts using both Data frames/SQL/Data sets and RDD/MapReduce in Spark for Data Aggregation, queries and writing data back into OLTP system through Sqoop
Followed Scrum methodology for the application development.
Involved in developing Hive DDLs to create, alter and drop Hive tables and storm, &Kafka
Developed various helper classes needed following Core Java multi-threaded programming and Collection classes.
Document unique id generation throughSolrconfiguration.
Experienced in transferring data from different data sources into HDFS systems usingKafka producers, consumers andKafkabrokers.
Further used pig to do transformations, event joins, elephant bird API and pre -aggregations performed before loading JSON files format onto HDFS.
Involved inSolrinstallation, configuration, indexing etc
Performing importing and exporting data into HDFS and Hive using Sqoop.
POC Implementation on Impala and Spark.
Good understanding of Partitions, Bucketing concepts in Hive and designed both Managed and External tables in Hive to optimize performance.
Worked with application teams to install operating system,Hadoopupdates, patches, version upgrades as required.
Exported the analysed data to the relational databases using Sqoop for visualization and to generate reports for the BI team.
Supported in settling up QA environment and updating configuration for implementing scripts with Pig and Scoop.
Developed Spark scripts by usingScalashell commands as per the requirement.
Used Akka as a framework to create reactive, distributed, parallel and resilient concurrent applications inScala.

Confidential, ND

Scala Developer

Responsibilities:

Worked on Performance Enhancement.
Worked with Distributed n-tier architecture and Client/Server architecture.
Worked in implementation & maintenance of applications in Web-based environment.
Proficient in using OOPs Concepts (Polymorphism, Inheritance, Encapsulation) etc.
Used Design Patterns like MVC (Model-View-Controller) and Singleton, Factory etc.
Involved inSolrinstallation, configuration, indexing etc.
Worked on setting up pig, Hive and Hbase on multiple nodes and developed using Pig, Hive and Hbase, MapReduce.
DevelopedSolrupdate service for real-time indexing which includes real time add, update and delete.
Implemented data access using Hibernate persistence framework.
Developed the configuration files and the class’s specific to the spring and hibernate.
Developed Middle layer search service to retrieve results fromSolr.
Expertise in server-side and J2EE technologies including Java, J2SE, JSP, Servlets, XML, Hibernate, Struts, Struts2, JDBC, and JavaScript development.
Excellent working experience in J2EE Architecture, MVC Architecture, Design Patterns.
Design of GUI using Model View Architecture (STRUTS Frame Work).
Integrated Spring DAO for data access using Hibernate.
Experience in Installing and monitoring standalone multi-node Clusters ofKafkaand Storm.
Involved in the Development of Spring Framework Controllers.
Performed unit testing for all the components using JUnit.
Designed and developed the XSD for WSDL
Developed user interface using JSP, JSP Tag libraries JSTL, HTML, CSS, JavaScript to simplify the complexities of the application
Involved in developing the Pig scripts.
Responsible for implementing Cloudera Recommended Improvements across all clusters.
Used slick to query and storing in database in aScalafashion using the powerfulScalacollection framework.
Capacity planning for new clusters.
Coordinate with UNIX teams with kernel patching and Vulnerability remediation onHadoop servers.
Involved in developing the Hive Reports.
Developed the scoop scripts in order to make the interaction between Pig and MySQL Database.

Confidential, TX

Hadoop Developer

Responsibilities:

Moving log data periodically into HDFS using Flume. Building multi-hop flows, fan-out flows, and failover mechanism.
Wrote MapReduce jobs to read data files and scrub the data.
Transferring data between MySQL and HDFS using Sqoop with connectors.
Creating and populating Hive tables and writing Hive queries for data analysis to meet the business requirements.
Developed Pig Latin scripts for data processing.
Migrating data from MySQL database to HBase. Running MapReduce jobs to access HBase data from application using Java Client APIs.
Automating the jobs using Oozie.
Installed and configured Hive, Pig, Sqoop, Flume and Oozie on the Hadoop cluster.
Used SVN for version control.
Actively participated in software development lifecycle (design, implement, deploy, test), including design and code reviews, test development, test automation.
Involved in solution-driven agile development methodology and actively participated in daily scrum meetings.
Coding, testing and deploying the web application using RAD 7.0 and Websphere Application Server 6.0.
Used JavaScript's for validating client side data. Wrote unit tests for the implemented bean code using JUnit.
Extensively worked on UNIX Environment.
Data is exchanged in XML format, which helps in interoperability with other software applications.
MonitoringHadoopcluster using tools like Nagios, Ganglia and Cloudera Manager.
Automation script to monitor HDFS and HBase through Cron jobs.
Develop high-performance cache, making the site stable and improving its performance
Create a complete processing engine, based on Cloudera's distribution, enhanced to performance

We provide IT Staff Augmentation Services!

Scala Developer Resume

PA

We'd love your feedback!

Resume Categories

Client Services

Job Seekers

Visa Sponsorship