Scala Developer Resume
PA
SUMMARY:
- Pprofessional IT experience which includes experience in Bigdata, Hadoop ecosystem related technologies in Banking, Retail, Insurance and Communication sectors.
- Experience in developing Map Reduce Programs using ApacheHadoop for analyzing the big data as per the requirement.
- Worked on majorHadoop ecosystems PIG, HIVE, HBASE and Cloudera Manager.
- Experience in developing PIG Latin Scripts and using Hive Query Language.
- In - depth understanding of Data Structure and Algorithms
- Experience working on NoSQL databases including Hbase, Cassandra and MongoDB.
- Experience using Sqoop to import data into HDFS from RDBMS and vice-versa.
- Hands on experience on major components in Hadoop Ecosystem like Hadoop Map Reduce, HDFS, HIVE, PIG, HBase, Zookeeper, Sqoop, Oozie, Flume and Avro.
- Performed data analysis using Hive and Pig.
- Loading log data into HDFS using Flume.
- Agile Methodology and Iterative development.
- Functional knowledge in Content Management and Enterprise Search (Fast ESP, ApacheSolr, Oracle Endeca).
- Expertise in Managing, Monitoring and Administration of ClouderaHadoop.
- Good knowledge on Oozie, a workflow scheduler system to manage apache Hadoop jobs.
- Proficiency in enterprise search with span experience inSolr3.4 to 4.10.2, Oracle Endeca, FAST ESP.
- Experience on Upgrade of ClouderaHadoopversions in production size clusters.
- Good involvement in upgrading MapReduce calculations utilizing Mappers, Reducers, Combiners and Practitioners to convey best results for the extensive dataset
- Hands on experience in installing, configuring and using ecosystem components like Hadoop Map Reduce, HDFS, Hbase, AVRO, Zoo Keeper, Oozie, Hive, HDP, Cassandra, Sqoop, PIG, Flume.
- Worked on Multi Clustered environment and setting up Cloudera Hadoop echo System.
- Experience working with Cloudera & Horton works Distribution of Hadoop.
- Extensive knowledge of NoSQL databases such as Hbase.
- Good experience in working with cloud environment like Amazon Web Services () EC2 and S3.
- Hands on experience on working with Amazon EMR framework transferring data to EC2 server.
- Experience in performance tuning theHadoopcluster by gathering and analyzing the existing infrastructure.
- Experience in automating theHadoopInstallation, configuration and maintaining the cluster by using the tools like puppet.
- Profound experience in creating real time data streaming solutions using Apache Spark/Spark Streaming, Kafka.
- Good knowledge on Spark Ecosystem and Spark Architecture.
- An astounding cooperative person and self-starter with great relational abilities and demonstrated capacities to complete undertakings before target due dates.
- Working knowledge of RDBMS such as MySQL, Oracle 8i/9i/10g, Microsoft SQL Server.
- Understanding of Data warehouse and ETL tools.
TECHNICAL SKILLS:
Big Data: Hadoop, HDFS, Pig Hive Hbase, Sqoop, Cloudera, Oozie, Zookeeper, Cassandra, MongoDB
Language: Java, SQL, PL/SQL, HTML, XML, JavaScript, C#
Java/J2EE: Java, J2EE, Servlets, Portlets (JSR 168, JSR 286), JSP, JSF, Java Beans, JDBC, EJB
Open source framework and web development: Struts, Spring, Hibernate, JavaScript, AJAX, Dojo, fJQuery, Ehcache, Log4j, Ant, JBoss, Web services, SOA, SOAP, REST, WSDL and UDDI
Portals/Application servers: Weblogic, WebSphere Application server, WebSphere Portal server, JBOSS
Operating system: Windows, AIX, UNIX, Linux
Configuration Mgmt: CMVC, Clearcase, Clearquest, PVCS, CVS
Development Tools: Eclipse, Visual Studio, Net Beans, Rational Application Developer, WSAD, JUnit
Databases: NoSQL, Hbase, Cassandra, MongoDB, Oracle, SQL Server, DB2, MySQL, Toad, SQL
Software Engineering: UML 2.0, Rational Rose, Design Patterns (MVC, DAO etc)
PROFESSIONAL EXPERIENCE:
Confidential, PA
Scala Developer
Responsibilities:
- Involved in complete SDLC - Requirement Analysis, Development, System Integration Testing and Performance Testing.
- Involved in architecture and design of distributed time-series database platform using NoSQL technologies like Hadoop/HBase, Zookeeper.
- Responsible for configuring deployment environment to handle the application using Jetty server and Web Logic 10 and Postgres database Confidential the back-end.
- Involved in the implementation of Spring MVC Pattern and developed persistence layer using Hibernate framework.
- Utilized Java, JSP,Solrand Spring Framework.
- DevelopedScalascripts using both Data frames/SQL/Data sets and RDD/MapReduce in Spark for Data Aggregation, queries and writing data back into OLTP system through Sqoop
- Followed Scrum methodology for the application development.
- Involved in developing Hive DDLs to create, alter and drop Hive tables and storm, &Kafka
- Developed various helper classes needed following Core Java multi-threaded programming and Collection classes.
- Document unique id generation throughSolrconfiguration.
- Experienced in transferring data from different data sources into HDFS systems usingKafka producers, consumers andKafkabrokers.
- Further used pig to do transformations, event joins, elephant bird API and pre -aggregations performed before loading JSON files format onto HDFS.
- Involved inSolrinstallation, configuration, indexing etc
- Performing importing and exporting data into HDFS and Hive using Sqoop.
- POC Implementation on Impala and Spark.
- Good understanding of Partitions, Bucketing concepts in Hive and designed both Managed and External tables in Hive to optimize performance.
- Worked with application teams to install operating system,Hadoopupdates, patches, version upgrades as required.
- Exported the analysed data to the relational databases using Sqoop for visualization and to generate reports for the BI team.
- Supported in settling up QA environment and updating configuration for implementing scripts with Pig and Scoop.
- Developed Spark scripts by usingScalashell commands as per the requirement.
- Used Akka as a framework to create reactive, distributed, parallel and resilient concurrent applications inScala.
Confidential, ND
Scala Developer
Responsibilities:
- Worked on Performance Enhancement.
- Worked with Distributed n-tier architecture and Client/Server architecture.
- Worked in implementation & maintenance of applications in Web-based environment.
- Proficient in using OOPs Concepts (Polymorphism, Inheritance, Encapsulation) etc.
- Used Design Patterns like MVC (Model-View-Controller) and Singleton, Factory etc.
- Involved inSolrinstallation, configuration, indexing etc.
- Worked on setting up pig, Hive and Hbase on multiple nodes and developed using Pig, Hive and Hbase, MapReduce.
- DevelopedSolrupdate service for real-time indexing which includes real time add, update and delete.
- Implemented data access using Hibernate persistence framework.
- Developed the configuration files and the class’s specific to the spring and hibernate.
- Developed Middle layer search service to retrieve results fromSolr.
- Expertise in server-side and J2EE technologies including Java, J2SE, JSP, Servlets, XML, Hibernate, Struts, Struts2, JDBC, and JavaScript development.
- Excellent working experience in J2EE Architecture, MVC Architecture, Design Patterns.
- Design of GUI using Model View Architecture (STRUTS Frame Work).
- Integrated Spring DAO for data access using Hibernate.
- Experience in Installing and monitoring standalone multi-node Clusters ofKafkaand Storm.
- Involved in the Development of Spring Framework Controllers.
- Performed unit testing for all the components using JUnit.
- Designed and developed the XSD for WSDL
- Developed user interface using JSP, JSP Tag libraries JSTL, HTML, CSS, JavaScript to simplify the complexities of the application
- Involved in developing the Pig scripts.
- Responsible for implementing Cloudera Recommended Improvements across all clusters.
- Used slick to query and storing in database in aScalafashion using the powerfulScalacollection framework.
- Capacity planning for new clusters.
- Coordinate with UNIX teams with kernel patching and Vulnerability remediation onHadoop servers.
- Involved in developing the Hive Reports.
- Developed the scoop scripts in order to make the interaction between Pig and MySQL Database.
Confidential, TX
Hadoop Developer
Responsibilities:
- Moving log data periodically into HDFS using Flume. Building multi-hop flows, fan-out flows, and failover mechanism.
- Wrote MapReduce jobs to read data files and scrub the data.
- Transferring data between MySQL and HDFS using Sqoop with connectors.
- Creating and populating Hive tables and writing Hive queries for data analysis to meet the business requirements.
- Developed Pig Latin scripts for data processing.
- Migrating data from MySQL database to HBase. Running MapReduce jobs to access HBase data from application using Java Client APIs.
- Automating the jobs using Oozie.
- Installed and configured Hive, Pig, Sqoop, Flume and Oozie on the Hadoop cluster.
- Used SVN for version control.
- Actively participated in software development lifecycle (design, implement, deploy, test), including design and code reviews, test development, test automation.
- Involved in solution-driven agile development methodology and actively participated in daily scrum meetings.
- Coding, testing and deploying the web application using RAD 7.0 and Websphere Application Server 6.0.
- Used JavaScript's for validating client side data. Wrote unit tests for the implemented bean code using JUnit.
- Extensively worked on UNIX Environment.
- Data is exchanged in XML format, which helps in interoperability with other software applications.
- MonitoringHadoopcluster using tools like Nagios, Ganglia and Cloudera Manager.
- Automation script to monitor HDFS and HBase through Cron jobs.
- Develop high-performance cache, making the site stable and improving its performance
- Create a complete processing engine, based on Cloudera's distribution, enhanced to performance
