We provide IT Staff Augmentation Services!

Big Data Developer, Consultant Resume

Middletown, NJ

SUMMARY:

  • Outstanding Java/Scala/Python Developer and talented Software Engineer with proven expertise in Analysis, Design, Development, Testing and Maintenance of enterprise web applications using Java/J2EEand big data technologies (around 6+ years IT, 2+ in big data)
  • Deep understanding/knowledge of Hadoop Architecture and major components such as HDFS, YARN architecture and good understanding of scalability, workload management, schedulers, and distributed platform architectures
  • Technical expertise in Hadoop Map Reduce, Amazon EMR, HDFS, Hive, Cloudera Manager,Google Cloud Platform, Presto, Oracle SQLand also Linux/UNIX Shell Scripting.
  • Experience in developing Spark applications using Scala/Python for Spark Streaming and SparkSQL
  • Extensive experience in importing and exporting data using Sqoop from HDFS to Relational Database Management Systems and vice versa.
  • Experience on Lucene and openNLPfor nature language processing
  • Hands on experience with MLib, machine learning and underlying algorithms
  • Familiar withKubernetesand Dockers
  • Strong understanding of core Java, data structure, algorithm design, Object - Oriented Design (OOD) and Object-Oriented Programming (OOP)
  • Hands on experience in HTML, CSS, JSP, Bootstrap
  • Experience on Apache Tomcat and Java Servlets
  • Experienced in Tableau Desktop for data visualization and analysis
  • Experience on Java unit tests (JUnit) and integration tests
  • Experience with source code management tools such as Git orSVN
  • Experience with project tracking software such as Jira/Rally
  • Experience in Agile Development environments

WORK EXPERIENCE:

Confidential

Big Data Developer, Consultant, Middletown, NJ

Responsibilities:

  • Developed and optimized Spark Scala programs/HiveQL to perform transportation data enrichment, transformation and wrangling
  • Implemented and supported in big data ETL procedure using Scala/Spark Streaming, Python and Big Data tools(AWS S3, Zeppelin, EC2 and etc…)
  • Usedmicroservice architecture, with a Spring-based service interacting through a combination of REST and RDBMS/NoSQL datasources
  • Build and maintain SQL scripts and complex queries for Hive data extraction and analysis
  • ImplementedAIML interpreter services and pattern recognition services to createCMLP platform intelligent Q&Aassistant using Java, Lucene and OpenNLP
  • Responsible for creating model management microservice using Spring Boot, Maven, MongoDB, Jgit, Postman and Swagger to manage PMML files and model features.
  • Createdkeyword parser, SentenceDetector, Tagger and TextFeatureExtrator in common-NLP library
  • Experience on configure yaml, docker and Kubernetes fordeployment
  • Wrote JUnit tests and Integration test cases for microservices
  • Onboarding training for new joiners like environment installation, access, service running, Postman, Jenkins job, deployment and git turtorial
  • Worked with itrack/jira and git bash/pull request for project tracking
  • Troubleshooting and optimization of performance, such as debug or performance tuning

Confidential

Analytic Software Developer, Consultant, Middletown, NJ

Responsibilities:

  • Implemented and supported in big data ETL procedure using Python, Sqoop, Pig and Big Data APIs
  • Enhanced data validation by using Python, Beeline and Oracle ODBC
  • Experience onHiveQL and Beeline to run analytical functions, including UDF and optimization.
  • Wrote script for auto data extraction and validation by using Python script
  • Wrote Python script for sampling and sampling test
  • Wrote script to process raw data such as cleansing, normalization,transformation,standardizationand merging
  • Modified data modeling by using MLlib for machine learning purpose
  • Hands on experience on Pyspark/Spark Scala for data processing and performance tuning
  • Utilized PyUnit, nose and pytest for python tests
  • Performed analysis and presented results usingSpark SQL/Zeppelin
  • Experience on Apache Tomcat and Java Servlets
  • Involved in design and development of presentation layer using HTML and JSP
  • Worked with Git Bash for version control and Rally for project tracking
  • Troubleshooting and optimization of performance

Confidential

Developer, Piscataway, NJ

Responsibilities:

  • Wrote Sqoop scripts to make the interaction between Hive and other datasources
  • Experience on creating Hive tables to store the processed results
  • Optimized Hive tables using optimization techniques to provide better performance with HiveQL
  • Experience on real-time analytics using Kafka, Spark Streaming and Pyton
  • Used Apache Spark/Scala/Python for data clean-up, transformation and merging
  • Experience on setting up and configuration of Amazon EMR on AWS
  • Hands on experience setting up and configuration Cloudera and Google Cloud platform
  • Optimized Spark and solve performance tuning issues
  • Wrote JUnit test to check the methods and fix the defects
  • Tableau experience on graphing results for reporting

Confidential

Software Engineer, Santa Clara, CA

Responsibilities:

  • Implemented personalizedRecommendation app based on user visit history with ApacheTomcat, REST API, HTML5, CSS3, JavaScript, JavaandMaven
  • Parsing user and store information from Yelp open dataset and Yelp API (JSON)
  • Implemented RDBMS/NoSQL database (MySQL/MongoDB) to store the user visiting history data
  • Wrote Scalatoprocessing real-time loganalysis withFlume, Kafka and Spark Streaming
  • Optimized real-time processing performance
  • Deployed the server onAWS and test with JUnit test, A/B test and regression test.
  • Worked under agile development

Confidential

Software Engineer

Responsibilities:

  • This project is to maintain virtual lab testbedand databaseonstudent’smanagement
  • Maintained virtual lab website and network Testbed with HTML5/CSS3, JavaScript,LAMP and Linux
  • Implementationed student management application tomanage student profile and usage information with
  • JDBC, SpringMVCand Hibernate

Confidential

Developer

Responsibilities:

  • Designed UML diagrams using Magic Draw to generate java code
  • Writing POJO classes for Spring injection
  • Involved in designing and developing back end java beans using OOPs
  • Created XML parser using SAX and DOM XML parser
  • Created new front end with JSP, Struts, JSTL and JavaBeans
  • Taken care ofSQL queries for the backend RDBMS
  • Experience in using the Object Relational Mapping tool Hibernate
  • Developed java Bean Tags
  • Developed Exception Handling Framework for whole system
  • Taken care of Java multi threading in java classes / library
  • Deployed application on development and production server with Apache Tomcat
  • Wrote common java script related functions servers
  • Wrote unit tests using JUnit, JMock and acceptance tests using STIQ
  • Supported and fixed production defects

Confidential

Developer, Consultant

Responsibilities:

  • UsedStruts, JSP, JSTLto implement the user landing page
  • Experience onMySQL data query and JDBC for database connectivity
  • Designed RESTful web services with JSON representation for handling AJAX requests
  • Develop payment switchfor integrating with multiple Payment gatewayssuchas credit car, debit card or internet banking
  • Created alerts templates Confidential DB end for multiple channels like sms and email
  • Designed and developed Java Multi-Threadingin java classes
  • Implementation information authentication withcryptographic algorithm
  • Experience on Apache Tomcat and Java Servlets
  • Experience on unit tests (JUnit),integration tests and regression tests
  • Used Log4j for External Configuration Files and debugging
  • Working with QA team for finalizing Test Plans
  • Worked in Agile team

Confidential

Java Developer

Responsibilities:

  • Developed web application with Spring injectionand SpringMVC
  • Involved in design and development of back end java bean components
  • Developed common library using iBATIS/Mybatis against MySQL database,
  • Strong SQL queries for the backend RDBMS
  • Involved in writing configuration files XML
  • Developed user interface with HTML/CSS3, JavaScriptandJSP
  • Experience on Apache Tomcat and Servlets
  • Used Java-J2EE patterns like Data Transfer Objects, Data Access Objects, Singleton and factory patterns
  • Design and developed Maven scripts to build and deploy the application in JBoss
  • Experience on JUnit test and A/B test
  • Experience on Agile(Scrum) team and Git tool for version control

Hire Now