We provide IT Staff Augmentation Services!

Lead Data Engineer, Data Sciences Resume

4.00/5 (Submit Your Rating)

SUMMARY:

  • Experienced in designing and developing complex BIG data solutions
  • Designed and implemented high velocity high performance data ingestion pipelines to read from a variety of sources and persist into big data ecosystem
  • Effectively lead the product teams to deliver quality solutions while managing scope and schedule.
  • Knowledgeable in using relational data modeling and dimensional data modeling techniques to come up with the right model for the use cases.
  • Partner with Product Owner to develop the product roadmap, design and technology strategy
  • Proven ability to interact with business and converting the logical specs and requirements into data solutions
  • Mined/analyzed huge data sets to identify the signals and create reports.
  • Executed POCs to test public cloud data solutions vs private cloud and developed good understanding of Cloud fundamentals
  • 4 years of experience in implementation of GUI Centric Projects using Java Swing, JSP (Java Server Pages).
  • Developed ETL jobs using JAVA, CAMEL to fetch data from a variety of sources/APIs and load it into a varying set of platforms including databases and data lakes.
  • Nine plus years of experience in Object Oriented Software Analysis, Architecture & Design, Development & Implementation of enterprise systems using Java and J2EE Technologies.
  • Expertise in the development of mission critical web based applications utilizing Java, J2EE, STRUTS, EJB, JSP, Java Script, XML, UNIX, Ajax, JDBC, Java Swing and Java Beans.
  • Extensively worked with Servlets based and STRUTS based multi - tier applications.
  • Experienced in preparing documents as a part of Functional Specs, Detailed Design for Change Requests, Unit Test Plans and also documented the application.
  • Experience in analysis & design using UML (Visio), Lucid Chart.
  • Specialized in using IDEs like Eclipse, BEA Workshop and Intellij
  • Expertise in HTML, DHTML and JavaScript.
  • Technically proficient in module development and upgrades based on needs.
  • Experience with reviews (codes, design, use cases, test cases) and production support.
  • Excellent communication skills, creative problem solver, positive listener and team player.

SKILL:

Operating Systems: Windows NT/2000, UNIX

Technologies: HDFS, Spark, Storm, Nifi, Kafka, Oozie, Elastic Search, Logstash, Google Cloud Platform (Dataproc, Dataflow, BigQuery, GCS),Java, J2EE (JSP, JDBC, EJB2.0, STRUTS1.1, JNDI), HTML, XML, Java Script, AJAX, SQL, PL/SQL and CSS

Servers: Apache Tomcat and Web logic

Databases: Oracle 8i, 9i, 11g, DB2, MS Access, Mongo DB, Postgres

Middleware: BEA Web Logic8.1/7.0, Apache Tomcat 4.0/5.0

GUI: Java Server Pages (JSP) and Java Swing

IDE: Eclipse3.0/2.3, BEA Web Logic version 8.1, Rational Application Architect, Intellij

Tools: TFS, VSS, CVS, GIT, TOAD, Visio, IBM Rational Clear Case & Clear Quest, Advanced Query Tool, CAMEL, HIVE, Miscrostratgey, Lucid Chart

PROFESSIONAL EXPERIENCE:

Confidential

Lead Data Engineer, Data Sciences

Responsibilities:

  • Develop product (RedEx) that empowers and enables big data platform users to configure data pipelines to read from high velocity Kafka streams (multiple formats) and ingest into bigdata ecosystem with good performance and monitoring capabilities
  • Enable reading from kafka in Avro and persist the data in ORC in hive
  • Schema evolution
  • Configurable deduplication logic
  • Enable merging of Avro Kafka messages from multiple topics based on an uber schema and write to hive while supporting schema evolution
  • Provide data movement scheduling capability in REDEX
  • Make changes to Apache Spark open source code to read from non-consecutive offsets/compacted topics
  • Deliver the required capabilities for a RedEx UI,
  • RedEx- yarn application
  • Design the schema to persist user configurations
  • Deliver Nifi secure cluster and Nifi flows for streaming clickstream data from Confidential .com into Elastic Search
  • Enable RedEx for PII ingestion (SecureRed)
  • Deliver the egress and ingress capabilities into a restricted environment for storing PII
  • Deliver the capability to create alerts/monitor RedEx jobs,
  • Capture the metadata and publish the execution metrics onto a kafka topic for user consumption

Environment: Spark, Nifi, Storm, Oozie, HDFS, Hive, Elastic Search, Logstash, Kafka

Sr. Data Engineer

Confidential

Responsibilities:

  • Interview business team members to get details for the identified story points and develop reporting capability roadmap and manage scope/schedule.
  • Develop detail technical design for multiple products (reports/dashboards, etc.) simultaneously.
  • Work with Google, Facebook, Pininterest and Twitter teams to determine the appropriate APIs and pull in the required data to support Guest Identification.
  • Architect and develop technical solutions that deliver relevant information via identification and reporting
  • Define the architecture and detailed design as per the EA approved design patterns.
  • Pull Activity, clicks, and impressions log files from Google storage using DCM/DFA and trafficking APIs and load the data into the Hadoop (BIG RED - Private Cloud).
  • Explore, mine and analyze the data to understand the logs data and identify guests for Audience Targeting in Paid Search
  • Associate digital actions of guests on both external and internal digital platforms to identify guests and missed opportunities, relate sales back to Confidential .
  • Work with the business to determine relevant key words.
  • Responsible for end to end of the SDLC including analysis, design, development, testing and prod activities.
  • Design and develop “Reviews Landing Page”, site map generators, etc.
  • Work on modifications and enhancements to the external landing page.
  • Create detail design document and develop the required code.
  • Work with the other partner teams to make sure they understand the tasks to be done.

Environment: Intelli J, GIT, Hadoop, HIVE, OOZIE, SPARK, SOLR, Tableau, DOMO, Java, Groovy, Spring Boot, Junit, Spock, Oracle 11g, Teradata, Google Cloud Platform (Dataproc, Dataflow, BigQuery, GCS)

Sr. Data Engineer

Confidential

Responsibilities:

  • Interview business team members to identify business needs and develop application/capability roadmap and manage scope/schedule.
  • Understand the business analytics strategy for Confidential .com and determine the KPIs and metrics that need to be reported based on the marketing campaign strategy.
  • Design, develop and implement the processes, solution to scrape inventory and pricing statistics from competitor websites.
  • Analyze the gathered requirements and document the technical requirements.
  • Work with the project manager and put together a plan for the delivery on the execution side of the roadmap.
  • Define the application’s architecture and detailed design as per the approved design patterns.
  • Create detail design document and develop the required code.
  • Co-ordinate, delegate and supervise the offshore development team.
  • Design and implement the whole analytics data solution for the gathered data from various web analytics platforms and scrapping data stores catered to produce the required reports/dashboards every two hours.
  • Participate in the system, integrated and user acceptance testing activities.
  • Demo/showcase the products to the partner teams and act as a SME to make sure they understand the tasks to consume and utilize the data/products.

Environment: Rational Application Architect, Java (J2EE), MS Access, Microsoft Visio, DB2, Oracle 11g,Teradata, Advanced Query Tool, Benthic Software, Micro strategy, Team Foundation Server, GIT, Adobe Site Catalyst, Hadoop, Tableau

Confidential

Senior Developer/Tech Lead

Responsibilities:

  • Participate and lead the meetings with various teams to gather requirements.
  • Analyze the gathered requirements and create functional specification document.
  • Define the application’s architecture and detailed design.
  • Review and document current Web analytics data collection documentation.
  • Test, implement, document and support existing and newly developed web analytics requirements and procedures.
  • Create detail design document and write the Java J2EE code required.
  • Develop, standardize and maintain existing technical documentation.
  • Lead the development team to deliver the code to match existing analytics code base.
  • Coordinate with the other partner teams to make sure they understand the tasks to be done.
  • Collaborate with development team to implement changes to code base.
  • Collaborate with internal clients to gather requirements for tracking improvements.
  • Implement best practices and industry standards respective to programming.
  • Responsible for all appropriate documentation associated with web analytics.

Environment: Eclipse Indigo, Java (J2EE), MS Access, Microsoft Visio, Adobe Site Catalyst, Charles, Web storm 3.0, Java Script

Confidential

Java Developer

Responsibilities:

  • Develop and implemented complex internet and intranet applications on one or more platforms.
  • Act as an escalation point for production troubleshooting guidance; provide and mentoring to less experienced developers.
  • Perform website and portal monitoring.
  • Use knowledge of web technologies to lead projects resulting in functional enhancements of web-based applications.
  • Evaluate systems specifications for client web site requirements, lead design presentations, influence direction and implement the most efficient and cost-effective software/package solution.
  • Prepare Functional Specifications and detail design documentation.
  • Perform data exploration/analysis, data modeling, data load and create reports.
  • Prepare Unit Test Plans and worked with the testing resources.
  • Coordinate with systems partners outside the group and oversees contractors on projects.
  • Perform web-enabled database development and module design.
  • Use programming tools to write scripts.
  • Perform web security activities and participate in security planning.

Environment: IBM WebSphere, ORACLE 10g, IBM Rational Clearcase BEA Web, Java (J2EE), Tableau

Confidential

Senior Java Developer

Responsibilities:

  • Lead the JAD sessions involving business users and SME for requirements gatherings.
  • Involvement in requirement analysis, design & development and production phases of the project right from the beginning.
  • Provide inputs into application’s architecture and detailed design.
  • Resolve conflicts, manage time lines, deliverables and expectations of business users in the context of physical implementation.
  • Re-design and develop the application integrations using web Services and XML.
  • Work with Middleware teams to setup MQ queues for the application.
  • Design, develop and implement Change Requests based on --“Model- View-Control (MVC-2) " - architecture using Struts framework.
  • Design, develop and implement change requests based on Flash technology, Flex Builder (Adobe Live Cycle Enterprise Suite).
  • Develop dynamic content based UI Screens( Servlets, JSP, AJAX, Java Script)
  • Develop and tune necessary query / stored procedure to retrieve the data from the Database.
  • Determine and implement enhancements to the various functionalities (Backend, UI).
  • Test, debug, fix the bugs and document the system design.
  • Analyze the daily defects of the production support and find the root cause to resolve all outstanding issues and ensure high availability.
  • Determine effective alternate solutions to the issues faced by the user.
  • Plan and manage KT activities during transition of the project

Environment: Solaris 2.8 (Sparc 5.8), ORACLE 9i, BEA Web Logic Portal 8.1, Java (J2EE), Oracle, JDBC, JSP, Servlets, Java script, HTML, EJB 2.0

We'd love your feedback!