Big Data - Solution Architect / Sr Developer Resume
New Hampshire, CA
SUMMARY:
- 16+ years of end to end IT projects implementation experience in Java - J2EE, Big Data and cloud technologies.
- Working as a Big Data Solution Architect / Sr. Big Data Developer, Scala & Java Technology Lead Developer in the Retail, ERP, Confidential, Banking, Telecom, and Manufacturing and Retail industries.
- Effective Team Player with excellent Inter-Personal Skills, Communication and Listening Skills and committed to deliver the Projects on Schedule.
TECHNICAL EXPERTISE:
- Big Data - Map R 4.0.1, HortonWorks 2.3, Talend 5.3 ETL, MapR-DB, Hadoop 2.5.1,HBase 0.98, Hive 1.2.0,IBM MDM 9.0.2, IBM CDC, IBM MQ 8.0, IBM Quality Stage, Splunk 6.2, Zoo Keeper 3.4.6, Parquet, ORC, Avro, Map Reduce, Oozie Workflow, NoSQL, Cassandra 2.1,Solr 3.6 and Couch Base 2.5, Kafka 0.8.1, Storm 0.9, AWS Cloud (EC2,S3, Beanstalk and IAM).
- Spark 1.6.1, Scala 2.10.1, Spark SQL, Spark MLib 1.4.1
- J2EE 1.7( Java, Swing, Servlets, JSF, Servlets, Web Services ( SOAP & Restful), EJBs and JMS)
- JSF 2.2, Primefaces 3.5, Apache Struts 2, Spring 3.0 & Hibernate 3.3
- HTML 5,CSS 3, Ajax, JavaScript, JQuery, XML, SAX,DOM, XSLT, JSON,JAXB
- IBM WebSphere Process Server 7.0
- IBM Websphere Portal Server 7.0 ( IBM Java Developer Certified )
- IBM Choridiant 7.0, IBM ILog JRule
- IBM WAS 8.5 & IBM MQ, JMS, Apache Camel 2.10, WebLogic 7.0, JBOSS 7.1.1, Tomcat 7.0.22
- Oracle 10g, ERWIN 6, MySQL 5.1, SQL Server 2007
- IBM JBuilder, RAD 7.0, RSA 7.5, BPD.
- UML, Rational Rose 9.0, Visio, Peregrine, Microsoft Project Plan 2007
PROJECT EXPERIENCE:
Confidential, New Hampshire, CA
Big Data - Solution Architect / Sr Developer
Tools: Horton Works 2.3, Spark 1.5.1, Scala 2.10.1,Spark SQL, Java 1.8, Python 3.6, Apache Cassandra 2.1, Kony Mobility platform 3.0, Kafka 0.8.1, Wild Fly 8.0, QlikView 10, Oracle 10g, IBM DB2 9.7, SQL Server, AWS EC2, S3, IAM, Jenkins 2.0.
Responsibilities:
- Involved in design, architecting and development, integration testing, productionizing and post production support activities of the activities of the following modules.
- Employee Details
- Employee Performance
- Throughput Alerts
- Employee Praise Coach
- Employee Attendance without Picks
- Unplanned Employees
- Shipping File Streaming
- Lift File Streaming
- Involved in the functional analysis, design and development activities of the following technical frameworks to consume various source data and transforming data into Cassandra presentation tables as per business requirements.
- WMS Data source ingestion
- Batch processing - ETL transformation
- Rest API development
- Cassandra Data modeling
- Spark jobs scheduling
- Spark jobs managing through control table to enhance the jobs performance to handle multiple warehouses.
- Performed Cassandra data modeling and designing tables as per business requirements for source data ingestion, ETL transformation and presentation tables.
- Designing and development of data monitoring framework which monitors and alerts the data arrivals, failure to populate the specific data fields based on expected data arrival based on the meta data configuration.
- Performed integration testing, production deployment and post production support, change management activities.
- Performed AWS Cloud activities involving AWS EC2, S3, Elastic beanstalk, Load Balancing, Continuous integration and cloud application deployment activities using Jenkins.
- Generating warehouse Analytical reports daily, weekly employee, shipping, loads and lifts performance using Spark Jobs and QlikView tools.
Confidential, New Jersey, CA
Big Data - Solution Architect / Lead Developer
Tools: Horton Works 2.1, Spark 1.5.1, Scala 2.10.1, Java 1.8, Spark MLib 1.4.1, Apache Cassandra 2.1, Hadoop 2.5.1, Zoo Keeper 3.4.6, Kafka 0.8.1, Python 3.6, Java 1.7, AWS S3, EC2,AWS SDK 1.10, Tableau 9.0.2
Responsibilities:
- Design and implementation of following modules which includes solution architecting, coding and unit testing
- Data Source ingestion
- Customer model ETL processing
- Cluster generation and User Affinity calculation
- Recommendation system API
- Data Visualization using Tableau
- Involved in the development of Spark Streaming job for acquiring data from various existing data source access patterns which includes web services, cloud S3, Kafka, Cassandra and HDFS and performed data validation and transformations while ingesting data into HDFS based on the control parameters captured through control UI
- Event Log
- MDDB
- Gup
- BI
- DynaTrace
- Involved in Cassandra data modeling design and development of ETL load into Cassandra database for the various data staging includes L1, L2, metrics calculation, cluster and affinity score data as needed recommendation processing using spark streaming jobs, Spark Sql and data frame packages.
- Performed exploratory data analysis to determine the attributes which are to be extracted from various data sources in order to be construct customer cluster model generation and performed necessary transformation logic build.
- Involved in the cohort analysis, and development of behavioral metric calculation, implicit ratings, weight and decay calculation on the metrics, cluster model generation, affinity score calculations using Collaborative Filtering and Machine Learning algorithms using Spark platform using Scala Java platform.
- Developed Recommended for You (RFY) service analytics based on data insights derived from customer model and user affinities using CF & ML which makes use of Item Based similarity and the user’s recent history.
- Involved in the development of Control UI to capture the control parameters which are used to control the data transformation, weight and decay calculations, cluster model generation and recommendation system.
Confidential, Minneapolis, CA
Big Data - Solution Architect / Lead Developer
Tools: Mapr 4.0.1, Talend 5.3 ETL tool, MapR-DB, Hadoop 2.5.1, HBase 0.98, Hive 1.2.0 (Hive Server 2, HCat), Spark 1.4.1, Sqoop 1.4.2, IBM MDM 9.0.2, IBM CDC, IBM MQ 8.0, IBM Quality Stage, Splunk 6.2, Zookeeper 3.4.6, Kafka 0.8.1 & Java Restful Web Services, Parquet, ORC, Avro schema, MapR CLI, Hive CLI, Map Reduce, Java 1.7, Java Threading.
Responsibilities:
- Common Data Acquisition
- Generic Data Ingestion Framework
- Centralized Storage/Archive Mechanism (Data Lake)
- Common Data Integration and Enrichment Capabilities
- Standard Data Provisioning
- Involved in solution designing of various use cases of meta data driven common data acquisition frame work which includes schema evolution, IMS, MDM, Real Data feed, data compression, ETL etc and decision making process.
- Involved in design and development activities of the following use cases using ETL development for ingestion and validation framework using Talend, Map Reduce, HBase, Hive, Spark technologies, Sqoop, IBM MDM & MQ technologies.
- Source data Ingestion ( Historical, NRT and RT )
- Data Validation
- Data Quality Rules
- Data Enrichment
- Schema Evolution
- Error handling and HPSM notification
- Restartability
- Current Snapshot
- Involved in design and development activities on Splunk Dashboard use case to generate various data fabric ingestion reports like ingestion time, failure scenarios, restarted jobs, memory capacity reports, enrichment reports etc using meta data captured during core data fabric processes, Performed code review, defect analysis, solution design documentation and code deployment activities as part of deliverables.
Confidential, TX
Big Data - Solution Architect / Sr. Developer
Tools: Cassandra 2.1, CQL, Storm 0.9, Zookeeper 3.4.5, Kafka 0.8.1 & Java Restful Web Services, Hadoop 0.21, HortonWorks 2.1, YARN 2.0, Spark 1.3, Hive 0.13, Pig 0.11, Oozie Workflow 3.1,Avro schema, AWS Private Cloud, EC2, S3, Java Threading, Cassandra Bulk Loader, Git, Jira and Stash
Responsibilities:
- Involved in existing systems (POS & CMOS) study and providing the solution architecture for processing real time and historical sales data based on the business requirements.
- Designed and involved in the development of Cassandra column family design, Restful web services, Mapreduce, pig, Hive, Storm, Kafka and Zookeeper integration, ETL processing from EDW and successfully deployed the cexp services into production which are currently being used across store manager and associates through NM’s applications.
- Realtime and historical data captured in Cassandra db and HDFS using Oozie workflow. Mapreduce, pig and spark code written for data aggregation and analysis .Avro schema to handle new types and complex schemas.
- XCode - iPhone mobile applications are developed for reporting and visualizing sales data (Associate Facing App) for the various business use cases through Restful web services which retrieves big data from Cassandra and HDFS file system using CQL and HiveQL.
- Solr and Couchdb used for customer profile and product catalog data with search capabilities.
- Historical data are extracted from EDW using Cassandra bulk load - java thread code for migrating existing data into Cassandra DB .
- Writing Unix and JCL scripts to schedule daily load into cluster db and data validation on daily basis.
- Performed change management, build, implementation, code review, defects management and deployment plan activities along with business stakeholders.
- Analyzed data growth and proposed system improvements and capacity management plan and activities for both real time and historical data.
- Ensured process compliance and standards using Agile scrum methodologies through Stash and Jira tools and achieved production deployment on time.
Confidential
Solution Architect
Tools: TOGAF 6.1, Hadoop 0.19, HortonWorks 2.0, Mapreduce, Hive 0.13 and Pig 0.11, Oozie Workflow 3.1, Hive, JSF - Primefaces 3.5, Spring 3.0.2, Hibernate 3.3, XML (SAX & DOM) Parsers, java script, Drools 5.1,, Restful Web services, JMS and Apache Camel 2.10, Oracle 10g, MongoDB 3.0, IBM WAS 8.5, JBOSS 7.1.1, Tomcat 7.1, UML .
Responsibilities:
- Involved in Re-Architecting the product to migrate from Richfaces JSF 1.1 to Primefaces JSF 2.1 Migration originally developed in Oracle Form application into java based web application. Designed and developed an SOA based web services framework with MQ, JMS, DRools and Apache Camel for the converting back end logic into web services to enhance the workflow management and to ease the external system integration.
- Developed a PoC for Big data implementation for the Finance and Auto dealership contract and management module using Big Data technologies like Hadoop, MapReduce, Hive, HCatalog, Oozie workflow and Pig scripts.
- Developed an approach and utility tools to improve the performance of the product by migrating back end Pl/SQL code into java components for the ERP modules like Purchase, Sales, Inventory, Finance, Auto dealership management, supply chain management, Human resources, Payroll, Manufacturing and Contracting modules .
- Developed PoCs for the complex use cases and involved in test cases preparation for unit, integration, regression testing the application across platforms. Oracle, MongoDB, JBOSS, Tomcat, GlassFish, WebSphere .
- Developed Architecture, Deployment diagrams, High Level and Low Level designs, Visual Models, Object Models using Rational Rose for the respective use cases.
- Involved to improve the performance and high availability of the application that includes capacity planning, websphere clustering, code analysis, multi threading and optimization activities.
- Developed using Agile scrum and Test Driven Development methodologies.
Confidential
Technology Lead
Tools: Java, JSF, Spring 3.0.2, hibernate 3.3.2, XML (SAX & DOM) Parsers, Ajax, JSON, javascript, Eclipse, IBM Websphere Portal Server 7.0, IBM Process Server 7.0,WebSphere 6.1, Tomcat 7.0, IBM ILOG JRules, IBM MQ, Web Services. Mysql 5.1, IBM RSA, RSM, Redwood explorer, Jasper reports, rhutil.
Responsibilities:
- Involved in design and development of Commercial application where provisioned banks can purchase or modify the product selection based on which vendors will receive the credit reports which had product provisioning module, payment generation module, authentication modules.
- Involved in design development of the PPE modules which is more of batch process runs during the night and generates reports send it to the provisioned banks based on their product opted which had components like batch processing, PDF report generation, email template and exception handling and involved in product support activities for these components.
- Developed using Agile scrum and Test Driven Development methodologies.
Confidential
Technology Lead
Tools: Java, JSF, Struts 2, Spring 3.0.2, hibernate 3.3.2, XML (SAX & DOM) Parsers, Ajax, JSON, javascript, JQuery, Web Services, Tomcat 7.1, JBOSS 6.0.1,WebSphere 6.1, Oracle 9i, DB2, Eclipse, IBM RSM.
Responsibilities:
- Involved in design and development of the Confidential modules which has Image Bank, Carry Over, Parts management, Category management, Customizable Catalog, Promotions and Order management .
- Involved in Design and development of modules like RDR Processing, SIMS Contract Processing, TFS Payment, Accrual processing, True-up process, Jasper Reports using JSF, Spring, Hibernate and DB2 database.
- Designed High Level and Low Level designs, Visual Models, Object Models using Rational Rose for the respective use cases and prepared use cases for unit, integration, regression test cases and coordinated deliverables.
- Developed using Agile scrum and Test Driven Development methodologies.
Confidential
Solution Designer /Onsite Coordinator
Tools: Java, IBM Chordiant 6.3, WebSphere 7.0, IBM ILOG JRules, JSP, XML (SAX & DOM) Parsers, java script, Ajax, JSON, JMS MQ, Web Services, ERWIN 6,Oracle 10g, DB2, Microsoft Project 2007, Eclipse IBM RSA 7.5, RSM, BPF and JRule Designer.
Responsibilities:
- Developed and coordinated at onsite for 5 Lloyds process and wrapping 35 FDI process and integrating these processes in Foundation Desktop.
- Confidential processes are developed in interact with OCIS, Vision Plus, CAPD servers for back end data processing through secure IBM Data Power system.
- Developed an end to end design DTO/BO model using RSM, PDM LDM using Erwin tool, workflow, domain services and UI development using Chordiant framework, defect fixing and Encryption solution.
