Big Data Evangelist Resume
PROFILE:
- Eighteen years of IT industry experience encompassing a wide range of skill sets.
- Worked extensively on Knowledge Management (KM) framework to implement Data Warehouse/BI solutions for the Telecom, Banking, Staffing, Education, Healthcare, Financial Services industries and the State Government.
- Handled EDW projects over 18 years in the capacity of Big Data Evangelist and Principal Architect.
- Expertise in architecting, solutioning, implementing and supporting Enterprise wide mission critical data warehouse solutions with very large data sizes.
- Successfully, implemented STAR schemas using Ralph Kimball methodology.
- Have turned around large troubled Big Data projects to become profitable with my knowledge and technical acumen
- I have started Mumbai’s first Spark meetup which is also ranked amongst other Spark meetups worldwide
BIG DATA & NON BIG DATA TOOLS & TECHNIQUES:
Right Time ETL Tools: Apache Spark, InfoSphere Streams, IBM Data Stage v11.3, IBM Change Data Capture
Data Visualization/Real Time: R, Data Explorer/ Vivisimo, Cognos Real Time(RTM), Cognos NOW, Application Builder
Appliances: Pure Data for Analytics(PDA, PDOA), Netezza
Big Data Applications: Big Insights
Big Data Accelerators: Telecom Events Data(TEDA), Sentiment Data(SDA), Machine Data(MDA)
Data Modeling: Entity Modeling, ERWIN
Data Mining: IBM i - Miner, SPSS
Data Profiling: IBM Information Analyzer, Business Glossary, Metadata Workbench
Project Management: MS Project Server, MS Visio, Kintan, Remedy
Databases: DB2, Oracle, Teradata, SQL Server
Automation: Tivoli Storage Automation(TSA) and Tivoli Storage Management (TSM)
PROFESSIONAL EXPERIENCE:
Confidential
Big Data Evangelist
Responsibilities:
- Customer to Customer Relationships(Kith and Kin)
- Customers with significant overlaps in personal information data (PII) are flagged as two CIFs for the same customer
- Customers with same last name and address
- Customers who are joint holders or signatories on each other’s accounts
- Customers who have standing instructions to pay one account from another
- Customers who have guarantee/nominee/appointee/trustee/power - of-attorney relationships
- Existing EDW is 5 years old and has sources data from 57 structured sources such as Core Banking, Internet, ATM and CMP.
- I augmented it with IBM Big Insights for handling new data sources such as Facebook, Twitter, LinkedIn, YouTube, google+, Instagram and Pinterest
- I worked closely with Confidential on two sizing exercises where I introduced ETL MPP Architecture using Grid Topology, PDOA and Big Insights.
- Additionally I created a Propagation engine with which the datawarehouse provides data to business in Real time on T+0 basis on daily basis
- Solutioned and implemented near real time enterprise data warehouse for Confidential . Also set a new world record in IBM world regards processing daily logs of 3.6 TB using IBM Information Data Replication
- Architecturally complex as it involved “ Confidential Remote Topology” that constitute only 1% in overall Confidential install base
- Customized this Confidential solution by creating an external log registration program to handle log shipping via Oracle Data Guard as well as manual shipping post end of day EOD) processing by Core Banking source system
- Solutioned and implemented India’s first High Performance Computing(HPC) DataStage using Grid topology that spans across 64 AIX Cores of P8 type of servers
Environment: Cassandra Titan, BigInsights, Streams, Spark, Titan, SPSS, Confidential, Pure Data for Operational Analytics (PDOA), DB2, DB2 BLU, Oracle, UNIX
Confidential
Principal ArchitectResponsibilities:
- Evaluate RFP’s and respond with BigInsights based Big Data solution that includes Business, Software and Hardware components
- Design, architect and build a data platform over Big Data Technologies
- Design, Develop and Demonstrate Big Data PoCs
- Mentor clients on Big Data in consulting capacity
- Expert in Hadoop Projects and associated distribution such as IBM BigInsights, Cloudera etc.
- Interact with Functional and Technical team to identify right solution mix for clients
- Solution and implement real time Big Data applications
- Architect and size Big Data applications
- Build Pilot and Prototypes for Big Data applications
- Fulfill Big Data staffing requirements
- Project Planning for Big Data implementations
- Support existing and new applications
- Audit/review existing and new Big Data applications
Confidential
Data Warehouse Team LeadResponsibilities:
- Successfully migrated from Teradata FSLDM into Confidential FSLDM
- Solutioned and implemented award winning India’s largest banking datawarehouse solution
- Build India’s first propagation engine(BEDA) for real time transaction availability for a 270TB Confidential datawarehouse
- Build an Enterprise wide data warehouse for Confidential where I handled transactional logs of over 3.6 TB in a day to set a new world record in Confidential world for largest daily volume
- Solutioned and implemented India’s first DataStage Grid implementation using GPFS for load sharing and distribution
- Build an Enterprise wide data warehouse for Foreign Offices to handle transactional logs from over 18 Confidential install bases to set a new world record in terms of handling one of the most complex Confidential installation
- In a Technical leadership role mentored onsite and offsite teams to architect and implement a Near Real Time Propagation Engine (NRT) for data movement using ETL between the transactional system and Global Data warehouse
- Developed and implemented a Data Warehouse and an Operational Data Store for Credit Card company to create and support cross selling models
- Build technical solutions and performed tuning for Advanced Risk Sciences and Analytics Reporting applications being a part of IMD, RISK team
- Architected a Data Warehouse for Medical claims processing to handle one of the largest volumes in medical claims processed
- Successfully designed, developed and implemented ETL to load data for one of the largest Dental providers. Developed the Logical Data Model as well as Physical Data Model
- Requirements Gathering, Analysis Document, Technical Architecture, Design Document, Design of Logical, Physical Data Model, Star Schema/Dimension modeling
- A Web enabled Data Warehouse with Multiple Data Marts to track criminal activities for 5 western states of U.S.A
- Implemented a data warehouse to create a fraud detection system to track the early warnings for fraud/thefts in postal system for a major postal company
- Architected Business Intelligence functionality into an Enterprise Knowledge Portal
