Sr. Principal Architect, Data And Analytics Resume
PROFESSIONAL SUMMARY:
- 20 years plus of total IT experience with 5 years working expertise in Confidential
- Sr. Principal Technology Architect for Data & Analytics unit at Confidential and Confidential
- Extensive experience in DW/BI and Big data implementations across diverse industries including Financial, Retail, Telecom and Card / Insurance and Manufacturing
- Managing Center Of Excellence for Emerging Technologies as in Big Data & Advance Analytics/DW Appliances at the unit level
- Extensive expertise in open source Big Data Technologies including Advance Hadoop stack and Advance Analytics as in Data Science, Machine Learning and Visualization
- Hands on in diverse DW technologies, most Hadoop Technology stack and Data science
- Passionate about learning new technologies, Mentoring and Team Building
SPECIALTIES:
IT Strategy and Planning
DW/Big Data strategy and Architecture
Big Data Solution Architecture and Project Delivery
Big Data/ Analytics Product development
Advance Analytics with Machine Learning and AI
Program & Project Management
People Management and Team Building
Vendor and Alliance Management
TECHNOLOGY:
DW Platforms: Hadoop, Oracle, Teradata, DB2, Netezza, Teradata and Vertica
Data Management: Informatica, Datastage, Talend and Datameer
Visualization: SAP Business Objects, Cognos, Microstrategy, Tableau, and Open source Visualization tools as in D3.js, Chart.js, Jquery, Angular.js and Apache Hue
Big Data: IBM Biginsights, Hadoop (Cloudera/MapR), No Sql (Hbase/Cassandra/Mongo db), Apache Spark, Apache storm, Apache Kafka, Graph database (Neo4j), Machine Learning using R and Python
Languages: Java, Pig, HiveQL, R, UNIX Shell and Scala, Python
Data Science: R, Python, Scala, Spark MlLib, Recommendation and Classification Engines, Machine Learning Algorithms (Supervised and Unsupervised), NLP and AI
PROFESSIONAL EXPERIENCE:
Sr. Principal Architect, Data and Analytics
Confidential
Responsibilities:
- As solution Architect at Clarity, performed technology assessment and recommended solution architecture and roadmap for the digital platform at Humana
Sr. Principal Architect, Data and Analytics
Confidential
Responsibilities:
- Engaged in Platform assessment and Recommendation and roadmap for future state architecture for a Marketing Platform (GMDR), that includes both Bigdata and non - Bigdata options
- Participated and led key Technology Strategy Initiatives inside American Express IM space in diverse business portfolios (Customer/Merchant Marketing, Digital Campaigning and BIDW shared managed service)
- Conducted in-depth questionnaire sessions with business & technology stakeholders to understand current pain points and come up with Point of Arrival (POA) architecture and tactical/strategic roadmap for technology adoption
- Articulated Confidential value proposition and technology adoption roadmap for emerging technologies
- Supported and offered thought leadership in most Emerging BI/Big Data tools / Technologies and Solutions
- DW/Big Data Solution Architecture and Point of Views
- Tools and software evaluation and propose Roadmap for Big Data technology adoption
- Design Patterns for Data Storage and ADW(augmented DW solutions)
- Data Visualization / Analytics and API based data publishing
- Open source technology adoption in Big Data and DW space
Confidential
Chief Architect
Technology: AWS, Apache Spark, PySpark, Redis, Apache Livy, Spark ML, Scikit-Learn, Python and Scala, CoreNLP, Word2Vec, NLTKResponsibilities:
- Also as part of its NLP platform, designed the architecture for Text normalization, Sentiment analysis, Topic Modeling and Text Classification
- Designed and architected data ingestion, data wrangling and Automated AI and NLP modules for the Predactica product suite
- Architected underlying API integration platform involving node.js, flask (API management), Apache Livy and Pyspark for interactive user experience
- Managed agile Product development and supported several POCs with fortune 500 companies
- Assessment of Azure/AWS and Bluemix cloud services and recommend the right product
- Oracle BDA (Cloudera based) platform assessment and product configuration
- BDA Security framework setup (Kerberos/AD integration)
- Pilot Analytics use cases using Oracle Big Data discovery tools and Apache spark Analytics Frameworks
- Data Federation using Spark SQL, Presto and Big Data SQL from Oracle
- Recommend optimal configuration for Dev/test and Prod environment
- Execute Cognitive/Speech Analytics use cases using Python, Kaldi and Spark MlLib(NLP) libraries
- Confidential TMS Connected Car Pilot Program - As Solution Architect, led a team of data scientists and Big data developers to pilot telematics analytics use cases related to larger Confidential Connected Car initiative
- Establish data lake on MS Azure and HDInsight
- Build frameworks for telematics data acquisition and ingestion
- Build data pipeline for data curation and consumption leveraging Apache Spark on HDInsight and PowerBI for visualization
- Data preparation and statistical modeling on R-Studio to build both predictive and Descriptive analytics from Telematics, Customer and Vehicle data
- Showcased Driving destination prediction and descriptive Repair analytics based on historical telematics data
- Big Data Lake engagement at United Health - Led a team of Big Data Developers/Architects for implementing large scale Data Lake Implementation program at United Health. The program is a multi-phase big data program involving 8 scrum team executed in agile mode. The implementation involves building several framework components (Ingestion/Enrichment and Provisioning) and ingesting 40 odd data sources into the Big Data Lake built on top of Map-R distribution
- External Facing Merchant Application (aka My Business Trends) exposing 3 years of historical Card member spend data(35 TB transaction data) on MapR Hadoop platform and publishing data in charts and graphs using Advance Visualization and charting tools. The technology stack entails MR and Pig for data transformation, Hive data store along with Solr search and memcached for low latency data access.
- Building a Common Metadata driven Data Ingestion Framework for American Express Bigdata Platform (750 node MapR cluster) that enables data sourcing from different SoRs and Platforms. The Ingestion Framework (a.k.a Cornerstone) enables seamless data ingestion, data masking (PII data), metadata capture for transactional data, master data in both real time and batch mode. Pig, Python are used for data management along with Hive and HiveQl used for data access.
- Prospect Personalization Application leveraging historical omniture clickstream data and real-time user data to render customized offers and advertisements. Hbase NoSQL database used for data persistence and for storing historical clickstream data and custom rule engines used for generating real-time custom offers for registered users. Tableau and D3.js used for analytical and real-time data visualization.
- Build an Enterprise Data Hub at Aetna that manages member, claims, providers and compliance data for its insurance business in a distributed Marklogic NoSQL database platform. Real time, event based and data generated in batch mode are ingested, transformed in Marklogic data hub for user consumption. Marklogic data hub will eventually provide realtime data access capability to its transactional data store over mobile devices and Analytics use cases.
- Large Scale Hadoop Augmented DW solution at Apple computers with Petabyte data migration from Teradata to Hadoop/Hive.
- Consulting with Travelers Insurance on its Big Data Roadmap and Technology adoption through POCs. This involves creating Sqoop and Flume based data ingestion framework on CDH platform and POC on Mongodb Nosql db
- Play key consultant role in Allstate and Moneygram Agent Pay Data Transformation programs
- Executed customer use case driven POCs with Apache Storm, Spark streaming and Augmented DW solutions involving Hadoop Technology stack
- Created solution Architecture and Managed large scale data migration programs leveraging IBM Netezza at Bank Of America, TWC, Cell South and American Express
- Provided Technical leadership and ground level help in designing custom Modeling and Recommendation solutions at Amex. The solutions include real time prospect personalization and real time credit line decision using KNN algorithm
- Created solution Architecture and Managed large scale data migration programs leveraging IBM Netezza at Bank Of America, TWC, Cell South and American Express
- Strong track record of planning, architecting and implementing custom and packaged Data warehouse & Business Intelligence solutions with high performance and extensibility in a batched or real time environments
- Principal Architect/Data Modeler for Bank Of America ECRIS DW/BI roadmap. Responsible for modeling the EDW and Distribution layer data marts involving dozen subject areas and that many Facts. Implemented best practices in both relational and dimensional modeling. Managed Netezza NPS upgrade for 12 Netezza P12 servers. Coordinated a ELT development environment involving historical and incremental load in Netezza DW environment.
- Responsible for leading, architecting and implementing the ETL framework design, Exception and Error notification & reporting at Confidential Retailer. Designed and implemented half a dozen full life cycle DW projects involving data migration, enhancement and BI delivery roadmap involving Business Objects Product Suite, Adobe Flex and Netezza ELT framework. Also worked with Oracle, DB2 and Mainframe source systems.
- Designed and developed a home grown Performance management tool involving Business Objects Xi-R3 SDK and Flex. Proactively improved Batch and Scheduled Reports performance supporting 2000+ users in Confidential .
- Designed solution architecture for asynchronous data load in a Netezza EDW involving multiple source systems in Time Warner Cables. Managed to cut down the batch time to 5 hrs from 24 hrs batch processing time. This involved technology migration from Oracle/Informatica to Netezza and ELT framework
- Managed Big Data COE that engages in innovation through ideation, solution development and build Industry leading accelerators
- Involved in Solution Architecture development for Big Data use cases for diverse clients involving open source Big Data Technology stack and traditional BI stack
- Directly involved in POCs and product ideations around numerous Big Data Use cases namely real time data ingestion, Fraud Analytics and Customer 360 view
Senior Solution Architect
Confidential, Dallas
Responsibilities:
- Served as a lead BI Solution architect at leading specialty retailer Confidential for more than 7 years delivering multiple data warehouse projects, supporting Sales and Inventory applications for its Retail store line. Also delivered analytical reporting solutions involving SAP Business Object and rich visualization on dashboards
- Delivered solution architecture for a multi-phased end-to-end DW development project involving data model, infrastructure alignment, POC, build and testing phases.
- The solution was delivered on Oracle 10g platform with Business Objects as reporting tool.
- Led a data migration effort from Oracle to Netezza involving application reengineering and rebranding of reporting platform with major Business Objects product upgrades and new reports development