Hadoop Architect Resume
SUMMARY
- Extensive data architecture and modeling experience for large and complex projects spread across multiple organizations
- BigData: Exposure with IBM BigInsights IBM Softlayer based Hadoop eco system with HDFS, BigSql and Hive.
- Expertise in designing intuitive executive dashboards and score card reporting on real time / near real time EDW data.
- Expertise in MPP (IBM Netezza, AWS RedShift, Microsoft PDW) database systems for faster data processing.
- POC Experience In - Memory (IBM DB2 Blue) database technologies to handle better user experience for BI Dashboards.
- Expertise in different cloud environments AWS (Amazon Web Services), IBM Soft layer and Microsoft Azure
- Expertise in multi-terabyte Data warehouse environments, with multiple ETL and Reporting technologies.
- Being Expert in SQL excelled in data analysis, data profiling, data masking and helping data users as data steward.
- Being Informatica expert developed and enhanced key ETL processes and environment (DEV, TEST and PROD).
- Being Oracle expert developed stored procedures and SQL queries for intranet portal and data extracts.
- Being Linux Expert created scripts to handle automated data flow and test driven error handling into ETL process.
- Designed and developed programs to identify process optimizations for efficient data load, retrieval and analysis.
- Created and maintained conceptual, logical and physical data models using industry best practices.
- Created HLD, LLD and STT (Source to Target), ERD, Data Dictionary, DFD (Data Flow Diagrams) & Test case documents.
- Experienced working with Business Leaders, Analysts, Program Managers, Business, Application, Infrastructure Architects.
- Proficient with database architecture tools and models (unstructured, relational, STAR, Snowflake)
- Prioritized, estimated and executed projects and tasks in line with the business stakeholder’s expectation
- Executed projects and programs using conventional, Six-sigma, Agile (SCRUM) methodologies.
- Implemented Change Control Board (CCB) within ITIL framework to minimize the impact of new deployments
- Experience in production support, crisis management and leading ‘virtual war room’-type activities
- Expertise in data architecture, master data management (MDM), data quality and data integration with multiple sources
- Worked collaboratively with stakeholders to provide technical analysis and design for data-level solution
- Provided data architecture and solutions that ensures to meet data integration, interoperability, privacy requirements
- Technical Leader in multiple business functions with demonstrated success in Digital Data Management.
- I believe and practice in bringing value to work place by blending in quickly with organizational demands and work culture.
- Managed challenging projects like (Group bookings and AWS implementation) with a successful delivery within budgets.
- Participated in building IT teams in Revenue Management, Rewards, Sales and Catering portfolios.
- Created project plans, resource loaded the plans and forecasted costs for the senior leadership budget committees.
- Won multiple business/client appreciation awards as Technical Leader.
- Implemented new Group booking & High Performance Pricing systems using Oracle, Informatica and Linux.
- Implemented fraud detection system that saved the company millions and elevated business leadership confidence.
- Implemented a new eAgent payout and Sales Goals system that saved the company 2M+ per year.
- Implemented Amazon Web Services (AWS) Redshift based solution for Worldwide demand, pricing & reservation analysis
TECHNICAL SKILLS
Cloud: AWS (Amazon Web Services) and IBM Soft layer
Methodologies: Agile (Scrum), Six Sigma, ITIL, Waterfall.
Database Tools: RedShift, Hive, Netezza, PDW, Oracle, DB2, MS SQL Server.
ETL Tools: Informatica, NZLoad, SQL/Loader, DTS
OLAP Reports: ASP.NET, MS Excel, Crystal Analysis, (Business Objects) Cognos
Modeling Tools: ErWin, Oracle Designer 2000, Visio 2000
Applications: Oracle Apps 11i (ERP), People Soft 8(ERP), Clarify (CRM), HP Open View Service Center (CRM), Seibel (CRM)
Environment: Cloud (AWS and IBM softlayer, Azure), Unix, Linux, Motif, AS/400, Windows NT
Quality: Six Sigma, CMM Level V, ISO
Standards: Data Masking, HIPPA, PCI, PII, SOX.
PROFESSIONAL EXPERIENCE
Confidential
Hadoop Architect
Responsibilities:
- Used Hortonworks Hadoop platform, MongoDB, Netezza, Tableau, Linux and ArcGIS technologies.
- Approximately 6 million policies with $3.5 billion yearly premium.
- Part of data team injected mainframe FIMA data into Phoenix application data stores.
- ETL and Reporting on arcGIS Data, Direct Servicing Agent (DSA), WYO Insurance Companies, Open Source data.
- Created reusable code components which facilitated for faster data load (ETL) and data analysis.
Confidential
Technical Lead
Responsibilities:
- Implemented new Group booking, High Performance Pricing, fraud detection system and demand segment analytics.
- Led integration of Marriott.com data with Click stream, Campaign, Customer loyalty, Omni - Channel, OR (statistical), Nielson account, Sales, Rewards, travel, demand, Payment, Guest, Travel agents and Search engines data into EDW
- Led Implementation of PCI (Payment Card Industry, PII (Personally Identifiable Information) into database.
- Led some critical projects Web analytics, Click stream, Campaign, Omni, eAgent, eCom, eMerge, Rewards, customer loyalty
- Part of CCB team maintained infrastructure demand, software upgrades and vendor consolidations.
Confidential
Data Architect / Project Lead
Responsibilities:
- Provided Informatica (ETL) development and technical leadership to deliver Business Analytics data warehouse, which generates estimated/ actual spread and key success drivers/factors of business (mainly predictive analysis) several business status reports are generated out from the DW data, used Informatica, Linux, Oracle, Cognos technologies.
- Integrated different data streams Seibel, thingamajob.com, People Soft employee and TeamTrak applicant data with EDW.
- Being an expert in Informatica and Oracle taken initiative to improve TeamTrak process performance, tuned TeamTrak fast index process to reduce to 2 hours from 24 hour process time.
Confidential
Sr. Data Architect
Responsibilities:
- Implemented Open source database modeled (GUS) and loaded global DNA, RNA data for researcher’s match analytics.
- Implemented Data architecture to collaborate with other NIH funded healthcare projects across the globe
- Created data models and conducted the training on data model to analysts/ UI development teams and user groups
