Azure Big Data Engineer/architect Resume
5.00/5 (Submit Your Rating)
SUMMARY
- He is an experienced Cloud/Big Data/ETL professional with specialization in administration, Dev/Ops, solution’s architecture, designing and of Cloud and on - premise solutions including Networking/VPC, Big Data, ETL, Containers, ML, Catalog/Governance, Data Quality, Security, etc.
- I am currently working as Cloud/Big Data/ETL engineer/architect at various industries, including Financials and Pharmaceuticals in PA, DC, NC, and NJ.
- Key strengths are the sound architectural basis, detailed understanding of most advanced technology, knowledge of the full software project life-cycle and extensive Cloud, Big Data, ETL, and SQL experience.
PROFESSIONAL EXPERIENCE
Confidential
Azure Big Data engineer/architect
Responsibilities:
- Designed new VPCs for the secure access POCs, architected and implemented multiple Amazon Glue/Spark pipelines with 100s DPUs.
- I have implemented Data Lake solutions with AWS Athena and Redshift, GCP Big Query, Azure Analytics for the multi-cloud POCs.
- Conducted assessments and created POCs to evaluate several Data Lake technologies: CDH(Cloudera) on a 100 hosts cluster vs. the Glue, Fargate and Redshift AWS Datalake vs. GCP BigQuery/BigTable/GCK solution
- Lead the cloud ETL development for a financial company - developed Dashboard and Financial DM reporting for five separate business lines. Created complex concurrent workflows with dozens of parallel executions, scaling out with maxim performance in real-time.
Confidential
Data Engineer/Architect, Data Modeler.
Responsibilities:
- Designed and implemented the Informatica/Tomcat HA for Informatica Domains in 4 (2x2,2x3) environments, Grid (2x3).
- Defined, designed and implemented technical architecture to support business requirements, including 2 High Availability (HA) implementations in Suse Enterprise with Isilon, VCS, and Oracle Exadata.
- Produced Technical Documentation supporting major Architecture Decisions: Informatica Vendor tools comparison and technical characteristics, Runtime and Development Architecture Documents, including Installation and Security Guide for the Informatica 10 Server on Linux, Oracle Database requirements for Informatica and Applications Development.
- Designed Solution implementation on the Informatica integration with AD,SFDC,MDM(IDC),Oracle 12 C,Exadata,SUSE 12.
- Designed and implemented Informatica DevOps for PowerCenter and IDQ, GIT, pmrep and infacmd utilities.
- Designed and implemented multiple Dev, Test and higher Informatica environments
- Lead Informatica Security Design and best practices for ETL Development.
- Created the Informatica Dev/Ops Solution with 2 ETL Test and Data Validation tools, Informatica Data Validation Option (DVO) and the FitNesse DbFit, integrating the last within the Informatica ETL server for both the PowerCenter and the Developer IDE command-line executions.
- Provided technical expertise to set the technical direction and manage issues, architecture, technical integration and technical service levels for a group of technologies., participated in detailed design and code reviews, review system performance and consumption issues, review test plans, and provide technical guidance and support to others. As Oracle and Informatica SME oversaw the knowledge transfer to the client team and the consulting team on site in Mumbai/Bangalore delivery centers.
Informatica and Oracle Data Integration engineer /SME, Technical Lead
Confidential
Responsibilities:
- Leading development and conversion of Sybase codebase and database objects as well as ET/Informatica codebase into MSSQL: Technical Project Management, starting Project Plan, assigning deliverables within MS Project 2010,2013, Team management, Offshore/Onshore in dynamic and complex fin. Environment.
- Performed data analysis using SQL queries, statistical functions, shell scripting, & other tools including python and Hadoop Hive / Pig application data requirement and dependencies and dependencies gathering.
- Designed, developed and implemented complex statistical model system with large business impact, using Informatica and PL/SQL, reusable maplets and worklets, R statistical, python, Hadoop Hive and Pig
- Developed full fledged statistic Data Processing and Reporting solution with several phases of ETL and Data processing on demand and with UC4.Worked with and wrote nontrivial queries in an Oracle database, such as heavily nested queries, complex join, OVER, and inline views.
- Oracle,MSSQL DB tuning and administering, security groups, logs, data modeling: Data Model and design patterns., strong ability to author mappings and workflows in Power Center 9.5, often without complete STM documents., to document as-built mappings in standard STM as well as strong ability to document Design, Installation and Migration Documentation.
- Development with Oracle PL/SQL, Informatica, Crystal. Web Reports and BOBJ Universe Designer on Callidus ICM. Functional, technical designs, migration requests. Testing and triaging, identifying the cause of defects, and correcting the code.
- Master Data Management and Data Governance Strategy Development, Data Quality Profiling for selected SAP Master Domains(Material/Customer) with the help of Talend MDM and Open Studio for Data Quality.
Confidential, Hatboro, PA
Informatica Administrator/ Technical Lead
Responsibilities:
- Informatica & DAC Installations and Administrations, Oracle 10g & 11G tuning and app DBA. Organization of massive (2-4 days) migrations and system restores for integrated Oracle Applications system with multiple sides (Oracle DBAs, Unix admins, System admin, Applications admins) and coordination with offshore and on-site development.
- Design and Development of Complex logic including OPM Cost Methods with Informatica and PL/SQL, OBIEE 11.6, Oracle DAC.
- Design of complex MDM architecture with Informatica / Siperian MDM Multi-domain, Data stewardship rules with IDD /Siperian Business Director and Data Cleansing and Standardization with MDM and IDQ.
- Full MDM cycle, from loading into landing tables to base objects and Xref tables loading.
- Design, development and implementation of complex ETL Processes with Informatica,9.1, Oracle PL/SQL