Data Architect Resume
Wilmington, DE
SUMMARY:
- 15 years of experience in Information Technology with expertise in Data modeling , Data Migration, Data Warehouse, Hadoop and Data Lake solutions for large programs.
- Having 2+ years of experience in Big Data space implementing end - to-end Hadoop solutions.
- Experience in implementing Hadoop ecosystems such as HDFS, MapReduce, Hive, Impala, Sqoop, Kafka and Hue.
- Hands on experience in Importing and exporting data from different sources like mainframe , Oracle into Hadoop using data ingestion tool like Talend ETL , Sqoop and Kafka.
- Expertise in working with Hive/Impala (Parquet) tables, data distribution by implementing partitioning and bucketing, writing and optimizing the HiveQL queries.
- Expertise in Conceptual/Logical/Physical Data Modelling, Enterprise Data Warehouse Design, Datamart Design, Metadata, Data Quality, Master Data Management.
- Strong knowledge in Data Warehousing, Data Modeling, Data Migration, Oracle PL/SQL, Informatica(ETL) and Teradata utilities (BTEQ, FASTLOAD, FASTEXPORT, MULTILOAD, SQL Assistant, PMon).
- In depth and thorough knowledge of development and design with RDBMS -OLTP, dimensional modelling using data modelling tool ERWIN and Informatica/Talend ETL .
- Extensive experience with ETL tool in designing the Workflows, Mappings using Informatica Power Center 9.0/8.6.1/x.
- Domain knowledge on Banking, Insurance, Telecom Transformation Project, ERP and Reference Data/Capital Market.
- Current area of working is in Hadoop, Oracle 9i/10g (SQL, PL/SQL), Talend and Erwin.
- Expertise in Database Performance Tuning, Performance Monitoring and Optimization using Oracle Hints, Explain plans.
- An energetic, self-motivated team leader with hands on experience in programming skills, client-server infrastructure, requirements gathering, application integration and customization.
- Collaborate with client business leaders, executives, SMEs, information technology, business users on full life cycle engagements.
- Hands-on experience across all stages of Software Development Life Cycle (SDLC) including business requirement analysis, data mapping, build, unit testing, systems integration and user acceptance testing.
- Good exposure to maintenance & production support environment.
TECHNICAL SKILLS:
Operating System: Hadoop/Bigdata
Unix, Linux, Win XP/2000, Win NT/5.0/4.0, Win 9x: Map Reduce, HDFS, Hive, Impala, Kafka, Sqoop, Hue
RDBMS: Oracle 10g/9x/8x, Teradata
ETL: Informatica, Talend
Data Modeling: Relational and Dimensional Modeling - Erwin r7.1/7.2
Reporting: Crystal Report, BO, Cognos
Languages: Oracle SQL, PL/SQL
Tools: / Utilities/Features: TOAD, SQL Developer, SQL * Loader, Microsoft VSS, CVS, PVCS
Functional: BANKING, INSURANCE, TELECOM, ERP and Reference Data/Capital Market
PROFESSIONAL EXPERIENCE:
Confidential, Wilmington, DE
Data Architect
Environment: Cloudera Hadoop, Oracle 10g (PL/SQL), Talend, Data Modelling (Erwin), Cognos
Responsibilities:
- Ingest data into Hadoop/HDFS from different data sources using Talend ETL, Sqoop and Kafka for real time switch data.
- Creation of Data lake which holds a large amount of raw data for the use of other applications.
- Design Talend generic ETL mapping to load data into Hive/Impala (Parquet) tables/partition tables.
- Worked on Talend Kafka to ingest the real time data streams, to push the data to appropriate HDFS.
- Creation of log and reconciliation process to track the load failure and success
- Design of Conceptual, Logical and Physical model for Analytical system, (FACT and Dimensional model) in Hadoop ecosystem.
- Developed framework to ingest one time as well as incremental load for historical data from Oracle and stored into Hive.
Confidential, New Jersey
Data Architect, Onsite-Coordinator
Environment: Oracle 10g (PL/SQL), Informatica 8.6, Data Modelling (Erwin), Cloudera Hadoop
Responsibilities:
- Conceptual, logical and Physical data models, deployed, tested and moved to Production for Confidential .
- Designed ETL mappings and business requirements to translate all the business rules into mapping rules to extract data from source systems and Detailed Solution Design document.
- Data Analysis for source system and designed staging area for and model area to load the source data .
- Perform ETL Solution Design and Architecture, ensuring a fit with the overall ETL/Informatica architecture and conformance with Velocity standards and best practices.
- Co-ordination between the offshore and the onsite team.
- Data Analysis for source system and designed the landing area for migration & build ETL mappings.
- Developed Oracle SQL queries according to the transformation rules and also performance tuning of those queries.
- Implemented Hadoop ecosystem to handle huge volume of data for one of the major application (Revenue master) data using Sqoop to load the data from oracle to HDFS.
- Impala (Connector) used as query tool to connect to HDFS system from reporting tool (Tableau).
Confidential
Environment: Oracle 9i (PL/SQL), Informatica 9.1, CRM and AMDOCS, Erwin
Data Architect, Onsite Coordinator
Responsibilities:
- Co-ordination between the offshore and the onsite team.
- Preparation of data mapping sheet (S2T-Source to Target) to extract the data from the legacy system and Detailed Solution Design document.
- Cross functionalities between CRM, AMDOCS and IPT (Integrated Product Transformation).
- Designing of the system to transform/migrate the customer assets and billing services to the new product definition.
- Product Rationalization and Transformation framework to migrate the products data into the new system.
- Data Analysis for source system and designed the landing area for migration & build ETL mappings.
- Developed Oracle SQL queries according to the transformation rules and also performance tuning of those queries.
- Involved in unit testing, data validation and verification of the code and code reviews.
Confidential
Technical Lead
Environment: Oracle 9i (PL/SQL), Informatica 9.0.1, Teradata and UNIX
Responsibilities:
- R equirement gathering and coordination between onsite & offshore.
- Data Analysis for source system and designed staging area (Oracle) for Migration & build ETL mappings.
- Involved in preparation of Detailed Design Solutions and preparation of data mapping sheet to extract the data from the source (Teradata).
- Developed Teradata SQL queries according to the transformation rules and also performance tuning of those queries.
- Developed Informatica ETL jobs for extraction of data from Teradata to Flat File and ETL jobs for loading the extracted data from Flat file to Oracle.
- Involved in vendor testing, data validation & verification of the code / code Reviews.
Confidential
Technical Lead
Environment: Teradata, UNIX, QC
Responsibilities:
- Involved in Understanding the functionality and also in preparing Solution design documents.
- Created, optimized, reviewed, and executed Teradata SQL test queries to validate transformation rules.
- Analysis & Design of Complex Enhancement requested by the Customer. Design, Development and Enhancements using Teradata macros.
- Developed Complex SQL queries using various joins and developed various dynamic Teradata SQL assistant thorough out the projects.
- Involved in Performance tuning of complex queries.
- Optimizing the database in order to improve the response time and system performance as a whole.
- Test case/data preparation, execution and verification of the test results.
Confidential
Technical Lead
Environment: Oracle 9i (PL/SQL), SQL Loader, TOAD, Web Logic, Clarify, UNIX
Responsibilities:
- Involved in Understanding the functionality and also involved in preparing Level Of Estimation (LOE) and Low Level Design Document for RFC's (Request For Change), Change Management, impact analysis, fine tuning, additional functionality etc.
- The major part of my team included constant analysis, Interface Designing (Using Oracle Enqueueing and Dequeueing Functionalities), responding to customer queries and responding to critical /severe business problems (Like performance issue, service outage) and doing appropriate escalation based on criticality and feasibility.
- Developed Complex SQL queries using various joins and developed various dynamic SQL’s.
- Involved in Performance tuning of complex queries.
- Test case/data preparation, execution and verification of the test results.
Confidential
Technical Consultant
Environment: Oracle 9i (PL/SQL), Oracle Designer, JDK1.4, TOAD, UNIX
