Sr. Data Architect/modeler Resume
Newark, NJ
SUMMARY:
- 9+ years of experience as Data Architect/Modeler and Data Analyst with high proficiency in requirement gathering and data modeling including design and support of various applications in OLTP, Data Warehousing, OLAP and ETL Environment.
- Skillful in Data Analysis using SQL on Oracle, MS SQL Server, DB2 & Teradata.
- Proficient in System Analysis, ER/Dimensional Data Modeling, Database design and implementing RDBMS specific features.
- Practical understanding of the Data modeling (Dimensional & Relational) concepts like Star - Schema Modeling, Snowflake Schema Modeling, Fact and Dimension tables.
- Experienced in Technical consulting and end-to-end delivery with architecture, data modeling, data governance and design - development - implementation of solutions.
- Heavy use of Access queries, V-Lookup, formulas, Pivot Tables, etc. Working knowledge of CRM Automation Salesforce.com, SAP.
- Extensive experience on usage of ETL& Reporting tools like SQL Server Integration Services (SSIS), SQL Server Reporting Services (SSRS)
- Experienced in integration of various relational and non-relational sources such as DB2, Teradata, Oracle, Netezza, SQL Server, NoSQL, COBOL, XML and Flat Files, to Netezza database.
- Responsible for detail architectural design and data wrangling, data profiling to ensure data quality of vendor data, Source to target mapping
- Experience in SQL and good knowledge in PL/SQL programming and developed Stored Procedures and Triggers and Data Stage, DB2, UNIX, Cognos, MDM, UNIX, Hadoop, Pig.
- Logical and physical database designing like Tables, Constraints, Index, etc. using Erwin, ER Studio, TOAD Modeler and SQL Modeler.
- Experience in Big Data, NoSQL Database like Cassandra and technical expertise in Hadoop.
- Data Warehousing: Full life-cycle project leadership, business-driven requirements, capacity planning, gathering, feasibility analysis, enterprise and solution architecture, design, construction, data quality, profiling and cleansing, source-target mapping, gap analysis, data integration/ETL, SOA, ODA, data marts, Inman/Kimball methodology, Data Modeling for OLTP, canonical modeling, Dimension Modeling for data ware house star/snowflake design.
- Good understanding of AWS, big data concepts and Hadoop ecosystem.
- Extensive ETL testing experience using Informatica 9x/8x, Talend, Pentaho.
- Work on Background process in oracle Architecture. Also drill down to the lowest levels of systems design and construction.
- Experience in designing Architecture for Modeling a Data warehouse by using tools like Erwin r9.6/r9.5, Sybase Power Designer and E-R Studio.
- Experience in BI/DW solution (ETL,OLAP, Data mart), Informatica, BI Reporting tool like Tableau and Qlikview and also experienced leading the team of application, ETL, BI developers, Testing team
- Experience in working with Micro Strategy Security model including Users, Groups, Security Roles and Security filters.
- Experience in Big Data Hadoop Ecosystem in ingestion, storage, querying, processing and analysis of big data
- Experience in Dimensional Data Modeling, Star/Snowflake schema, FACT & Dimension tables.
- Good experience in SFDC related technologies such as Apex, Visualforce, Apex triggers
- Expertise on Relational Data modeling (3NF) and Dimensional data modeling.
- Worked on Informatica Power Center tools-Designer, Repository Manager, Workflow Manager.
- Business Intelligence: Requirements analysis, Key Performance Indicators (KPI), metrics development, sourcing and gap analysis, OLAP concepts and methods, aggregates / materialized views and performance, rapid prototyping, tool selection, semantic layers Excellent experience in writing SQL queries to validate data movement between different layers in data warehouse environment.
TECHNICAL SKILLS:
Database Tools: Microsoft SQL Server12.0, Teradata 15.0, Oracle 11g/9i/12c and MS Access
Version Tool: VSS, SVN, GIT
BI Tools: Tableau 7.0/8.2, Tableau server 8.2, Tableau Reader 8.1,SAP Business Objects, Crystal Reports Packages: Microsoft Office 2010, Microsoft Project 2010, SAP and Microsoft Visio, Share point Portal Server
Tools: OBIE 10g/11g/12c, SAP ECC6 EHP5, Go to meeting, Docusign, Insidesales.com, Share point, Mat-lab
ETL/Data warehouse Tools: Informatica 9.6/9.1/8.6.1/8.1 , SAP Business Objects XIR3.1/XIR2, Web Intelligence, Talend, Tableau 8.2
Quality Assurance Tools: Win Runner, Load Runner, Test Director, Quick Test Pro, Quality Center, Rational Functional Tester.
Testing and defect tracking Tools: HP/Mercury (Quality Center), Quick Test Professional, Performance Center, Requisite, MS Visio.
AWS: AWS, EC2, S3
Project Execution Methodologies: Agile, Ralph Kimball and BillInmon data warehousing methodology, Rational Unified Process (RUP), Rapid Application Development (RAD), Joint Application Development (JAD)
Operating System: Windows, Unix, Sun Solaris
Data Modeling Tools: Erwin, Rational System Architect, IBM Infosphere Data Architect, ER Studio and Oracle Designer.
PROFESSIONAL EXPERIENCE:
Confidential, Newark, NJ
Sr. Data Architect/Modeler
- Designed and implemented a Data Lake to consolidate data from multiple sources, using Hadoop stack technologies like SQOOP, HIVE/HQL.
- Talend Administrative tasks like - Upgrades, create and manage user profiles and projects, manage access, monitoring, setup TAC notification.
- Analyzed metrics surrounding the activities associated with the acquisition, documentation, review, cleaning and processing of data.
- Specifies overall Data Architecture for all areas and domains of the enterprise, including Data Acquisition, ODS, MDM, Data Warehouse, Data Provisioning, ETL, and BI.
- Reverse engineered some of the databases using Erwin.
- Proficiency in SQL across a number of dialects (we commonly write MySQL, PostgreSQL, Redshift, SQL Server and Oracle).
- Worked with Micro Strategy Object Manager to move objects among the Development, Test, and Prodenvironments.
- Specializes in technology platform migrations for Windows Server, Windows Desktop, Hyper-V, Office 365 and Azure
- Developed Data Mapping, Data Governance, Transformation, Acquisition and cleansing rules for the Master Data Management Architecture involving OLTP, ODS.
- Working as a Data Modeler/Architect to generate Data Models using Erwin and developed relational database system.
- Also involved in Data Architect role to review business requirement and compose source to target data mapping documents.
- Researched, evaluated, architect, and deployed new tools, frameworks, and patterns to build sustainable Big Data platforms for our clients
- Designed and developed architecture for data services ecosystem spanning Relational, NoSQL, and Big Data technologies.
- Collected large amounts of log data using Apache Flume and aggregating using PIG/HIVE in HDFS for further analysis
- Designed the Logical Data Model using ERWIN 9.64 with the entities and attributes for each subject areas.
- Designed Common Information Model (CIM) using IBM Infosphere Data Architect data modeling tool
- Worked extensively in creating Micro Strategy Application Objects such as Compound Metrics, Level Metrics, complex Filters, Custom Groups, Consolidations, and nested Prompts.
- Worked on multiple SFDC projects as Administrator and Developer.
- Worked with the Spark for improving performance and optimization of the existing algorithms in Hadoop using Spark Context, Spark-SQL, Spark MLlib, Data Frame, Pair RDD, Spark YARN.
- Developed U-SQL Scripts for schematizing the data in Azure Data Lake Analytics.
- Experienced in using Talend Data Fabric tools (Talend DI, Talend MDM, Talend DQ, Talend Data Preparation, ESB, TAC)
- Installed Hortonworks Hadoop clusters and supporting packages.
- Gathered and analyzed existing physical data models for in scope applications and proposed the changes to the data models according to the requirements.
- Advises on and enforces data governance to improve the quality/integrity of data and oversight on the collection and management of operational data.
- Designed Physical Data Model (PDM) using IBM Infosphere Data Architect data modeling tool and ORACLE PL/SQL
- Used the Spark Data Stax Cassandra Connector to load data to and from Cassandra.
- Design Architecture for API development & deployment as Microservice including Python code in Docker container and Azure.
- Design reference data and data quality rules using IDQ and involved in cleaning the data using IDQ in Informatica Data Quality 9.1 environment.
- Integrated crystal reports using Erwin Data Modeler.
- Us Erwin to support for TeradataV15 and SSL.
- Designed and developed Reference Integrity, Technical and Business Data
- Quality rules using IDQ and involved in cleaning the data using IDQ in Informatica
- Data Quality Data modeling, Design, implement, and deploy high-performance, custom applications at scale on Hadoop /Spark.
- Loaded data into Hive Tables from Hadoop Distributed File System (HDFS) to provide SQL-like access on Hadoop data.
- Developed and implemented data cleansing, data security, data profiling and data monitoring processes.
- Applied data analysis, data mining and data engineering to present data clearly.
- Developed long term data warehouse roadmap and architectures, designs and builds the data warehouse framework per the roadmap.
- Involved in designing Logical and Physical data models for different database applications using the Erwin.
- Migrated data from MS Excel / CSV files to SFDC using Data Loader.
- Worked with Cloudera, AWS, and Hortonworks
- Experience with AWS ecosystem (EC2, S3, RDS, Redshift).
- Written Python Scripts, mappers to run on Hadoop distributed file system (HDFS).
- Advises/leads projects involving the ETL related activities and the migration or conversion of data between enterprise data systems. Coordinates interactions between central IT, business units, and data stewards to achieve desired organizational outcomes.
Environment: Oracle 12c, MS-Office, SQL Architect, TOAD Benchmark Factory, Teradatav15, SQL Loader, SFDC, Big Data, SharePoint, ERwin r 9.64,Micro strategy,DB2, MS-Office, SQL Server 2008/2012, AWS.
Confidential, Bronx, NY
Sr. Data Analyst/Modeler
- Worked in importing and cleansing of data from various sources like Teradata, Oracle, flat files, MS SQL Server with high volume data
- Designed Logical & Physical Data Model /Metadata/ data dictionary using Erwin for both OLTP and OLAP based systems.
- Reverse Engineered DB2 databases and then forward engineered them to Teradata using ER Studio.
- Part of team conducting logical data analysis and data modeling JAD sessions, communicated data-related standards .
- Involved in meetings with SME (subject matter experts) for analyzing the multiple sources.
- Developed the logical data models and physical data models that capture current state/future state data elements and data flows using ER Studio.
- Delivered dimensional data models using ER/Studio to bring in the Employee and Facilities domain data into the oracle data warehouse.
- Performed troubleshooting, fixed and deployed many Python bug fixes of the two main applications that were a main source of data for both customers and internal customer service team.
- Worked on linux system (Red Hat) to deploy the Talend code.
- Wrote and executed SQL queries to verify that data has been moved from transactional system to DSS, Data warehouse, data mart reporting system in accordance with requirements.
- Worked in importing and cleansing of data from various sources like Teradata, Oracle, flat files, SQL Server 2005 with high volume data
- Worked extensively on ER Studio for multiple Operations across Atlas Copco in both OLAP and OLTP applications.
- Generated comprehensive analytical reports by running SQL queries against current databases to conduct data analysis.
- Produced PL/SQL statement and stored procedures in DB2 for extracting as well as writing data.
- Co-ordinate all teams to centralize Meta-data management updates and follow the standard Naming Standards and Attributes Standards for DATA &ETL Jobs.
- Finalize the naming Standards for Data Elements and ETL Jobs and create a Data Dictionary for Meta Data Management.
- Developed the design & Process flow to ensure that the process is repeatable.
- Performed analysis of the existing source systems (Transaction database)
- Involved in maintaining and updating Metadata Repository with details on the nature and use of applications/data transformations to facilitate impact analysis.
- Created DDL scripts using ER Studio and source to target mappings to bring the data from source to the warehouse.
- Designed the ER diagrams, logical model (relationship, cardinality, attributes, and, candidate keys) and physical database (capacity planning, object creation and aggregation strategies) for Oracle and Teradata .
- Involved in SQL queries and optimizing the queries in Teradata.
- Created DDL scripts using ER Studio and source to target mappings to bring the data from source to the warehouse.
- Identify, assess and intimate potential risks associated to testing scope, quality of the product and schedule .
- Wrote and executed SQL queries to verify that data has been moved from transactional system to DSS, Data warehouse, data mart reporting system in accordance with requirements.
- Worked in importing and cleansing of data from various sources like Teradata, Oracle, flat files, SQL Server 2005 with high volume data.
Environment: ER Studio, Business Objects XI, Rational Rose, Data stage, MS Office, MS Visio, SQL, SQL Server 2000/2005, Rational Rose, Crystal Reports 9, SQL Server 2008, SQL Server Analysis Services, SSIS, Oracle 10g
Confidential, Miami Lakes, Florida
Data Analyst/Modeler
- Worked on data mapping process from source system to target system. Created dimensional model for the reporting system by identifying required facts and dimensions using Erwin
- Developed enhancements to Mongo DB architecture to improve performance and scalability.
- Forward Engineering the Data models, Reverse Engineering on the existing Data Models and Updates the Data models.
- Performed data cleaning and data manipulation activities using NZSQL utility.
- Analyzed the business requirements by dividing them into subject areas and understood the data flow within the organization
- Generated a separate MRM document with each assignment and shared it on SharePoint along with the PDF of updated data models .
- Created a Data Mapping document after each assignment and wrote the transformation rules for each field as applicable
- Worked on Unit Testing for three reports and created SQL Test Scripts for each report as required
- Extensively used Erwin as the main tool for modeling along with Visio
- Established and maintained comprehensive data model documentation including detailed descriptions of business entities, attributes, and data relationships.
- Worked on Metadata Repository (MRM) for maintaining the definitions and mapping rules up to mark.
- Developed data Mart for the base data in Star Schema, Snow-Flake Schema involved in developing the data warehouse for the database.
- Designed Logical Data Models and Physical Data Models using Erwin.
- Developed the Conceptual Data Models, Logical Data models and transformed them to creating schema using ERWIN.
- Created a list of domains in Erwin and worked on building up the data dictionary for the company
- Created DDL scripts for implementing Data Modeling changes. Created ERWIN reports in HTML, RTF format depending upon the requirement, Published Data model in model mart, created naming convention files, co-coordinated with DBAs' to apply the data model changes.
- Analyzed the physical data model to understand the relationship between existing tables. Cleansed the unwanted tables and columns as per the requirements as part of the duty being a Data Analyst.
- Worked very close with Data Architectures and DBA team to implement data model changes in database in all environments.
Environment: Oracle Data Modeler, Teradata 12, SSIS, Business Objects, Erwin r8.2, Oracle SQL Developer, SQL Server 2008, ER/Studio Windows XP, MS Excel.
Confidential
Data Analyst/Modeler
- Designed Star and Snowflake Data Models for Enterprise Data Warehouse using ERWIN
- Created and maintained Logical Data Model (LDM) for the project. Includes documentation of all entities, attributes, data relationships, primary and foreign key structures, allowed values, codes, business rules, glossary terms, etc.
- Improved performance on SQL queries used Explain plan / hints /indexes for tuning created DDL scripts for database. Created PL/SQL Procedures and Triggers.
- Validated and updated the appropriate LDM's to process mappings, screen designs, use cases, business object model, and system object model as they evolve and change.
- Worked with Business users during requirements gathering and prepared Conceptual, Logical and Physical Data Models.
- Created conceptual, logical and physical data models using best practices and company standards to ensure high data quality and reduced redundancy.
- Wrote PL/SQL statement, stored procedures and Triggers in DB2 for extracting as well as writing data.
- Attended and participated in information and requirements gathering sessions
- Translated business requirements into working logical and physical data models for Data warehouse, Data marts and OLAP applications.
- Performed extensive Data Analysis and Data Validation on Teradata.
- Responsible for the development and maintenance of Logical and Physical data models, along with corresponding metadata, to support Applications.
- Created business requirement documents and integrated the requirements and underlying platform functionality.
- Excellent knowledge and experience in Technical Design and Documentation.
- Used forward engineering to create a physical data model with DDL that best suits the requirements from the Logical Data Model.
- Worked with the DBA to convert logical Data models to physical Data models for implementation.
- Involved in preparing the design flow for the Data stage objects to pull the data from various upstream applications and do the required transformations and load the data into various downstream applications
- Performed logical data modeling, physical data modeling (including reverse engineering) using the Erwin Data Modeling tool.
- Experience in developing dashboards and client specific tools in Microsoft Excel and Power Point.
Environment: ER- Studio 6.0/6.5, Toad 8.6, Informatica 8.0, IBM OS 390(V6.0), DB2 V7.1, Oracle9i, PL/SQL, Solaris 9/10, Windows Server 2003 & 2008. NZSQL,