Sr. Data Architect/ Data Modeler Resume
Dorchester, MA
SUMMARY
- Over 7 years of Industry experienced in IT with solid understanding of Data Architecture, Data Modeling, Data Analysis, Evaluating Data Sources and strong understanding of Data Warehouse/Data Mart Design, BI, OLAP, OLTP, Client/Server applications.
- Experience as Architect UML models and leverage the advanced executable code generators to target different domains.
- Strong Knowledge of Data Warehouse Architecture and Star Schema, Snow flake Schema, FACT and Dimensional Tables.
- Strong Experience in Big Data Hadoop Ecosystem in ingestion, storage, querying, processing and analysis of big data.
- Experience in Dimensional Data Modeling, Star/Snowflake schema, FACT & Dimension tables.
- Experience with emerging technologies such Big Data, Hadoop, and NoSQL.
- Strong experience in analyzing/ Data Transformation of large amounts of data sets writing Pig scripts and Hive, AWS EMR, AWS RDS. Extensive knowledge in Hadoop stack components viz. Apache Hive, Pig Scripting, etc.
- Experience in analyzing data using Hadoop Ecosystem including HDFS, Hive, HBase, Zookeeper, PIG, Sqoop, Flume.
- Hands on experience in Normalization (1NF, 2NF, 3NF and BCNF) Denormalization techniques for TEMPeffective and optimum performance in OLTP and OLAP environments.
- Experience in cloud development architecture on Amazon AWS, EC2, EC3, Elastic Search, Redshift and Basic on Azure.
- Experience in BI/DW solution (ETL, OLAP, Data mart), Informatica, BI Reporting tool like Tableau and QlikView and also experienced leading the team of application, ETL, BI developers, Testing team.
- Good experience in working with different ETL tool environments like SSIS, Informatica and reporting tool environments like SQL Server Reporting Services (SSRS), Cognos and Business Objects.
- Proficient in UML Modeling like Use Case Diagrams, Activity Diagrams, and Sequence Diagrams with Rational Rose and MS Visio.
- Experienced in Technical consulting and end - to-end delivery with architecture, data modeling, data governance and design - development - implementation of solutions
- Experience in writing expressions in SSRS and Expert in fine tuning the reports. Created many Drill through and Drill Down reports using SSRS.
- Solid knowledge of Data Marts, Operational Data Store (ODS), OLAP, Dimensional Data Modeling with Ralph Kimball Methodology (Star Schema Modeling, Snow-Flake Modeling for FACT and Dimensions Tables) using Analysis Services.
- rganizing data as per the business requirements using Erwin, ER Studio in both OLTP and OLAP applications.
- Expertise lies in Data Modeling, Database design and implementation of Oracle, AWS Redshift databases and Administration, Performance tuning etc.
- Excellent experience in troubleshooting SQL queries, ETL jobs, data warehouse/data mart/data store models. Practical understanding of the Data modeling (Dimensional & Relational) concepts like Star-Schema Modeling, Snowflake Schema Modeling, Fact and Dimension tables. Worked on data modeling using ERWIN tool to build logical and physical models.
- Skillful in Data Analysis using SQL on Oracle, MS SQL Server, DB2 & Teradata.
- Extensive experience in development of T-SQL, Oracle PL/SQL Scripts, Stored Procedures and Triggers for business logic implementation.
- Strong experience with architecting highly per formant databases using PostgreSQL, PostGIS, MySQL and Cassandra.
- Decode the Teradata and SQL queries to find all the data attributes involved and document for the purpose of development.
- Excellent understanding of Hub Architecture Style for MDM hubs the registry, repository and hybrid approach.
- Mapping the Risk Data elements to the Authoritative Data Source and documenting the Schema, Database, Table details for data modelling purpose.
- Good exposure on usage of NoSQL database.
- Experienced in understanding the ETL framework metadata to understand the current state ETL implementation.
TECHNICAL SKILLS
Data Modeling Tools: IBM Info sphere Data Architect, ER Studio, Oracle Designer, Erwin R6/R9, Rational System Architect.
Reporting Tools: SSRS, Power BI, SSAS, MS-Excel, BI Platform.
Big Data: HBase, PIG, Hive, Hadoop
Cloud Platforms: Azure, AWS EMR, AWS RDS, EC2, S3.
Database Tools: Oracle 12c/11G/10g/9i, Microsoft SQL Server12.0, Teradata 15.0, and MS Access
BI Tools: SAP Business Objects, Tableau 7.0/8.2, Tableau server 8.2, Tableau Reader 8.1, Crystal Reports.
Tools & Utilities: TOAD 9.6, Microsoft Visio 2010.
Methodologies: RAD, JAD, RUP, UML, System Development Life Cycle (SDLC), Waterfall Model.
Packages: SAP and Microsoft Visio, Share point, Microsoft Office 2010, Microsoft Project 2010.
Operating Systems: Windows, Centos, Sun Solaris, UNIX, Ubuntu Linux
Version Tool: VSS, SVN, CVS, SAP BO 4.1
PROFESSIONAL EXPERIENCE
Confidential, Dorchester, MA
Sr. Data Architect/ Data Modeler
Responsibilities
- Massively involved in Data Architect role to review business requirement and compose source to target data mapping documents.
- Owned and managed all changes to the data models. Created data models, solution designs and data architecture documentation for complex information systems.
- Working as a Sr. Data Architect/Modeler to generate Data Models using Erwin r9.64 and developed relational database system.
- Interacted with ETL Team to understand Ingestion of data from ETL to Azure Data Lake to develop Predictive analytics
- As a Architect implement MDM hub to provide clean, consistent data for a SOA implementation
- Used SSAS to create OLAP solutions by creating cubes from various data sources.
- Generated DDL (Data Definition Language) scripts using Erwin and supported the DBA in Physical Implementation of data Models.
- Translated the business requirements into workable functional and non-functional requirements at detailed production level using Workflow Diagrams, Sequence Diagrams, Activity Diagrams and Use Case Modelling
- Created Azure Blob storage accounts via poweshell or azure storage explorer for storing the different types of data like Blob storage, Table storage and File storage.
- Designed the Logical Data Model using ERWIN 9.64 with the entities and attributes for each subject areas.
- Used Tableau for BI Reporting and Data Analysis.
- Developed Data Mapping, Data Governance, and Transformation and cleansing rules for the Master Data Management Architecture involving OLTP, ODS.
- Used SSRS to create reports, customized Reports, on-demand reports, ad-hoc reports and involved in analyzing multi-dimensional reports in SSRS.
- Create data integration and technical solutions for Azure Data Lake for providing analytics and reports for improving marketing strategies.
- Used data vault modeling method which was adaptable to the needs of dis project.
- Design and developed architecture for data services ecosystem spanning Relational, NoSQL, and Big Data technologies.
- Used SSDT to model using SSAS.
- Develop and maintain data architecture, including master data and data quality, using Toad Data Modeler and Microsoft Master Data Manager (MDS) as well as Oracle Data Integrator.
- Configured Hunk to read customer transaction data from Hadoop Ecosystems such as HDFS and Hive
- Used Flume extensively in gathering and moving log data files from Application Servers to a central location in Hadoop Distributed File System (HDFS) for data science.
- Involved in Normalization / Denormalization techniques for optimum performance in relational and dimensional database environments.
- Working with project management, business teams and departments to assess and refine requirements to design/develop BI solutions using Azure.
- Designed new reports and wrote technical documentation. Analyzed and visualized the data by power BI, developed and built SSRS, power BI reports and dashboard.
- Translate business and data requirements into Logical data models in support of Enterprise Data Models, ODS, OLAP, OLTP, Operational Data Structures and Analytical systems.
- Designed both 3NF data models for ODS, OLTP systems and dimensional data models using Star and Snow flake Schemas.
- Designed and developed architecture for data services ecosystem spanning Relational, NoSQL, and Big Data technologies.
- Collected large amounts of log data using Apache and aggregating using PIG/HIVE in HDFS for further analysis.
- Created Logical and Physical Data Model using IBM Data Architect tool.
- Specifies overall Data Architecture for all areas and domains of the enterprise, including Data Acquisition, ODS, MDM, Data Warehouse, Data Provisioning and BI.
- Loaded data into Hive Tables from Hadoop Distributed File System (HDFS) to provide SQL-like access on Hadoop data.
- Designing Star Denormalized tables on Azure.
- Participated in OLAP model based on Dimension and FACTS for efficient loads of data based on Star Schema structure on levels of reports using multi-dimensional models such as Star Schemas and Snowflake Schema.
- Utilize U-SQL for data analytics/data ingestion of raw data in Azure and Blob storage
- Developed and implemented data cleansing, data security, data profiling and data monitoring processes.
- Responsible for Dimensional Data Modeling and Modeling Diagrams using ERWIN.
- Applied data analysis, data mining and data engineering to present data clearly.
- Demonstrated expertise utilizing SQL Server Integration Services (SSIS), Data Transformation Services (DTS), and Data Stage and ETL package design, and RDBM systems like SQL Servers, Oracle, and DB2.
- Review and Patch of Netezza and Oracle environments including DB2, OS and Server firmware.
- Extensively used Crystal Reports SAP SE 14.2 for Data Reporting.
- Used Teradata Administrator and Teradata Manager Tools for monitoring and control the system.
- Developed and configured on Informatica MDM hub supports the Master Data Management (MDM), Business Intelligence (BI) and Data Warehousing platforms to meet business needs.
- Developed PL/SQL scripts to validate and load data into interface tables
- Participated in maintaining data integrity between Oracle and SQL databases.
Environment: Oracle 12c, MS-Office, SQL Architect, SSIS, Teradata v15, Hadoop, SQL Loader, ERwin r 9.64, DB2, MS-Office, SQL Server 2008/2012, SSRS, Azure, Azure Data Lake, Azure Blob, HBase, Hive.
Confidential, Bronx, NY
Sr. Data Analyst / Data Modeler
Responsibilities
- Designed and build relational database models and defines data requirements to meet the business requirements.
- Involved in developing Database Design Document including Data Model Conceptual, Logical and Physical Models using ER studio.
- Developing the Conceptual Data Models, Logical data models and transformed them to creating schema using ER Studio.
- Designing normalized and star schema data architectures using ER Studio and forward engineering these structures into Teradata.
- Working on Amazon Redshift and AWS and architecting a solution to load data, create data models and run BI on it.
- Actively involved in the Design and development of the Star schema data model.
- Implemented slowly changing and rapidly changing dimension methodologies; created aggregate fact tables for the creation of ad-hoc reports.
- Enforced referential integrity in the OLTPdatamodel for consistent relationship between tables and efficient database design.
- Used Star Schema and Snowflake Schema methodologies in building and designing the LogicalDataModel into Dimensional Models.
- Created and maintained surrogate keys on the master tables to handle SCD type 2 changes TEMPeffectively.
- Analyzed existing SSIS package, make changes to improve its performances, add standard logging and configuration system.
- Designed and implemented a Data Lake to consolidate data from multiple sources, using Hadoop stack technologies like SQOOP, HIVE/HQL.
- Created complex stored procedures to generate reports using SSRS and Extract Data.
- Written complex SQL queries for validating the data against different kinds of reports generated by Business Objects XIR2.
- Developed and deployed Data Warehouse infrastructure using SSIS for Data integration and SSRS to automate reports generations
- Implemented well-documented, well-architected, high-quality code of all enterprise applications AWS S3 environment
- Designing Logical data models and Physical Data Models using ER Studio.
- Conducted numerous POCs to efficiently import large data sets into the database from AWS S3 Bucket.
- Designed semantic layer data model. Conducted performance optimization for BI infrastructure.
- Involved in the creation, maintenance of Data Warehouse and repositories containing Metadata.
- Performed Hive programming for applications dat were migrated to big data using Hadoop.
- As an Architect implement MDM hub to provide clean, consistent data for a SOA implementation.
- Installing and configuring the a 3-node Cluster in AWS Linux Servers.
- Designed different type of STAR schemas like detailed data marts and Plan data marts, Monthly Summary data marts using ER studio with various Dimensions Like Time, Services, Customers and various FACT Tables.
- Developed and maintained data dictionary to create metadata reports for technical and business purpose.
- Extensive Data validation by writing several complex SQL queries and Involved in back-end testing and worked with data quality issues.
- Data Profiling, Mapping and Integration from multiple sources to AWS S3.
- Worked on AWS Redshift and RDS for implementing models and data on RDS and Redshift.
- Identify source systems, their connectivity, related tables and fields and ensure data suitably for mapping.
- Worked with BTEQ to submit SQL statements, import and export data, and generate reports in Teradata.
- Designed and documented Use Cases, Activity Diagrams, Sequence Diagrams, OOD (Object Oriented Design) using UML and Visio.
- Performed data cleaning and data manipulation activities using NOSQL utility.
- Designed and Developed Oracle PL/SQL Procedures and UNIX Shell Scripts for Data Import/Export and Data Conversions.
Environment: ER Studio, Oracle 11g, MS-Office, SQL Architect, Hadoop, Hive, Pig, TOAD Benchmark Factory, Sqoop, SQL Loader, AWS S3, AWS RDS, PL/SQL, DB2, SharePoint, MS-Office, SQL Server 2014
Confidential - Chicago, IL
Sr. Data Analyst
Responsibilities
- Analyzed the business requirements by dividing them into subject areas and understood the data flow within the organization
- Attended and participates in information and requirements gathering sessions.
- Designed and developed stored procedures, queries and views necessary to support SSRS reports.
- Load data from MS Access database to SQL Server using SSIS (creating staging tables and then loading the data).
- Performed Data Analysis and Data validation by writing SQL queries and Regular expressions.
- Created reports using SQL Server Reporting Services (SSRS).
- Wrote standard SQL Queries to perform data validation and created excel summary reports (Pivot tables and Charts).
- Database Design (Conceptual, Logical and Physical) for OLTP and OLAP systems.
- Created several types of reports using SSRS like parameterized, drill down and drill through reports.
- Creating complex SQL queries and scripts to extract and aggregate Data to validate the accuracy of the data.
- Worked on developing the tool to extract the data from db2 database and conducted MetaData and Data analysis.
- Created documents for technical & business user requirements during requirements gathering sessions.
- Turned SQL queries to make use of data base indexes, and analyzed the data base objects.
- Created Logical and Physical EDW models and data marts.
- Developed SSIS Packages to transfer the data between SQL Server database and files.
- Experienced in data migration and cleansing rules for the integrated architecture (OLTP, ODS, DW).
- Created SQL-Loader scripts to load legacy data into Oracle staging tables and wrote SQL queries to perform Data Validation and Data Integrity testing.
- Performed drill down analysis reports using SQL Server Reporting Services.
- Managed all indexing, debugging and query optimization techniques for performance tuning using T-SQL.
- Developed the logical and physical model from the conceptual model developed using a tool Erwin by understanding and analyzing business requirements.
- Normalized the database up to 3NF to put them into the star schema of the data warehouse.
- Worked with BTEQ to submit SQL statements, import and export data, and generate reports in Teradata.
- Handled data loading operations from flat files to tables using NZLOAD utility.
- Experienced in data cleansing for accurate reporting. Thoroughly analyzed the data and integrated different data sources to process matching functions.
- Applied data naming standards, created the data dictionary and documented data model translation decisions and also maintained DW metadata.
- Created DDL scripts for implementing Data Modeling changes. Created ERWIN reports in HTML, RTF format depending upon the requirement, Published Data model in model mart, created naming convention files, co-coordinated with DBAs' to apply the data model changes.
- Extensively used Normalization techniques (up to 3NF).
- Writing complex queries using Teradata SQL.
- Worked with the ETL team to document the transformation rules for data migration from source to target systems.
Environment: MS Visio, Business Objects, Informatica, ERWIN r7.2, PL/SQL, SSRS, SSIS, MS SQL, Windows NT, Linux, Sybase Power Designer, Oracle 9i, SQL Server, Windows, MS Excel, MS Access.
Confidential
Data Analyst
Responsibilities
- Responsibilities included source system analysis, data transformation, loading, validation for data marts, operational data store and data warehouse.
- PerformedData Analysis, Data Migration and data profiling using complex SQL on various sources systems including Oracle andTeradata.
- Generated comprehensive analytical reports by running SQL queries against current databases to conductData Analysis.
- Developed database objects such as SSIS Packages, Tables, Triggers, and Indexes using T-SQL.
- Worked in Data Analysis, data profiling and data governance identifying Data Sets, Source Data, Source Meta Data, Data Definitions and Data Formats.
- Extracted and analyzed data to provide data-driven product recommendations.
- Designed and implemented business intelligence to support sales and operations functions to increase customer satisfaction
- Developed Data Mapping, Data Governance, Transformation and Cleansing rules for the Master Data Management Architecture involving OLTP, ODS and OLAP.
- Worked in importing and cleansing of data from various sources like Teradata, Oracle, flat files, SQL Server with high volume data.
- Built reports using SSRS. Developed efficient reporting solutions. Set up subscriptions as needed for SSRS.
- Used SQL queries for Data verification and validation
- Created Drill Down, Drill Through, Sub and Linked reports using the SSRS as well as managed the subscription and authentication of these reports.
- Involved with all the phases of Software Development Life Cycle (SDLC) methodologies throughout the project life cycle.
- Created logical data model from the conceptual model and it's conversion into the physical database design using Erwin.
- Analyzed the data and provide resolution by writing analytical/complex SQL in case of data discrepancies.
- Tuning and code optimization using different techniques like dynamic SQL, dynamic cursors, and tuning SQL queries, writing generic procedures, functions and packages.
- Responsible for Relational data modeling (OLTP) using MS Visio (Logical, Physical and Conceptual).
- Create and execute test scripts, cases, and scenarios dat will determine optimal system performance according to specifications.
- Reverse Engineered DB2 databases and then forward engineered them to Teradata using Erwin.
- Tested the database to check field size validation, check constraints, stored procedures and cross verifying the field size defined within the application with metadata.
- Extensively worked on development of mappings with BODS Transformations like Map Operation, Table Comparison, History Preserving, Key Generation, Pivot, Reverse Pivot etc.
- Responsible for design of logical and physical Data model for client's investment management ODS using dimensional modeling.
Environment: Oracle 8i, Developer 2000 with Forms 5.0 and Reports 3.0, O, SSIS, SSRS, PL/SQL, MS-Access, Sql Server, MS Office, MS Visio, Informatica Power center 5.1, Erwin, Teradata SQL Assistant.
