Sr. Data Analyst/data Architect Resume
Cincinnati, OH
SUMMARY
- Over 8+ years of extensive experience in Data Analysis, Data Modeling, System Analysis, Data Architecture and Development, Testing and Deployment of business applications
- Strong experience in using Excel and MS Access to dump the data and analyze based on business needs.
- Experience in Designing and implementing data structures and commonly used data business intelligence tools for data analysis.
- Strong experience in Data Migration, Data Cleansing, Transformation, Integration, Data Import, and Data Export.
- Experience and working with data modeling tools like Erwin, Power Designer and ER Studio.
- Profound knowledge of best practices for data architectures
- Experience in developing Map Reduce Programs using Apache Hadoop for analyzing the big data as per the requirement.
- Experience in designing star schema, Snowflake schema for Data Warehouse, ODS architecture.
- Good experience in Data Virtualization tool Denodo Development and implementations.
- Experience in using IBM Optim, Data Masking and Subsetting techniques
- Experience in importing and exporting data using Sqoop from HDFS to Relational Database Systems (RDBMS) and from RDBMS to HDFS.
- Experience in data analysis using Hive, Pig Latin, and Impala.
- Well versed in Normalization/De - normalization techniques for optimum performance in relational and dimensional database environments.
- Experience in various Teradata utilities like Fastload, Multiload, BTEQ, and Teradata SQL Assistant.
- Expert in writing SQL queries and optimizing the queries in Oracle, SQL Server and Teradata.
- Excellent Software Development Life Cycle (SDLC) with good working knowledge of testing methodologies, disciplines, tasks, resources and scheduling.
- Excellent knowledge in Data Analysis, Data Validation, Data Cleansing, Data Verification and identifying data mismatch.
- Efficient in enterprise data warehouses using Kimball data warehouse and Inman's methodologies.
- Experienced in generating and documenting Metadata while designing OLTP and OLAP systems environment.
- Knowledge on data storage, security, masking and governance is required.
- Worked on developing data models by breaking down the XML schemas.
- Experience working on creating models for Teradata master data management.
- Experience with DBA tasks involving performance tuning, creation of indexes, creating and modifying table spaces for optimization purposes.
- Experience in Physical modeling for various platforms like Teradata, Oracle, DB2, SQL server.
- Experience in Data Virtualization with Denodo.
- Experience in working with different ETL tool environments like SSIS, Informatica and reporting tool environments like SQL Server Reporting Services (SSRS), COGNOS and Business Objects.
- Proficiency in Normalization to 3NF and Denormalization techniques for optimum performance of database environments like OLTP and OLAP systems.
- Strong Experience in ER and Dimensional Data Modeling to deliver Normalized ER and Star/Snow Flake schemas using Erwin, ER Studio and Oracle designer.
- Working experience in both traditional Waterfall and sprinted Agile Methodologies.
- Good Knowledge on SQL queries, Dynamic-queries, Sub-queries and creating database objects like Stored Procedures, Triggers, Packages, Cursors and Functions using SQL and PL/SQL for implementing the business techniques.
- Good in system analysis, ER Dimensional Modeling, Database design and implementing RDBMS specific features.
- Having good experience with Normalization (1NF, 2NF and 3NF) and De-normalization techniques for improved database performance in OLTP, OLAP, Data Warehouse and Data Mart environments.
- Expert in user interface designing and creating screen-shots using wire frames and comps, and expert in using tools like MS Project and MS Visio.
TECHNICAL SKILLS
Big Data: Hadoop 3.0, HDFS, Hive 2.3, Pig, Hbase 1.2, Sqoop, Flume 1.8.
Data Modeling: Erwin 9.7, ER Studio V17, Sybase PowerDesigner
Databases: Netezza, MS SQL Server2016/2014, Oracle12c/11g, MS Access 2016, IBM DB2.
Languages: SQL, PL/SQL, T-SQL, HTML 5/4, XML, Basic Java and JavaScript
Reporting Tools: Crystal reports, Business Intelligence, SSRS, Business Objects, and Cognos.
Project Execution Methodologies: Ralph Kimball and Bill Inmon methodology, Rational Unified Process (RUP), Agile, Rapid Application Development (RAD), Joint Application Development (JAD)
Operating System: Windows 10/8.1, Linux, Unix
PROFESSIONAL EXPERIENCE
Confidential - Cincinnati, OH
Sr. Data Analyst/Data Architect
Responsibilities:
- Participated in many sessions with business stakeholders to identify Data Cleansing Rules and Transformation Rules to optimize statistical efficiency and Data Quality
- Worked in importing and cleansing of data from various sources like DB2, Oracle, flat files onto SQL Server with high volume data
- Worked on Software Development Life Cycle (SDLC) with good working knowledge of testing, Agile methodology, disciplines, tasks, resources and scheduling.
- Used Python to place data into JSON files for testing Django Websites.
- Worked on Amazon Redshift and AWS and architecting a solution to load data, create data models.
- Created data masking mappings to mask the sensitive data between production and test environment.
- Created custom Denodo views by joining tables from multiple data sources.
- Created scheduled jobs for data extracts and report reloads by Denodo Scheduler.
- Involved in source to target (MDM) Data mapping sessions with IBM as they master the target.
- Developed and implemented automation of the data masking processes
- Used Reverse Engineering to connect to existing database and create graphical representation (E-R diagram).
- Collaborated with Applications Development, Project Managers, QA/Test, Database Administrators, Data Modelers and Data Quality teams to develop data masking solutions.
- Assisted in development of data architecture policies and procedures.
- Created and maintained data model standards, including master data management (MDM) and Involved in extracting the data from various sources like Oracle, SQL, Teradata, and XML.
- Well versed with the use of graphs, charts, Excel and statistical tables to objectively present the analysis results for quick and easy
- Designed and developed high-quality integration solutions by using Denodo virtualization tool.
- Created documents with step by step instruction for data archiving and masking the data.
- Created a high-level industry standard, generalized data model to convert it into logical and physical model at later stages of the project using Erwin and Visio.
- Responsible for defining the naming standards for Data warehouse.
- Maintained and implemented Data Models for Enterprise Data Warehouse using ERWIN.
- Integrated MySQL Workbench and Oracle using JDBC to Denodo.
- Used forward engineering to create a Physical Data Model with DDL that best suits the requirements from the Logical Data Model.
- Worked with Business users for requirements gathering, business analysis and project coordination.
- Used Oracle Data Masking pack for masking Sensitive data across the database.
- Used Python scripts to update content in the database and manipulate files.
- Ensured that data architecture tasks were executed within deadlines.
- Installing and Configuring Virtual Data Port(VDP) Database setup in Denodo.
- Documented business rules and performed source to target mapping of all project data elements in ETL tool.
- Create and maintain Metadata, including table, column definitions.
- Identified the Facts & Dimensions Tables and established the Grain of Fact for Dimensional Models.
- Designed Star and Snowflake Data Models for Enterprise Data Warehouse using ERWIN.
- Created masking rules to be implemented across 8 databases.
- Worked very close with Data Architectures and DBA team to implement Data Model changes in database in all environments. Generated DDL scripts for Database Modifications, Views and set tables.
- Arranged various guiding sessions for Programmers, Engineers, System Analysts and others for clarification of performance requirements, interfaces project capabilities and limitations.
- Extensive experience in PL/SQL programming Stored Procedures, Functions, Packages and Triggers
Environment: DB2, Oracle 12c, SQL Server, MDM, E-R diagram, Teradata r15, XML, Visio, Metadata, Amazon Redshift 1.11, AWS, Erwin 9.7
Confidential - Newport Beach, CA
Sr. Data Analyst/Data Architect
Responsibilities:
- Participated in all phases including Analysis, Design, Coding, Testing and Documentation. Gathered Requirements and performed Business Analysis.
- Applied a dimensional model structure to archive an Agile data model.
- Part of team conducting logical data analysis and data modeling JAD sessions, communicated data-related standards.
- Designed ER diagrams, logical model (relationship, cardinality, attributes, and, candidate keys).
- Generated Python Django Forms to record data of online users.
- Proposed best architecture solutions to meet business requirements.
- Worked on development projects that were based on Data Virtualization concepts and creating web services in Denodo.
- And also designed physical database (capacity planning, object creation and aggregation strategies) as per business requirements.
- Created logical and physical models and ER diagrams using Power Designer modeling tool for the relational and dimensional data modeling.
- Created caching jobs in Denodo different databases like MySQL, Hadoop, Teradata, Excel.
- Conducted JAD Sessions with business analyst and application teams to analyze requirements for data storage for new applications within ESP.
- Developed and maintained data Dictionary to create Metadata Reports for technical and business purpose.
- Created dimensional model for the reporting system by identifying required facts and dimensions using Erwin.
- Worked with Team for designing, documenting and configuring Informatica Data Director for supporting management of MDM data.
- Designed and Developed ETL jobs to extract data from Sales force replica and load it in data mart in AWS Redshift.
- Interacted with users and business analysts to gather requirements.
- Worked Normalization and De-normalization concepts and design methodologies like Ralph Kimball and Bill Inmon approaches and implemented Slowly Changing Dimensions.
- Developed automated data pipelines from various external data sources (web pages, API etc) to internal data warehouse (SQL server).
- Experienced in writing complex SQL queries and optimizing the queries in DB2, Oracle, Netezza, and Teradata etc.
- Developed statistics and visual analysis for warranty data using MS Excel, MS Access.
- Managed database design and implemented a comprehensive Star-Schema with shared dimensions.
- Wrote a Python module to connect and view the status of an Apache Cassandra instance.
- Worked on designing a Star schema for the detailed data marts and plan data marts involving confirmed dimensions.
- Loaded and transformed large sets of structured, semi structured and unstructured data using Hadoop/Big Data concepts.
- Responsible for Big data initiatives and engagement including analysis, brain storming, POC, and architecture.
Environment: ER diagrams, Erwin, MDM, Informatica, Redshift, data pipelines, Oracle, Netezza, Teradata, MS Excel, MS Access, Hadoop, POC, AWS
Confidential - Dayton, OH
Sr. Data Analyst/Data Modeler
Responsibilities:
- Developed a Conceptual model using Erwin based on requirements analysis
- Developed normalized Logical and Physical database models to design OLTP system for insurance applications
- Created dimensional model for the reporting system by identifying required dimensions and facts using Erwin.
- Worked with Python OO Design code for manufacturing quality, monitoring, logging, and debugging code optimization.
- Used forward engineering to create a Physical Data Model with DDL that best suits the requirements from the Logical Data Model
- Designed and Developed Use Cases, Activity Diagrams, and Sequence Diagrams using Unified Modeling Language (UML)
- Involved in the analysis of the existing claims processing system, mapping phase according to functionality and data conversion procedure.
- Performed Normalization of the existing OLTP systems (3rd NF), to speed up the DML statements execution time.
- Data modeling in Erwin; design of target data models for enterprise data warehouse (Teradata)
- Developed the required data warehouse model using Star schema for the generalized model
- Experienced in Oracle installations, upgrades, migration, designing logical/physical architecture, Tuning, Capacity planning, database access and Security and auditing.
- Knowledge of OLAP, Dimensional Data Modeling, Operational Data Store (ODS) Snow-flake modeling for Dimensions Tables using Analysis services and FACT.
- Collaborated with ETL, BI and DBA teams to analyze and provide solutions to data issues and other challenges while implementing the OLAP model.
- Worked for cleansing and organizing various tables in a presentable manner to help with better understanding of already existing models.
- Involved in development and implementation of SSIS, SSRS and SSAS application solutions for various business units across the organization.
- Designed and Developed Oracle, PL/SQL, Procedures, LINUX and UNIX Shell Scripts for data Import/Export and data Conversions.
- Created new reports based on requirements. Responsible in Generating Weekly ad-hoc Reports
- Planned, coordinated, and monitored project levels of per and activities to ensure project completion in time.
- Automated and scheduled recurring reporting processes using UNIX shell scripting and Teradata utilities such as MLOAD, BTEQ and Fast Load and Experience with Perl
- Involved in defining the source to target Data mappings, business rules and Data definitions.
- Performed Data analysis and Data profiling using complex SQL on various sources systems including Oracle and Teradata.
Environment: Erwin, OLTP, Teradata, Oracle, SSIS, SSRS, SSAS, Oracle, PL/SQL, LINUX, UNIX, MLOAD, BTEQ, Fast Load
Confidential - St. Petersburg, FL
Sr. Data Analyst/Data Modeler
Responsibilities:
- Interacted with the end users to understand the business requirement and identified data sources. Involved in regular interactions with Business Analysts and participated in data modeling JAD sessions.
- Involved in Data Architecture, Data profiling, Data analysis, data mapping and Data architecture artifacts design.
- Worked closely with Business analysts, data architects and various teams to understand the requirements and to translate them into appropriate database designs.
- Data Modeler/Analyst in Data Architecture Team and responsible for Conceptual, Logical and Physical model for Supply Chain Project.
- Performed spot check on the data issues coming up while loading data and captured the log data into Excel sheet and share with the business.
- Extracted daily error messages in excel spreadsheets and share with the business
- Performed unit test for the data that has been migrated into different environments.
- Reviewed and generated business rules for data and documented the data flows.
- Used the GIT repository for up-to-date change in queries.
- Created SQL queries to migrate data from source to target.
- Involved in logical and Physical Database design & development, Normalization and Data modeling using Erwin and SQL Server Enterprise manager.
- Analyzed data results from use of business processing software and provides conceptual solutions to systems design work.
- Used Model Mart of Erwin for effective model management of sharing, dividing and reusing model information and design for productivity improvement.
- Created Data Mapping documents which capture the source of data, any business rule to be applied to meet the internal customer needs of data.
- Met with the data modeling team once every week and discuss the standards and other data modeling related issues.
- Extracted data from the databases (Oracle and SQL Server, DB2, FLAT FILES) using Informatica to load it into a single data warehouse repository.
Environment: Erwin 8.5, Excel sheet, excel spreadsheets, GIT, SQL queries, Oracle, DB2, Flat Files, and Informatica
Confidential
Data Analyst/Data Modeler
Responsibilities:
- Worked extensively along with business analysis team, scrum masters in gathering requirements and understanding the workflows of the organization
- Involved in analysis of Business requirement, Design and Development of High level and Low level designs, Unit and Integration testing
- Implemented Relational Model and Dimensional Model for Data Marts and generated DDL scripts Using Erwin tool and have implemented forward and Reverse Engineering.
- Responsible for Metadata Management, keeping up to date centralized metadata repositories using Erwin modeling tools.
- Involved in Data flow analysis, Data modeling, Physical database design, forms design and development, data conversion, performance analysis and tuning.
- Performed Data Analysis and Data Profiling using complex SQL queries on various sources systems including Oracle, Teradata.
- Designed ODS, and Data Vault with expertise in Loan and all types of Cards.
- Extensively used Star Schema methodology for the reporting system by identifying required dimensions and facts and cleansed unwanted tables/columns
- Performed Reverse Engineering of the current application using Erwin, and developed Logical and Physical data models for Central Model consolidation.
- Used SQL for Querying the database in UNIX environment
- Analyzed OLTP source systems and Operational Data Store and research the tables/entities required for the project. Designing the measures, dimensions and facts matrix document for the ease while designing.
- Tuning all database via indexing of tables, MS SQL Server 2005 configuration parameters and stored procedures SQL code optimization.
- Worked in multiple issues raised by different users/ consumers of Data Warehouse and aided them in analyzing and modifying user queries to pull the reports
- Part of team conducting logical data analysis and data modeling JAD sessions, communicated data-related standards.
- Created logical/physical models and conceptual models using Erwin and VISIO
- Collaborate the data mapping document from source to target and the data quality assessments for the source data.
Environment: DDL scripts, Erwin 8.0, Metadata, SQL queries, Oracle 10g, MS SQL Server 2005, ODS, Data Vault, UNIX, OLTP, MS SQL, Visio
