Sr. Data Architect/ Data Modeler Resume
Eden Prairie, MN
PROFESSIONAL SUMMARY:
- Over 9+ years of total IT experience and expertise in data modeling for data warehouse/data mart development, Data Architecture, Data Analysis and business Intelligence (BI) applications.
- Experience in developing Map Reduce Programs using Apache Hadoop for analyzing the Big data as per the requirement.
- Extensive experience with various data processing platforms and languages including Apache Spark(Scala), Apache Drill, Python, Oracle PL/SQL, and PostgreSQL PL/pgSQL.
- Experience in data analysis using Hive, Pig Latin, Impala.
- Extensive experience in shell scripting Python, Perl, Ruby, or any other scripting language
- Strong experience with architecting highly performant databases using PostgreSQL, PostGIS, MYSQL and Cassandra.
- Well versed in Normalization / De - normalization techniques for optimum performance in relational and dimensional database environments.
- Experience in importing and exporting data using Sqoop from HDFS to Relational Database Systems (RDBMS) and from RDBMS to HDFS.
- Good experience in machine learning algorithms for Kaggle data sets.
- Experience with JAD sessions for requirements gathering, creating Data Mapping, documents, writing functional specifications, queries.
- Experience in working with Business Intelligence and Enterprise Data Warehouse(EDW)
- Experience in various Teradata utilities like Fastload, Multiload, BTEQ, and Teradata SQL Assistant.
- Good experienced in Dimensional and Relational Data Modeling using Star and Snowflake Schemas, Fact and Dimension tables, Conceptual, Logical and Physical data modeling using Erwin.
- Efficient in enterprisedatawarehouses using Kimballdatawarehouse and Inmon's methodologies.
- Experience working withdatamodeling tools like Erwin, Power Designer and ER Studio.
- Expertise in Normalization (1NF, 2NF, 3NF and BCNF)/Denormalization techniques for effective and optimum performance in OLTP and OLAP
- Experience in Database Creation and maintenance of physical data models with Oracle, Teradata, Netezza, DB2 and SQL Server databases.
- Excellent experience in writing and executing unit, system, integration and UAT scripts in a data warehouse projects.
- Expertise in SQL Server Analysis Services (SSAS) and SQL Server Reporting
- Excellent experience working on Netezza/Teradata and writing heavy SQL queries and expertise in various Teradata utilities like Fastload, Multiload, BTEQ and TeradataSQLAssistant.
- Experienced in working on both Agile methodology and Waterfall methodology.
- Experienced in generating and documenting Metadata while designing OLTP and OLAP systems environment.
- Experienced in Extract Transform and Load (ETL) data from spreadsheets, database tables and other sources using Informatica and well-versed in writing SQL queries to perform end-to-end ETL validations and support Ad-hoc business requests.
- Strong understanding of principles of data warehousing, fact tables, dimension tables, Slowly Changing Dimensions (SCD) Type I and Type II.
TECHNICAL SKILLS:
Analysis and Modeling Tools: Erwin 9.6/9.5, Sybase Power Designer, Oracle Designer, ER/Studio 9.7, Star-Schema, Snowflake-Schema Modeling, FACT and dimension tables, Pivot Tables.
Languages: PL/SQL, SQL, UNIX Shell Scripting, C++, HTML, TSQL, OOP, Data Structure, Algorithms
Cloud Technology: Amazon Web Services(AWS), EC2, EC3, Elastic Search, Microsoft Azure.
Big Data & NOSQL: Hadoop, Yarn, Sqoop, Flume, Kafka, Hive, Pig, Oozie, Storm, Cassandra, HBase, MapReduce
Database Tools: Microsoft SQL Server 2016/2014, Teradata 15/14, Oracle 12c/11g, MS Access, Greenplum, Poster SQL, Netezza.
Reporting Tools: Cognos 8/10, Oracle Reports, MS BI, Crystal Report11, Tableau, Business Objects and IDQ.
OLAP Tools: Tableau 9.3, SAP BO, SSAS, Business Objects.
Operating Systems: Microsoft Windows 8/7, Linux, and UNIX
Other tools: TOAD, SQL PLUS, BTEQ, SQL LOADER, MS Project, MS Visio and MS Office, Have worked on C++, UNIX, PL/SQL etc.
WORK EXPERIENCE:
Confidential - Eden Prairie, MN
Sr. Data Architect/ Data Modeler
Responsibilities:
- Developed full life cycle software in Agile environment including defining requirements, prototyping, designing, coding, testing and maintaining software.
- Define strategy for new data platform, focused on its architecture and a strong business case for leveraging big data technologies for more sophisticated consumer analytics.
- Worked closely with the business and IT partners to deliver the data systems to meet their needs and understand the future direction of business.
- Loaded and transformed large sets of structured, semi structured and unstructured data using Hadoop/Big Data concepts.
- Extracting Mega Data from Amazon Redshift, AWS, and Elastic Search engine using SQL Queries to create reports.
- Used Model Manager Option in ER Studio to synchronize the data models in Model Mart approach.
- Worked on MasterdataManagement (MDM) Hub and interacted with multiple stakeholders.
- Worked on NoSQL databases including HBase, Mongo DB, and Cassandra. Implemented multi-data center and multi-rack Cassandra cluster.
- Installation and Configuration of other Open Source Software like Pig, Hive, HBase, Flume and Sqoop.
- Collaboratively worked with the Data modeling architects and other data modelers in the team to design the Enterprise Level Standard Data model.
- Designed both 3NF data models for ODS, OLTP systems and dimensional data models using star and snow flake Schemas.
- Worked with reversed engineer Data Model from Database instance and Scripts.
- Created entity relationship diagrams and multidimensional data models, reports and diagrams based on the requirements.
- Worked closely with the development and database administrators to guide the development of the physical data model and database design.
- Completed enhancement for MDM (Master data management) and suggested the implementation for hybrid MDM.
- Analyze multiple sources of structured and unstructured data to propose and design data architecture solutions for scalability, high availability, fault tolerance, and elasticity.
- Designed and Developed SQl PL/SQL and Shell Scripts, Data Import/Export, Data Conversions and Data Cleansing.
- Implemented Dimensional Modeling Identified Facts and Dimensions, Physical and logical data modeling using ER Studio.
- Lead technical design for new products, focusing on data assets, data flow, and data analytics
- Forward Engineering the Data Models, Reverse Engineering on the existing Data Models and updates the data models.
- Handled importing data from various data sources, performed transformations using Hive, Map Reduce, and loaded data into HDFS.
- Worked on Data governance, data quality, data lineage establishment processes.
- Involved with data profiling for multiple sources and answered complex business questions by providing data to business users.
- Worked with data investigation, discovery and mapping tools to scan every single data record from many sources.
- Directing and overseeing data quality tests, including providing input to quality assurance team members.
- Deployed SSRS reports to Report Manager and created linked reports, snapshots, and subscriptions for the reports and worked on scheduling of the reports.
- Generated parameterized queries for generating tabular reports using global variables, expressions, functions, and stored procedures using SSRS.
- Recommending solutions regarding data systems, storage and lifecycle management.
- Worked with DBAs and the security coordinators to get access to the team members.
- Used SSRS to create reports, customized Reports, on-demand reports, ad-hoc reports and involved in analyzing multi-dimensional reports in SSRS.
Environment: ER Studio 9.0, NoSQL, T-SQL, MS Excel, Flat Files, Hadoop, Hive, Map Reduce, MDM, PL/SQL, OLAP, OLTP, DB2, ODS, SSRS
Confidential - Northbrook, IL
Sr. Data Modeler/ Data Architect
Responsibilities:
- Researched, evaluated,architect, and deployed new tools, frameworks and patterns to build sustainable BigDataplatforms for the clients.
- As a Architect implement MDM hub to provide clean, consistent data for a SOA implementation
- Responsible for Big data initiatives and engagement including analysis, brainstorming, POC, and architecture.
- Involved in the validation of the OLAP Unit testing and System Testing of the OLAP Report Functionality and data displayed in the reports.
- Worked on Business Intelligence solution using Redshift DB, and Tableau.
- Involved inDataprofiling anddataanalysis to understand the business process and implement the functionality in EDW through dimensional modeling.
- Built Logical and Physicaldatamodels andDataDimensional Modeling using ERWIN
- Responsible for technicalDatagovernance, enterprise wideDatamodeling and Database design.
- Designed and implemented complex highly scalable statistical models and solutions that comply with security requirements.
- Manage technical documentation and artifacts providing architecture, logical and physical data models, technical standards and project support
- Worked on building the Logical data model from the scratch from the XML as the data source.
- DevelopedDataMapping,DataGovernance, Transformation and cleansing rules for the MasterData Management Architecture involving OLTP, ODS
- Worked on building the data models to convert the data from one data Application to another in a way that suit the needs of the target database.
- Redefined many attributes and relationships in the reverse engineered model and cleansed unwanted tables/columns.
- Produced 3NF data models for OLTP designs using data modeling best practices and modeling skills.
- Designed data models for mission critical and high volume data management, real-time and distributed data process aligning with the business requirements.
- Conducted and participated in JAD sessions with the users, modelers, and developers for resolving issues.
- Enforced Referential integrity in the OLTP data model for consistent relationship between tables and efficient database design.
- Involved in the creation, maintenance of Data Warehouse and repositories containing Metadata.
- Created Source to Target Mapping Documents to help guide the data model design from the Data source to the data model.
- Involved in writing T-SQL, working on SSIS, SSRS, SSAS, Data Cleansing, Data Scrubbing and Data Migration.
- Worked extensively on Data Quality (running Data Profiling, Examine Profile outcome) Metadata management
- Applied data naming standards, created the data dictionary and documented data model translation decisions and also maintained DW metadata.
- Created data masking mappings to mask the sensitive data between production and test environment
- Participated in Performance Tuning using Explain Plan and TKPROF.
Environment: Erwin r9.6, Oracle 12c, Teradata 15, MDM, Pig, HBase, Sqoop, ODS, SQL Assistant, MS Visio, Spark, AWS Redshift, Agile, Windows 8, Tableau 9.3, ETL, PL/SQL, Metadata, SSAS, OLAP, OLTP
Confidential - Atlanta, GA
Sr. Data Analyst/Data Modeler
Responsibilities:
- Documented Technical & Business User Requirements during requirements gathering sessions.
- Gathered business requirements through interviews, survey with users and business analysts.
- Involved in preparing logical data models and conducted controlled brain-storming sessions with project focus groups.
- Extensively used star schema methodologies in building and designing the logical data model into dimensional models.
- Worked on the Snow-flaking the Dimensions to remove redundancy.
- Designed and developed Use Cases, Activity Diagrams, Sequence Diagrams, OOD (Object oriented Design) using UML and Visio
- Involved in business process modeling using UML through Rational Rose.
- Preparation ofdatadictionary / business glossaries and also integratingDatadictionary intodata models.
- Dataanalysis of existingdatabase to understand thedataflow and business rule applied to Differentdatabases by SQL
- Used ErwinDataModelerfor Generating DDL, Stored Procedure and Trigger Code for Source to Target database.
- Conducted team meetings and JAD sessions
- Developed logical data model using Erwin and created physical data models using forward engineering in generating DDL scripts and creating indexing strategies
- De-normalized the database to put them into the star schema of the data warehouse
- Worked extensively onDataMigration by using SSIS.
- DataProfiling of source using Oracle and Hue(Hadoop/Hive UI) and supporting developers by testing the target on Teradata and Hadoop
- Involved in the mapping ofdatasources and analytics, with the goal of ensuringdataquality.
- Designed the Logical Model into Dimensional Model using Star Schema and Snowflake Schema.
- Enforced referential integrity in the OLTP data model for consistent relationship between tables and efficient database design.
- Extracted data from the databases using Informatica to load it into a single data warehouse repository.
- Analyzed thedatawhich is using the maximum number of resources and made changes in the back-end code using PL/SQL stored procedures and triggers
- Developed SQL Queries to fetch complexdatafrom different tables in remote databases using joins, database links and bulk collects.
- Worked on multiple Data Marts in Enterprise Data Warehouse Project (EDW)
- Datagovernance functional and practical implementation and also responsible for designing commonDatagovernance frameworks.
- Worked closely with the ETL Developers to explain the complex Data Transformation Logic.
- Employed naming standard editor for defining new naming standards for the entities, attributes, domains, columns and tables that applied consistently all over the enterprise.
- Implemented slowly changing dimensions- Type2 and Type3 for accessing history of reference data changes.
- Provided source to target mappings to the ETL team to perform initial, full, and Incremental loads into the target data mart.
- Worked closely with Business, DBA team, ETL and Reporting teams to definedatarequirements.
Environment: Erwin 9, Teradata13/14, MS SQL Server 2008,Hadoop, Hive, OOD, PL/SQL, OLAP, OLTP, SSIS, Oracle 10g, Star Schema, Informatica, MS Office, MS Visio.
Confidential - Houston, TX
Sr. Data Analyst/Data Modeler
Responsibilities:
- Worked with Business users for requirements gathering, business analysis and project coordination.
- Interacted with users for verifying User Requirements, managing Change Control Process, updating existing Documentation.
- Created logicaldatamodel from the conceptual model and it's conversion into the physical database design using ERWIN.
- Created various Physical Data Models based on discussions with DBAs and ETL developers.
- Worked on data mapping process from source system to target system.
- Created dimensional model for the reporting system by identifying required facts and dimensions using Erwin.
- Extensively used Star and Snowflake Schema methodologies.
- Prepared data dictionaries and Source-Target Mapping documents to ease the ETL process and user's understanding of the data warehouse objects.
- Developed and maintained data Dictionary to create Metadata Reports for technical and business purpose.
- Worked on Performance Tuning of the database which includes indexes, optimizing SQL Statements.
- Used Model Mart of Erwin for effective model management of sharing, dividing and reusing model information and design for productivity improvement.
- Implemented Forward engineering to create tables, views and SQL scripts and mapping documents.
- Translated business concepts into XML vocabularies by designing XML Schemas with UML.
- Used Normalization (1NF, 2NF & 3NF) and de-normalization techniques for effective performance in OLTP and OLAP systems.
- Developed dimensional model forDataWarehouse/OLAP applications by identifying required facts and dimensions.
- Worked with Architecture team to get the metadata approved for the newdataelements that are added for this project.
- DevelopedDataMigration and Cleansing rules for the Integration Architecture (OLTP, ODS, DW)
- Designed STAR schema for the detaileddatamarts and Plandatamarts involving shared dimensions (Conformed)
- Generated DDL (DataDefinition Language) scripts using Erwin and supported the DBA in Physical Implementation ofdataModels.
- Performed data analysis and data profiling using SQL queries on various sources systems including Oracle and SQL Server 2008
- Used reverse engineering to connect to existing database and create graphical representation (E-R diagram) using Erwin.
- Used Erwin for reverse engineering to connect to existing database and ODS to create graphical representation in the form of Entity Relationships and elicit more information.
- Worked on Data Extraction/Transformation/Loading (ETL), Data Conversion and Data Migration by using SQL Server 2008 Integration Services (SSIS) and PL/SQL Scripts.
- Facilitated meetings with the business and technical team to gather necessary analyticaldata requirements.
- Worked on Oracle PL/SQL and Shell Scripts, Packages, Scheduling,DataImport/Export,Data Conversions andDataCleansing
- Development of database objects like Tables, views and materialized views etc using SQL.
- Documented the source to target mappings for both data integration as well as web services Utilized Erwin's forward/reverse engineering tools and target database schema conversion process.
- Developed SQL Queries to fetch complex data from different tables in remote databases using joins, database links and Bulk collects.
- Assisted in designing test plans, test scenarios and test cases for integration, regression and user acceptance testing.
Environment: Erwin 9.0, OLAP, OLTP, SSIS, ODS, PL/SQL, Metadata, SQL Server 2008, Star Schema, Oracle 10g
Confidential
Data Analyst/Data Modeler
Responsibilities:
- AnalyzedDatasources and requirements and business rules to perform logical and physicalData Modeling
- Created a logical design and physical design in ER/Studio.
- Communicated with users and businessanalyststo gather requirements. Involved in business process modeling using UML through Rational Rose.
- Reverse Engineered theDataModels and identified theDataElements in the source systems and adding newDataElements to the existingdatamodels.
- Worked ondataprofiling anddatavalidation to ensure the accuracy of thedatabetween the warehouse and source systems.
- Creatingdatatrace map anddataquality mapping documents, PerformingDataProfiling andData Quality.
- Executed the UNIX shell scripts that invoked SQL loader to loaddatainto tables.
- Developed Star Schema and Snowflake Schema in designing the Logical Model into Dimensional Model.
- Involved in Data modeling and design of Data Mart's using ER Studio
- Extensively used ER Studio for developingdatamodel using star schema methodologies.
- Participated in several JAD (Joint Application Design/Development) sessions to track end to end flow of attributes starting from source screens to all the downstream systems.
- Involved inDataProfiling,DataCleansing and make sure thedatais accurate and analyzed when it is transferring from OLTP toDataMarts andDataWarehouse.
- Involved indataextraction, validation, analysis of thedataand the store indatamarts.
- PerformedDataAnalysis anddataprofiling using complex SQL on various sources systems including Oracle 9i/8i and Teradata, to ensure accuracy of thedatabetween the warehouse and source systems.
- Created LDM, PDM in NF using ER Studio tool and converted the logical models to the physical design.
- Dataanalysis of existingdatato understand the current patient'sdataand business rules was applied to Differentdatabases by SQL.
- Involved in Performance tuning by leveraging oracle explain utility and SQL tuning.
- Created PL/SQL packages and Database Triggers and developed user procedures and prepared user manuals for the new programs.
- Involved in creating Sessions, worklets and Workflows and scheduling workflows using Workflow Manager.
- Created Entity Relationship Diagrams, grouped and created the tables, validated the data, identified PKs for lookup tables.
Environment: ER Studio, Star Schema, Oracle 9i, Teradata Oracle SQL Developer, PL/SQL, Business Objects, OLAP, OLTP, Workflow Manager.
