We provide IT Staff Augmentation Services!

Sr. Big Data Architect/data Modeler Resume

3.00/5 (Submit Your Rating)

Juno Beach, FL

SUMMARY:

  • Over 10+ Years of IT experience in Data Analysis, Data architecture, Data Modeling, Implementation and Testing of Enterprise Data Warehousing and Enterprise Database.
  • Experience in importing and exporting data using Sqoop from HDFS to Relational Database Systems (RDBMS) and from RDBMS to HDFS.
  • Excellent experience in creating cloud based solutions and architecture using Amazon Web services(Amazon EC2, Amazon S3, Amazon RDS) and Microsoft Azure.
  • Experience in analyzing data using Hadoop Ecosystem including HDFS, Hive, Spark, Spark Streaming, Elastic Search, Kibana, Kafka, HBase, Zookeeper, PIG, Sqoop, Flume.
  • Solid hands on experience with administration of data model repository, documentation in Metadata portals in such as Erwin, ER Studio and Power Designer tools.
  • Experienced in JIRA software for Plan, Track, and Report and Release management.
  • Experience in designing Enterprise Data warehouse, Data Marts, Reporting data stores (RDS) and Operational data stores (ODS).
  • Have considerable expertise in Metadata Management, Data Profiling & Quality, Data Governance and Master Data management (MDM).
  • Excellent understanding of an Approach to MDM to creating a data dictionary using Informatica or other tools to do mapping from sources to the Target MDM Data Model.
  • Experience in Big Data Hadoop Ecosystem in ingestion, storage, querying, processing and analysis of big data.
  • Experience in working with Business Intelligence and Enterprise Data Warehouse(EDW) including SSAS, Pentaho, Cognos, OBIEE and QlikView.
  • Experienced in various Teradata utilities like Fastload, Multiload, BTEQ, and Teradata SQL Assistant.
  • Well versed in conducting Gap analysis, Joint Application Design (JAD) session, User Acceptance Testing (UAT), Cost benefit analysis and ROI analysis.
  • Good understanding in Normalization (1NF, 2NF, 3NF and BCNF) techniques for OLTP environments and Denormalization techniques for improved database performance in OLAP environments.
  • Experience in developing Map Reduce Programs using Apache Hadoop for analyzing the big data as per the requirement.
  • Practical understanding of the Data modelling (Dimensional & Relational) concepts like Star Schema Modelling, Snowflake Schema Modelling, Fact and Dimension tables.
  • Strong experience in Data Analysis, Data Migration, Data Cleansing, Transformation, Integration, Data Import, and Data Export
  • Excellent Software Development Life Cycle (SDLC) with good working knowledge of testing methodologies, disciplines, tasks, resources and scheduling.
  • Extensive experience in SSIS Packages, SSRS reports and SSAS cubes on production server.
  • Experienced in Data loading using PL/SQL, SQL Server Integration Services packages (SSIS), Oracle Data Integrator (ODI) and developed dashboard reports using SQL Server reporting services (SSRS)
  • Proficient in data mart design and creation of cubes using dimensional data modelling - identifying Facts and Dimensions, Star Schema and Snowflake Schema.
  • Experience working with Agile and Waterfall data modelling methodologies, Ralph Kimball and Bill Inmon approaches.
  • Experience in NOSQL DB and tools like Apache HBase to handle massive data tables containing billions of rows, millions of columns.
  • Strong hands on experience using Teradata utilities (SQL, BTEQ, Fast Load, Multi Load, Fast Export, Tpump, Visual Explain, and Query man), Teradata parallel support and Unix Shell scripting.
  • Experience in integration of various relational and non-relational sources such as DB2, Teradata, Oracle, Netezza, SQL Server database
  • Have good Working experience on Informatica Power Center tools-Designer, Repository Manager, and Workflow Manager.
  • Expert in BI reporting and Data Reporting tools like Pentaho and SAP BI.

TECHNICAL SKILLS:

DataModelling Tools: Erwin 9.6/9.5, ER/Studio 9.7/9.0, Sybase Power Designer

Big Data Technologies: Hadoop, Hive, HDFS, HBase, Flume, Sqoop, Spark, Pig, Impala, MapReduce.

Programming Languages: SQL, PL/SQL, UNIX shell Scripting, PERL, AWK, SED

Databases: Oracle 12c/11g, Teradata R15/R14, MS SQL Server 2014/2016, DB2

Testing and defect tracking Tools: HP/Mercury (Quality Center, Win Runner, Quick Test Professional, Performance Center, Requisite, MS Visio & Visual Source Safe

Operating System: Windows, UNIX, Linux.

ETL/Datawarehouse Tools: Informatica 9.6/9.1, SAP Business Objects XIR3.1/XIR2, Web Intelligence, Talend, Tableau, Pentaho

Project Execution Methodologies: Ralph Kimball and BillInmondatawarehousing methodology, Rational Unified Process (RUP), Rapid Application Development (RAD), Joint Application Development (JAD)

Other tools: TOAD, SQL PLUS, SQL LOADER, MS Project, MS Visio and MS Office, Have worked on C++, UNIX, PL/SQL etc.

Web technologies: HTML, DHTML, XML, JavaScript

Tools: & Software: TOAD, MS Office, BTEQ, SQL Assistant.

PROFESSIONAL EXPERIENCE:

Confidential, Juno Beach, FL

Sr. Big Data Architect/Data Modeler

Responsibilities:

  • Responsible for the data architecture design delivery, data model development, review, approval and Data warehouse implementation.
  • Coordinate escalations on threats in the Azure environment and work with InfoSec team, analyze logs
  • Migrating application to Azure and working with engineering teams to complete testing and pilot migrations.
  • • Configuring internal load balancing, deploying Web Roles in Windows Azure.
  • Leads the design and maintenance of logical and physical data models (relational & dimensional), data dictionary and database volumetric.
  • Conducted design reviews with business analysts, Enterprise data architect and solution lead to create proof of concept for the reports.
  • Conducted and participated JAD sessions with Business Owners, Business Analysts, Application Development teams to understand and analyze business and reporting requirements
  • Responsible for Big data initiatives and engagement including analysis, brainstorming, POC, and architecture.
  • Designed and developed architecture for data services ecosystem spanning Relational, NoSQL and Big Data technologies.
  • Implemented Agile Methodology for building Integrated Data Warehouse, involved in multiple sprints for various tracks throughout the project lifecycle.
  • Loaded and transformed large sets of structured, semi structured and unstructured data using Hadoop/Big Data concepts.
  • Included migration of existing applications and development of new applications using AWS cloud services.
  • Involved in data model reviews as data architect with business analysts and business users with explanation of the data model to make sure it is in-line with business requirements.
  • Provided suggestion to implement multitasking for existing Hive Architecture in Hadoop also suggested UI customization in Hadoop.
  • Developed Map Reduce programs to cleanse the data in HDFS obtained from heterogeneous data sources.
  • Completed enhancement for MDM (Master data management) and suggested the implementation for hybrid MDM (Master Data Management.
  • Created and deployed DDLs based on the physical data model in Development Database
  • Massively involved in Data Architect role to review business requirement and compose source to target data mapping documents.
  • Designed both 3NF data models for ODS, OLTP systems and dimensional data models using Star and Snow flake Schemas.
  • Worked on Metadata Repository(MRM) for maintaining the definitions and mapping rules up to mark
  • Applied data naming standards, created the data dictionary and documented data model translation decisions and also maintained DW metadata.
  • Used SAS/Interface to Teradata to extract data from Teradata and also used SAS/SQL pass through facility.
  • Designed and Developed Real time Stream processing Application using Spark, Kafka, Scala and Hive to perform Streaming ETL and apply Machine Learning.
  • Created SSIS Packages for import and export of data between database and Flat Files.
  • Designed the ODS with core tables and now working on enhancing this model for additional master data.
  • Developed and implemented data cleansing, data security, data profiling and data monitoring processes.
  • Created process flow diagrams by using MS Visio and maintained design document.
  • Specifies overall Data Architecture for all areas and domains of the enterprise, including Data Acquisition, ODS, MDM, Data Warehouse, Data Provisioning, ETL, and BI.
  • Advises on and enforces data governance to improve the quality/integrity of data and oversight on the collection and management of operational data.
  • Extracted data from IBM Cognos to create automated visualization reports and dashboards on Tableau.
  • Developed the performance tuning of the database by using EXPLAIN PLAN, TKPROF utilities and also debugging the SQL code.
  • Worked with data investigation, discovery and mapping tools to scan every single data record from many sources.
  • Designed and Developed Oracle PL/SQL and Shell Scripts, Data Import/Export, Data Conversions and Data Cleansing.

Environment: ER/Studio 9.7, Oracle12c, Hive, Amazon Redshift, AWS, MapReduce, Hadoop, Cassandra, HBase, Teradata15, Spark, MDM, Agile, NoSQL, PL/SQL, OLAP, OLTP, SQL, HDFS.

Confidential, Bothell, WA

Sr.Big Data Architect/Data Modeler

Responsibilities:

  • Designed and Architect, and help Maintain scalable solutions on the big data analytics platform for enterprise module.
  • Designed Physical Data Model (PDM) using ER/Studio data modelling tool.
  • Implemented logical and physical relational database and maintained Database Objects in the data model using ER/Studio 9.0
  • Developing full life cycle software including defining requirements, prototyping, designing, coding, testing and maintaining software.
  • Designed and Developed Oracle PL/SQL and Shell Scripts, Data Import/Export, Data Conversions and Data Cleansing.
  • Participated in Rapid Application Development and Agile processes to deliver new cloud platform services.
  • Connected to Amazon Redshift through Tableau to extract live data for real time analysis.
  • Created SSIS Packages for import and export of data between SQl server database and Flat Files
  • Extracting data from various source systems like Oracle, SQL Server and flat files as per the requirements.
  • Involved in writing Shell Scripts to accumulate the MTD source file Collaboration with Architects and Managers for review of solutions and data strategy
  • Extensively involved in analyzing various data formats using industry standard tools and effectively communicate them with business users and SME's.
  • Working on AWS provisioning EC2 Infrastructure and deploying applications in Elastic load balancing.
  • Involved with data profiling for multiple sources and answered complex business questions by providing data to business users.
  • Identified security loopholes, established data quality assurance and addressed data governance.
  • Promoted the use of a shared infrastructure, application roadmap, and documentation of interfaces to improve information flow and reduce costs.
  • Designed both 3NF data models for ODS, OLTP systems and dimensional data models using Star and Snow Flake Schemas
  • Loaded data into Hive Tables from Hadoop Distributed File System (HDFS) to provide SQL access on Hadoop data
  • Developed MapReduce programs to parse the raw data, populate staging tables and store the refined data in partitioned tables in the EDW.
  • Worked on NOSQL databases such as MongoDB, HBase and Cassandra to enhance scalability and performance.
  • Integrated Hadoop frameworks such as Hive and HBase to further operational and analytical experience.
  • Performed Data mapping between source systems to Target systems, logical data modeling, created class diagrams and ER diagrams and used SQL queries to filter data
  • Analyzed the data from the sources, designed the data models and then generated scripts to create necessary tables and corresponding records for DBAs using Informatica.
  • Developed triggers, stored procedures, functions and packages using cursors and ref cursor concepts associated with the project using PL/SQL
  • Developed the performance tuning of the database by using EXPLAIN PLAN, TKPROF utilities and also debugging the code.

Environment: ER/Studio 9.0, SQL Server 2014, PL/SQL, ODS, OLAP, OLTP, SSIS, MDM, Mongo DB, Tableau, Netezza, Cassandra, Flat Files, Hadoop, HDFS, Pig, Oracle11g, UNIX, Teradata

Confidential, Charlotte, NC

Big Data Architect/Data Modeler

Responsibilities:

  • Played key role in defining all aspects of Data Governance - data architecture, data security, master data management, data archival, purging and metadata.
  • Developed long term data warehouse roadmap and architectures, designs and builds the data warehouse framework per the roadmap.
  • Coordinated with Data Architects and Data Modelers to create new schemas and view in Netezza for to improve reports execution time, worked on creating optimized Data-Mart reports.
  • Worked on Azure Management API to get the azure Compute /Network/Resource Manger Details
  • Implemented Azure SQL Databases, Choose the appropriate database tier and performance level; configure point-in-time recovery, geo-replication, and data sync; import and export data and schema.
  • Involved in Normalization /De-normalization, Normal Form and database design methodology. Expertise in using data modeling tools like MS Visio and Erwin Tool for logical and physical design of databases.
  • Worked on Informatica Utilities Source Analyzer, warehouse Designer, Mapping Designer, Mapplet Designer and Transformation Developer.
  • Designed and developed Use Cases, Activity Diagrams, Sequence Diagrams, OOD (Object oriented Design) using UML and Visio.
  • Designed a STAR schema for the detailed data marts and Plan data marts involving shared dimensions (Conformed).
  • Worked with data compliance teams, data governance team to maintain data models, Metadata, Data Dictionaries; define source fields and its definitions.
  • Used Erwin 9.1 for effective model management of sharing, dividing and reusing model information and design for productivity improvement.
  • Used forward engineering to create a physical data model with DDL that best suits the requirements from the Logical Data Model.
  • Responsible for migrating the data and data models from SQL server environment to Oracle 12c environment.
  • Worked with the ETL team to document the SSIS packages for data extraction to Warehouse environment for reporting purposes.
  • Collected large amounts of log data using Apache Flume and aggregating using PIG/HIVE in HDFS for further analysis. riented programming, database design and agile methodologies.
  • Good Experience with Django, a high - level Python Web framework.
  • Experience object oriented programming (OOP) concepts using Python, C++.
  • Experienced in WAMP (Windows, Apache, MYSQL, and Python/PHP) and MVC Struts.
  • Experienced in developing web-based applications using Python, Django, Java, HTML, DHTML, JavaScript and JQuery.
  • Good Knowledge of Python and Python Web Framework Django.
  • Worked with Hadoop eco system covering HDFS, HBase, YARN and MapReduce
  • Excellent experience and knowledge on data warehouse concepts and dimensional data modeling using Ralph Kimball methodology
  • Worked in Dimension Data modeling concepts like Star Join Schema Modeling, Snow-Flake Modeling, FACT and Dimensions Tables, Physical and Logical Data Modeling.
  • Involved in Dimensional modeling (Star Schema) of the Data warehouse and used Erwin to design the business process, dimensions and measured facts.
  • Used extensively Base SAS, SAS/Macro, SAS/SQL, and Excel to develop codes and generated various analytical reports.
  • Extensively used the advanced features of PL/SQL like Subtypes, Records, Tables, Object types and Dynamic SQL
  • Worked with BTEQ to submit SQL statements, import and export data, and generate reports in Teradata.
  • Developing reusable objects like PL/SQL program units and libraries, database procedures and functions, database triggers to be used by the team and satisfying the business rules
  • Extensively used SQL Loader to load data from the Legacy systems into Oracle databases using control files and used Oracle External Tables feature to read the data from flat files into Oracle staging tables.

Environment: ERwin9.1, SAS, Teradata, Oracle10g, PL/SQL, Hadoop, HDFS, MS Visio, Flat Files, OLTP, OLAP, Netezza, MS Visio, MS Excel, SSIS

Confidential, Charlotte, NC

Sr. Data Analyst/Modeler

Responsibilities:

  • Responsible for designingDatawarehouse schemas using Erwin and involved in system analysis and design of theDatawarehouse
  • Used Model Mart of Erwin for effective model management of sharing, dividing and reusing model information and design for productivity improvement.
  • Documented logical, physical, relational and dimensionaldatamodels. Designed thedatamarts in dimensionaldatamodeling using star and snowflake schemas.
  • Involved in design, development and Modification of T-SQL stored procedures, functions, packages and triggers to implement business rules into the application.
  • Involved in purpose of this project is to migrate Data Warehouse from the database environment to a Netezza appliance.
  • Creating Data mappings, Tech Design, loading strategies for ETL to load newly created or existing tables
  • Responsible for designing and implementing efficient Stored Procedures and SSIS Packages.
  • Designed high level ETL architecture for overall data transfer from the OLTP to OLAP with the help of SSIS.
  • Created and executed SSIS packages to populate data from the various data sources for different data loading operations for many applications.
  • Extensively worked with SSIS packages for Data Migration from source systems.
  • Worked withDataModelerto create Best-Fit PhysicalDataModel from the LogicalDataModel using Forward Engineering in Erwin.
  • Involved in translating business needs into long-term architecture solutions and reviewing object models, data models and metadata.
  • Conducted meetings with business and development teams fordatavalidation and end-to-enddata mapping.
  • Extensively developed Oracle 9i stored packages, procedures, functions and database triggers using PL/SQL for ETL process, data handling, logging, archiving and to perform Oracle back-end validations for batch processes.
  • Developed and maintained data dictionary to create metadata reports for technical and business purpose.
  • Migrated data from Oracle to Teradata data warehouse using Integration services (SSIS) and Informatica to generate the reports using SSRS for different SSAS Cubes.
  • Demonstrated generation of XML messages from Oracle database using Oracle XML DB functions.
  • Designs Data Architectures.
  • Designs and builds relational databases.
  • Develops strategies for data acquisitions, archive recovery, and implementation of a database.
  • Cleans and maintains the database by removing and deleting old data. design and develop Databases, Data Warehouses and Multidimensional Databases.
  • Relies on experience and judgment to plan and accomplish goals.
  • May lead and direct the work of others.
  • Typically reports to a project leader or manager.
  • A wide degree of creativity and lateral thinking is expected.
  • Guided the developers in writing efficient and secure SQLs and the DBAs in resolving production performance problems.
  • Identify Data Cleansing Rules, Address validation rules, Match and merge rules and survivorship rules for creating the golden record for implementation.
  • Created and maintained Metadata, including table, column definitions
  • Worked in importing and cleansing of data from various sources like Teradata, flat files, Oracle with high volume data.
  • Worked on data profiling and data validation to ensure the accuracy of the data between the warehouse and source systems.
  • Conducted meetings with the business and technical team to gather necessary analytical data requirements in JAD sessions.
  • Worked very close with Data Architects and DBA team to implement data model changes in database in all environments.

Environment: Erwin 8, Oracle 9i, Flat Files, XML, Informatica. 7.0, OLAP, OLTP, Metadata, Taradata13, PL/SQL, BTEQ, UNIX, T-SQL, SSAS, SSIS.

Confidential, Alpharetta, GA

Data Analyst/Data Modeler

Responsibilities:

  • Developed a thorough knowledge by working on Full Life Cycle of the Software Development Life Cycle (SDLC) phases namely analysis, design, coding and testing
  • Interacted with business users to analyze the business process and requirements and transformed requirements
  • Designed ER diagrams (Physical and Logical using Erwin) and mapping the data into database objects and produced Logical /Physical Data Models.
  • Performed Data Analysis, Data Migration and data profiling using complex SQL on various sources systems including SQL Server and Teradata12..
  • Complete study of the in-house requirements for thedatawarehouse.
  • Analyzed the Data Warehouse project database requirements from the users in terms of the dimensions they want to measure and the facts for which the dimensions need to be analyzed.
  • Performed source systemdata(Oracle, SQL Server) profiling and work with the business users to cleanse thedatabefore loading in the ODS and DM.
  • Created data masking mappings to mask the sensitive data between production and test environment.
  • Worked with parameter files and also created parameter values across mappings.
  • Developed SQL Queries, Stored Procedures, Triggers, User Defined Functions (UDF), Views, Indexes.
  • Implemented best practices in structuring SQL queries, Debugging Unexpected SQL Results etc.
  • Worked on XMLdatatype as well and created XSDs as XML objects in SQL Server.
  • Implemented best practices in structuring SQL queries, Debugging Unexpected SQL Results etc.
  • Designed and documented Use Cases, Activity Diagrams, Sequence Diagrams, OOD (Object Oriented Design) using UML and Visio.
  • Participated in writing data mapping documents and performing gap analysis on the systems.
  • Involved in Dimensional modeling (StarSchema) of the Data warehouse and used Erwin to design the business process, dimensions and measured facts.
  • Developed the code as per the client's requirements usingSQL,PL/SQLand Data Ware housing concepts.
  • Coordinated with DBA in implementing the Database changes and also updating Data Models with changes implemented in development, QA and Production
  • Good worked on Merging several flat files into one XML file.
  • Extensively worked on creating, Altering and Deleting the Tables in different Development Environments and also Production.
  • In depth analyses ofdatareport was prepared weekly, biweekly, monthly using MS Excel, SQL & UNIX.

Environment: Erwin 7.3, Teradata 12, T- SQL, XML, Flat Files, SQL Server 2005, UNIX, PL/SQL, OLAP, OLTP, ODS

Confidential, Northbrook, IL

Data Modeler/Data Analyst

Responsibilities:

  • Worked with project team representatives to ensure that logical and physical ER/Studiodatamodels were developed in line with corporate standards and guidelines.
  • Evaluateddataprofiling, cleansing, integration and extraction tools(e.g. Informatica)
  • Implementation of Metadata Repository, Data Cleanup procedures, Transformations, Data Standards, Data Governance program, Scripts, Stored Procedures, triggers and execution of test plans.
  • Used ER/Studio to transformdatarequirements intodatamodels.
  • Developed stored procedures and complex packages extensively using PL/SQL and shell programs.
  • Performed tuning of SQL Queries to improve the response time.
  • Physicaldatamodeling for proposed OLAP(ROLAP & MOLAP) ER/Studio.
  • Used data profiling tools and techniques to ensure data quality for data requirements.
  • Involved in designing Context Flow Diagrams, Structure Chart and ER- diagrams.
  • Extensively used SQL for performance tuning.
  • Involved in extensive Analysis on the Teradata and Oracle Systems.
  • Acted as a StrongDataAnalyst analyzing thedatafrom low level.
  • Extensively used ER/Studio for developingdatamodel using star schema methodologies.
  • Used existing UNIX shell scripts and modified them as needed to process SAS jobs, search strings, execute permissions over directories etc.
  • ConvertedDatastored in flat files into Oracle tables.
  • Involved in extensive SQL Scripting.

Environment: SQL/Server, Oracle 9i, MS-Office, Teradata, Informatica, ER Studio, XML, Business Objects

We'd love your feedback!