We provide IT Staff Augmentation Services!

Big Data Architect Resume

2.00/5 (Submit Your Rating)

Austin, TX

PROFILE:

  • 15 Years Experienced Data Architect, Big Data Architect, Data Warehouse Architect, Data Science, Machine learning, Data Visualization, Data Integration Architect, Data Modeler
  • Meta Data Management, Master Data Management, Data Governance, Business Analyst, Business Architecture, and Business Modeling.Strong Subject Matter expertise in Healthcare, Insurance and Financial

TECHNICAL SKILLS:

HARDWARE: OS Linux, AWS S3, Windows, EC2

SOFTWARE: Big Data Hadoop HDFS, Hive, Pig, HBase, Yarn Apache Spark, Scala, Spark Streaming, RDDs, Mlid, SparkSQL Cassandra NoSQL, CSQL Talend Data Integration, Hive Integration Apache Kafka, Messaging, API, Spark Integration, Hadoop Integration

Cloud: AWS EC2, RDS, S3, VPC, EBS, IAM, Redshift, Glacier

DevOps: Git, Chef, Docker, Jenkins, Puppet, Ansible, Nagios

Database: AWS Redshift, Vertica, Netezza, Teradata, Oracle, Sybase, MySQL, Teradata, Mongo DB, Hbase, Hive, Impala

Data Modeling tools: Infosphere Data Architecture, ERWin, Sybase Power Designer, and Visio

ETL: Informatica, Datastage, SSIS, PL/SQL

BI: SAS, Microstrategy, Cognos, SSRS, SSAS,R, R Studio

Languages: Python, Java, PL/SQL and T - SQLRELEVANT

PROFESSIONAL EXPERIENCE:

Confidential, Austin, TX

Big Data Architect

Responsibilities:

  • Drive information strategy and analysis engagements for clients' business groups
  • Performed Big Data Architecture, Information and Business Intelligence strategy
  • Worked as Solution architect in translating business needs and vision into roadmap, project deliverables and organization strategy
  • Delivery Lead and lead developer for multiple big data projects. Worked as lead from a technology perspective on big data technologies such as Apache Hadoop, Hive, Hbase, Kafka and Spark.
  • Programming experience in Python, Java and Scala
  • Knowledge of data science, machine learning and statistical modeling techniques.
  • Worked closely with the model development group to understand and meet business needs through the appropriate design and implementation of the model(s)
  • Deep understanding of rich data visualizations to communicate complex ideas to business leaders
  • Experienced in big data architecture, data warehouse and application development in technical lead architect capacity
  • Have programming skills in Big data ecosystem such as: Apache Hadoop, Hive, Hbase, Kafka and Spark on Cloudera Platform
  • Experience in understanding data processing from Teradata/ Oracle or any SQL database environment
  • Understanding of analytical techniques including segmentation, cluster analysis, and regression
  • Data Model using ERWIN and ETL Using Talend.

Confidential, Wilmington, DE

Redshift Data Warehouse Architect

Environment: Redshift, Hadoop, Cloudera, Informatica and Microstrategy

Responsibilities:

  • Requirement Analysis, Logical and Physical Data Model, Data Mapping.
  • ETL Architecture, Informatica mapping, session and Workflow design.
  • Redshift Administration work.
  • Cluster Creation Considerations.
  • Redshift Security.
  • Redshift Encryption Support.
  • Connecting from Client Tools.
  • Demo: Connecting from Client Tools.
  • Redshift Tables Design. Best Practices (Distribution style, Sort Keys and Encryption)
  • App Data Warehouse Table Structure.
  • Creating Redshift Schema, Users, Tables, Keys.
  • Creating and Managing Snapshots.
  • Resizing Redshift Clusters.
  • Setting up CloudWatch Alarms.
  • VIewing Query and Performance Metrics.
  • Best Practices for Data Loading using Copyfrom S3.
  • Query Tuning, running explain plan.
  • Work Load Management.

Confidential, Harrisburg, PA

Solution Architect Data Warehouse

Environment: Oracle 11g, SQL Server, AWS, S3, Redshift, Hadoop, HDFS, Hue, Horton Works, Hbase, Hive, Impala,Vertica, Netezza, Hive, No SQL, Java RS, Python, Erwin 9.6, Informatica, SAS, SAS DI, Tableau, R and R Studio

Responsibilities:

  • Managed, Designed, Developed and implemented the Data Warehouse Solution for advance Claims and Clinical Analytics.
  • Partnered with business and technology stakeholders to facilitate and align the strategy.
  • Developed SaaS-based computing solutions build powerful business functionality across your enterprise.
  • Responsibilities included developing project tasks, timelines, and roll-out procedures for the conversion to a new system.
  • Designed and Developed several Business Process, Data flow Use cases and Data Analysis for Member, Group, Provider, Claims, Plan Benefit, Pharmacy, Case Management, Patient Visit (EMR), Diagnosis, Lab, Observation, Hedis, Member Incentive, ICD 9, ICD 10 Subject Areas.
  • Create conceptual, logical, and physical, data models using Erwin 9.2 for both ODS, EDW and Semantic Layer.
  • Developing Data mapping documents.
  • Installing Configuring, Managing 10 Node Vertica Cluster in AWS cloud.
  • Performance tuning, Projection creating, partitioning in Vertica Database
  • Installing, Configuring and tuning Hortonworks Hadoop cluster to house Hospital System Data
  • Securing Data using Hadoop Ranger
  • Landed all HL7 and Hospital System Data in Vertica Cluster
  • Design Claims and Clinical Data Integration strategy in AWS S3 and Vertica
  • Designed and Developed ETL Scripts using HiveQL and Impala Scripts
  • Designed and Developed Transaction Hub MDM for (Member, Provider and Patient) using Mirth Connect MDM tool.
  • Designed Rest API to consume the master data.
  • Designed and Developed Data Science Machine learning Algorithms for Risk and Coherts using R and R Studio
  • Familiar with Spark and Scala

Confidential, South Field, MI

Enterprise Data Architect/Data Warehouse BI Architect

Environment: Oracle 11g, Erwin 8.2, Big Data, Hadoop, MongoDB, Datastage and Cognos

Responsibilities:

  • Developed and implemented the enterprise data architecture strategy. Partnered with business and technology stakeholders to facilitate and align the strategy
  • Defined and evolved enterprise data architecture and design best practices in areas that include, but are not limited to, standards, principles, processes and methodologies, infrastructure, etc.
  • I was responsible for data architecture tasks on various projects that support the organization in achieving its strategic
  • Responsibilities included developing project tasks, timelines, and roll-out procedures for the conversion to a new system. This included the conversion of data within the previous system.
  • Hands-on experience working with MongoDB
  • Worked with large Hadoop deployments: provisioning, maintenance, monitoring, issue resolution;
  • Experience with deployment Architecture definition and documentation for a Hadoop based production environment that can scale to petabytes
  • Designed and Developed several Business Process and Data flow Use cases for Member, Group, Provider, Claims, Plan and Benefit, Pharmacy, Case Management.
  • Create conceptual, logical, and physical, data models using Erwin 8.2 for both ODS and EDW. Subject area primarily model is PARTY Registry MDM hub for (Member, Group, Sub Group, Provider), EDW (Member, Provider, Plan and Benefit, Claims, Pharmacy, Case Management)
  • Model and review all current operational data structures and recommend optimizations and reconfigurations to Data Architects for implementation.
  • Designed and develop Replication technique, Partition and Index strategy.
  • Worked with DBA to physicalize the model with storage parameters.
  • Developing Data Mapping documents.
  • Analyzing and developing the ETL transformation logic from Facet to Party Model and EDW.
  • Provide Data Model support to Business System Analyst and Developers.
  • Worked closely with developers to make them understand the model and load the model
  • Define best practices, policies and procedures regarding SQL Tuning, Performance tuning, Metadata Management, Data Profiling, Data quality, data storage and archiving, disaster recovery, security and data dictionary metadata standards.

Confidential, St. Louise, MO

Data Architect

Environment: Oracle 10g, SQL Server 2008, Erwin 7.3/8.0/8.3, TeraData, Oracle

Responsibilities:

  • Responsibilities included developing project tasks, timelines, and roll-out procedures for the conversion to a new system. This included the conversion of data within the previous system.
  • Identify the key business Metrics. Meet regularly with key business personnel, Business Sponsor’s, SME’s and IS analysts to determine business needs and to document requirements.
  • Designed and Developed several Business Process and Data flow Use cases for Claims, Claim Contact, Provider, Provider Contact, Claims document, Incident etc.
  • Create conceptual, logical, and physical, data models using Sybase Power Designer for both OLTP and Data Warehouse Applications.
  • Model and review all current operational data structures and recommend optimizations and reconfigurations to Data Architects for implementation.
  • Developing Data Mapping documents.
  • Analyzing and developing the ETL transformation logic.
  • Designed and developed Informatica workflows to transfer data from source to stage to target.
  • Worked extensively on Data Quality issues using. I have designed logic to tackle Data anomalies and data integrity issues.
  • Design and Developed MDM and started Data governance initiative.
  • Provide Data Model support to Business System Analyst and Developers.
  • Worked closely with developers to make them understand the model and load the model
  • Define best practices, policies and procedures regarding SQL Tunning, Performance tunning, Metadata Management, Data Profiling, Data quality, data storage and archiving, disaster recovery, security and data dictionary metadata standards.
  • Participate in the development and maintenance of, and adherence to, corporate data architecture, data management standards and conventions, data dictionaries and data element naming standards.
  • Provide leadership and guidance for database architecture design and strategy to ensure quality deliverables across the entire IS organization.
  • Document detailed functional and technical specifications based on agreed solutions.
  • Support development of the business solution as part of the technical team.
  • Work with DBA to support migration of applications from Development to Test to Production
  • Recommend and evaluate new tools and methodologies.
  • Work with management to identify issues and risks that may have an effect on quality or delivery from a technical, business and end-user perspective.
  • Evaluate and estimate the work effort required to meet a desired deliverable.
  • Provide status reporting on work assignments and alert IS management to deviations from plan.
  • Ensure completed work meets with all IS best practices and policies.
  • Perform administration, maintenance and configuration changes to existing applications where appropriate and be willing to support mission critical 24x7 applications.
  • Interact with account management, project management, and clients as appropriate both locally and globally.
  • Developed database standards for global deployment security and business recovery
  • Provided performance guidelines and analyzed database metrics for Web Applications

Confidential, Waltham, MA

Data Architect

Environment: Windows NT, Teradata SQL Server 2005, Sybase, Oracle 10g, Oracle RAC, DB2 UDB, DTS, Informatica 8.1, T-SQL, PL/SQL, XML, Java, ERWIN7.1 - 7.3, Sybase Power Designer, Cognos BI 8, Visio and Rational Rose.

Responsibilities:

  • Worked in Normalization/De-normalization techniques for optimum performance in relational and dimensional database environments.
  • 10+ years experience in designing star schema, identification of facts, measures and dimensions, Snowflake schema and ODS architecture for modeling a Data Warehouse used in relational, dimensional and multidimensional modeling
  • Excellent noledge in data analysis and modeling of Databases for business applications in Client/Server (OLTP) and Data warehouse environments (OLAP) systems.
  • Data Mart design and creation of Cubes using Dimensional data modeling and identifying Facts and Dimensions.
  • Very good experience in Ralph Kimbal and Bill Inmon Methodologies.
  • Very good experience in Object Oriented data modeling using tools like ER/Studio for both forward and reverse engineering Developing and standardizing the business Codebook.
  • SQL Tunning and Performance tunning.
  • Analyzing the existing reports, reporting system.
  • Designed and developed long-term Data architectural standards.
  • Developing logical and Physical Star Schema using ERWIN and Sybase Power Designer for Oracle environment.
  • Developing Data Mapping documents.
  • Analyzing and designing the ETL transformation logic.
  • Leading the Data Warehouse development effort team.
  • Analyzing the facts grains and slowly changing dimension attributes
  • Designed and understand BI KPI’s measures and scorecards
  • Lead Data Warehouse and BI developers.

Confidential, NY

Senior Consultant

Environment: Solaris 2.6, Windows NT, SQL Server 2000, Sybase, Oracle 9i, DB2 UDB, DTS, Cognos, PL/SQL, Transact-SQL, SQL, ERWIN, Visio, Rational Rose and Quantifacts.

Responsibilities:

  • Identify the key business Metrics.
  • Working closely with different departmental business users and understanding the gaps in their process.
  • Understanding the transactional process flow and drawing high-level BI execution plan.
  • Conducting user interviews, gathering requirements, analyzing the requirements.
  • Identifying the source system, analyzing the source system and tying different source system together for conformed dimensionality.
  • Developing and standardizing the business Codebook.
  • Analyzing the existing reports, reporting system.
  • Designed and developed long-term Data architectural standards.
  • Developing logical and Physical Star Schema using ERWIN for Oracle 9i environment.
  • Developing Data Mapping documents, Data Profiling, Data cleansing, Meta Data management, Data governance.
  • SQL Tunning and Performance tunning.
  • Analyzing and developing the ETL transformation logic.
  • Designed and developed Oracle PL/SQL Stored procedures to transfer data from source to stage to target.
  • Leading the Data Warehouse development effort team.
  • Analyzing the facts grains and slowly changing dimension attributes

Confidential, Greenwich, CT

Senior Consultant

Environment: Solaris 2.6, Windows NT, SQL Server 2000, Sybase, Oracle, DB2 UDB, Informatica, Cognos, PL/SQL, Transact-SQL, SQL, ERWIN, Visio, Rational Rose. Business Objects and Brio.

Responsibilities:

  • Identify the key business Metrics.
  • Working closely with different departmental business users and understanding the gaps in their process.
  • Understanding the transactional process flow and drawing high-level BI execution plan.
  • Conducting user interviews, gathering requirements, analyzing the requirements.
  • Identifying the source system, analyzing the source system and tying different source system together for conformed dimensionality.
  • Setting up the cleansing rules for the source system.
  • Developing and standardizing the business Codebook.
  • Analyzing the existing reports, reporting system.
  • Developing logical and Physical Star Schema.
  • Developing Data Mapping documents.
  • Analyzing and developing the ETL transformation logic.
  • Designed and developed DTS packages to transfer data from source to stage to target.
  • Designed MOLAP cubes for multidimensional analysis.
  • Leading the Data Warehouse development effort team.
  • Analyzing the facts grains and slowly changing dimension attributes

Confidential, Horsham, PA

Senior Consultant

Environment: Solaris 2.6, Windows NT, SQL Server 2000, Sybase, Oracle, DB2 UDB, Informatica, Cognos, Microstrategy, PL/SQL, Transact-SQL, SQL, Java, Perl, ERWIN, Visio, Rational Rose. .

Responsibilities:

  • Loan Origination, Loan Servicing, Customers, Borrowers, Investors, Parcels, Reserves and Escrows.
  • Currently the MAJOR MORTGAGE BANK queries the McCracken OLTP database to generate reports for viewing key performance measures. The queries that MAJOR MORTGAGE BANK is currently using to do this are complex and the users have needed in the past to write these queries.
  • Involved in Designing and implementation of OLAP based BI platform using
  • Informatica for ETL, Microstrategy for BI platform and SQL Server and Oracle as DB platform.
  • Financial Statement Productivity
  • Property Inspection
  • Loan Analysis
  • Performed requirement analysis, went through all the user cases and issue logs
  • Worked as a Business Analyst in the First Phase of the Project.
  • Analyzed the Business Logic and implemented the Conceptual model, Physical Data Model using ERWIN.
  • Worked with Source Qualifier, Mappings, Transformation, Mapplets, Session and Batches in Informatica
  • Worked with Expression, Aggregation, Lookup, Rank, Sequence generator and Store procedure to analysis and develop data loading techniques. Build cubes and Dimension for Data warehousing Databases.
  • Designed various session and batches for ETL using Informatica
  • Developed test scenarios and implemented test plan.
  • Adhoc Query with Microstrategy Query, multidimensional Analysis.
  • Define and implement approaches to load and extract data from the database using Extract-Transform-Load (ETL) tools. Metadata design and management on the ETL using Informatica and DTS and reporting side using Microstrategy.
  • Assist and implement the development methodology for ODS and DSS including development and implementation of batch data loading programs
  • For DW and DSS using Informatica, DTS, PL/SQL and T-SQL Extract Data, Cleansing, Loading and security strategy for DM and DSS.
  • Establish auditing procedures to ensure continued data integrity.
  • Assist in post implementation improvement efforts to have performance and increased functionality.

Confidential, NY

Senior Consultant

Environment: Solaris 2.6, Windows NT, Sybase, Oracle, DB2, Informatica, Cognos, Microstrategy, PL/SQL, Transact-SQL, SQL, Java, Perl, ERWIN, Visio, Rational Rose. Business Objects and Brio.

Responsibilities:

  • GS Online offers Sales of New Issues of securities in the United States. It is an ecommerce site for equity capital markets of Confidential . It deals with different kinds of Instruments from Equities, Registered, Unregistered 144A, Convertibles, fixed income and Derivatives. Asset management, Derivatives, Debt instrument.
  • Global Financial Data Model. Straight through Processing, Trade matching, Trade settlement. Security Masters Tables, Fund Masters Tables, Lots and Holdings and Cash Transactions
  • Worked on Equities (Common Stocks) and fixed income instruments.
  • Worked on trade settlement, worked on Convertibles, Sales Trader customizations, IPO, allocation, IOI modules.
  • Involved in GAP Analysis to determine and document a specific business and technical approach for implementing the requirements.
  • Performed requirement analysis, went through all the user cases and issue logs
  • Analyzed the Business Logic and implemented the Conceptual model, Physical Data Model using ERWIN.
  • Pulling Data from Sybase, Oracle and DB2 from multiple sources SEC, Thompson financial and Internal National Glodman offices
  • Worked with Source Qualifier, Mappings, Transformation, Mapplets, Session and Batches in Informatica
  • Worked with Expression, Aggregation, Lookup, Rank, Sequence generator and Store procedure to analysis and develop data loading techniques. Build cubes and Dimension for Data warehousing Databases.
  • Designed various session and batches for ETL using Informatica
  • Developed test scenarios and implemented test plan.
  • Adhoc Query with COGNOS Query, multidimensional Analysis with Cognos Powerplay. Data Mart Creation with Cognos Decision Stream.
  • Reports generated using Business Objects, Microstrategy and Brio
  • Designed OOD for GS Online using Rational Rose and UML
  • Designed Java Classes for pulling Data from Datamarts using JDBC.
  • Designed Object Relational Mapping.
  • Setting up the JDBC driver and all connectivity issue.

Confidential, NY

Data warehouse Architect/Data Architect/Senior DBA

Environment: Solaris 2.6, Windows NT, Sybase, Oracle, DB2, Informatica, Cognos, Microstrategy, PL/SQL, Transact-SQL, SQL, Java, Perl, ERWin, Visio, Rational Rose and Business Objects and Brio.

Responsibilities:

  • Involved in GAP analysis around the identification of business rules, business and system process flows, user administration, requirements and assumptions.
  • Performed requirement analysis, went through all the user cases and issue logs
  • Analyzed the Business Logic and implemented the Conceptual Data model
  • Designed and implemented complete Physical Data Model
  • The case studies demonstrate the importance of having a methodology for defining meta data requirements, capturing and integrating meta data, how to calculate ROI, form a team, and develop a project plan, advanced meta data architectures, pulse-of-the-market analysis of meta data integration tool vendors, methodology for defining an attainable project scope, and a detailed walk through of a detailed meta data model.
  • Provided Master-to-Master and Master to Snapshot Architecture
  • Scheduled Push Purge
  • Designed and developed both read-only and updateable snapshot Handled complex conflict resolutions

Confidential, NY

Senior Data Architect/Senior DBA

Environment: Solaris 2.6, Windows NT, Oracle 8, PL/SQL, Java, JDBC, Perl, Kshell, SQL*Loader, Pro*C, Developer 2000 and C++

Responsibilities:

  • Analyzed Workflow
  • Designed high-level Entities
  • Designed Conceptual Data Model and Physical Data Model
  • Translated business needs into long term architecture solutions
  • Assisted in efforts for continuous improvement in performance and functionality
  • Coordinated enhancements and maintenance of data warehouse including structural changes
  • Extracted data from different sources using ETL Tools

We'd love your feedback!