We provide IT Staff Augmentation Services!

Data Analyst Resume

4.00/5 (Submit Your Rating)

Jersey City, NJ

SUMMARY

  • Over 5+ years of IT professional experience in Data Analysis, Data Modeling, Designing, Developing, and implementing data models for enterprise - level applications and systems.
  • Experienced in integration of various relational and non-relational sources such as DB2, Teradata, Oracle, Netezza, SQL Server, NoSQL, COBOL, XML and Flat Files, to Netezza database.
  • Experience in providing solutions within the Hadoop environment using technologies such as HDFS, MapReduce, Pig, Hive, HBase, ZooKeeper, Storm, and other Big Data technologies.
  • Experienced in designing Star Schema, Snowflake schema for Data Warehouse, by using tools like Erwin data modeler, Power Designer and Embarcadero E-R Studio.
  • Experienced in big data analysis and developing data models using Hive, PIG, and MapReduce, SQL with strong data architecting skills designing data-centric solutions.
  • Experienced in Data modeling for Data Mart/Data Warehouse development including conceptual, logical and physical model design, developing Entity Relationship Diagram (ERD), reverse/forward engineer (ERD) with CA ERwin Data Modeler.
  • Experienced in Netezza tools and Utilities NzLoad, NzSql, NzPL/SQL, Sqltoolkits, Analytical functions, etc.
  • Extensive experience in Relational and Dimensional Data modeling for creating Logical and Physical Design of Database and ER Diagrams using multiple data modeling tools like Erwin, ER Studio.
  • Very good experience and knowledge on Amazon Web Services: AWSRedshift, AWS S3 and AWS EMR.
  • Experienced in importing and exporting the data using Sqoop from HDFS to Relational Database systems/mainframe and vice-versa.
  • Experienced in development and support knowledge on Oracle, SQL, PL/SQL,T-SQL queries
  • Experienced in Logical Data Model (LDM) and Physical Data Models (PDM) using Erwin, ER Studio and Power Designer data modeling tool.
  • Experienced in migration of Data from Excel, Flat file, Oracle to MS SQL Server by using SQL Server SSIS.
  • Strong experience in Normalization (1NF, 2NF, 3NF and BCNF) and De-normalization techniques for effective and optimum performance in OLTP and OLAP environments and experience with Kimball Methodology and Data Vault Modeling
  • Experienced in ETL design, development and maintenance using Oracle SQL, PL/SQL, TOAD SQL Loader, and Database Management System (RDBMS).
  • Experienced in Designed and developed Data models for Database (OLTP), the Operational Data Store (ODS), Data warehouse (OLAP), and federated databases to support client enterprise Information Management Strategy and excellent Knowledge of Ralph Kimball and BillInmon's approaches to Data Warehousing.
  • Expertise in SQL Server Analysis Services (SSAS), SSIS and SQL Server Reporting Services (SSRS)
  • Experienced in Transform, and Load data from heterogeneous data sources to SQL Server using SQL Server Integration Services (SSIS) Packages.
  • Good knowledge and experience in Developing Informatica Mappings, Mapplets, Sessionss, Workflows and Worklets for data loads from various sources such as Oracle, Flat Files, DB2, SQL Server etc.
  • Excellent understanding and working experience of industry standard methodologies like System Development Life Cycle (SDLC), as per Rational Unified Process (RUP), AGILE Methodologies.

TECHNICAL SKILLS

Programming Languages: C, Visual Basic, and C++, VB 6.0, SQL, Hadoop (Hive, Pig), Python, R.

ETL tools: Informatica Power center, SSIS, AB Initio

Data Modeling Tools: Erwin 9.7/9.6, Sybase Power Designer, Oracle Designer, ER/Studio V17

DataModeling: Sybase Power Designer / IBM Data Architect

BI & Reporting tools: Business Objects, Cognos, SSRS, Crystal reports, Business Intelligence, SSRS, and Cognos.

Project Execution Methodologies: Ralph Kimball & Bill Inmon data warehousing methodology, Rational Unified Process (RUP), Agile, Rapid Application Development (RAD), Joint Application Development (JAD)

MS-Office Package: Microsoft Office (Windows, Word, Excel, PowerPoint, Visio, Project).

ETL Tools / Tracking tool: Informatica, SSIS, SSAS, SSRS / JIRA.

Database Development: T-SQL and PL/SQL, Microsoft Hyper-V Servers

Databases: Teradata R12 R13 R14.10, Oracle, MS SQL Server, DB2, Netezza

Testing and defect tracking Tools: HP/Mercury (Quality Center, Win Runner, Quick Test Professional, Performance Center, Requisite, MS Visio & Visual Source Safe

Operating Systems: Windows, UNIX, Sun Solaris.

PROFESSIONAL EXPERIENCE

Confidential, Jersey City, NJ

Data Analyst

Responsibilities:

  • Responsible for Big data initiatives and engagement including analysis, brainstorming, POC, and architecture and worked with BigData and Big Data on Cloud, Master Data Management and Data Governance.
  • Designed the Logical Data Model using ERWIN 9.64 with the entities and attributes for each subject areas and Involved in designing Logical and Physical data models for different database applications using the Erwin.
  • Working on Cloud computing using Microsoft Azure with various BI Technologies and exploring NoSQL options for current back using Azure Cosmos DB (SQL API)
  • Designed and developed a DataLake using Hadoop for processing raw and processed claims via Hive and Informatica.
  • Developed and implemented different PigUDFs to write ad-hoc and scheduled reports as required by the Business team.
  • Design of Big Data platform technology architecture. The scope includes data intake, data staging, data warehousing, and high performance analytics environment.
  • Utilized Apache Spark with Python to develop and execute BigData Analytics and Machine learning applications, executed machine learning use cases under Spark ML and Mllib.
  • Used Polybase for ETL/ELT process with Azure Data Warehouse to keep data in Blob Storage with almost no limitation on data volume and created the template SSIS package that will replicate about 200 processes to load the data using Azure SQL.
  • Data modeling, Design, implement, and deploy high-performance, custom applications at scale on Hadoop /Spark and implemented Data Integrity and Data Quality checks in Hadoop using Hive and Linux scripts.
  • Involved in loading data from LINUX file system to HDFS Importing and exporting data into HDFS and Hive using Sqoop Implemented Partitioning, Dynamic Partitions, and Buckets in Hive.
  • Specifies overall Data Architecture for all areas and domains of the enterprise, including Data Acquisition, ODS, MDM, Data Warehouse, Data Provisioning, ETL, and BI.
  • Create the architectural artifacts for the Enterprise Data Warehouse and the Operational Dashboard, such as Entity Relationship Diagrams (ERD), the DDL scripts, the Conceptual Data Model, and technical as well as business documents.
  • Developed Data Mapping, Data Governance, and Transformation and cleansing rules for the Master Data Management Architecture involving OLTP, ODS.
  • Involved in Normalization / De normalization techniques for optimum performance in relational and dimensional database environments.
  • Implemented Spark solution to enable real time reports from Cassandra data and implemented strong referential integrity and auditing by the use of triggers and SQL Scripts.
  • Designed and developed T-SQL stored procedures to extract, aggregate, transform, and insert data and developed SQL Stored procedures to query dimension and fact tables in data warehouse.
  • Created and maintained SQL Server scheduled jobs, executing stored procedures for the purpose of extracting data from DB2 into SQL Server.
  • Performed migration and merging of RPD's in OBIEE and performed Hive programming for applications that were migrated to big data using Hadoop and focused on architecting NoSQL databases like Mongo, Cassandra and Cache database.
  • Involved in Dimensional modeling (Star Schema) of the Data warehouse and used Erwin to design the business process, dimensions and measured facts.
  • Generated parameterized queries for generating tabular reports using global variables, expressions, functions, and stored procedures using SSRS.
  • Created External and Managed tables in Hive and used them appropriately for different PIG scripts required for reporting.
  • Managed multiple ETL development teams for business intelligence and Master data management initiatives.
  • Point in time Backup and recovery in MongoDB using MMS and data modeling for data from RDBMS to and MongoDB for optimal reads and writes and perform routine management operations, including configuration and performance analysis for mongodb and diagnosing Performance Issues for MongoDB.
  • Reverse engineered some of the databases using Erwin and proficiency in SQL across a number of dialects (we commonly write MySQL, PostgreSQL, SQL Server, and Oracle).
  • Coordinating with Client and Business Analyst to understand and develop OBIEE reports.

Environment: DB2, CA Erwin 9.64, Oracle 12c, MS-Office, SQL Architect, TOAD Benchmark Factory, SQL Loader, PL/SQL, SharePoint, Informatica, SSRS, SSIS, Python, T-SQL, MongoDB, AWS S3, AWS Glue MS-Office, AWS Redshift, SQL Server 2016, Hive, Pig, Hadoop, Spark, Azure.

Confidential, Princeton, NJ

Data Analyst

Responsibilities:

  • Demonstrable expertise in core IT processes, utilizing ETL tools to query, validate, and analyze data.
  • Expert business and technical requirements documentation skills employing contemporary tools for data mapping, diagramming, Use Cases, and business rules to produce concise functional specifications.
  • Follow and assess the business process model defining metadata rules and critical data elements.
  • Conduct analysis, gather requirements, develop Use Cases, data mapping, and workflow diagrams.
  • Develop a global incident management reporting dashboard for the DQM over multiple IM platforms.
  • Investigate unused modules of the DQM and report viability and feasibility for implementation.
  • Comparative cost/benefit analysis between DQM modules and Inquiry Framework for DQ assessments.
  • Utilize multimedia office suite applications and conduct surveys for high-level dashboard reporting.
  • Performing daily integration and ETL tasks by extracting, transforming and loading data to and from different RDBMS.
  • Creating complex SQL queries and scripts to extract and aggregate data to validate the accuracy of the data.
  • Business requirement gathering and translating them into clear and concise specifications and queries.
  • Prepare high-level analysis reports with Excel and Tableau. Provides feedback on the quality of Data including identification of billing patterns and outliers.
  • Identify and document limitations in data quality that jeopardize the ability of internal and external data analysts.
  • Wrote standard SQL Queries to perform data validation and created excel summary reports (Pivot tables and Charts).
  • Gather analytical data to develop functional requirements using data modeling and ETL tools.
  • Systems Documentation change control/defect analysis and updates. Implementation testing.
  • Gathered data and documenting it for further reference and designed Database using Erwin DATA modeler.
  • Used Ref cursors and Collections with bulk bind and bulk collect for accessing complex Data resulted from joining of a large number of tables to extract data from the data warehouse.
  • Fine Tuned (performance tuning) SQL queries and PL/SQL blocks for the maximum efficiency and fast response using Oracle Hints, Explain plans.
  • Used Teradata as a Source and a Target for few mappings. Worked with Teradata loaders within the Workflow manager to configure Fast Load and Multi-Load sessions.
  • Load data from MS Access database to SQL Server 2005 using SSIS (creating staging tables and then loading the data).
  • Highly proficient in using T-SQL for developing complex Stored Procedures, Triggers, Tables, Views, User Functions, User profiles, Relational Database Models and Data Integrity, SQLjoins and Query Writing.
  • Migration of MS Access to SQL SERVER 2012.
  • Requirements gathering, analysis, Use Cases, data mapping, and workflow diagramming.
  • Data quality analysis and execution of the Data Quality Management (DQM) package.
  • Wrote SQL queries using analytical functions.
  • Created UML based diagrams such as Activity diagrams using MS Visio.
  • Perform data extrapolation and validation of reports for analysis and audits.
  • Created T/SQL statements (select, insert, update, delete) and stored procedures.
  • Utilize SSIS for ETL data modeling, data migration, and analysis.
  • Project manages analytics for deployment within the Development Life Cycle.
  • Developed SQL scripts involving complex joins for reporting purposes.
  • Developed various SQL scripts and anonymous blocks to load data SQL Server 2005.

Environment: Windows, MS Office (MS Word, MS Excel, MS PowerPoint, MS Access, MS SharePoint, MS Visio), SQL, SSIS, ETL, SSRS, Erwin, Tableau, SQL.

Confidential

Data Analyst

Responsibilities:

  • Work with users to identify the most appropriate source of record required to define the asset data for financing
  • Performed data profiling in Target DWH
  • Experience in using OLAP function like Count, SUM,and CSUM
  • Performed Data analysis and Data profiling using complex SQL on various sources systems including Oracle and Teradata.
  • Hands on Experience on Sqoop.
  • Developed normalized Logical and Physical database models for designing an OLTP application.
  • Developed new scripts for gathering network and storage inventory data and make Splunk ingest data.
  • Imported the customer data into Python using Pandas libraries and performed various data analysis - found patterns in data which helped in key decisions for the company
  • Created tables in Hive and loaded the structured (resulted from Map Reduce jobs) data
  • Using HiveQL developed many queries and extracted the required information.
  • Exported the data required information to RDBMS using Sqoop to make the data available to the claims processing team to assist in processing a claim based on the data.
  • Design and deploy rich Graphic visualizations with Drill Down and Drop-down menu option and Parameterized using Tableau.
  • Extracted data from the database using SAS/Access, SAS SQL procedures and create SAS data sets.
  • Created Teradata SQL scripts using OLAP functions like RANK () to improve the query performance while pulling the data from large tables.
  • Worked on MongoDB database concepts such as locking, transactions, indexes, Sharding, replication, schema design, etc.
  • Performed Data analysis using Python Pandas.
  • Good experience in Agile Methodologies, Scrum stories, and sprints experience in a Python-based environment, along with data analytics and Excel data extracts.
  • Created Hive queries that helped market analysts spot emerging trends by comparing fresh data with EDW reference tables and historical metrics.
  • Involved in defining the source to target data mappings, business rules, business and data definitions
  • Responsible for defining the key identifiers for each mapping/interface
  • Responsible for defining the functional requirement documents for each source to the target interface.
  • Hands on Experience on Pivot tables, Graphs in MS Excel
  • Using advanced Excel features like Pivot tables and Charts for generating Graphs.
  • Designed and developed weekly, monthly reports using MS Excel Techniques (Charts, Graphs, Pivot tables) and Powerpoint presentations.
  • Strong Excel skills, including pivots, VLOOKUP, conditional formatting, large record sets. Including data manipulation and cleaning.

Environment: SQL/Server, Oracle 9i, MS-Office, Teradata, Informatica, ER Studio, XML, Hive, HDFS, Flume, Sqoop, R connector, Python, R, Tableau 9.2.

Confidential

SQL Developer

Responsibilities:

  • Created new database objects like Procedures, Functions, Packages, Triggers, Indexes and Views using T-SQL in Development and Production environment for SQL Server 2000
  • Actively participated in gathering of Requirement and System Specification.
  • Developed SQL Queries to fetch complex data from different tables in remote databases using joins, database links and formatted the results into reports and kept logs
  • Worked on complex T-SQL statements, and implemented various codes and functions.
  • Installed, authored, and managed reports using SQL Server 2005Reporting Services
  • Wrote Transact SQL utilities to generate table insert and update statements.
  • Developed and optimized database structures, stored procedures, DDL triggers and user-defined functions.
  • Implemented new T-SQL features added in SQL Server 2005 that are Error handling through TRY-CATCH statement, Common Table Expression (CTE)
  • Created Stored Procedures to transform the data and worked extensively in T-SQL for various needs of the transformations while loading the data.

Environment: SQL Server 2008/2005, T-SQL, .Net, Microsoft Framework 2.0, Erwin, MS Visio.

We'd love your feedback!