We provide IT Staff Augmentation Services!

Senior Data Analyst/data Engineer Resume

3.00/5 (Submit Your Rating)

Malvern, PA

SUMMARY

  • Around 6 years of technical experience in the field of Finance, Retail, and Ecommerce performing Statistical Modelling, Data Extraction, Data cleaning, Data screening, Data Exploration and Data Visualization.
  • Extensive knowledge in python, Java, MySQL, oracle UNIX and LINUX.
  • Experience with Normalization (1NF,2NF and 3NF) and de - normalization techniques for improved database performance in OLTP,OLAP, data warehouse and DataMart environments.
  • Web development using python, Django.
  • Experience with web development, web services, python, and Django framework.
  • Worked on several python packages likeNumPy,SciPy,Pytables, pandas and scikit-learn.
  • Well versed with the concepts of Forward Engineering and Reverse Engineering for the existing databases for physical models using Erwin tool.
  • Hands on experience in SQL and PL/SQL and writing stored procedures.
  • Good experience with oracle 9i, 10g, 11g, MS SQL Server, MS Access
  • Hands on experience in data processing automation using python.
  • Experienced in all phases of software development life cycle SDLC. (Analysis, Requirements gathering, Designing) with expertise in documenting various requirements specifications, functional specifications, Test plans, Source to Target mappings, SQL joins.
  • Strong working Experience with Agile, Scrum, Kanban, and Waterfall methodologies.
  • Good experience in troubleshooting SQL queries, ETL queries, data warehouse/ data mart/ data store models.
  • Experience in building reports using SQL server reporting services and crystal reports.
  • Significant exposure to Talend ETL tool.
  • Experienced in designing Data Mart and creation of cubes.
  • Excellent analytical and communication skills with clear understanding of business process flow and SDLC life cycle.

TECHNICAL SKILLS

Data modelling: Erwin DM r9, Power Designer, Microsoft Visio 2016, ER Studio, Star-schema Modelling, snowflake- schema modelling, FACT, and dimension tables.

ETL Tools: Informatica, Microsoft SSIS

Databases: Oracle 9i, 10g, 11g, MS SQL Server, PostgreSQL, Oracle, MongoDB, Oracle SQL Teradata

Operating Systems: Linux, Unix, Windows Server

Programming Language: SQL, Python, R, (MySQL& SQL Server), C, UNIX Shell scripting.

Reporting Tools: Tableau, PowerBI, Matplotlib, Seaborn and crystal reports

Office Applications: MS Office (Word, Excel, Power point)

Methodologies: Scrum, Agile, Waterfall, SDLC

Cloud services: AWS S3, EC2, Azure, GIT

Packages and tools: Pandas, NumPy, SciPy, Scikit-Learn, ggplot2.

Machine learning: Linear Regression, logistics Regression, Decision tree, Random Forest, Gradient boosting, Support Vector Machines, Time series forecasting and Dimensionality Reduction.

PROFESSIONAL EXPERIENCE

Senior Data Analyst/Data Engineer

Confidential, Malvern, PA

Responsibilities:

  • Participated in requirement gathering session with business users and sponsors to understand and document the business requirements.
  • Designed and developed a horizontally scalable API’S using Python Flask.
  • Worked on logical and physical model using Erwin based on requirements.
  • Developed entire frontend and backend modules using python on Django web Framework.
  • Integrated high - level business rules (constraints, triggers, and indexes) with the code.
  • Assisted DBAs in the implementation of the data models.
  • Worked closely with ETL team in loading and mapping the data.
  • Created Source-to-target (S2T) mapping document as part of Data Analysis.
  • Involved in data migration from staging to integration.
  • Implemented the standard naming conventions.
  • Built Tabular model cubes using SSAS involving dimensions and fact tables
  • Involved in day-to- day maintenance and solved any issues related to reports.
  • Created and maintained Database Objects (Tables, Views, Indexes, Partitions, Database triggers)
  • Dealt with different data sources ranging from flat files, Excel, Oracle, and SQL Server.
  • Developed Tableau visualizations and dashboards using Tableau Desktop.
  • Experience in Project development and coordination with onshore-offshore ETL/BI developers & Business Analysts.
  • Generated ad-hoc SQL queries using joins, database connections, and transformation rules to profile datafrom Oracle and SQL Server database systems.
  • Strong Experience in conducting User Acceptance Testing (UAT), Unit Testing and documenting Test Cases and Test Scripts.
  • Communicating with the project team throughout all stages of design, managing time effectively, and work on project timelines simultaneously in demanding deadline driven environment.

Environment: Tableau 2020.2, Anaconda 2021.05, Jupyter Notebook Python 3.3, Excel 2013,, UNIX, MS Excel 2007, SQL server 8, HP Quality Center 10.

Data Analyst/Data Modeler

Confidential, Southfield, Michigan

Responsibilities:

  • Conducted JAD sessions, wrote meeting minutes and documented the requirements.
  • Attended and participated in information and requirements gathering sessions.
  • Ensured that Business Requirements can be translated intoDataRequirements.
  • Created Business Requirement documents (BRD's), such as SRS & FRS and integrated requirements and underlying platform functionality.
  • Experience in working on various python packages such as NumPy, SQL, and PyTables.
  • Designed the technical specifications document for Teradata ETL processing ofdatainto masterdatawarehouse and strategized the integration test plan and implementation.
  • Designed and Developed Use Cases, Activity Diagrams, and Sequence Diagrams using Unified Modeling Language (UML).
  • Performed Normalization of the existing OLTP systems (3rd NF), to speed up the DML statements execution time.
  • Experience in developing web applications by following Model View Control (MVC) architecture using server-side application Django
  • Data Modelling in Erwin design of target data model for enterprise data warehouse.
  • Created and Maintained LogicalDataModel (LDM) for the project.
  • Designed Star schema and Snowflake schemaDataModels for EnterpriseDataWarehouse using ER Studio.
  • Worked extensively on Transactional- grain, Periodic snapshot grain and Accumulating snapshot grain while designing dimensional models.
  • Validated and updated the appropriate LDM's to Process Mappings, Screen Designs, Use Cases, Business Object Model, and System Object Model as they evolve and change.
  • Designed the Database Tables & Created Table and Column Level Constraints using the suggested naming conventions for constraint keys.
  • Worked on optimizing and tuning the Teradata views and SQL’s to improve the performance batch and response time ofdatafor users.
  • Writing Procedure and Package using Dynamic PL/SQL.
  • MaintainedDataModel and synchronized it with the changes to the database.
  • Involved with all the phases of Software Development Life Cycle (SDLC) methodologies throughout the project life cycle.
  • Attendant architecture meeting anddatagovernance meeting to understand the project.
  • Identified and mapped variousdatasources and their targets successfully to create a fully functioningdatarepository.

Environment: SQL Server 2012/2014, PL/SQL, python, Django, TeraDataETL, Informatica, Toad, Erwin 9.6, Microsoft Visual Studio,DataObjects.

Data Modeler/Data Analyst

Confidential -Irving, TX

Responsibilities:

  • Interacted with business users to clarify on business logic required for thedatamodels.
  • Involved in gathering complete requirements by organizing and managing meetings with BusinessAnalysts,DataStewards, and subject matter experts on a regular basis.
  • Analyzed OLTP source systems and OperationalDataStore and research the tables/entities required for the project.
  • Analyzed the specifications and identified the sourcedatafrom disparatedatasources like Oracle, MS SQL Server and flat files that needs to be moved todatawarehouse.
  • Designed and maintained the Logical /Physical dimensionaldatamodels and generating the DDL statements and working with the database team in creating the tables, views, keys in the database.
  • PerformedDataprofiling and identified the risks involved withdataintegration to avoid time delays in the project.
  • PerformedDatascrubbing for removing incomplete, irrelevantdataand maintained consistency in the targetdatawarehouse by cleaning the dirtydata.
  • Performing data analytics capabilities including introduction of technical capabilities to facilitate data centric decision making.
  • Worked on specifications given by theDatagovernance team andDataquality team that required managing the masterdatafrom all the business units and ensuringdataquality across the enterprise.
  • Validate the models with the productiondataand developed the Source to Target mapping matrix, an expert solution to design and develop the mappings for the loading ofdata, to the ETL developers.
  • Conducted frequent meetings with my ETL coding and development team to co-ordinate the process and to efficiently organize and distribute the workflow among the team.

Environment: ER Studio 4.5, INFORMATICA Power Center5.0, Oracle 10g, SQL Server 2008, Flat files, SQL/PL SQL, WIN 2000/NT

SQL Developer

Confidential

Responsibilities:

  • Worked on the project that involved development and implementation of adatawarehouse.
  • Created Store Procedures, Functions, Triggers, Indexes and Views using T-SQL in all the environments for SQL Server 2000.
  • Developed DTS Packages to transfer thedatabetween SQL Server and other database and files.
  • Scheduled jobs & Re-Scheduled on Production Databases.
  • Created report form with Stored Procedure using T-SQL.
  • Wrote and optimized Triggers, Stored Procedures and Queries in T-SQL.
  • Created database objects like tables, views, and indexes.
  • Created and modified stored procedures, triggers, and cursors.
  • Designed, developed, and modified various Reports.
  • Handled and managed customer relationship management system.
  • Designing and modeling database according to Requirement and made changes according to it.
  • Designed, developed, and tested complex queries for Reports distribution.
  • Built and performed user acceptance testing on modified reports.
  • Skilled in running jobs in Enterprise Manager using job-scheduling tool.

Environment: MS SQL Server 2005, MY SQL, Lotus Access, MS Office Suite (word, power point, excel), SSRS, SQL Server Integration Services (SSIS).

ETL Developer / Data Modeler

Confidential

Responsibilities:

  • Participated in requirement gathering session with business users and sponsors to understand and document the business requirements.
  • Involved in analyzing the financial impact of health plan initiatives for large corporations.
  • Worked on logical and physical model using ERwin based on requirements.
  • Worked with DBA to create the physical model and database objects.
  • Identified the Primary Key, Foreign Key relationships across the entities and across subject areas.
  • Worked closely with ETL team in loading and mapping the data.
  • Created SSIS packages to automate the ETL processes included Meta data on record count, file size and run time.
  • Developed ETL process using Pentaho PDI to extract the data from Oracle Database.
  • Calculate and analyze claims data for provider incentive and supplemental benefit analysis using Microsoft Access and Oracle SQL.
  • Involved in Index Analysis and performance Tuning.
  • Worked on reconciliations of balances from different systems.
  • Analyzed requirements with developers, business analysts and provided my inputs.
  • Created Source-to-target (S2T) mapping document as part of Data Analysis.
  • Wrote and executed the test cases to perform System, Functional and Regression testing.
  • Designed SSIS Packages to extract, transfer, load (ETL) existing data into SQL Server from different environments for the SSAS cubes.
  • Provided analytic support to patient quality improvement teams working in different clinical.
  • Wrote and edit SQL queries for database testing and reports verification.
  • Worked on creating DDL, DML scripts for the data models.
  • Worked on stored procedures for processing business logic in the database.
  • Actively ensuring the production implementation of the new enhancements with smooth transition of the project by adding value to the team in filling the gap where required as an analyst, programmer, and QA tester.
  • Created and maintained Database Objects (Tables, Views, Indexes, Partitions, Database triggers etc.)
  • Dealt with different data sources ranging from flat files, Excel, Oracle, and SQL Server.
  • Communicating with the project team throughout all stages of design, managing time effectively, and work on project timelines simultaneously in demanding deadline driven environment.

Environment: CA Erwin, Oracle10g, MS Excel, SQL server, SSIS, Oracle.

We'd love your feedback!