We provide IT Staff Augmentation Services!

Data Science Analyst Resume

4.00/5 (Submit Your Rating)

SUMMARY

  • 4+ years of Data Integration/ETL development experience
  • 4+ years of Experience in writing SQL queries and SQL development in SQL Server 2012/2014/2016 and Oracle 11g/ Oracle 12c databases.
  • 4+ years of ETL design using Microsoft SSIS, Confidential Powercenter.
  • 4+ years of ETL design using Ability to work closely with Business Analysts and SMEs to understand business requirements
  • Perform Data profiling and analysis of structured and unstructured data
  • Create source to target mappings and ETL design for integration of new/modified data streams into the data warehouse/data marts
  • Validate the ETL design and ensure dat technical specifications are complete, consistent, concise, and achievable.
  • Create ETL Processes and reports using Microsoft SQL Server Suite (SSIS, SSRS, SSAS) and other data integration tools like Data Stage, Confidential, Confidential initio.
  • 2+ years of experience with cloud - based platforms like Snowflake, Redshift and Big Query.
  • 2+ years of experience with Big Data Technologies like Hadoop, Pyspark.
  • Develop ETL code as per the technical specifications and business requirements according to the established designs
  • 4+ years of SQL database experience with versions 2012 and 2016 and/or 2012, Oracle 11g/Oracle 12c
  • Excellent SQL skills and ability to create stored procedures in SQL Server and Oracle.
  • Participate in ETL architecture design reviews with the BI team
  • Understanding of Enterprise Data warehouse data models and dimensional modeling concepts, source to target mapping and Data Integration architecture
  • Demonstrated ability to meet tight deadlines, follow development standards and TEMPeffectively raise critical issues with the client
  • Experience in creating Source to Target Mappings
  • Experience in Data Profiling & data analysis in a DW environment on Oracle/MS SQL Server
  • Performance Tuning of ETL processes in large scale data warehouse environment
  • Knowledge of MS SQL Server Suite (SSIS, SSRS, SSAS) and other ETL tools like Confidential, Data Stage.
  • Knowledge of end-to-end SDLC from project inception to deployment.
  • Experience in different project management methodologies me.e. Agile and Waterfall.
  • Experience in Text Analytics, developing different Statistical Machine Learning, Data Mining solutions to various business problems and generating data visualizations using Python, and Tableau.
  • Experience in developing data science, machine-learning models such as Regression, Classification and Clustering algorithms using Python, Numpy, Pandas, Scikit-Learn.

PROFESSIONAL EXPERIENCE

Confidential, Data Science Analyst

Confidential, TN

  • Conducted comprehensive analysis and evaluations of business needs; provided analytical support for policy; Engineered financial, operational and reputational impacts and influence decisions for different models.
  • Retrieved, manipulated, analyzed, aggregated and performed ETL through billions of records of claim data from databases like RDBMS and Hadoop cluster using SAS (Proc SQL), PL/SQL, Scala, Sqoop and Flume.
  • Used Matplotlib, Seaborn in Python to visualize the data and performed featuring engineering such as detecting outliers, missing value and interpreting variables.
  • Worked on transformation and dimension reduction of the dataset using PCA and Factor Analysis.
  • Developed, validated and executed machine learning algorithms including Naive Bayes, Decision trees, Regression models, SVM, XG Boost to identify different kinds of fraud and reporting tools dat answer applied research and business questions for internal and external clients.
  • Implemented models like Linear Regression, Lasso Regression, Ridge Regression, Elastic Net, Random Forest and Neural Network to provide predictions to help reducing the rate of frauds.
  • Experienced in using Pandas, Numpy, SciPy, Scikit-learn to develop various machine-learning algorithms.
  • Used SAS, PySpark, MLlib to evaluate different models like F-Score, Precision, Recall, and A/B testing.
  • Fine-tuned the developed algorithms using regularization term to avoid overfitting.
  • Analyzed real time data using Spark Streaming and Spark core with MLlib.
  • Used the final machine-learning model to detect fraud of real time data.
  • Extensively involved in data visualization using D3.js and Tableau.

Confidential, Redwood City

Software Engineering Intern, Product Specialist Team 

  • Experience in writing developing and deploying ETL packages using Confidential Power Center, to move data from Oracle 11g to Oracle 11g and Oracle to SQL Server 2016 databases.
  • Worked with database developers to develop SQL Queries and stored procedures /functions/triggers in SQL Server and Oracle
  • Successfully deployed packages from Development to QA and assisted in deploying the packages to production
  • Worked with Data Architects and BI developers in creating cubes in SSAS using multi-dimensional data modeling
  • Developed data Mappings between source systems and warehouse components using Mapping Designer
  • Worked extensively on different types of transformations like source qualifier, expression, filter, aggregator, rank, update strategy, lookup, stored procedure, sequence generator, joiner, XML.
  • Setup folders, groups, users, and permissions and performed Repository administration using Repository Manager.
  • Involved in the performance tuning of the Confidential mappings and stored procedures and the sequel queries inside the source qualifier.
  • Isolated and fixed Integration Service, Repository Service and Database issues in the product by leveraging the tools provided by the Operating System as well as developing SQL queries, adhoc tools to solve unique customer issues.
  • Developed Linux and Python scripts using pattern detection, tagging techniques to identify failure, performance bottlenecks in Application Services and ETL pipeline logs.

Confidential, Database Analyst /ETL Analyst

Confidential, TN

  • Developed and deployed packages using SSIS ETL to load data from Flat Files, XML, and data from Oracle to Oracle and Oracle to SQL Server 2016
  • Experience in writing developing and deploying ETL packages to move data from various sources like SQL Server 2012/2014 databases to SQL Server 2016 Data warehouse environment.
  • Successfully deployed packages from Development to QA and assisted in deploying the packages to production
  • Worked with Data Architects and BI developers in creating cubes in SSAS using multi-dimensional data modeling
  • Involved in Designing of Data Modeling for the Data warehouse
  • Involved in Requirement Gathering and Business Analysis

Confidential

Applications Development Consultant / Database Developer

  • Developed Various procedures and packages using SQL in SQL Server 2012/ Oracle 11g
  • Developed and deployed packages using SSIS ETL to load data from Flat Files, XML, and data from Oracle to Oracle and Oracle to SQL Server 2012
  • Developed DW / DataMart using Snowflake and Star schema techniques.
  • Worked with Data Architects and BI developers in creating cubes in SSAS using multi-dimensional data modeling
  • Involved in Designing of Data Modeling for the Data warehouse
  • Involved in Requirement Gathering and Business Analysis
  • Experience in writing developing and deploying ETL packages to move data from various sources like SQL Server 2012 databases to SQL Server 2012 Data warehouse environment.
  • Successfully deployed packages from Development to QA and assisted in deploying the packages to production
  • Worked with Data Architects and BI developers in creating cubes in SSAS using multi-dimensional data modeling
  • Involved in Designing of Data Modeling for the Data warehouse
  • Involved in Requirement Gathering and Business Analysis

Confidential

Business Intelligence Analyst

  • Collaborating with cross-functional teams to determine business requirements, priorities, define metrics & key performance indicators (KPI), develop a business intelligence report to support company objectives and goals
  • Utilizing SQL and Hadoop environments to leverage customer transactional, behavioral & demographic data and perform statistical & predictive modeling to drive product development and customer experience.
  • Designing, developing & implementing Customer Master Data Model & Dashboard to extract customer 360 data for important metrics & dimensions to provide insights about active customers. dis model is helping in decision making & is reusable to the analyst community
  • Performing quantitative analysis based on the voice of customer (VOC) data to learn customer behavior, product adoption, market growth opportunities, and forecasting; test and transform hypothesis into recommendation; develop BI dashboards using Tableau viz.
  • Collaborating with cross-functional teams to determine business requirements, priorities, define metrics & key performance indicators (KPI), develop a business intelligence report to support company objectives and goals.

We'd love your feedback!