We provide IT Staff Augmentation Services!

Data Analyst Resume

0/5 (Submit Your Rating)

NC

SUMMARY

  • 11 years of extensive experience in various domains of the IT industry like Banking, Travel, Logistics, and Healthcare as a Data Engineer.
  • Expertise in designing and implementing regression models to analyze healthcare data on a daily basis, using tools such as R and Python
  • Skilled in conducting statistical analysis on healthcare data using regression models, to identify trends and patterns in the data and control for confounding variables.
  • Proficient in evaluating the performance of regression models on healthcare data using metrics such as R - squared, MSE, and RMSE, to ensure accurate and reliable results.
  • Experienced in communicating findings to stakeholders in the healthcare industry, delivering clear and concise reports and presentations to inform decision-making and improve Confidential t outcomes.
  • I have used statistical methods and machine learning techniques to analyze and interpret large and complex data sets.
  • I worked with a variety of programming languages and tools such as Python, R, SQL, and Tableau to manipulate and visualize data.
  • Hands-on experience in MSSQL Server with Business Intelligence in SQL Server Integration Services, SQL Server Analysis Services, SQL Server Reporting Services, Tableau Desktop, Tableau server / Administrator, Power BI Desktop/Server, and in Azure Cloud Technologies including Azure Database, Azure SQL, Azure Datawarehouse, Azure Data Factory (ADF), Azure Data Lake (ADL), Azure Databricks (ADB).
  • Expertise in using major components of Hadoop ecosystem components like HDFS, YARN, MapReduce, Hive, Sqoop, HBase, Spark, Spark SQL, Kafka, Spark Streaming and Elasticsearch
  • Built web apps, automation tools, and rest APIs in Python using the Django and Flask frameworks.
  • Creating applications with Java, Spring, and the Spring Boot Rest API.
  • On a daily basis, I work with a variety of technologies to extract data, including Hive, Hadoop, Sqoop, Spark, Kafka, and Elasticsearch.
  • I made use of several different cloud computing platforms, such as Azure, AWS Lambda, OpenShift, and Kubernetes.
  • In order to solve Big Data problems, I must code in Hive, Sqoop, Shell, Python, and PySpark.
  • Significant practical experience working with different JavaScript frameworks, including Angular and React JS.
  • I have classified some data using Python modules like NumPy, pandas, and PySpark
  • I have developed web apps using the Flask and Django frameworks, as well as Rest APIs.
  • Significant expertise in archiving source databases into a big data environment using Sqoop.
  • Developing PySpark code to extract the data from XML files.
  • Has extensive experience using Python to create automation tools for large amounts of data.
  • Working knowledge of how to transmit reports to the business team using Python schedulers.
  • Writing Hive views and Elastic search queries for data to be shown in the front end.
  • I have Implemented applications from scratch using HTML5, and CSS.
  • I have Development Experience in Client/Server-side JavaScript and JavaScript Frameworks like NodeJS, React JS, and Angular.
  • Having Experience in writing Http Interceptors to add headers to every call going to the server.
  • Experience in Coding, Testing, and Implementation/Maintenance Support in Object Oriented and MVC.
  • Magento development Experience for developing e-commerce stores.
  • I have experience working with Java, JSP, Servlets, Spring Boot, and JDBC.
  • Working experience with different types of databases like MySQL, PostgreSQL, MongoDB, and SQL.
  • Good experience writing CI/CD pipelines using Docker, and Jenkins.

TECHNICAL SKILLS

Database: MySQL, PostgreSQL, MongoDB, and SQL

Data Modelling: Data Analytics, Statistics, Predictive Modeling, Machine Learning Models (Exploratory Data Analysis (EDA), Decision Trees & Random Forests, Linear & Logistic Regression, Clustering Etc)

Programming Languages: Python (NumPy, Pandas, Matplotlib, NLTK, Scikit-Learn), R, Java

Frameworks: Spring Boot, Django, and Flask

Analytical Tools: Power BI Desktop/Server, Tableau Desktop, Snow Flake

Big Data Tools: Hadoop Ecosystem, Map Reduce, Spark, Pyspark, HBase, Hive, Pig, Sqoop, Kafka, Hadoop, Snowflake, Databricks

Cloud: Azure, AWS, GCP, Azure SQL Database, Azure Data Studio, Azure SQL Datawarehouse, Azure Data Factory (ADF), Azure Databricks, OpenShift, Kubernetes, Docker, and Jenkins

UI: Angular, React

Source Control: GitHub, SVN

PROFESSIONAL EXPERIENCE

Confidential, NC

Data Analyst

Responsibilities:

  • Proficient in using supervised learning models such as linear regression, logistic regression, decision trees, and random forests for classification and prediction tasks.
  • Experienced in utilizing unsupervised learning models such as clustering, principal component analysis, and association rule learning for exploratory data analysis and pattern discovery.
  • Conducted data analysis and implemented machine learning models to enable data-driven decisions.
  • Collaborated with cross-functional teams including data scientists and product managers to define project requirements and deliverables.
  • Designed and developed scalable data processing pipelines using Apache Spark, Hadoop, and Hive
  • Optimized data retrieval and storage using Elasticsearch.
  • Built REST APIs with Spring Boot and integrated them with cloud platforms such as Azure.
  • Created graphs with Tableau to illustrate the predictions and observations for the business unit.
  • Developed front-end interfaces using Mern Stack like Angular, NodeJS, MongoDB, and ReactJS.
  • Conducted code reviews and unit tests to ensure code quality.
  • Collaborated with product managers and stakeholders to define project requirements and deliverables.
  • Built and maintained data processing pipelines using Python and PySpark.
  • Ability to work effectively in cross-functional team environments, excellent communication, and interpersonal skills. EvaluatedSnowflakeDesign considerations for any change in the application process.
  • Build theLogical and Physical data model for Snowflakeas per the changes required.
  • Defined roles, and privileges required toaccess different database objects and virtual warehouse sizing for Snowflake for different types of workloads.
  • Involved in the design and development of multiple Power BI Dashboards and reports.
  • Involved in creating the notebooks for transforming data from raw to the stage and then to curated zones using Azure data bricks.
  • Managing data privacy and security in Power BI.
  • Extensively involved in designing and developing the Power BI Data model using multiple DAX expressions to build calculated columns and calculated measures.

Confidential, FL

Sr Data Engineer

Responsibilities:

  • Develop the Framework for the creation of new snapshots and deletion of old snapshots in Azure Blob Storage and worked on setting up the life cycle policies to back the data from delta lakes.
  • Expert in building the Azure Notebooks functions by using Python, Scala, and Spark.
  • Built and configured a virtual data center in the Azure cloud to support Enterprise Data Warehouse hosting including Virtual Private Cloud (VPC), Public and Private Subnets, Security Groups, and Route Tables.
  • Integrated both framework and CloudFormation to automate Azure environment creation along with the ability to deploy on Azure, using build scripts (Azure CLI) and automate solutions using Terraform.
  • Worked on transforming data in Azure Spark Databricks platform to parquet formats for efficient data storage
  • Worked on reading and writing multiple data formats like JSON, ORC, and Parquet on HDFS using PySpark.
  • Understanding the Business requirements and developing common solutions that meet the business requirement.
  • Worked on implementing secure views and row-level security on snowflake tables.
  • Created external tables and copied the data from the external storage account s3 to Snowflake.
  • Writing a Data Bricks code and ADF pipeline fully parameterized for efficient code management.
  • Developed spark applications in Python (PySpark) on the distributed environment to load huge numbers of CSV files with different schema into Hive ORC tables.
  • Worked on reading and writing multiple data formats like JSON, ORC, and Parquet on HDFS using PySpark.
  • Worked onSnow SQL andCreated Snow Pipe for continuous data load andused COPY to bulk-load the data into Snowflake.
  • Created data sharing between two Snowflake accounts.
  • Created internal and external stages and transform data during the load.
  • Migrated objects from SQL Server/Teradata toSnowflake.
  • Used Temporary, transient tables,procedures/viewsin Snowflake to loadDimensional and Facts on different databases.
  • Heavily involved in testing Snowflake to understand the best possible ways and best practices in optimizing cloud resources.

Confidential, FL

Data Engineer

Responsibilities:

  • Using Bash and Python including Boto3 to supplement automation provided by Ansible and Terraform for tasks such as encrypting EBS volumes backing AMIs.
  • Involved in using Terraform to migrate legacy and monolithic systems to Amazon Web Services.
  • Wrote Lambda function code and set Cloud Watch Event as a trigger with Cron job Expression.
  • Validate Scoop jobs, and Shell scripts& perform data validation to check if data is loaded correctly without any discrepancy. Perform migration and testing of static data and transaction data from one core system to another.
  • Worked on creating and runningDockerimages with multiple microservices es and Docker container orchestration using ECS and lambda.
  • Developed Spark scripts by writing custom RDDs in Scala for data transformations and performing actions on RDDs.
  • Created Metric tables, End-user views in Snowflake to feed data for Tableau refresh.
  • Generated Custom SQL to verify the dependency for the Daily, Weekly, and Monthly jobs.
  • Implemented Kafka producers to create custom partitions, configured brokers, and implemented high-level consumers to implement the data platform.
  • Developed best practices, processes, and standards for effectively carrying out data migration activities. Worked across multiple functional projects to understand data usage and implications for data migration.
  • Prepared data migration plans including migration risk, milestones, quality, and business sign-off details.
  • Performed advanced procedures like text analytics and processing, using the in-memory computing capabilities of Spark using Scala.
  • Worked on migrating MapReduce programs into Spark transformations using Scala.
  • Developed spark code and spark-SQL/streaming for faster testing and processing of data.
  • Wrote Python modules to extract data from the MySQL source database.
  • Deployed the project on Amazon EMR with S3 connectivity for setting backup storage.
  • Created Jenkins jobs for CI/CD using git, Maven, and Bash scripting
  • Conducted ETL Data Integration, Cleansing, and Transformations using AWS glue Spark script.

Confidential

Software Engineer

Responsibilities:

  • Proficient in designing and developing scalable RESTful APIs using Java Spring Boot framework with extensive experience in implementing CRUD operations, error handling, security, and authentication mechanisms.
  • Skilled in utilizing various Spring Boot components such as Spring Data JPA, Spring Security, and Spring MVC to build and deploy efficient and reliable RESTful web services, integrating with databases and third-party APIs.
  • Designed and developed custom web applications using PHP and CodeIgniter
  • Developed and maintained e-commerce websites using Magento and WordPress.
  • Wrote complex SQL queries to retrieve and update data in MySQL databases.
  • Integrated payment gateways and shipping APIs to enable online transactions.
  • Developed responsive front-end interfaces using HTML, CSS, and JavaScript.
  • Collaborated with designers, developers, and project managers to meet project requirements.
  • Implemented Agile methodologies to ensure timely delivery of projects.
  • Created and maintained MySQL databases and optimized query performance.

We'd love your feedback!