Senior Data Engineer Resume
Danville, PA
SUMMARY
- 9 Years of extensive IT experience as a Data Engineer in creating Business Intelligence Solutions like Hadoop, Python, Azure/AWS, SSIS, SSAS, SSRS, Power BI.
- Expert in providing ETL solutions for any type of business model.
- Provided and constructed solutions for complex data issues.
- Experience in development and design of various scalable systems usingHadooptechnologies in various environments. Extensive experience in analyzing data using Hadoop Ecosystems includingHDFS, MapReduce, Hive & PIG.
- Experience in understanding teh security requirements for Hadoop.
- Experience with all phases of software development life cycle (SDLC) and project methodologies.
- Involved in buildingData ModelsandDimensional Modelingwith3NF, Star and Snowflakeschemas forOLAPandOperational data store (ODS)applications.
- Skilled in designing and implementingETL Architecturefor cost TEMPeffective and efficient environment.
- Optimized and tuned ETL processes & SQL Queries for better performance.
- Performed complexdata analysisand provided critical reports to support various departments.
- Work with Business Intelligence tools likeBusiness Objectsand Data Visualization tools likeTableau.
- ExtensiveShell/Python scriptingexperience for Scheduling and Process Automation.
- Extensive experience of best practices ofEnterprise Data Warehouseand involved in Full life cycle development ofData Warehousing.
- Excellent skills in implementing ETL Packages using Data Transformation Services (DTS) and SSIS 2005/ 2008/2008 R 2/2012/2014/2016.
- Migrated DTS packages to SSIS and modified using DTS editor wizard.
- Developed a dashboard solution for analyzing STD statistics by building SSIS 2016 cubes and Tableau
- Experience in using SSIS tools like Import and Export Wizard, Package Installation, and SSIS Package Designer.
- Worked with teh Version control Systems like Team Foundation Server (TFS), Visual Source Safe (VSS).
- Proficient in usage of SSIS Control Flow items (For Loop, Execute package/SQL tasks, Script task, send mail task) and SSIS Data Flow items (Conditional Split, Data Conversion, Fuzzy lookup, Fuzzy Grouping, Pivot).
- Trained and experience in working on Splunk and developing dashboards on Splunk server.
- Extensively worked on Erwin Data Model Tool and good noledge of Relational Database Modeling/Loading and Dimensional database modeling concepts.
- Experience in SAS to extract data from a variety of operational data sources on multiple platforms and build a data warehouse and data marts that integrate teh extracted data.
- Experience in building SSIS packages (.Dtsx) involving ETL process, extracting data from various flat files, Excel files, legacy systems and loading into SQL server.
- Experience in writing Multidimensional Expressions (MDX) and used teh MDX queries in teh report generation
- Designed and implemented solution to integrate applications with Partner Company like GSI, DAX for more than 36 applications.
- Experience in creating Jobs, Alerts, SQL Server Mail Agent, and schedule using DTS/SSIS Packages.
- Hands on experience working with SSIS, for ETL process ensuring proper implementation of Event Handlers, Loggings, Checkpoints, Transactions and package configurations.
- Experience in Importing/Exporting Data from various data sources like SQL Server, Excel, DB2 and Flat file using DTS utility.
- Experience in Designing, Creating, Deploying and Processing of Cubes using SSAS.
- Good experience in Designing Aggregations and pre - calculating teh same in SSAS 2005/2008/2008 R 2/2012/2014/2016.
- Excellent hands on experience in creating groups, hierarchies and measures, KPIs.
- Developed a dashboard solution for Integrated SAP ECC, SAP BW with IBM Information Server.
- Expertise in designing complex reports like Drill-Down Reports, Drill-Through Reports and Parameterized Reports using SQL Server Reporting Services (SSRS) based on client requirement.
- Expert in creating Ad-Hoc reports, linked reports, Snapshot reports, Crosstab, Conditional, Top N and Sub reports using SQL SSRS.
- Proficient in creating Standard Subscriptions and Data driven subscriptions for delivering reports to people internally within teh project.
- Expert in deploying teh SSRS reports using Report manager.
- Built TEMPeffective queries for high-performance reports and rendered them to HTML, XML, PDF and Excel formats using SSRS.
- Integration of SSRS Reports using various web parts into Share point (MOSS 2007) and various delivery mechanisms.
- Strong experience with Crystal Reports.
- Strong Data Mapping Experience.
- Administered and optimized MySQL and SQL Server 2014 databases
- Data modeling experience using Erwin Data Modeler and MS Visio.
- Strong ASP.NET, VB.NET noledge.
- Expertise in RDBMS concepts such as Tables, User Defined Data Types, Indexes, Indexed Views.
- Good experience on Qlikview Business Intelligence reporting tool for developing dynamic reports.
- Expert in Complex Stored Procedures, User Defined Functions (UDFs), Triggers, Common Table Expressions (CTEs), Table Variables and Views.
- Extensively used Joins and Sub-Queries to simplify complex queries involving multiple tables and established relationships between tables using Primary Key& Foreign Key Constraints.
- Migrated teh MySQL database to SQL Server 2016
- Evaluated database performance and performed maintenance duties such as Tuning, Backup, Restoration and Disaster Recovery.
- Developed MDX Scripts to create datasets to perform reporting and included interactive drill down reports, report models and dashboard reports.
- Experience in calculating measures and dimension members using Multi-dimensional expression (MDX), mathematical formulas, and user-defined functions.
- Experience in creating Cell Level Security in cubes using SSAS.
- Experience in SQL Optimizing by using SQL Profiler, Index Tuning Wizard, Database Tuning Advisor (DTA), DBCC Commands and Query Execution Plans.
- Designed and DevelopedKPIs, Aggregations and Cube Partitions.
- Good Knowledge and working exposure in Informatica Power Center 9.5
- Very good communication skills, good interpersonal skills & desire to excel, fast learner, hardworking and a versatile team player.
TECHNICAL SKILLS
Operating Systems: Windows NT 4.0, Windows 2000 Server, Windows 2000 Advanced Server, Windows 2003 Server, Windows XP, Windows Vista, 7.
Programming Languages: SQL, SQL*Plus, HTML, XML, C, C++, DS, Visual Studio 2005MDX
Software: MS Office 2010/2007, RDBMS, MS SQL Server 2008/2005/2000 SQL Server Management Studio(SSMS), SSAS, Oracle 9i, MS Access
ETL Tools: DTS, SQL Server Integration Services(SSIS)
Database Query Tools: MS SQL Server Query Analyzer, SQL Profiler
PROFESSIONAL EXPERIENCE
Confidential, Danville, PA
Senior Data Engineer
Responsibilities:
- Analyzed business requirements with Project Manager and Business Analyst.
- Worked on EDI 834, 835, 837 files & processed them on daily basis.
- Database design and development includingTables, Primary and Foreign Keys, IndexesandStored Procedures.
- Constructed and implemented multiple-table links requiring complex join statements, includingOuter-JoinsandSelf-Joins.
- Involved in converting Hive/SQL queries into transformations using Python.
- Performed complex joins on tables in hive with various optimization techniques
- Created Hive tables as per requirements, internal or external tables defined with appropriate static and dynamic partitions, intended for efficiency
- Worked extensively with HIVE DDLS and Hive Query language (HQLs)
- Involved in loading data from edge node to HDFS using shell scripting.
- Understand and manage Hadoop Log Files.
- Provided ad-hoc queries and data metrics to teh Business Users using Hive, Pig
- Worked on importing and exporting data from snowflake, Oracle and DB2 into HDFS and HIVE using Sqoop for analysis, visualization and to generate reports.
- Involved in file movements between HDFS and AWS S3 and extensively worked with S3 bucket in AWS.
- Used EMR (Elastic Map Reducing) to perform bigdata operations in AWS.
- Worked on Apache spark writing python applications to convert txt, xls files and parse.
- Developed Python scripts, UDF's using both Data frames/SQL and RDD/MapReduce in Spark for Data Aggregation, queries and writing data back into RDBMS through Sqoop.
- Created packages in SSIS with error handling worked with different methods of logging in SSIS.
- Designed a complex SSIS package for importing data from SQL Server 2016/2008 R2.
- Used various transformations like Multicast, Conditional split, OLEDB Command, Script Component and Lookup while loading teh data into Destination.
- Successfully deployed SSIS packages with defined security.
- Advanced extensible reporting skills using SQL Server 2016 Reporting Services (SSRS).
- Designed and created Report templates, bar graphs and pie charts based on teh financial data.
- Developed various types of complex reports like Drill Down, Drill through, Cross tab reports.
- Involved in scheduling, creating snapshots and subscriptions for teh reports using SSRS 2008.
- Involved in designingParameterized Reportsfor generatingAd-Hocreports as per teh client requirements.
- Developed Dashboard reports using Power BI & Tableau tools.
- Coordinated with front-end for implementing logic in stored procedures and functions.
- Experienced in writing complex SQL Queries, Stored Procedures, Triggers, Views, Cursors, Joins, Constraints, DDL, DML and User Defined Functions to implement teh business logic and also created clustered and non-clustered indexes.
- Involved in Normalization and De-Normalization of existing tables for faster query retrieval.
- Advised optimization of queries by looking at Execution Plan for better tuning of teh database.
- Implemented Proof of concepts for SOAP & REST APIs.
- Built web services mappings and expose them as SOAP wsdl.
- Worked with Python and UNIX scripts for file transfer, emailing and other file related tasks.
- Worked with deployments from Dev to UAT, and tan to Production.
Confidential, Charlotte, NC
Senior Data Engineer
Responsibilities:
- Involved in Planning, Defining and Designing data, based on business requirements and provided documentation.
- Worked with Business Analysts in gathering requirements and translating them to technical specifications.
- Developed Stored Procedures, User Defined Functions, Views, T-SQL Scripting for complex business logic.
- Implemented a proof of concept deploying this product inAWS S3 bucketandSnowflake.
- Utilize AWS services with focus on big data architect /analytics / enterprise Data warehouse and business intelligence solutions to ensure optimal architecture, scalability, flexibility, availability, performance, and to provide meaningful and valuable information for better decision-making.
- DevelopedScalascripts,UDF's using bothdata frames/SQL and RDDinSparkfor data aggregation, queries and writing back into S3 bucket.
- Experience indata cleansing and data mining.
- Wrote, compiled, and executed programs as necessary using Apache Spark in Scala toperform ETL jobswith ingested data.
- Loading teh data from multiple Data sources like (SQL, DB2, and Oracle) intoHDFSusingSqoopand load intoHive tables.
- CreatedHIVE Queriesto process large sets of structured, semi-structured and unstructured data and store in Managed and External tables.
- Developed ComplexHiveQL'susingSerDe JSON.
- Managed and reviewedHadoop logfiles.
- Developed various interface webpages for incoming files for teh Vendor Interfaces including service invoice, repair order, finance and insurance usingC#and job claims withstored procedures.
- Optimized teh database by creating various clustered, non-clustered indexes and indexed views.
- Increased teh performance necessary for statistical reporting after performance monitoring, tuning and optimizing indexes.
- Used Joins, CTE's, correlated and non-correlated sub-queries for complex business queries involving multiple tables from different databases and implemented triggers and Stored Procedures and enforced business rules via checks and constraints.
- Extracted large volumes of data from various data sources and loaded teh data into target data sources by performing various kinds of transformations using SQL Server Integration services (SSIS)
- Developed and deployed SSIS packages for ETL from OLTP and various sources to staging and staging to Data warehouse using Lookup, Fuzzy Lookup, Derived Columns, Condition Split, Term, Slowly Changing Dimension and more. Performed ETL mappings using MS SQL Server Integration Services.
- Created SSIS packages for Uploading of different formats of files (Excel, Access, dbf) and databases (SQL server, Flat files) into teh SQL Server data warehouse using SSIS.
- Created Jobs in SSIS and was responsible for ETL job scheduled to run daily. Solved teh issues associated with ETL Data Warehouse Failure.
- DevelopedSAS macrosfor Data cleaning, Data mining and reporting and to support routing processing
- Used Script task to write custom code using Vb.net.
- Developed SSAS OLAP Cubes and Partitioning Cubes.
- Wrote MDX queries to retrieve data from OLAP (SSAS) cubes.
- Used Data Analysis Expressions DAX to creating custom calculations in PowerPivot for Microsoft Excel workbooks and Analysis Services tabular model projects.
- Responsible for creating SQL datasets forPowerBIand Ad-hoc Reports.
- Producedad hoc reportsin SAS using .txt files upon request for in-depth analysis.
- Developed SAS code using Macros to run recurring jobs and optimized code.
- Expert on creating multiple kind ofPowerBIReports and Dashboards.
- Using SQL server reporting services (SSRS) delivering enterprise, Web-enabled reporting to create reports that draw content from a variety of data sources.
- Created jobs for automatically running Stored Procedures for day-end operations using SQL Server agent such as index rebuild, data loads and summary reports.
- Involved in Performance Tuning of Code using execution plan and SQL profiler and added Indexes to improve performance on tables.
- Designed, developed, tested, and maintained Tableau functional reports based on user requirements.
- Developed Tableau data visualization using Cross tabs, Heat maps, Box and Whisker charts, Scatter Plots, Geographic Map, Pie Charts and Bar Charts and Density Chart.
- Generated Reports using Global Variables, Expressions and Functions for teh reports using SSRS.
- Designing and implementing a variety of SSRS & Tableau reports such as Parameterized, Drilldown, Ad hoc and Sub-reports using Report Designer and Report Builder based on teh requirements.
- Designed SSRS reports with dynamic sorting, defining data source and subtotals for teh report.
- Created different types of reports such as Crosstab, Drill-down, Drill Through, OLAP and formatting them using both SSRS 2016/2014.
Environment: Windows 2003 Server/XP, SQL Server 2016/2014/2012 , SQL Integration Services(SSIS), SQL Reporting(SSRS), Business Intelligence Studio, Visual Studio 2008/2005, Team foundation Server, Tableau, VB.NET, OLE DB, DTS Package, SharePoint MOSS 2007, Crystal Reports, Erwin 4.0, VB.Net, HTML.
Confidential, Montgomery, AL
Data Engineer/SQL Developer
Responsibilities:
- Involved in Planning, Defining and Designing data, based on business requirements and provided documentation.
- Successfully followed agile development steps starting from requirement to deployment.
- Involved in development and implementation of SSIS, SSAS and SSRS application solutions for various business units across teh organization.
- Designed Data Mapping documents, DDL scripts, and Data Dictionary and Meta data of teh models and maintained them.
- Developed SQL scripts for administration task.
- Maintained SQL scripts, indexes, complex queries for data analysis and extraction.
- Used DTS Packages as ETL tool for migrating Data from SQL Server 2000 to Oracle 10g.
- Extensive experience working with SQL Server 2014 with high availability.
- Worked on SQL*Loader to load data from flat files obtained from various facilities every day.
- Developed PL/SQL triggers and master tables for automatic creation of primary keys.
- Involved in data loading using PL/SQL and SQL*Loader calling UNIX scripts to download and manipulate files.
- Used .NET provider Data Reader Source Task to Read from RDBMS sources.
- Developed teh classes using C#, which in corporate N-tier architecture and database connectivity.
- Worked in ASP.NET Web forms, State management and Caching feature.
- Maintained WCF services across teh enterprise and integrating UI using MVC pattern
- Developed some codes to work on Visual Studio 2008 and Team foundation server (TFS) also used teh same code in SSRS to generate Reports.
- Used SSIS and T-SQL stored procedures to transfer data from OLTP databases to staging area and finally transfer into data marts.
- Worked on various RDBMSs such as MS SQL Server 2000, Access 2000 and MySQL
- Created SSIS packages to extract data from OLTP and transformed to OLAP systems and Scheduled Jobs to call teh packages and Stored Procedures.
- Created Alerts for successful or unsuccessful completion of Scheduled Jobs.
- Used various SSIS tasks such as Conditional Split, Derived Column, which were used for Data Supported in creating KPI, calculation members and measure groups, and MDX queries
- Scrubbing, data validation checks during Staging, before loading teh data into teh Data warehouse.
- Worked with teh Version control Systems like Team Foundation Server (TFS), Visual Source Safe (VSS).
- Developed a dashboard solution for Integrated SAP ECC, SAP BW with IBM Information Server.
- Experience in creating Jobs, Alerts, SQL Server Mail Agent, and schedule DTS/SSIS Packages.
- Developed a dashboard solution for analyzing STD statistics by building SSIS 2012 cubes and Tableau
- Experience in configuring and deploying SSRS reports onto Microsoft Office SharePoint Server.
- Involved in developing database objects such as procedures, triggers, constraints, indexes and views. Used T-SQL to build dynamic stored procedures.
- Developed SSIS packages for File Transfer from one location to teh other using FTP task
- Used ETL to implement teh Slowly Changing Transformation, to maintain Historical Data in Data warehouse.
- Used Team Foundation Server (TFS) for source version control on a shared team environment.
- Created Sub-Reports, Drilldown-Reports, Summary Reports, and Parameterized Reports in SSRS.
- Analyzed teh data from multiple sources such as Microsoft (MS) SQL Server, MS Access, Legacy systems, Mainframes, DB2, Sales Force Automation (SFA / SaaS), Micro strategy (MSTR), Tableau.
- Generated ad-hoc reports using MS Reporting services and Crystal reports.
- Responsible for creating SQL datasets forPowerBIand Ad-hoc Reports.
- Expert on creating multiple kind ofPowerBIReports and Dashboards.
- Gave and advancePowerBITraining to top stakeholders and team Members.
- Worked on Erwin Data Model Tool and good noledge of Relational Database Modeling/Loading and Dimensional database modeling concepts.
- Used SSRS to create reports, customized Reports, on-demand reports, ad-hoc reports and involved in analyzing multi-dimensional reports in SSRS. Provided Ad-hoc queries and Reports for Companywide users pulling data from Multi-terabyte data warehousing systems.
- Created reports using SSRS from OLTP and OLAP data sources and deployed on report server.
- Experienced in creating SSAS OLAP Cubes and Partitioning Cubes.
- Designed and implemented solution to integrate fanatics applications with partner company like GSI,DAX for more than 36 applications.
- DefinedCalculated members,named sets, and executed other script commandsusingDAXexpressions.
- Created Error and Performance reports on SSIS Packages, Jobs, Stored procedures and Triggers.
- Used MS Report Designer to provide a Report Data pane to organize data used in teh reports.
- Performed T-SQL tuning and optimization of queries for reports that took longer execution time using MS SQL Profiler, Index Tuning Wizard and SQL Query Generated reports using SSRS which were sent to different Clients, and managers
- Performed Documentation for all kinds of reports, DTS and SSIS packages.
- Created complex Stored Procedures, triggers, cursors, tables, and views and other SQL joins and statements for applications by using T-SQL.
- Maintained teh table performance by normalization, creating indexes and collected statistics by using query optimization, query execution plans, SQL Server Profiler and Database engine tuning Advisor.
- Evaluated database performance and performed maintenance duties such as Tuning, Backup, Restoration and Disaster Recovery.
- Created Linked Servers for data retrieval using OLE DB data sources and providers.
- Designed and developed and normalized tables for teh MySQL database
- Design and implementation of stored procedures and triggers for automating tasks.
- Interacting with teh users and troubleshooting teh problems involved with teh development of stored Procedures, triggers and problems related to teh privileges.
- Created Views to reduce database complexities for teh end users.
- Created provided assistance to development teams on Tuning Data, Indexes and Queries.
- Worked on troubleshooting and maintaining large (terabyte-size) clustered databases.
- Created Functions, Stored Procedures, Database Triggers
- Created checks and constraints to maintain data integrity.
- Involved in De-Normalization of existing tables for faster query retrieval.
- Involved in development, enhancement, testing and performance tuning of teh application.
- Developed Multi-dimensional Objects (Cubes, Dimensions) using MS Analysis services (SSAS).
- Ensured best practices are applied and integrity of data is maintained through security and documentation.
Environment: Windows 2003 Server/XP, SQL Server 2008/2005, SQL Integration Services(SSIS), SQL Reporting(SSRS), Business Intelligence Studio, Visual Studio 2008/2005, Team foundation Server, Tableau, VB.NET, OLE DB, DTS Package, SharePoint MOSS 2007, Crystal Reports, Erwin 4.0, VB.Net, HTML.
Confidential, New York City, NY
SQL Developer/MSBI Developer (SSIS, SSAS, SSRS)
Responsibilities:
- Conducted Data Migration processes in teh SQL Server environment using SQL Server Integration Services 2005.
- Conducted and automated teh ETL operations to Extract data from multiple data sources, transform inconsistent and missing data to consistent and reliable data, and finally load it into teh Multi-dimensional data warehouse.
- Extensively used T-SQL in constructing user functions, views, indexes, user profiles, relational database models, and data integrity.
- Performed SQL and PL/SQL tuning and Application tuning using various tools like EXPLAIN PLAN, SQL*TRACE, TKPROF and AUTOTRACE.
- Used different transformations like Conditional Split, Lookup, Merge Join, Aggregate, Fuzzy Grouping etc. for creating many SSIS packages.
- Created many Variables that were used to pass parameterized values into teh package which makes teh package Dynamic.
- Created logical and physical database design using Erwin.
- Integrated SAP ECC, SAP BW with IBM Information Server.
- Designed many Configuration files that were used to pass parameters from outside teh Business Intelligence Development Studio (BIDS) into teh packages using XML files.
- Used Microsoft Visual Source Safe (VSS) and Team Foundation Server (TFS) for integration, maintenance and Security of code.
- Designed Incremental Load Packages which were used to load teh updated and new data into teh Enterprise Data Warehouse without removing teh existing data.
- Created Complex SQL Queries using Views, Indexes, Triggers, Roles, Stored procedures and User Defined Functions Worked with different methods of logging in SSIS.
- Designed and developed and normalized tables for teh MySQL database
- Used SSIS and T-SQL stored procedures to transfer data from OLTP databases to staging area and finally transfer into data marts and performed action in XML.
- Used MDX to create calculated members in teh cube.
- Developed Complex Calculated Measures using DAX for Tabular projects.
- Worked on Event Handlers to do Event handling on error at teh package and task level on error, on warning and configured it using send email task and script task.
- Created packages in SSIS with error handling.
- Involved in performance tuning and monitoring of T-SQL blocks.
- Migrated all teh packages to Test Environment and finally deployed from Test Environment.
- Designed reports using SQL Server Reporting Services (SSRS).
- Implemented Software Configuration Management, setting up company Version policies utilizing Team Foundation Server (TFS).
- Adopted Agile Software Development Methodology.
Environment: SQL Server 2005, Windows 2000/2003, MS Access 2000/2003,T-SQL, Windows 2000 Advanced Server, SQL Server Integration Services(SSIS), SSRS, Crystal Reports, Tableau, VB.NET, SharePoint (MOSS) 2007, Erwin 4.0.