Data Quality Analys Resume
New Brunswick, NJ
SUMMARY:
- Hands on experience as a data analyst/data quality steward in enterprise and business level data environment. Roles include synthesize and analyze, validate and profile, identify gaps/errors/defects in data quality, map, cleanse, and scrub data from multiple sources/vendor. Exposure to both upstream and downstream data channels/sources. Proficient in querying and accessing enterprise databases (Oracle, SQL Server, MySQL, DB2, Hive, Cloudera/Impala and other big data sources) using intermediate to advance SQL queries for ad - hoc-analysis and reporting. Have also used Alteryx for data integration, cleansing and reporting purpose. Proficient in creating business facing visual reports/ dashboards/visualizations using particularly Tibco Spotfire, Tableau, Cognos. Proficient in using Informatica Developer and Informatica Analyst for data profiling. Have actively developed, enriched and refined meta data - data dictionaries, data lineage/data life cycle cross reference documentation, lookup tables etc. Hands on experience on Confidential and Data Governance. Have working knowledge of Alteryx tool for data preparation and reporting.
- Business Data Analyst cum project manager with Business Intelligence, Business Analytics and project management background in multiple domain including pharmaceuticals, chemicals, R&D laboratories, supply chain, logistics, health care, science and research, finance & digital communications. Work as a conduit/channel between business (product owners) and technical teams (data architects, system analysts, ETL teams) to elicit, supervise, manage, document and coordinate end to end resource boxed data management and application development activities and ensuring delivery in compliance with functional and non-functional standards incorporating conventional and contemporary project management methodologies
- Data Analysis/Data Quality Management/Data Stewardship
- Technical: upstream and downstream data quality in terms of data quality checks, authenticity, reliability, validity. Using SQL (on relational such as Oracle, SQL Server, MySQL, DB2), as well as Hive and similar query languages on Hadoop Distributed File System, Cloudera/Impala for data extraction and analysis.
- Proficient in data profiling using Informatica Developer, Informatica Analyst.
- Experience in Development and enrichment of meta data - data dictionaries, data life cycle, data lineage and related documentation.
- Have working knowledge of using Alteryx Designer 10.6 x64 as the key tool for Data Preparation, Data Integration, Data Analysis, Developing and running Alteryx jobs/workflows for efficient data management.
- Dashboard Development experience: Business facing Metric performance (Key Performance Indicators), Data Mining-metric driver identification (Trending, forecasting and modeling)
- Data Visualization
- Tibco Spotfire (+1 year experience), and MS Excel (+ 10 years)
- Tableau (+ 6 months), Informatica Developer, Informatica Analyst (for Confidential Data profiling + 1 year), IDQ
- Business Analysis
- Business Model, Requirement analysis, functional documentation, technical specifications.
- Data Mining/Modeling/Analytics
- (Bus. Intelligence)
- Trending, Forecasting, outlier identification - within Excel Data Analysis Pack Spotfire, Tableau. Training on Cognos.
- Project Management
- Certified Scrum Master
- Project Documentation (Charter), Scope identification, Estimation, budgeting
- SDLC: Methodology: Agile/Scrum, Waterfall, Prototyping, Project Management tools: Microsoft Project, Visio. Past training on Primavera
- Database Environment
- Oracle, SQL Server, MySQL, DB2, Big Data (Hive, Impala). Front end Hue and Impala
- Platform Environment, ERP and CRM experience
- Windows (7, 8, Vista, XP), JD Edwards (XE and 9.1), Mainframe ERP(AS400), Siebel Pharma CRM, Linux, Solaris
- Collaboration/Sharing/Communication
- SharePoint, Lync, WebEx, Communicator, Outlook, Teamviewer etc.
- Productivity tools
- MS Office, IE 8, Firefox, Chrome, Adobe Professional, Visio, MS Project
- ETL
- Amazon ETL tool for query scheduling.
PROFESSIONAL EXPERIENCE
Data Analyst
Confidential
Responsibilities:
- Extracted and analyzed data from Relational (Oracle) and Big Data (Hadoop/Hive) Environment and using SQL & Hive Query Language (HQL) respectively
- Used Informatica Developer (IDQ) and Informatica Analyst to profile data
- Worked with product owners, business teams for finalizing requirements.
- Developed Data Quality rules based on the column level profiling from IDQ and Analyst
- Scheduled jobs for metric result loads in target environment
- Created and developing test scripts.
- Conducted User Acceptance Testing.
- Using Rally tool, created user stories, tasks and dependencies for project tracking
- Regularly attending scrum meetings, and with business stakeholders for properly identifying data quality specs and threshold criteria.
Confidential, New Brunswick, NJ
Data Quality Analys
Responsibilities:
- Catering to ad-hoc data requests for procurement and travel spend data by extracting data from various Confidential and non- Confidential sources, managing, manipulation, cleansing, analyzing data. This involves combination of following skills and technologies
- Developing ad-hoc SQL scripts on both relational databases (SQL Server 2012) as well as Big Data sources (Impala).
- Creating temp tables, views for creating customized ad-hoc data sets which can then be used as data inputs for Alteryx jobs mentioned below.
- Developing and running Alteryx (Data Integration and Reporting tool) jobs/workflows (yxmd files) for faster data retrieval, data cleansing, integration and report generation in multiple formats (yxdb, tde, xls, csv etc.)
- Developed a Data Lineage dashboard mapping backward dashboard field names from Tableau (Dashboard layer) through to Data Lake (Big Data - Impala environment) to the Physical Tables and developed relationships visually (data lineage model) as well as SQL coding and finally presented that in a Tableau Dashboard. Also mapped business definitions for each field/column in the dashboard layer.
- Part of the DQ team for development of a Data Quality Dashboard for identifying data quality issues such as consistency, accuracy, conformity etc.
- Reporting utilizing skills and technologies such as Alteryx, SQL ad-hoc queries development, accessing Impala Big Data environment for query development and data extraction, utilizing SQL Server for performing data manipulation activities and finally using Tableau for some data analytics work including development of data lineage repository mapping dashboard layer fields to their source fields
Confidential, Somerset, NJ
Data Analyst/Data Steward (Master Data Management)
Responsibilities:
- Worked as the Data Steward cum Data Analyst in Master Data Management project reporting directly to the Data Governance Lead, Confidential .
- I was actively involved in successful conversion from JDE XE to JDE 9.1 for St. Petersburg, FL branch plant.
- My roles/day to day activities involved the following activities:
- Extracting, analyzing, cleaning, standardizing, profiling and updating Product and Customer Data residing in JD Edwards Enterprise Resource Planning (ERP) system using combination of SQL, Excel & Informatica Analyst. This involves the following activities
- Data extraction using intermediate to advance level SQL queries. (Multiple joins, REGEXP functions for pattern recognition and string parsing, CASE WHEN statements etc.)
- Duplicate identification and development of match and merge rules.
- Referencing product master, address book master data dictionaries, Lookup tables for extracting/converting user defined codes/values e.g.
- Finished Goods, Raw Materials, Work in Process etc.
- Converting & Decoding Standard Operating Procedure and related product documents from various sites/branch plants in business-friendly user reports
- Analyzing query results in Excel, creating pivot table/chart reports etc. and sharing them with Confidential team members
- Data profiling using Informatica Analyst (IDQ). Includes column profiling, pattern recognition, valid and invalid data identification based on SOP criteria.
- Actively worked on enriching and improving meta data sources - data dictionaries, cross reference documentation, Lookup tables etc.
- Regularly attended Confidential internal and external stakeholder meetings.
Confidential, NJ
Project Manager/Data Analyst
Responsibilities:
- Provided technical input by developing advance SQL code/solution for a complex reporting issue by developing a performance metric dashboard measuring customer retention rates monthly.
- Reviewed functional documentation, technical documentation (meta data, data dictionaries) related to migration from old Data Mart to newer Data Mart based on Star Schema architecture.
- Attended scrum/sprint sessions for updating regarding project update.
- Collaborated with business and technical teams for a marketing unit sponsored initiative for automating code generation to access routine data sets by marketing managers.
- Monitored and supervised overall project progress and ensured on-time delivery as per laid down standards.
Confidential, Weston, MA
Commercial Information Management Analyst
Responsibilities:
- Understood the business model, process diagrams, data flow diagrams
- Guided technical teams by in converting complex business rule criteria into SQL code/logic which ultimately automated a business process. This ultimately led to the merging of duplicated CMIDs per HCP in a SIEBEL based CRM Application (Axis).
- Worked with business stakeholders/partners in eliciting and capturing business requirements and documenting business rules for a data quality initiative/project.
- Liaised with technical teams in providing both user requirements (for a Business Objects solution) fixing and removing duplicate CMIDS peer HCP
- Ensured delivery in terms of re-usability of product in various applications, accessibility in existing Data warehouse environment.
- Data Quality Project (work stream 2): Data Stewardship (Data Quality Management) on Monthly Upload of Commercial Data in Production through Staging Environment
- Worked with business stakeholders and product owners in identifying time and cost saving opportunities in existing business processes.
- Improved existing data stewardship practices by documenting technical issues, translating them back to business and coordinating with technical (data warehouse teams) in cleaning upstream data sources.
- Provided hands on technical expertise in automating the task of identification of the current factoring being applied (rollup to monthly supplies) accessing the staging Database.
- Crafted, developed and automated through SQL script direct access to business (without IT support) to extract total volumes by Product, Vendor, National Drug Code (NDC Number), mapped them to the factors being applied, and determined if the current factoring was correct or needed to be changed.
- Data Mapping in Excel (VLOOKUPS, INDEX MATCH, SUMIF etc.) across multiple source/spreadsheets
- Analyzed and updated an Excel based Dashboard which highlighted comparative volumes for each product, and vendor for the current and previous month and compared them to the previous 6 months averages.
- Created, updated and maintained an Excel based DQM issue log which identified volumes across product and vendors.
- Updated a SharePoint based issue tracker and prioritization site which created new issue item, edited them, and closed them after resolution.
Confidential, Lawrenceville. NJ
Business Data Analyst
Responsibilities:
- Got orientation on the entire end to end clinical trial global logistical model
- Reviewed business documentation, process flows, data flows.
- Data collection from business stakeholders (business unit managers, external vendors (partnering labs, vendors etc.)
- Conceptualized and developed detailed business unit reporting, KPI reporting via dashboards/data visualizations in Spotfire and deployed them on SharePoint. This included
- Performance Dashboard against key logistics metrics.
- Dashboards developed using visualizations including Map Charts, Heat Maps, Line Charts, Combo charts, Pie Charts, Box Plots, etc.
- Advance use Spotfire’s Custom Expressions.
- Advance filtering (separate for each visualization)
- Dynamic linking to MySQL for running SQL queries to modify visualizations
- And other Spotfire related functions.
- Identification of business rules in interpreting and analyzing data. (Sitting with business unit managers and understanding reporting needs)
- Reviewed existing business requirements (documentation) for dashboard application development.
- Attended and participated in Spotfire working group and application development sessions for automating existing dashboards scripts.
Business Analyst
Confidential
- Requirement Analysis for data driven project
- Meeting with business stakeholders and product owners
- Facilitation of meetings and coordination
Confidential
Business Data Analyst SQL Support
Responsibilities:
- Provided technical expertise and support in data analysis and data reporting.
- Managing, updating samples, supply chain and logistics data pertaining to chemicals (Dry and Solvents) in terms of quality specification required, received, shipping, logistics, lead times, delivery times, prices, quantity ordered and compliance with quality control
- Created trending reports, Excel dashboards, charts/graphs, Pivot tables, VLOOKUP’s etc. and communicated major analytical findings and worked with functional and domain experts to identify recommended responses and propose solutions.
- Queried database using SQL and generated ad-hoc reports.
- Referenced meta data (data dictionaries) for cross referencing data elements and attributes for querying purpose.
- Worked with finance, marketing, IT and production on various supply chain and logistical issues pertaining to Fisher’s products.
- Extracted and reviewed large data sets, interpreted results and developed reports highlighting key business performance metrics.
- Developed reports on inflation, variances, prices etc. for various Fisher products (Dry and Solvents).
- Maintaining, updating data on AS400 mainframe ERP system (Product/Material Master, Vendor Master, Prices, Lead times, Quantities, Unit Conversions).
- Uploaded reports on company’s secure exchange servers
- Identified and fixed data quality issues stemming out of extracted reports through querying main frame servers and correcting the issue.
- Vendor management and supply chain analysis. Communicating with domestic and international suppliers, coordinating meetings and negotiating better pricing terms.
- Supported manager in sourcing decisions based on market data analysis using subscribed commodity & equity forums such as Chemdata, Propurchasor and other forums.
- Worked on procurement projects and identified internal and external trends that influenced international and domestic pricing affecting procurement decisions.
- Worked as a focal point between IT and Business teams for capturing business requirements for a BI application for the Business Unit.
- Developed methods for data consistency and validity checking to assure accurate and meaningful results.
- Used Ariba to automate vendor bidding and selection process and generate Ariba’s Spend Visibility reports for the procurement department.
Confidential
Business Data Analyst/Coordinator
Responsibilities:
- Worked as a Data Analyst/Project Coordinator in the Global Data Management Systems team for a Master Data Management ( Confidential ) Project
- Validated customer data using Oracle (Siebel) CRM for validating customer accounts based on pre-set criteria using structured questionnaire, decision trees & process charts.
- Identified and addressed data issues that affect data integrity.
- Identified data points to support business need requirements.
- Conducted impact analysis of data on downstream & upstream processes/systems.
- Summarized/aggregated data for consolidating information and presented to senior management.
- Identified and escalated issues impacting business when required.
- Coordinated with project team and proposed changes for improving data analysis and management.
- Created project documentation regarding data quality and validity.
Confidential
Business Data Analysis and SQL Support
Center City, Philadelphia
- Performed Data Analysis for the Cable and High Speed Business Unit involving periodic & random/ad-hoc requests from management.
- Translated business logic into complex SQL for processing ad-hoc requests for middle and upper management.
- Identified data points to support business need requirements.
- Conducted impact analysis of changes to customer’s existing account setups (speeds, tiers etc.).
- Extracted query results into excel spreadsheets and created reports/dashboards using Excel (Pivots, Formulas, Charts)
- Used Tableau for advanced data visualizations and dashboard development
- Uploaded, and managed reports on company’s SharePoint portal
- Mapping of Production Server with the Reporting Server on key reporting fields, identifying target and source fields using SQL and answering business requests.
- Developing complex queries for identifying business issues related to services being rendered such as tiers, speeds, devices and variance occurrences.
- Performance source to target mappings.
- Enriched and improved existing meta data (data dictionaries and related documentation)
- Assisted ETL teams in data migration using SSIS
Confidential
Business Data Analyst
Raritan, NJ
Responsibilities:
- Managed data and assisted in reporting and analytical functions/operations on a multi-tiered integrated Data Warehouse, reporting and analytics environment.
- Performed data analysis and synthesis of customer data.
- Identified data points affecting customers in terms of Service Level Agreements (SLAs).
- Performed data scrubbing/cleansing as part of building up a department wise data mart on top of the current enterprise level data warehouse.
- Created data dictionary (metadata) for the data mart.
- Collaborated with Data Mart team through my reporting manager in their ETL (Informatica) operations.
- Extracted predictive/trending reports using IBM Cognos.
- Generated routine and random custom reports using SQL in TOAD for Oracle (Siebel).
- Extracted and manipulated large customer/health care service providers’ data sets using SQL and created multi-dimensional visualizations/dashboards/pivot tables/charts and used advance functions in MS Excel.
- Uploaded the reports on Secure Exchange Servers and SharePoint for external and internal team partners.
- Coordinated and kept liaison with multiple stakeholders on technical issues pertaining to reporting.