Analyst Programmer Resume
VA
SUMMARY
- 12+ years experience in designing and developing ETL solutions for Data applications on Oracle, Netezza, SQL Server, DB2 using Informatica Power Center.
- 1+ years of experience in extracting, parsing and transforming data from various file formats like XML, JSON and Web Logs using Python, Spark(Pyspark) and Hive using HDFS as data storage system.
- Cloudera Certified Hadoop and Apache Spark Developer (CCA175)
- Fair amount of experience in building ETL/Ingestion jobs using Jupyter notebooks with Apache Spark.
- Experienced in writing Shell/Awk scripts for data movement and file parsing.
- Good understanding of AWS components such as EMR, EC2, EBS, VPC, S3 and Redshift
- Good understanding on most of teh components of Hadoop Ecosystem (HDFS/HIVE/Sqoop/Spark/Impala/Hbase).
- Good understanding of NoSQL DBs such as HBase and DynamoDB
- Strong understanding of MPP Databases (Redshift, Netezza)
- Experienced in writing complex SQLs for Analysis and Data Extraction.
- Experienced in BI Reporting tools such as Hyperion Interactive Reporting (Brio) and SSAS (SQL server Analysis Services).
- Experienced in creating SSIS packages
- Amazon certified AWS developer associate
- Excellent understanding on Data Warehousing concepts and Design
- Experienced in working in fast paced Agile environments. Worked on both Scrum and Kanban methodologies..
- Excellent team player with good interpersonal and analytical skills
- Fair Understanding of Regression Models
TECHNICAL SKILLS
ETL tools: Informatica Power Center, SSIS (Exposure)
Programming Languages: Python, Shell Scripting, AWK.
Relational Data Bases: Netezza, Oracle, SQLServer, DB2, Amazon Redshift.
Non RDBMS: Hbase, Dynamodb (Exposure)
AWS Cloud: EC2, S3, VPC, EBS, EMR, Redshift
Query Languages SQL, PL/SQL
Big Data: Hadoop (HDFS and Hive), Sqoop, Spark(PySpark), Impala
Data warehouse design methodologies: Star and Snowflake Schema, Kimball and Inmon Methodologies
PROFESSIONAL EXPERIENCE
Confidential Irvine, CA
Data Warehouse Engineer
Responsibilities:
- Develop File parsers using Shell/AWK/Python
- Extract and load data from HIVE to Netezza
- Create Reusable components to ingest data from a variety of different file formats.
- Develop ETL jobs using Pyspark with Jupyter notebooks in on premise cluster for certain transforming needs and HDFS as data storage system.
- Developed automated ETL Routines in Python to extract data from third party APIs like CDK and Outlook Server and load it into Valuations Data Warehouse.
- Developed Automated scripts and Alerts to monitor Production Processes
- Develop Stored Procedures and User Defined functions in Netezza.
- Worked on Migrating EDW from Windows to Linux during Datacenter move
- Provide support to teh Decision Sciences and Predictive Analytics teams with various queries regarding data.
- Developed outlier detection routines for Decision Sciences and Predictive Analytics teams.
- Currently involved in developing Prototypes for migrating On - Premise Valuations Data Warehouse to AWS Cloud
- Performance Tuning of Data Loads and Query Optimization
Confidential
Tech Lead
Responsibilities:
- Performed Analysis of Legacy code and create reverse engineering documents
- Developed Source to Target Mapping documents for various ETL routines
- Designed and developed ETL Mappings using Informatica Power Center to Extract, Transform and load data into Unified Sales Data warehouse
- Developed Extracts to be fed into Logility Planning tool
- UNIX/AWK Shell Scripts to parse complex files
- Delegation and distribution of Tasks among team members and regular follow-ups to make sure teh timelines are met
- Identify priorities in consultation with teh client
- Performance tuning of teh Mappings/Workflows and SQLs
- Analysis and Effort estimation of Change Requests.
- Production support
Confidential
Data Warehouse DeveloperResponsibilities:
- Developed SSIS Packages to load data from Oracle to SQL Server for Cube generation
- Created ETL Mappings using Informatica Power Center to load POS, OBS and Inventory data marts.
- Developed OLAP data Cubes using SSAS and ESSBASE
- Developed Hyperion IR Reports and Jobs
- Shell PL/SQL Scripts to facilitate data loads
- Control user access to various reports and cubes based on business divisions using Hyperion Shared Services and SQL Server
- Performed Install and Upgrade of Informatica and Hyperion Software as needed
- Production Support
Confidential
Data Warehouse Developer
Responsibilities:
- Data Modelling
- Designed ETL Framework for AOLs European BI datamart
- Analyzed Legacy programs and re-engineer them using Ab-Initio graphs and Unix Shell Scripts
- Developed a custom tool to perform automated coding standards verification
- Developed Autosys Job schedules for batch processing
- Performed Adhoc data analysis to support queries from Business users
- Production Support
Confidential, VA
Analyst Programmer
Responsibilities:
- Developed ETL mappings for TED and PEPR projects using INFORMATICA Power Center 7.x
- Performed analysis of legacy COBOL programs
- Developed DB2 stored procedures to facilitate data cleansing and data loads
- Used DB2 bulk load utility to load huge amount of data into ODS
- Developed Backend Module for HA/TA Audit sytems
- Created Audit samples using Base SAS for HA/TA applications
- Developed Reports using DB2 stored procedures and Business Objects.
- Developed shell scripts in UNIX.
- Coordinated with teh Offshore Counterparts and Client
- Coordinated teh deployment and Migration of Unix, DB2 and INFORMATICA code to production environment
- Production Support to TED, PEPR and HA/TA applications
Trainee
Confidential
Responsibilities:
- SDLC Concepts
- UNIX
- C
- Operating System concepts
- RDBMS
- Software Engineering
