Senior Big Data Hadoop/etl Developer Resume
HoustoN
SUMMARY
- Responsible for building large - scale big data processing systems and served as the expert in data warehouse design, integration of start-of-art Big Data technologies, and acceleration of Big Data processing requirements.
- Involved in successful development of business intelligence solutions and providing Big Data services for lots of fortune 500 companies, e.g., retailer industry, manufacturing, financial services.
- Highly skilled in SQLServer/Oracle database development, database administration, ETL jobs development using Informatica, Talend, ECL and T-SQL, SQL, PL/SQL, Report creation using SSRS, Oracle BI and solid knowledge of data modelling, and data warehouse concepts.
- Produces data deliverables such as Dashboards, KPI, Data Visualizations, Analytics, Reports, Data feeds according to the software development lifecycle.
- Create designs that clearly identify how standard issues will be addressed. Work with other team members (Data Analytics, Data SMEs and Data Architects/Models) to understand the source of data required and details about the meaning and structure of the data necessary to properly use it.
- Extensive coding experience, including JAVA, Scala, ASP.net, C#, POWSHELL, Shell Scripting.
- Advanced knowledge of Structured Query Language (SQL) including Data Manipulation Language (DML) and Data Definition Language (DDL).
- Solid skills in designing and implementing data warehouses and data marts using components of Kimball Methodology, like Data Warehouse Bus, Conformed Facts & Dimensions, Slowly Changing Dimensions, Surrogate Keys, Star Schema, Snowflake Schema, etc.
- Expert in technical knowledge of Extract/transform/load (ETL) solutions.
- Experience in implementing end-to-end of various functionalities of Data Warehouse, Data Marts, and Business Intelligence and Transactional applications.
- Designing & optimizing complex stored procedures to meet data management & data integration objectives, maintaining ETL processes using SQL server SSIS, designing dashboards and designing SSAS OLAP cubes.
- Skilled in installation of databases (Oracle, SQL Server) along with JDK and Web Logic, designing the data model, developing the schemas and other SQL PLSQL database objects.
- Experience with the creation of SSIS packages in Data Extraction, Transforming, and Loading (ETL) from various sources using SQL Server Integration Services, Bulk Insert, and Bulk Copy Program (BCP).
- Expert in data modeling (physical and logical), ER diagrams, data dictionary, data map, normalize/denormalize, agile data modeling
- Skilled with implementing, ETL development (Talend, Informatica), ECL (Big Data), SQL, PL\SQL, MYSQL, ORACLE, SQL Server (DBA, development and reporting), Java, JSP, Servlet, HTML.
- Performing tests and validate all data flows and prepare all ETL processes according to business requirements and incorporate all business requirements into all design specifications.
- 7+ years of Intelligent Data Integration and ETL experience in application development with the large Enterprise Data Warehouse and Business Intelligence systems.
- Proficient in leveraging latest databases Oracle 12c, Teradata and Netezza in achieving better performance through Partitioning, Optimizers and MPP (Massively Parallel Processing) architecture.
TECHNICAL SKILLS
Database: Microsoft SQL Server 2012/2014/2016 , Oracle, MySQL, HBase, HDFS, Postgres, Hive, Teradata, DB2, NoSQL
Software Language: SQL, C/C++/C#, PowerShell, Java, HTML5/CSS/JavaScript/XML, VBA, JDBC, Python, Perl, Shell, R, Elastic Search, Apache Kafka, Apache Storm, Apache Hbase, Apache Hadoop, Apache Spark
Development: MS VSTS 2013/2015, Agile/Scrum, JIRA/Railly, SVN, Git/Github/Bitbucket/Bamboo
Tools: Power BI, Ad hoc tools, Micro Strategy, Qlik, Tableau, Universe Designer, Web Intelligence, Xcelsius, DataStage, DOORS 9.5, MAXIMO 7.6,ECL IDE, Informatica Power Centre 9.x/8.x/7.x, MySQL Query Browser, SQLYog, DB Visualizer, PUTTY, Mogwai, Toad, KenanArbor (Telecom Billing)
Domain: Banking, Finance, HealthCare, Telecommuting, Transportation, Insurance and Research
PROFESSIONAL EXPERIENCE
Confidential, Houston
Senior Big Data Hadoop/ETL Developer
Responsibilities:
- Designed and developed Big Data analytics platform for processing structured and unstructured data using Spark, JAVA, Hadoop, Hive and Pig
- Performed code reviews, analyzed execution plans, and re-factor inefficient code following data standards, resolved data issues, completed unit testing and completed system documentation for ETL processes.
- Created Spark ETL jobs to load Big Data into Hadoop File System (HDFS) and aggregated and integrated into Postgre database system.
- Lead several data extraction, warehousing design and analytics initiatives that enhance ETL progresses and performance.
- Extensive experience in data architecture, data quality management, reference and master data management, data integration, data governance, database development and design.
- Worked on development of ETL jobs using SSIS, reports using SSRS and Oracle BIEE, worked on PLSQL objects like procedures, functions, triggers for loading the data in staging tables.
- Created Session (Static & Dynamic) and Repository (System and Non-system) variables for reports. Responsible for merging data from multiple sources into a data warehouse. Designing warehouse, cleaning, standardizing, and scrubbing data before loading.
- Analyze and interpret all complex data on all target systems and analyze and provide resolutions to all data issues and coordinate with data analyst to validate all requirements.
- Strong Knowledge on contemporary Data Warehousing trends- Bill Inman, Ralph Kimball methodologies, Star Schema, Snowflake schema, ODS, EDW, DM, OLAP Dimensions and Facts.
- Analyze, validate and refine Business and Functional requirements to implement the strategic Data Warehouse Life Cycle for efficient Business Intelligence trends.
Confidential, Plano, TX
Senior Software Engineer- ETL
Responsibilities:
- Develops, enhances, tests, supports, maintains and debugs database software applications using Oracle SQL, PL/SQL skills that support business units or support functions.
- Installation and maintenance of databases (Oracle, SQLServer) along with JDK and WebLogic, designing the data model, developing the schemas and other SQL PLSQL database objects.
- Analysis and data mapping for the new source systems w.r.t legacy systems.
- Worked with multiple programming teams to establish ECL programming constructs and standards
- Expert in writing functional documentation as a Business Analyst as well as preparing and presenting the User Training Manuals and project related documents.
- Performed code reviews, analyzed execution plans, and re-factor inefficient code following data standards, resolved data issues, completed unit testing and completed system documentation for ETL processes.
- Worked as a key contributor to the design, development and implementation of critical projects within the Application Delivery organization.
- Worked closely with the Data Architect, business users, application architects, and other developers to model, implement and improve databases used in mission-critical applications within the organization.
- Transformed business requirements into logical and physical data models by conducting data profiling/analysis. Maintained and published data mappings of data elements across systems, created and maintained relevant architecture artifacts.
- Created automated processes for the activities such as database backup processes and SSIS packages run sequentially using SQL Server Agent job.
Confidential, Richardson, TX
PL/SQL and ETL Developer
Responsibilities:
- Developed complex Stored Procedure, Views and User Defined Functions on MS SQL Server; identified weaknesses in T-SQL code and then improved those weaknesses for future release.
- Worked on the oracle SQL queries and PL/SQL objects like functions, procedures, packages, triggers, views, cursors.
- Analyzed and Developed Complex Stored Procedures, Functions, Index, Triggers, Cursor, Table, Constraints, Joins, Subquery and CTE in SQL to facilitate efficient data manipulation and data consistency.
- Involved in development of code using Core Java, Servlets and ETL Design, database design and performance tuning.
- Designed data model by using ERWIN and designed data warehouse dimensional models and data marts used for reporting.
- Integrate heterogeneous sources Oracle, Flat Files, XML and CSV files through Informatica Mappings, Sessions and Workflows.
- Work with ETL admins in setting up Operating System Profiles (OSP), Red Hat Linux NAS/SAN locations, Domain, Node and Repository Configurations, Version Control and Deployments to higher environments.
- Develop Slowly Changing Dimension (SCD Type 1 or Type 2), Star Schema or Snow Flake Schema techniques and implement History or Incremental Load using Change Data Capture (CDC) mechanism.
- Prepare Source to Target Data Mappings, ETL Design and technical specification documents in compliance with the data governance and best standards and review with the Enterprise Architects and ETL Admins.
- Implement Error Logging and Alert/Abort process for any data flow issues and enable restart ability/recover techniques through Informatica Power Center tool.
- Perform Data Validation and Reconcilement checks for each data movement process and log the results for auditing purpose.
