Sr. Etl/data Architect Consultantconfidential Resume
Raleigh, NC
SUMMARY:
- Over 14 years of very strong hands on experience in Data Warehousing/Business Intelligence System Analysis, Design, Development and Implementation on Client - Server environment.
- Ability to perform detailed analysis of business problems and technical environments and use this in designing the BI solution.
- Expert in Developing Data Warehousing / Business Intelligence applications using IBM Information Server - 11.5/9.1, IBM DataStage - 7.5/7.0, Informatica PowerCenter 7.x/8.x, Cognos 8.4 Report Studio, Query Studio, Analysis Studio, Transformer 8.4.
- Strong Analytical skills in ETL, SQL, PL/SQL, Stored Procedures. Performed Debugging, Troubleshooting and Performance Tuning.
- Extensively worked with IBM Data Stage Parallel Extender (Enterprise Edition) to run parallel instants to achieve better performance while working with massive number of data.
- Better understanding of Relational and Dimensional data modeling, Logical and Physical data modeling, Star Schema, Snowflake Schema, Fact and Dimension table design.
- Expert in understanding of Data Warehousing design techniques, to implement Slowly Changing Dimension phenomenon, Surrogate key assignment, Audit key assignment and Change Data Capture.
- Expertise in Data Analysis, Data Cleansing, Data Migration, Data Mining and Database Design.
- Good working knowledge with various Databases like Oracle 11g/10g/9i/8i, Netezza 6.0.5, DB2 UDB 9.1/8.1/7.2 and SQL Server 8.X/7.X, Sybase, Teradata.
- Working knowledge on various operating systems like Red Hat Linux 5.5, Sun Solaris 9.0/8.0,IBM AIX 6.0/5.2/5.1 and Windows 2000/XP/NT.
- Expert in Unit testing, System Integration testing, Functional testing, and Performance testing.
- Unique ability to Analysis, Address and Resolve Project Development issues at all levels, Also Proficiency in design Production support frame-work.
- Excellent analytical problem solver and a comprehensive team player.
- Ability to work independently with minimal supervision.
TECHNICAL SKILLS:
Developement Tools: IBM Information Server- 11.5/9.1/8.7/8.5/8.1 , Ascential DataStage- 7.5/7.0, Informatica - 8.X/7.X, Talend Data Integration, Business Objects 6.5, Cognos 8.4, Report Studio 8.4, Query Studio 8.4, Analysis Studio 8.4, Micro Strategy 8.x, AutoSys, TOAD, DBArtisan
RDMS: Oracle 11g/10g/9i/8.x, Netezza 6.0.5, DB2 UDB 9.1/8.1/7.2, SQL Server 8.x/7.x, Sybase 12.5/11, TeraData 6.2 Cassandra, MongoDB
No SQL: IBM AIX 6.0/5.2, Sun Solaris 8.0/5.9/5.8, Windows XP/2000/98/95, Windows
Operating Systems: NT 4.0, MS-Dos, Red Hat Linux
Languages: C, C++, Basic, SQL, PL/SQL, Visual Basic
Protocols: TCP/IP, FTP, NDM, SMTP, HTTP
Version Controls: Rational Clear Case, CVS, PVCS, DataStage version control
PROFESSIONAL EXPERIENCE:
Sr. ETL/Data Architect Consultant
Confidential
Responsibilities:
- Working with business users to understand and implement data flow, data enrichment, data consolidation, change data capture and complex insurance and banking business transformation for customer self-service reporting.
- Determines database structural requirements by analyzing client operations, applications, and programming; reviewing objectives with clients; evaluating current systems.
- Directly involve with the technical team for ETL development and maintenance of the Data Warehouse application.
- Working with business team to understand end user requirements and doing effort analysis for new requirements/enhancement and change requests.
- Supporting FASG application which is aimed at deriving different share of relationship metrics which helps members to make their financial planning.
- Works closely with the project manager and developers to develop project plans, schedules, releases and production support model.
- Lead efforts to design data integration, metadata lineage and governance architecture.
Environment: IBM InfoSphere Information Server 11.5, Informatica Power Center 9, Netezza, DB2, Hortonworks, Spark-SQL.
Business Analytics & Information Management Mainline Information Systems
Confidential, Raleigh, NC
Responsibilities:
- Work with client team members to define business requirements and translate into Data Warehouse Business Intelligence (BI) technical requirements.
- Analyze the healthcare service provider’s business process and design & develop custom software solutions on IBM PureData (Netezza) System.
- Architect ETL framework and responsible for managing implementation and supporting ETL deliverables for different source systems.
- Designed and developed Spark integration to extract relational data into Cassandra for TDM use.
- Designed integration models, specifications, and other artifacts needed for the Data Services team to be able to execute based on the existing and future states.
- Implement products within the InfoSphere suite on Unix or Linux platform, including installation, patching and troubleshooting; creation of user guides, best practice, policies and procedures for the InfoSphere products.
- Define ETL requirements; prepare ETL specifications and functional requirements including audit and support documentation.
- Integrated with Salesforce cloud to extract CRM data for reporting datamart.
- Optimizing of existing algorithms in Hadoop using Spark Context, Spark-SQL, Data Frames and Pair RDD's.
- Responsible for developing to estimated timelines or updating management on deviations to these estimates.
- Raised critical technical issues to the project manager and team that may impact on the project planning and/or timeline.
- Participated and contributed in quality assurance walk-throughs of ETL components.
- Ensured the ETL code delivered is running, confirms to specifications and design guidelines.
Environment: IBM InfoSphere Information Server 11.5/9.1 (DataStage, FastTrack), IDA (InfoSphere Data Architect), Netezza 6.0.5, Cassandra, Spark, SQL Server, Hadoop, MapReduce, HDFS.
Lead Technical Consultant
Confidential, Charlotte, NC
Responsibilities:
- Involve in the requirement scope, definition and analysis in support of Data Warehousing efforts.
- Work with project team for resource planning and allocation to meet multiple project milestones.
- Design ETL process flow to populate Atomic Data Warehouse.
- Document ETL specs which address overall ETL development strategy, data transformation methodology for Patient atomic level data extracted from source systems (Cerner, Quadramed, EMPI)
- Architect ETL framework and responsible for managing implementation and supporting ETL deliverables for different source systems.
- Established job boundaries, to break an EE job into more manageable ETL components.
- Design and develop DataStage parallel extender jobs for splitting bulk data and process dataset across all available nodes to achieve best job performance.
- Implemented daily Incremental load process using DataStage jobs.
- Developed ELT process flow to load initial bulk data into Data Warehouse.
- Developed and scheduled the UNIX shell script to automate multi instance of loading processes for Patient data into AWM.
- Worked with end user to perform user acceptance test.
Environment: IBM InfoSphere Information Server 9.1 (DataStage /BusinessGlossary/Information Analyzer/FastTrack), IDA (InfoSphere Data Architect), Netezza 6.0.5, Oracle 11g, Cerner, Quadramed, EMPI, IBM AIX, Shell Scripting.
Systems Intgrt Sr Spec
Confidential, Dallas, TX
Responsibilities:
- Worked with project-team members to define the infrastructure and system requirements to support ETL environment.
- Documented the ETL architectures and installations.
- Installed, configured, tuned, and maintained DataStage application instances and supporting underlying database and red hat linux operating system environments.
- Worked with DBA, System Admin team to configure Datastage with Oracle and DB2.
- Installed Clear-Case and configured with IDA (InfoSphere Data Architect).
- Installed Hyperion Essbase connector and configured with DataStage.
- Did configuration of the nodes (both physical and logical) in DataStage EE server and memory allocation for various processes and log.
- Involved in project technical team meeting to plan, coordinate, and implement system patches and upgrades to ETL & EDW database environments, to maintain the integrity of the systems.
- Prepared high level design doc which address overall ETL development strategy to extract patient/financial data coming from different source systems like Cerner, DAAC, Medhost and McKesson.
- Assisting onsite/offshore development team in trouble shooting of DataStage issues/jobs.
- Performed user and security administration for the Information Server.
- Responsible for set up the projects, roles, privileges in different environments (Dev,Model, Prod).
- Implemented procedures for maintenance, monitoring, backup, and recovery operations for the Information Server.
- Ensured delivery of scheduled and manual jobs within the defined SLA’s in production environment.
- Worked with IBM for trouble shooting issues and PMR management.
Environment: IBM InfoSphere Information Server 8.5 (DataStage/QualityStage/BusinessGlossary/Information Analyzer/FastTrack), IDA (InfoSphere Data Architect), DAAC, Cerner, ClearCase, Cognos 8, Oracle 11g, Informatica 8, Red Hat Linux 5.5, Windows Vista/7, Shell Scripting, XML, PL/SQL, Daptiv, Showcase.
Sr. DataStage Developer/Project Lead
Confidential, Boston, MA
Responsibilities:
- Worked closely with Business Analyst team to understand specific end-user requirement.
- Excellent understanding of DataStage Parallel Runtime Engine architect, which used to design DataStage EE jobs (ETL) framework.
- Worked with different sets of Configuration file, to identify degree of parallelism for each and every EE job.
- Expert in understanding of OSH (Orchestrate Shell Script), to debug DataStage EE job compilation error.
- Extensively work on “stage to operator mapping”, to understand E2E DataStage job execution flow.
- Better knowledge of DataStage supported Data-Types which helps to identify run time Data-Types conversation (implicitly/explicitly) and NULL- Handling.
- Increased EE job performance by Optimizing (not maximize) all available hardware resources.
- Designed and Developed, DataStage Sequence jobs for Batch processing.
- Involved with project management team for status tracking, according identified short term/long term goal.
- Did effort analysis for new requirements/enhancement.
- Did production support (defect analysis, requirement/process-flow analysis, bug fixing) for ETL jobs.
- Wrote user defined routines, to implement complex business rules in DataStage application.
- Created SQL server stored procedures, views for loading the staging tables.
- Developed the automated and scheduled load processes using UNIX shell scripting.
- Developed JIL scripts to scheduled DataStage jobs using Autosys.
- Designed and Developed DataStage jobs, to process XML data source file.
- Published DataStage job as web service, for internal state street client, using DataStage RTI plug-ins.
Environment: IBM Data Stage 7.5.1 (EE), IBM Information Server 8.1, Oracle 10g, MS SQL Server 8.x, Sun Solaris 5.8 Windows XP, Shell Scripting, XML, Web Service, MQ-series, RTI, AutoSys
Sr. DataStage Developer/Administrator
Confidential, Ashburn, VA
Responsibilities:
- Coordinated with Business Analyst team to understand specific business requirements.
- Designed complete ETL specification charts for off-shore development team.
- Implemented the standards of creating DataStage workflows.
- Responsible for preparing ETL strategies for extracting data from XML source file and loading into Oracle db table.
- Developed DataStage server job to send notification, read response back from web services and update status into Oracle db table.
- Designed and Developed DataStage dimension jobs to implement slowly changing dimension phenomenon.
- Designed, Developed and Documented ETL jobs on both Server and Parallel DataStage Environments.
- Used business knowledge, to tune SQL statements in existing programs and created new SQL queries for projects.
- Design parallel jobs using enterprise edition (EE) to utilize the system resource at efficient level.
- Configured IBM Data Stage server 7.5.2(EE) with Oracle 10g and Web services.
- Designed and Configured DataStage server directory structure by keeping performance and scalability in mind.
- Export/Import code between different environments and projects to maintain jobs in synchronous and to track the real time defect.
- Successfully executed build deployments and maintenance of different code objects by working across the group (system administrators and DBAs).
- Successfully implemented onsite-off shore delivery model.
Environment: IBM Data Stage 7.5.2 (EE), Oracle 10g, Sun Solaris 5.9, Windows XP, Shell Scripting, XML, Web Service, RTI, TOAD, Actuate
DataStage Developer
Confidential, San Bruno, CA
Responsibilities:
- Developed and documented DataStage jobs according to business requirements.
- Worked with DataStage EE to implement pipeline and partition parallelism techniques to improve job performance while working with bulk data sources.
- Developed DataStage ETL jobs using Join, Look Up and Transformer stages to meet business requirements.
- Wrote shell script to call particular Bteq (Basic TeraData Query) script to load Teradata table.
Environment: IBM Data Stage 7.5 (EE), TeraData 6.2, Sun Solaris 8.0, Cognos Report Studio, Micro Strategy 8.0.1, Shell Scripting
Database Developer
Confidential, Dallas, TX
Responsibilities:
- Supported Integration Test activities for various electronic systems and architectures.
- Inspecting samples of finished products and recording defects, Wrote custom BI reports to statistically analyze quality control problems.
- Planning production schedule within budgetary limitations and time constraints.
- Developed efficient and error free production process for electro-mechanical assembly to meet client’s requirement.
- Worked closely with customer to identify their engineering needs.
- Manage inventory to efficient level to minimize cost yet maximize flow and delivery performance of finished products.
- Designed and Developed SQL server store procedure for maintain CRM database.
- Developed custom ETL process flows to load excel data into Material Requirement Planning (MRP) database; generate daily status report to track assemblies.
- Wrote UNIX shell scripts for backing up programming data resides on company wide network
Environment: Oracle 8i, SQL Server 2000, MS-excel, Korn Shell, WindowsNT4.0, UNIX.
Test Engineer
Confidential
Responsibilities:
- Activities included gathering test requirements.
- Supported Integration test activities.
- Performed system test for software development.
- Involved in verification, validation testing.
- Performed black box testing.
Design Engineer
Confidential
Responsibilities:
- Reviewed engineering design to supply defect free product to customer.
- Design and development of all aspects of company products and design of printed circuits boards.
- Design, manufacture and test prototypes.
- Planning and implementation of product schedules to meet delivery deadlines.
- Preparation of technical documentation to support design of company products.
- Electrical testing, troubleshooting and repair of all designed products.