Data Architect Resume

SUMMARY

SQL reviews and Performance tuning suggestions
Knowledge of or experience with IBM / BMC tools (Catalog Manager, DASD Manager, Change Manager, App Tune, Mainview, SQL Explore, etc.) used for database support work.
Familiar with Teradata Architecture, Postgre .
Familiar with Hadoop stack( Flume,Sqoop,Pig,spark,Hive and storm)
Data Governance and building the customer - centric data glossary and change management workflows.
Architecting data modernization, ensuring that Master Data and Core Data are defined from a business and technical perspective
Exposure in Designing and implement Table structures in AWS Redshift and Google cloud bigquery.
Exposure on Creating Redshift cluster, resizing cluster, snapshots and restore.
Exposure on AWS Services EC2, EMR, SQS, SNS, Lambda, S3, Glacier.
Familiar with SSIS and SSDT tool using integration services packages, analysis services data models and reporting services reports.
Experience with distributed, scalable databases such as Cassandra, and HBASE Manage extracting, loading and transforming data in and out of Hadoop, primarily using HIVE.
Familiar with SAS, R and Easytrieve programming
Good exposure in writing complex SQL scripts.
·Optimize and tune the Hadoop environment to meet the performance requirements.
Good knowledge in Hp service manager 9.3

TECHNICAL SKILLS

Language: Cobol, C, JCL, Pascal, Pl/1, Rexx, SAS,CICS, Mqseries,Map R and IMS- DC

Software: MS Office, MS-Project, Lotus note and Prime vera-3

Operating system: Windows 7,Z/OS, Linux, Unix

Hadoop: HBase, HDFS, Map/Reduce, Kafka,Flume,Yarn, Oozie, Pig, Hive, Spark, Kafka, Storm

Hadoop Distribution: Hadoop with Harton works, MapR and Cloudera

Databases: DB2, IMS DB, Cassandra and Hbase

Testing Tools: Hp quality center,Test-link,Trac and Share point

Utilities: ISPF, SPUFI and Sort

Analyzing Tool: Tableau

Schedule tool: Control-M

Configuration tool: Endeover and Panvelet

Tools: DB2 catalog emanager,DB2 Apptune, IBM Fault analyzer, File-aid, IBM Infosphere Biginsight, CA Erwin data model, Talend open studio and File manager

PROFESSIONAL EXPERIENCE

Confidential

Environment: Z/OS, Windows 7,Hadoop HDFS cluster with Cloudera

Data Architect

Responsibilities:

Guiding the full lifecycle of a Hadoop solution and Image Migration including requirements analysis, governance, capacity requirements, technical architecture design (including hardware, OS, network topology), application design, testing, deployment and executes a process to move images from FileNet imaging services to Hadoop environment.
Monitoring the process of ingestion, retrieval, deletion and export into Hadoop to ensure process are successful.
Index store inside Hadoop using Solr on Hbase and ingest metadata including (delete, update and remove on the index meta data).
Validate preprocessing migration data layout like header(lob,batchname etc) and body(ISEIT key,salt-GUID etc) on ingest node.
Data modeling, schema designing for no-sql(HBase) and impala tables.
Near real-time analytical using framework with Flume, Kafka and spark streaming.
Involve understanding requirements and developing proof of concepts(POC) and proto-types and validate architectural decisions.
Working with Offshore and different stake holders.

Confidential

Environment: Z/OS, Windows 7, DB2,IMS-DB,Cassandra and Hadoop HDFS.

Data architect

Responsibilities:

Understand Inbound and outbound data flow requirements, data models for Landing, Staging and base objects, Mapping documents, Match and Merge rules
Creating logical and physical database modeling and designs.
Extract and analysis the data before load into cluster.
Data Quality, Metadata, Data Lineage, Data Transformation, Modeling, Analysis, Reporting etc
Hcatalog/Hive deploying HBase with other Hadoop components, Using Sqoop effectively to load data, writing to and reading from Hbase.
Datastax Cassandra Distribution
Cassandraproducts strengths and weakness to produce efficient schema designs that serves effective and high performance queries
Data model based on Performance, Scalability, Flexibility, Complexity and Functionality of the Business.
Monitor database integrity, investigate inconsistencies, and make recommendations regarding best method to rectify data inconsistencies
Distributed database Design, Data modeling, Development and Support in Datastax Cassandra distribution.
Talend with data preparation, integration, quality and MDM.
Cassandraproducts strengths and weakness to produce efficient schema designs that serves effective and high performance queries.
Design, support, data migration and deploy Hadoop cluster and Cassandra database in production environment.
Interact with BA to get the functional requirements.
Document logical, physical and dimensional datamodels and review the models with technical team.
Coordination with offshore .

Confidential

EDW Big Data Analytics

Responsibilities:

Confidential services with focus on big data analytics / enterprise data warehouse and business intelligence solutions to ensure optimal architecture, scalability, flexibility, availability, performance, and to provide meaningful and valuable information for better decision-making.
Work with other teams to collect and analyze business requirements; develop, debug and test multi-processing pool / multi-threading parallel fault tolerant and scalable data processing, integration/ETL programs in Python and Java to continuously acquire and process big data from various sources to ensure fast process, reliable data quality and consistent across-the-board.
Design and implement big data life cycle management; utilize time-series to partition big tables; conduct capacity planning and performance benchmark analysis to ensure the solution meets future growth requirements in a cost-effective manner.
Develop and follow standards and best practices; promptly investigate/identify root causes and resolve complex issues/problems whenever occur.
Set up alarm, monitoring and Cloud Watch services; ensure optimal operations and meet SLAs.
Document on wiki pages; leverage agile methodology; mentor / help others on DBs as needed.
Explore big data analytics Hadoop ecosystem such as Spark, Elastic MapReduce (EMR), Hive, Apache Pig, Kafka, Redis, Cassandra, near real-time data processing, and machine learning.

Confidential

Infrastructure engineer.

Responsibilities:

Build testing environment for Mainframe and Hadoop cluster.
Provisioning, installing, configuring, monitoring, and maintaining HDFS, Yarn, HBase, Flume, Sqoop, Oozie, Pig, Hive, and Storm-Yarn .
Hcatalog/Hive Administration, deploying HBase with other Hadoop components, Using Sqoop effectively to load data, writing to and reading from Hbase
Backup, recovery and maintenance.
Design and deploy Hadoop cluster production environment
Manage HDFS, Hue and all the related Hadoop tools
Manage the backup and disaster recovery for Hadoop data
·Optimize and tune the Hadoop environment to meet the performance requirements
Work with big data developers, designers and scientists in troubleshooting map reduce job failures and issues with Hive, Pig, HBASE, Flume etc.
Setup the Kerberos for security
Configuring Cloudera manager and cluster management.

Confidential

Team Lead

Responsibilities:

Quote and request enrollment in the DSS OnStar product via all Internet, Agency and Operations Center channels.
Request and receive odometer information from OnStar monthly for all vehicles receiving the discount.
Calculate annualized mileage using odometer readings and dates from OnStar.
Determine the DSS adjustment based upon data available to State Farm.
Operations and Agency display of information supporting the DSS adjustment.
Reflect DSS adjustment on applicable policyholder communications.
Pass odometer information to the analytical environment.
To prepare a Test Case, Test Plan, Summary report and Test Strategy
Coding in various modules.
Preparing and Executing a Batch job in Auto Daily Flows.
To conduct the Code review and meeting with OSC
To prepare utilization report and status report.
Trouble shooting the defect
Raising the defect in Clear Quest/Lotus Notes

Confidential

Team Lead

Responsibilities:

To prepare a Test Case, Test Plan, Summary report and Test Strategy
Coding in various modules.
Preparing and Executing a Batch job in Auto Daily Flows.
To conduct the Code review and meeting with OSC
To prepare utilization report and status report.
Trouble shooting the defect
Raising the defect in Clear Quest/Lotus Notes

Confidential

Team Lead

Responsibilities:

Coding of various programs
Preparation of test specifications and test data
Recording test results

Confidential

Environment: PL/1, DB2

Responsibilities:

Preparation of low level design
Coding of various programs
Preparation of test specifications and test data
Recording test results
Verification of results.

Confidential

Software Consultant(Team member)

Responsibilities:

Coding and testing of edited programs
Designed and updated procedures and jobs
Reviewing the program for the requirement as well as quality standards.
Preparing the unit test plan which checks all the conditions.

Confidential

Software Consultant (Team member)

Responsibilities:

Conducted system analysis, coding and testing of edited programs
Designed and updated procedures and jobs
Involved in prepared test cases

We provide IT Staff Augmentation Services!

We'd love your feedback!

Resume Categories

Client Services

Job Seekers

Visa Sponsorship