Data Architect Resume
SUMMARY
- SQL reviews and Performance tuning suggestions
- Knowledge of or experience with IBM / BMC tools (Catalog Manager, DASD Manager, Change Manager, App Tune, Mainview, SQL Explore, etc.) used for database support work.
- Familiar with Teradata Architecture, Postgre .
- Familiar with Hadoop stack( Flume,Sqoop,Pig,spark,Hive and storm)
- Data Governance and building the customer - centric data glossary and change management workflows.
- Architecting data modernization, ensuring that Master Data and Core Data are defined from a business and technical perspective
- Exposure in Designing and implement Table structures in AWS Redshift and Google cloud bigquery.
- Exposure on Creating Redshift cluster, resizing cluster, snapshots and restore.
- Exposure on AWS Services EC2, EMR, SQS, SNS, Lambda, S3, Glacier.
- Familiar with SSIS and SSDT tool using integration services packages, analysis services data models and reporting services reports.
- Experience with distributed, scalable databases such as Cassandra, and HBASE Manage extracting, loading and transforming data in and out of Hadoop, primarily using HIVE.
- Familiar with SAS, R and Easytrieve programming
- Good exposure in writing complex SQL scripts.
- ·Optimize and tune the Hadoop environment to meet the performance requirements.
- Good knowledge in Hp service manager 9.3
TECHNICAL SKILLS
Language: Cobol, C, JCL, Pascal, Pl/1, Rexx, SAS,CICS, Mqseries,Map R and IMS- DC
Software: MS Office, MS-Project, Lotus note and Prime vera-3
Operating system: Windows 7,Z/OS, Linux, Unix
Hadoop: HBase, HDFS, Map/Reduce, Kafka,Flume,Yarn, Oozie, Pig, Hive, Spark, Kafka, Storm
Hadoop Distribution: Hadoop with Harton works, MapR and Cloudera
Databases: DB2, IMS DB, Cassandra and Hbase
Testing Tools: Hp quality center,Test-link,Trac and Share point
Utilities: ISPF, SPUFI and Sort
Analyzing Tool: Tableau
Schedule tool: Control-M
Configuration tool: Endeover and Panvelet
Tools: DB2 catalog emanager,DB2 Apptune, IBM Fault analyzer, File-aid, IBM Infosphere Biginsight, CA Erwin data model, Talend open studio and File manager
PROFESSIONAL EXPERIENCE
Confidential
Environment: Z/OS, Windows 7,Hadoop HDFS cluster with Cloudera
Data Architect
Responsibilities:
- Guiding the full lifecycle of a Hadoop solution and Image Migration including requirements analysis, governance, capacity requirements, technical architecture design (including hardware, OS, network topology), application design, testing, deployment and executes a process to move images from FileNet imaging services to Hadoop environment.
- Monitoring the process of ingestion, retrieval, deletion and export into Hadoop to ensure process are successful.
- Index store inside Hadoop using Solr on Hbase and ingest metadata including (delete, update and remove on the index meta data).
- Validate preprocessing migration data layout like header(lob,batchname etc) and body(ISEIT key,salt-GUID etc) on ingest node.
- Data modeling, schema designing for no-sql(HBase) and impala tables.
- Near real-time analytical using framework with Flume, Kafka and spark streaming.
- Involve understanding requirements and developing proof of concepts(POC) and proto-types and validate architectural decisions.
- Working with Offshore and different stake holders.
Confidential
Environment: Z/OS, Windows 7, DB2,IMS-DB,Cassandra and Hadoop HDFS.
Data architect
Responsibilities:
- Understand Inbound and outbound data flow requirements, data models for Landing, Staging and base objects, Mapping documents, Match and Merge rules
- Creating logical and physical database modeling and designs.
- Extract and analysis the data before load into cluster.
- Data Quality, Metadata, Data Lineage, Data Transformation, Modeling, Analysis, Reporting etc
- Hcatalog/Hive deploying HBase with other Hadoop components, Using Sqoop effectively to load data, writing to and reading from Hbase.
- Datastax Cassandra Distribution
- Cassandraproducts strengths and weakness to produce efficient schema designs that serves effective and high performance queries
- Data model based on Performance, Scalability, Flexibility, Complexity and Functionality of the Business.
- Monitor database integrity, investigate inconsistencies, and make recommendations regarding best method to rectify data inconsistencies
- Distributed database Design, Data modeling, Development and Support in Datastax Cassandra distribution.
- Talend with data preparation, integration, quality and MDM.
- Cassandraproducts strengths and weakness to produce efficient schema designs that serves effective and high performance queries.
- Design, support, data migration and deploy Hadoop cluster and Cassandra database in production environment.
- Interact with BA to get the functional requirements.
- Document logical, physical and dimensional datamodels and review the models with technical team.
- Coordination with offshore .
Confidential
EDW Big Data Analytics
Responsibilities:
- Confidential services with focus on big data analytics / enterprise data warehouse and business intelligence solutions to ensure optimal architecture, scalability, flexibility, availability, performance, and to provide meaningful and valuable information for better decision-making.
- Work with other teams to collect and analyze business requirements; develop, debug and test multi-processing pool / multi-threading parallel fault tolerant and scalable data processing, integration/ETL programs in Python and Java to continuously acquire and process big data from various sources to ensure fast process, reliable data quality and consistent across-the-board.
- Design and implement big data life cycle management; utilize time-series to partition big tables; conduct capacity planning and performance benchmark analysis to ensure the solution meets future growth requirements in a cost-effective manner.
- Develop and follow standards and best practices; promptly investigate/identify root causes and resolve complex issues/problems whenever occur.
- Set up alarm, monitoring and Cloud Watch services; ensure optimal operations and meet SLAs.
- Document on wiki pages; leverage agile methodology; mentor / help others on DBs as needed.
- Explore big data analytics Hadoop ecosystem such as Spark, Elastic MapReduce (EMR), Hive, Apache Pig, Kafka, Redis, Cassandra, near real-time data processing, and machine learning.
Confidential
Infrastructure engineer.
Responsibilities:
- Build testing environment for Mainframe and Hadoop cluster.
- Provisioning, installing, configuring, monitoring, and maintaining HDFS, Yarn, HBase, Flume, Sqoop, Oozie, Pig, Hive, and Storm-Yarn .
- Hcatalog/Hive Administration, deploying HBase with other Hadoop components, Using Sqoop effectively to load data, writing to and reading from Hbase
- Backup, recovery and maintenance.
- Design and deploy Hadoop cluster production environment
- Manage HDFS, Hue and all the related Hadoop tools
- Manage the backup and disaster recovery for Hadoop data
- ·Optimize and tune the Hadoop environment to meet the performance requirements
- Work with big data developers, designers and scientists in troubleshooting map reduce job failures and issues with Hive, Pig, HBASE, Flume etc.
- Setup the Kerberos for security
- Configuring Cloudera manager and cluster management.
Confidential
Team Lead
Responsibilities:
- Quote and request enrollment in the DSS OnStar product via all Internet, Agency and Operations Center channels.
- Request and receive odometer information from OnStar monthly for all vehicles receiving the discount.
- Calculate annualized mileage using odometer readings and dates from OnStar.
- Determine the DSS adjustment based upon data available to State Farm.
- Operations and Agency display of information supporting the DSS adjustment.
- Reflect DSS adjustment on applicable policyholder communications.
- Pass odometer information to the analytical environment.
- To prepare a Test Case, Test Plan, Summary report and Test Strategy
- Coding in various modules.
- Preparing and Executing a Batch job in Auto Daily Flows.
- To conduct the Code review and meeting with OSC
- To prepare utilization report and status report.
- Trouble shooting the defect
- Raising the defect in Clear Quest/Lotus Notes
Confidential
Team Lead
Responsibilities:
- To prepare a Test Case, Test Plan, Summary report and Test Strategy
- Coding in various modules.
- Preparing and Executing a Batch job in Auto Daily Flows.
- To conduct the Code review and meeting with OSC
- To prepare utilization report and status report.
- Trouble shooting the defect
- Raising the defect in Clear Quest/Lotus Notes
Confidential
Team Lead
Responsibilities:
- Coding of various programs
- Preparation of test specifications and test data
- Recording test results
Confidential
Environment: PL/1, DB2
Responsibilities:
- Preparation of low level design
- Coding of various programs
- Preparation of test specifications and test data
- Recording test results
- Verification of results.
Confidential
Software Consultant(Team member)
Responsibilities:
- Coding and testing of edited programs
- Designed and updated procedures and jobs
- Reviewing the program for the requirement as well as quality standards.
- Preparing the unit test plan which checks all the conditions.
Confidential
Software Consultant (Team member)
Responsibilities:
- Conducted system analysis, coding and testing of edited programs
- Designed and updated procedures and jobs
- Involved in prepared test cases
