We provide IT Staff Augmentation Services!

Data Architect Resume

2.00/5 (Submit Your Rating)

SUMMARY

  • SQL reviews and Performance tuning suggestions
  • Knowledge of or experience with IBM / BMC tools (Catalog Manager, DASD Manager, Change Manager, App Tune, Mainview, SQL Explore, etc.) used for database support work.
  • Familiar with Teradata Architecture, Postgre .
  • Familiar with Hadoop stack( Flume,Sqoop,Pig,spark,Hive and storm)
  • Data Governance and building the customer - centric data glossary and change management workflows.
  • Architecting data modernization, ensuring that Master Data and Core Data are defined from a business and technical perspective
  • Exposure in Designing and implement Table structures in AWS Redshift and Google cloud bigquery.
  • Exposure on Creating Redshift cluster, resizing cluster, snapshots and restore.
  • Exposure on AWS Services EC2, EMR, SQS, SNS, Lambda, S3, Glacier.
  • Familiar with SSIS and SSDT tool using integration services packages, analysis services data models and reporting services reports.
  • Experience with distributed, scalable databases such as Cassandra, and HBASE Manage extracting, loading and transforming data in and out of Hadoop, primarily using HIVE.
  • Familiar with SAS, R and Easytrieve programming
  • Good exposure in writing complex SQL scripts.
  • ·Optimize and tune the Hadoop environment to meet the performance requirements.
  • Good knowledge in Hp service manager 9.3

TECHNICAL SKILLS

Language: Cobol, C, JCL, Pascal, Pl/1, Rexx, SAS,CICS, Mqseries,Map R and IMS- DC

Software: MS Office, MS-Project, Lotus note and Prime vera-3

Operating system: Windows 7,Z/OS, Linux, Unix

Hadoop: HBase, HDFS, Map/Reduce, Kafka,Flume,Yarn, Oozie, Pig, Hive, Spark, Kafka, Storm

Hadoop Distribution: Hadoop with Harton works, MapR and Cloudera

Databases: DB2, IMS DB, Cassandra and Hbase

Testing Tools: Hp quality center,Test-link,Trac and Share point

Utilities: ISPF, SPUFI and Sort

Analyzing Tool: Tableau

Schedule tool: Control-M

Configuration tool: Endeover and Panvelet

Tools: DB2 catalog emanager,DB2 Apptune, IBM Fault analyzer, File-aid, IBM Infosphere Biginsight, CA Erwin data model, Talend open studio and File manager

PROFESSIONAL EXPERIENCE

Confidential

Environment: Z/OS, Windows 7,Hadoop HDFS cluster with Cloudera

Data Architect

Responsibilities:

  • Guiding the full lifecycle of a Hadoop solution and Image Migration including requirements analysis, governance, capacity requirements, technical architecture design (including hardware, OS, network topology), application design, testing, deployment and executes a process to move images from FileNet imaging services to Hadoop environment.
  • Monitoring the process of ingestion, retrieval, deletion and export into Hadoop to ensure process are successful.
  • Index store inside Hadoop using Solr on Hbase and ingest metadata including (delete, update and remove on the index meta data).
  • Validate preprocessing migration data layout like header(lob,batchname etc) and body(ISEIT key,salt-GUID etc) on ingest node.
  • Data modeling, schema designing for no-sql(HBase) and impala tables.
  • Near real-time analytical using framework with Flume, Kafka and spark streaming.
  • Involve understanding requirements and developing proof of concepts(POC) and proto-types and validate architectural decisions.
  • Working with Offshore and different stake holders.

Confidential

Environment: Z/OS, Windows 7, DB2,IMS-DB,Cassandra and Hadoop HDFS.

Data architect

Responsibilities:

  • Understand Inbound and outbound data flow requirements, data models for Landing, Staging and base objects, Mapping documents, Match and Merge rules
  • Creating logical and physical database modeling and designs.
  • Extract and analysis the data before load into cluster.
  • Data Quality, Metadata, Data Lineage, Data Transformation, Modeling, Analysis, Reporting etc
  • Hcatalog/Hive deploying HBase with other Hadoop components, Using Sqoop effectively to load data, writing to and reading from Hbase.
  • Datastax Cassandra Distribution
  • Cassandraproducts strengths and weakness to produce efficient schema designs that serves effective and high performance queries
  • Data model based on Performance, Scalability, Flexibility, Complexity and Functionality of the Business.
  • Monitor database integrity, investigate inconsistencies, and make recommendations regarding best method to rectify data inconsistencies
  • Distributed database Design, Data modeling, Development and Support in Datastax Cassandra distribution.
  • Talend with data preparation, integration, quality and MDM.
  • Cassandraproducts strengths and weakness to produce efficient schema designs that serves effective and high performance queries.
  • Design, support, data migration and deploy Hadoop cluster and Cassandra database in production environment.
  • Interact with BA to get the functional requirements.
  • Document logical, physical and dimensional datamodels and review the models with technical team.
  • Coordination with offshore .

Confidential

EDW Big Data Analytics

Responsibilities:

  • Confidential services with focus on big data analytics / enterprise data warehouse and business intelligence solutions to ensure optimal architecture, scalability, flexibility, availability, performance, and to provide meaningful and valuable information for better decision-making.
  • Work with other teams to collect and analyze business requirements; develop, debug and test multi-processing pool / multi-threading parallel fault tolerant and scalable data processing, integration/ETL programs in Python and Java to continuously acquire and process big data from various sources to ensure fast process, reliable data quality and consistent across-the-board.
  • Design and implement big data life cycle management; utilize time-series to partition big tables; conduct capacity planning and performance benchmark analysis to ensure the solution meets future growth requirements in a cost-effective manner.
  • Develop and follow standards and best practices; promptly investigate/identify root causes and resolve complex issues/problems whenever occur.
  • Set up alarm, monitoring and Cloud Watch services; ensure optimal operations and meet SLAs.
  • Document on wiki pages; leverage agile methodology; mentor / help others on DBs as needed.
  • Explore big data analytics Hadoop ecosystem such as Spark, Elastic MapReduce (EMR), Hive, Apache Pig, Kafka, Redis, Cassandra, near real-time data processing, and machine learning.

Confidential

Infrastructure engineer.

Responsibilities:

  • Build testing environment for Mainframe and Hadoop cluster.
  • Provisioning, installing, configuring, monitoring, and maintaining HDFS, Yarn, HBase, Flume, Sqoop, Oozie, Pig, Hive, and Storm-Yarn .
  • Hcatalog/Hive Administration, deploying HBase with other Hadoop components, Using Sqoop effectively to load data, writing to and reading from Hbase
  • Backup, recovery and maintenance.
  • Design and deploy Hadoop cluster production environment
  • Manage HDFS, Hue and all the related Hadoop tools
  • Manage the backup and disaster recovery for Hadoop data
  • ·Optimize and tune the Hadoop environment to meet the performance requirements
  • Work with big data developers, designers and scientists in troubleshooting map reduce job failures and issues with Hive, Pig, HBASE, Flume etc.
  • Setup the Kerberos for security
  • Configuring Cloudera manager and cluster management.

Confidential

Team Lead

Responsibilities:

  • Quote and request enrollment in the DSS OnStar product via all Internet, Agency and Operations Center channels.
  • Request and receive odometer information from OnStar monthly for all vehicles receiving the discount.
  • Calculate annualized mileage using odometer readings and dates from OnStar.
  • Determine the DSS adjustment based upon data available to State Farm.
  • Operations and Agency display of information supporting the DSS adjustment.
  • Reflect DSS adjustment on applicable policyholder communications.
  • Pass odometer information to the analytical environment.
  • To prepare a Test Case, Test Plan, Summary report and Test Strategy
  • Coding in various modules.
  • Preparing and Executing a Batch job in Auto Daily Flows.
  • To conduct the Code review and meeting with OSC
  • To prepare utilization report and status report.
  • Trouble shooting the defect
  • Raising the defect in Clear Quest/Lotus Notes

Confidential

Team Lead

Responsibilities:

  • To prepare a Test Case, Test Plan, Summary report and Test Strategy
  • Coding in various modules.
  • Preparing and Executing a Batch job in Auto Daily Flows.
  • To conduct the Code review and meeting with OSC
  • To prepare utilization report and status report.
  • Trouble shooting the defect
  • Raising the defect in Clear Quest/Lotus Notes

Confidential

Team Lead

Responsibilities:

  • Coding of various programs
  • Preparation of test specifications and test data
  • Recording test results

Confidential

Environment: PL/1, DB2

Responsibilities:

  • Preparation of low level design
  • Coding of various programs
  • Preparation of test specifications and test data
  • Recording test results
  • Verification of results.

Confidential

Software Consultant(Team member)

Responsibilities:

  • Coding and testing of edited programs
  • Designed and updated procedures and jobs
  • Reviewing the program for the requirement as well as quality standards.
  • Preparing the unit test plan which checks all the conditions.

Confidential

Software Consultant (Team member)

Responsibilities:

  • Conducted system analysis, coding and testing of edited programs
  • Designed and updated procedures and jobs
  • Involved in prepared test cases

We'd love your feedback!