We provide IT Staff Augmentation Services!

Big Data Architect / Hadoop Administrator &lead Developer Resume

3.00/5 (Submit Your Rating)

Cincinnati, OH

SUMMARY:

  • Certified Cloudera Hadoop Admin and Developer having over 15 years of experience in IT - Software Engineering with current expertise being in Big Data/Hadoop ecosystems and NoSQL (Cassandra, Mongo DB, and Hbase)
  • Hands on experience in Hadoop Ecosystem components such as Hive, Pig, Sqoop, Flume, Zookeeper/Kafka and Hbase and MapReduce
  • Hands on experience in HDFS, Big Data, ETL for data warehouse projects
  • Strong SQL programing, Hiveql for Hive, Pig and Hbase
  • Strong programming skills in Spark/Scala
  • Strong knowledge of Hadoop Architecture and Daemons such as HDFS, Job Tracker, Task Tracker, Namenode, Datanode and Yarn.
  • Involved in designing and implementation in Kafka to extract data from Teradata database into Hadoop cluster
  • Implemented Sqoop for large dataset transfer between Hadoop and RDBMS
  • Skilled in performing real time analytics on HDFS using HBase
  • Involved in designing and implementation in Kafka for real- time transformation data to Hadoop cluster using Talend
  • Experience in working with Cloudera, Hortonworks Hadoop Distributions and Datastax Cassandra
  • Hands on experience on AWS infrastructure services Amazon Simple Storage Service (Amazon S3) and Amazon Elastic Compute Cloud (Amazon EC2)
  • Worked with Zookeeper/Kafka to manage the flow of jobs and coordination in the cluster
  • Experience in performance tuning, monitoring the Hadoop cluster by gathering and analyzing the existing infrastructure using Cloudera manager, Ambari, or Opscenter
  • Strong Red Hat, Linux Admin skills, working with Unix/Linux for last 14 years
  • Oracle DBA: including Erwin database design, development, testing, implementation and data warehousing
  • Data Architect: system design, data model design, backup/recovery strategies, OLAP WH project
  • Strong database performance tuning skills

TECHNICAL SKILLS:

Languages: Spark / Scala, PL/SQL, COBOL/COBOL II, Shell script

OS: Red Hat, IBM AIX, Oracle Linux - RHEL 6, HP/UX 11, Ubuntu 12.4 Centos 6.4

Networks: TCP/IP, Windows NT 4.0 Server

Software: Cloudera/Hadoop manager 4.5, Oracle Real Application Cluster (RAC) 10g 11gr2, Data Guard, 10g/11g database, Oracle Enterprise Manager Grid Control (OEM) 12c, GoldenGate 12c, CA Erwin R7

Hardware/software:: ASM for Oracle RAC, NFS

PROFESSIONAL EXPERIENCE:

Confidential, Jersey City, NJ

IntellectDesign

  • Installed, configured Cloudera Hadoop and Hotonworks HDP 2.x clusters 5.x
  • Setup kafka cluster with multiple brokers, partitions and producer, consumer groups
  • Installed and configured mongodb multiple nodes sharding, integreted with Hadoop cluster system
  • Data streaming from kafka to mongodb by Spark Scala program and big data load testing
  • Provided detailed instruction on installation, configuration and workflow to the team
  • Performance tuning and troubleshooting on clusters and recommended solutions
Confidential, Cincinnati, OH

Big Data Architect / Hadoop Administrator & Lead Developer

  • Installed, configured Cloudera Hadoop components and Hortonworks HDP 2.x
  • Installed, configured Cassandra cluster by Opscenter on AWS/EC2/S3
  • Worked on analytics as lead Big Data/Scala developer
  • Provided procedure definition and design of solution
  • Evaluated performance on different Big Data platforms
  • Performance tuning on peak time processes on Hadoop cluster
  • Worked on CDH, Ambri and configured ecosystem tools like Hive/hiveql, Hbase, Sqoop, spark SQL/scala, Kafka
  • Configured ETL tools and for performance with Teradata databases
  • Worked on data models and ingested DWH data to Hadoop cluster
  • Monitoring and troubleshooting on production cluster.
  • Installed and configure macine learning tool -- Alpine data lab and data models
  • Worked on a datapipeline project using flume, kafka, spark/scala/dataframes and store in hive.
  • Generated BI report by Tableau
Confidential, Seattle, WA

Data Architect / Hadoop Administrator & Developer

  • Actively involved in design, review, implementing and optimizing data transformation processes In the Hadoop ecosystems.
  • Lead several Hadoop data extraction, warehousing and analytics tasks
  • Coordinated with offshore team on development task and troubleshooting
  • Install, manage and support Linux operating systems, ex. RHEL, CentOS, Ubuntu.
  • Installed configured Hadoop cloudera CDH5, setup Hadoop cloudera distribution system and monitor by Hadoop Cloudera manager.
  • Hands on experience with Amazon web services, created EC2 (Elastic Compute Cloud) cluster instances, setup data buckets on S3 (Simple Storage Service), set EMR (Elastic MapReduce) with Hive scripts to process big data.
  • Worked on Pig and Hiveql. Involved in data warehouse, schemas creation and management.
  • Involved in writing Shell Scripts.
  • Setup and optimize the development and production environment.
Confidential, Sunnyvale, CA

Hadoop Administrator / Developer and Oracle DBA support.

  • Installed and configured Red Hat/CentOS and Ubuntu/Cloudera manager with Hadoop multiple nodes
  • Collected data from different databases( i.e. Teradata, Oracle, MySQL) to Hadoop
  • Installed, configured and created Hbase, Hive, Pig and MapReduce scripts
  • Worked on Hive/Hbase vs RDBMS, imported data to Hive, created tables, partitions, indexes, views, queries and reports for BI data analysis
  • Involved in writing Shell Scripts
  • Conducted introductory classes on Hadoop admin and Hadoop developer
  • Troubleshooting and performance tuning on Hadoop system
  • Installed, refreshed, upgraded 11g databases
  • Production Support for any OLTP database issue
  • Work closely with Application teams to resolve performance issues
  • Installed Oracle 11gr2 RAC on EMC/ASM storage, installed Data guard
  • Installed and configured Oracle 12c and GoldenGate 12c
  • Installed and setup Oracle 12c Enterprise Manager and Mongo database
Confidential, Midland, MI

Oracle RAC Admin and OEM 12C Admin

  • Installed and configured 11gr2 RAC and Oracle Enterprise Manager 12c (OEM) to monitor all 24 nodes 24/7 on cluster, ASM, listeners, databases, agents and performance tuning
  • Setup notifications, admin groups, monitor templates, incidents rules, scheduled jobs for backups, database clone, SQL performance, AWR reports
  • Production Support for RAC/SAP system
  • Documented troubleshooting procedures
Confidential, Dodgeville, WI

Oracle Architect/DBA

  • Performance tune production database: Solaris 10 kernel and Java application
  • Created performance health check report using OEM Grid control to analyze data in conjunction with AWR analysis by ADDM
  • Heavy user interface with marketing team to reorg data warehouse to support the Business Intelligence model, also completed many SQL store procedures for analysis and market trending and cleaned up, refreshed all images for company online sale.
  • Installed and configured Oracle Golden Gate for bi-directional replication
  • Production Support for all Erwin data models for company databases. Managed requirements and releases for all international marketing work tracks
  • Designed new production databases merge strategies
  • Configured Nagios for system-wide monitoring
  • Re-designed company development architecture and refresh procedures on VMware
Confidential, Seaside, CA

Senior Oracle DBA

  • Installed and configured OEM (Grid Control) for security system
  • Designed and maintained relational database model, with many sub models
  • Monitored and performed troubleshooting for all US Air Force Base PIPS databases
  • Performed SQL tuning for system statistics reports
  • Worked on Data warehouse project, ETL scripts and implemented.
  • Setup policy for database security, such as audit vault, FGA and encryption
  • Optimized system configuration related to daily performance and maintenance
  • Provided solutions for data guard performance on network issues
  • Installed Oracle 11g RAC on Linux RH, ASM
  • Installed and configured Golden Gate to replicate Oracle 11g databases
Confidential

Senior Oracle DBA/HP-UX Administrator

  • Provided architectural design for implementation of data warehouse to support massive credit card data repository
  • Designed data ware house model, transformed it to a physical database and configured the system/database
  • Worked closely with the development team to tune ETL scripts. Performed verification on all database stored procedures, functions and triggers
  • Created baselines for Unix servers
  • Production Support for OLTP databases
Confidential, Mountain View, CA

Senior Oracle DBA

  • Production and Development Support for PPM, worked with 70 java developers
  • Installed Sun Solaris 9 and Linux Red Hat 4.1 operating system
  • System admin for UNIX Solaris, Linux, HP-UX, IBM IAX, Windows NT, 2003, XP
  • Installed OEM Grid Control to monitor production Real Application Cluster (RAC)
  • Completed Sql package, procedures and functions for upgrade software

We'd love your feedback!