We provide IT Staff Augmentation Services!

Senior Hadoop Administrator Resume

5.00/5 (Submit Your Rating)

Alpharetta, GA

PROFESSIONAL SUMMARY:

  • Total 13 years of hands - on IT experience including 4 years of extensive exposure in Big Data Hadoop Administration and 7 years in Linux/Unix/Windows Administration Technologies.
  • Possesses strong abilities in Designing, Planning, Building, Configuring, Administering, Troubleshooting,
  • Maintenance, Performance Monitoring and Fine-tuning of large scale Hadoop production Clusters using Apache, Cloudera and Hortonworks, on Physical as well as Cloud AWS servers

TECHNICAL SKILLS:

Hadoop/Big Data:  HDFS, Map Reduce, Hbase, Kafka, Storm, Spark, Ranger, Nifi,Pig, Hive, Sqoop, Flume, Hue, Oozie, Zookeeper, Apache Phoenix, Knox

Operating System:  RHEL/SUSE/Centos/Ubuntu/Windows

Cloud & Virtualization:  EC2,S3,SQS,Lambda,Autoscaling,Adding EBS Volume, RDS, Redshift, ELB, VPC, Security Groups, IAM roles, Policies, EMR and DynamoDB

Devops Tools:  Jenkins, Gradle, Chef, Ansible, Git, GITHUB, SVN, Dockers, Maven, Agile, Scrum

Programming languages:  Python, Linux shell scripts, Java, Scala

Web Servers:  Web Logic, Web Sphere, Apache Tomcat

Network Protocols:  TCP/IP, HTTP, DNS, DHCP,NTP,SFTP,LDAP,SMTP,FTP,Kerberose

Database:  Oracle/MySQL/HBASE

Scheduling:  Oozie Coordinator, Autosys

QUALIFICATIONS:

  • Total 13 years of IT experience including 4 years of experience as Hadoop Administrator and 7 years as Linux/Unix/Windows Administrator.
  • Experience in Capacity Planning, installation, configuration and support of Hortonworks and Cloudera CDH Clusters
  • Experience on Installation and Configuration of Hadoop Ecosystems - HDFS, YARN, Map Reduce, Hbase, Storm,Kafka, Ranger, Spark, Pig, Hive, Sqoop, Flume, Tez, Zookeeper, Oozie
  • Experience in setting up Hadoop High Availability (Name Node, Resource Manager, Hive, Hbase and Disaster Recovery.
  • Strong experience with Hadoop Security and Governance using Ranger, Kerberos, Security Concepts-Best Practices, Falcon.
  • Good experience in BI. data warehousing, analytics, and Database
  • Good experience with data analytics tools such as Splunk, Cognos, Tableau
  • Strong experience with cluster security tools such as Kerberos, Ranger and Knox.
  • Good Knowledge on Apache Spark, Spark- Streaming, Apache Kafka
  • Experience on product upgrades, rollbacks, and updating patch fixes between different product versions.
  • Experience in Commissioning and Decommissioning of nodes within a cluster.
  • Experience in job automation using Oozie, cluster coordination through Zookeeper and MapReduce job scheduling using Capacity Schedulers.
  • Experience in tool Integration, automation, configuration management in GIT, SVN, Jira platforms.
  • Experience in Setup monitoring and alerts for the Hadoop cluster, creation of dashboards, alerts, and weekly status report for uptime, usage, issue, etc.
  • Experience on setting Sqoop to ingest RDBMS data into Hive and vice versa.
  • Experience in setting up Flume and Kafka to ingest log data and HDFS.
  • Expertize on Hadoop Cluster Performance Tuning and Troubleshooting.
  • Experience setting up Hadoop Clusters on EC2 instances for Product POCs.
  • Experience on Red Hat Enterprise Linux Administration and Devops Tools Puppet, chef, Jenkins.
  • Monitor the cluster - jobs, performance and fine-tune when necessary using tools Ambari, Autosys, AppDynamics.
  • Design, implement, test and document performance benchmarking strategy for platform as well for each use cases.
  • Experience with the Continuous Integration and Continuous Deployment pipeline ecosystem including tools such as Maven, Gradle, Jenkins and Puppet configuration tools.
  • Hands on experience in AWS provisioning and good knowledge of AWS services like EC2, S3, VPC, IAM, ELB
  • Maintain hardware-level stability and availability, including all break/fix issues, hardware replacement, hardware modifications, and hardware/server configurations
  • Participate in a 24x7 on-call support rotation

Environment: HortonworksAmbariYarn, Mapreduce2, HBaseHive, Tez, Pig, Knox, Sqoop, Oozie, Zookeeper, Kerberos, Ambari Metrics, Ranger, Phoenix and Spark

PROFESSIONAL EXPERIENCE:

Senior Hadoop Administrator

Confidential, Alpharetta, GA

Responsibilities:

  • Responsible for Managing large scale Hadoop cluster environment, handling all Hadoop environment builds, including design, capacity planning, cluster setup, performance tuning and ongoing monitoring.
  • Implemented large scale Hadoop (Hortonworks HDP 2.4 Stack) enterprise Data Lake for Prod, DEV, and UAT Environment.
  • Upgraded Hortonworks Ambari and HDP Stack from 2.3 to 2.4 Version in Dev, DR and Prod Environment.
  • Data node commissioning, Decommissioning. HDFS Disk Rebalancing.
  • Changing all Hadoop, Yarn and HBase configuration based on issues and performance. Configured YARN queues - based on Capacity Scheduler for resource management.
  • Hadoop Security - Kerberos - Setting up Generic, Headless and Service Key-tabs. Setting up Kerberos principals. Create User access, user directories, Allocate Space quota and Resolve User Permission issues. Create strategy and maintain Ranger Policies for HDFS, Hive and Hbase.
  • Setting up High-Availability for Name node, Hive and Yarn(Resource Manager)
  • Monitor job performances, file system/disk-space management, cluster and database connectivity, log files, management of backup/security and troubleshooting various user issues.
  • Configuring Hbase replication between Production and Disaster Recovery Cluster. Hbase Performance and High Availability, Peer Replication testing.
  • Importing Data using Apache Phoenix with SQLLINE.PY and PSQL.PY
  • Full shutdown backup using Distcp tool and restore the data from backup. Snapshot setup and creation.
  • Cluster status monitoring by Ambari cluster Management and HBase Master Web UI.
  • Importing and exporting data into HDFS and Hive using SQOOP. Transfer and load Memo and Payment datasets
  • Involved in creating Hive tables, loading with data and writing hive queries that will run internally in map reduce way.
  • Analyzing various Hadoop log files for troubleshooting.
  • Design, implement, test and document performance benchmarking strategy for platform as well for each use cases.
  • Prepared Architecture documents and detailed configuration documents. Maintain HDFS directory structure and access as per the standard.
  • Support Application team thru Incident management tool like service now and fix various issues related to Hadoop platform.
  • Hands-on experience in diagnosing, troubleshooting various networking, hardware & Linux server's services issues and performing preventive maintenance.
  • Participate in a 24x7 on-call support rotation and off-hours maintenance windows.

Environment: Hortonworks, Ambari, Yarn, Mapreduce2, HBase, Hive, Tez, Pig, MySQL, DB2, Sqoop, Oozie, Zookeeper, Kerberos, Ambari Metrics, Ranger, Phoenix and Spark

Hadoop Administrator

Confidential, Bentonville, AR

Responsibilities:

  • Responsible for installation, configuration, supporting and managing Hadoop Clusters.
  • Responsible for Cluster maintenance, Monitoring, commissioning and decommissioning Data nodes, Troubleshooting, Manage and review data backups, Manage & review log files.
  • Collaborating with application teams to install operating system and Hadoop updates, patches, version upgrades.
  • Configuring Flume for efficiently collecting, aggregating and moving large amounts of log data from many different sources to HDFS.
  • Importing and exporting structured data from different relational databases into HDFS and Hive using Sqoop.
  • Setting up High-Availability for Namenode and ResourceManager
  • Secured production environments by setting up Linux users, setting up Kerberos principals.
  • Configured YARN queues - based on Capacity Scheduler for resource management.
  • Installed Oozie workflow engine to schedule Hive and PIG scripts.
  • Hands on experience in Zookeeper and ZKFC in managing and configuring in NameNode failure scenarios.
  • Used SPARK to build fast analytics for ETL Process and Constructed ingest pipeline using Spark streaming.
  • Hands-on experience in diagnosing, troubleshooting various networking, hardware & Linux server's services issues and performing preventive maintenance.
  • Participate in a 24x7 on-call support rotation and off-hours maintenance windows.

Environment: Java MapReduce, Scala Spark, HDFS, Hive, Pig, MySQL, DB2, Sqoop, Flume, Oozie, Eclipse, SVN, Maven, Jenkins.

Linux/Hadoop Administrator

Confidential

Responsibilities:

  • Responsible for cluster maintenance, adding and removing cluster nodes, cluster monitoring and troubleshooting, manage and review data backups, manage and review Hadoop log files.
  • Continuous monitoring and managing the Hadoop cluster through Cloudera Manager.
  • Installed Oozie workflow engine to run multiple Hive and Pig jobs.
  • Configuring Flume for efficiently collecting, aggregating and moving large amounts of log data from many different sources to HDFS.
  • Importing and exporting structured data from different relational databases into HDFS and Hive using Sqoop.
  • Managed Disks and File systems using LVM on Linux. and monitoring
  • Solve production problems when needed 24x7 Develop and document best practices
  • Planning, installation, configuration, management and troubleshooting of Red Hat Enterprise Linux platform for test development and Production servers
  • Monitor Linux Server for CPU utilization, Memory Utilization and Disk Utilization for performance monitoring.
  • Maintain hardware-level stability and availability, including all break/fix issues, hardware replacement, hardware modifications, and hardware/server configurations
  • Worked with Linux, Oracle Database, and Network teams to ensure the smooth relocation of the servers.
  • Perform physical hardware installation and configuration according to project requirements
  • Provision and manage Amazon Web Services resources for Production, QA, and Development
  • Manage off-site team for monitoring and managed hosting services (verify OS patches and backups)
  • Responsible for 24x7 Global on call support for production Issues.

Linux/Unix Administrator

Confidential, Wilmington, DE

Responsibilities:

  • Handling the on-call and resolving the critical tickets after business hours.
  • Preparing the SLA justification for missed SLA ( Sev 1&2 only)
  • Handling the restoration of the files from TSM and Networker Backup.
  • Attending Bridge call for sev1 & sev2 issues & working till issues get resolved.
  • Working on ticketing process based on ITIL (IT Infrastructure Library). Working on Automated and Manual Tickets
  • Replacement of H/W (Motherboard, Memory, Media-Drives, NIC Cards, HBA) by coordinating with onsite team/vendor.
  • installing the patches on all the servers for every quarter as per customer OLA
  • Performance tuning and monitoring using netstat, iosstat, vmstat and sar.
  • Supported and administered Veritas Volume Manager and Veritas Cluster products.

Linux/Windows Administrator

Confidential, NY

Responsibilities:

  • Configuring and troubleshooting of linux and solaris  machines.
  • Configuring and troubleshooting LVM (linux) and SVM (solaris)
  • User administration(Add, Delete, Modify) on  Linux and Solaris
  • Configuring and troubleshooting NIS client (linux and solaris)
  • Configuring and troubleshooting NFS (linux and solaris).
  • Configuring and troubleshooting APACHE, SAMBA servers.
  • Taking the backup of  production servers using Tivoli storage manager
  • Configuring and troubleshooting of RAID(0,1 and 5)

We'd love your feedback!