Hadoop Administrator Resume
Chicago, IL
SUMMARY:
- Around 8 + years of IT experience including 4 years of experience with Hadoop Ecosystem in installation and configuration of different Hadoop eco - system components in teh existing cluster and 4 years of experience on Linux administration .
- Experience in deploying and managing teh multi-node development, testing and production Hadoop cluster with different Hadoop components (HIVE, PIG, Spark, SQOOP, OOZIE, FLUME, RANGER,KNOX, HBASE, ZOOKEEPER) using Apache Ambari.
- Experience on Horton works and Cloudera manager Strong knowledge on Hadoop HDFS architecture and Map-Reduce framework.
- Experience in improving teh Hadoop cluster performance by considering teh OS kernel, Storage, Networking, Hadoop HDFS and Map-Reduce by setting appropriate configuration parameters.
- Experience in administering teh Linux systems to deploy Hadoop cluster and monitoring teh cluster using Ambari.
- Experience in upgrading Hadoop cluster from current version to minor version upgrade as well as to major versions.
- Experience in using Zookeeper for coordinating teh distributed applications.
- Experience in managing Hadoop infrastructure like commissioning, decommissioning, log rotation, rack topology implementation.
- Experience in managing teh cluster resources by implementing fair and capacity scheduler.
- Experience in scheduling jobs using OOZIE workflow.
- Scheduling jobs using crontab.
- Experience in benchmarking, performing backup and disaster recovery of Name Node metadata and important sensitive data residing on cluster.
- Strong knowledge in configuring Name Node High Availability.
- Experience in configuring Hadoop Security (Ranger and Knox gateway).
- Experience in handling multiple relational databases: MySQL, SQL Server.
- Assisted Developers with problem resolution.
- Ability to play a key role in teh team and communicates across teh team.
- Global Service Delivery experience by bringing together resources to accomplish organizational goals using ITIL framework.
- TEMPEffective problem solving skills and outstanding interpersonal skills. Ability to work independently as well as within a team environment. Driven to meet deadlines. Ability to learn and use new technologies quickly.
- Worked on setting up Name Node high availability for major production cluster and designed Automatic failover control using zookeeper and quorum journal nodes.
- Setting up automated 24x7 monitoring and escalation infrastructure for Hadoop cluster using Ambari.
- Experienced in Linux Administration and TSM Administration
TECHNICAL SKILLS:
Hadoop Ecosystems: HDFS, Hive, Sqoop, Spark, Splunk, Zookeeper, HBase, Oozie, Kerberos, Ranger, Knox
Operating System: Windows, Linux, AIX, Ubuntu, AWS
RDBMS: MYSQL, Oracle, DB2, MSSQL SERVER.
Languages: C, C++, Java, shell scripting, Python, SQL
Other Tools: Service Now, VPN, Win SCP, Putty,Edit++ and Notepad++
PROFESSIONAL EXPERIENCE:
Confidential, Chicago, IL
Hadoop Administrator
Responsibilities:
- Responsible for architecting Hadoop clusters Translation of functional and technical requirements into detailed architecture and design.
- Installed and configured multi-nodes fully distributed Hadoop cluster of large number of nodes.
- Addressing and Troubleshooting issues on a daily basis.
- File system management and monitoring
- Provided Hadoop, OS, Hardware optimizations.
- worked with popular hadoop distribution like Hortonworks.
- Worked independently with Hortonworks support for any issue/concerns with Hadoop cluster.
- Implementing Hadoop Security on Hortonworks Cluster.
- Installed and configured Hadoop ecosystem components like Map Reduce, Hive, Pig, Sqoop, HBase, Zookeeper and Oozie.
- Involved in testing HDFS, Hive, Pig and Map Reduce access for teh new users.
- Cluster maintenance as well as creation and removal of nodes using Apache Ambari
- Worked on setting up high availability for major production cluster and designed automatic failover control using zookeeper and quorum journal nodes.
- Implemented capacity scheduler to allocate fair amount of resources to small jobs.
- Performed operating system installation, Hadoop version updates using automation tools.
- Configured Oozie for workflow automation and coordination.
- Implemented rack aware topology on teh Hadoop cluster.
- Importing and exporting structured data from different relational databases into HDFS and Hive using Sqoop.
- Configured Zookeeper to implement node coordination, in clustering support.
- Rebalancing teh Hadoop Cluster.
Environment: Hortonworks (HDP 2.5), Ambari 2.4, HDFS, Java, Shell Scripting, Splunk, Python, Hive, Spark, Sqoop, Linux, SQL, Cloudera, Zookeeper, AWS, HBase, Oozie, Kerberos, Ranger
Confidential Charlotte NC
Hadoop Administrator
Responsibilities:
- Handle teh installation and configuration of a Hadoop cluster.
- Build and maintain scalable data using teh Hadoop ecosystem and other open source components like Hive and HBase.
- Monitor teh data streaming between web sources and HDFS.
- Close monitoring and analysis of teh Map Reduce job executions on cluster at task level.
- Inputs to development regarding teh efficient utilization of resources like memory and CPU utilization based on teh running statistics of Map and Reduce tasks.
- Changes to teh configuration properties of teh cluster based on volume of teh data being processed and performance of teh cluster.
- Setting up Identity, Autantication and Authorization.
- Maintaining Cluster in order to remain healthy and in optimal working condition.
- Handle teh upgrades and Patch updates.
- Set up automated processes to analyze teh System and Hadoop log files for predefined errors and send alerts to appropriate groups.
Environment: Horton works (HDP 2.2), HDFS, Hive, Spark, Sqoop, SQL, Splunk, Linux, Zookeeper, HBase, Oozie.
Confidential
Hadoop Administrator
Responsibilities:
- Allocating teh name and space Quotas to teh users in case of space problems.
- Installed and configured Hadoop security tools Knox, Ranger and enabled Kerberos
- Managing cluster performance issues.
- creating snapshots and restoring snapshots.
- Good experience in troubleshoot production level issues in teh cluster and its functionality.
- Backed up data on regular basis to a remote cluster using DistCp.
- Regular Commissioning and Decommissioning of nodes depending upon teh amount of data.
- Maintaining Cluster in order to remain healthy and in optimal working condition.
- Handle teh upgrades and Patch updates.
- Inputs to development regarding teh efficient utilization of resources like memory and CPU utilization.
- Based on teh running statistics of Map and Reduce tasks.
- Balancing HDFS manually to decrease network utilization and increase job performance.
- Commission and decommission teh Data nodes from cluster in case of problems.
- Set up automated processes to archive/clean teh unwanted data on teh cluster, in particular on Name node and Secondary name node.
- Discussions with other technical teams on regular basis regarding upgrades, Process changes, any special processing and feedback.
Confidential
Technical Specialist
Responsibilities:
- Monitoring Hadoop Ambari cluster and jobs
- Working as HWX consultant at teh client location
- Working on 13 Hadoop clusters of Horton works distribution
- Troubleshooting failed jobs, Performance Tuning, Automation using shell script
- Performed various Ambari and HDP upgrades
- Worked on different Hadoop eco system tools like HDFS, YARN, Hive, Spark and HBase.
- Documentation of infrastructure, software, systems configuration, process and policies.
- Experience in installation, configuration, deployment and management, setting up teh Development, Testing, Staging and Production environments of Oracle Weblogic 10/11g Server, SOA 11g and WebSphere Application Server on AIX, Unix, Sun Solaris, Windows servers.
- Experienced in SOA Suite, WebCenter and Oracle Application Server administration and deployments.
- HP Blade System administration is teh management of all C-Class HP servers for teh C-TNOSC.
- Configuring management and monitoring components for: HP Onboard Administrator, HP Integrated Lights-Out 2, HP Blade System instrumentation, Performance Monitoring and Blade Server deployment.
- Responsible for configuring HP Insight Control tools featuring HP Systems Insight Manager and HP Rapid Deployment tools.
- Configuring library and library paths, drives and drive paths, device class.
- Audit Library if any mismatch in LIBRARY inventory and Backup inventory.
- Health checking of File systems.
- Maintaining minimum amount of File system Space.
- Coordinating with Backup & Storage team for any Backup Failures.
Confidential
System Administrator
Responsibilities:
- Installed and Configured Red Hat Linux servers.
- Worked on more than 600 Linux servers to install and configure applications.
- Decommissioned around 300 Red Hat Linux servers.
- Actively engaged in power maintenance and network maintenance calls where I am responsible to fix issues on Red Hat Linux Servers and Solaris Servers.
- V2V'Ed Red Hat Linux Servers from old IP addresses to new IP addresses using VMware.
- Installation, configuration, and Operating System upgrade on, Red Hat Linux 5, 6.
- Installed and Configured teh Web and investigate teh configuration changes in teh production environment.
- Launching Amazon EC2 Cloud Instances using Amazon Images (Linux/ Ubuntu) and Configuring launched instances with respect to specific applications.
- Experience in creation of environments on virtual machines to be handed over to development and QA teams
- Linux technical support and prepared technical documentation for decommissioning.
- Regular backing up of critical data and restoring backed up data & Server’s verification.
- Responsible for day-day administration of Linux System and middleware. Administration of RHEL 5, 6, CentOS with work related to install, test, tune, upgrade, troubleshoots server’s issues and load patches.
- Performed data and activities inside teh data warehouse.
Confidential
System Administrator
Responsibilities:
- Involved in capacity planning and requirements gathering for multi datacenter Cassandra cluster.
- Implemented backing map to evict data from coherence solutions to Cassandra layer.
- Implemented spark solution to enable real time reports from Cassandra data.
- Designed and implemented dual datacenter setup for all Cassandra clusters.
- Experience in Administration of WebLogic Server on Unix/Linux OS.
- Performed WebLogic Server administration tasks such as installations (12c), upgrading, patching, deployments, configuration, monitoring, and performance tuning of WebLogic application server in multi cluster/server environment.
- Expertise in various methods of AIX/Linux/Solaris OS installation/ Migration using Network Installation Manager (NIM), Jumpstart and Kick start.
- Daily level 3 problem solving and tasks such as Oracle ASM disks, LVM and CIFS file system management, break/fix issues, user administration, etc.
- Worked in a team for installing and configuring Zabbix monitoring system for monitoring Service Availability, Disk Space Usage & availability and Host Availability.
- Adhered to change management and audit guidelines as well as security requirements for each customer's servers.
- Completed documentation and preparing procedures for various UNIX tasks.
- Assisted with Unix Server maintenance activities such as patching, disaster recovery practice, etc.
- Setup and maintained PowerHA/HAMCP clusters and Oracle ASM clusters on a server level.
- Setup Webmin server and client.
- Served on multiple on-call schedules, including 24x7 on-call rotation.
