We provide IT Staff Augmentation Services!

Hadoop Administrator Resume

0/5 (Submit Your Rating)

NC

SUMMARY

  • Over all 8+ years of experience in Systems & Network Administrative Support, Infrastructure Design, Database Design, Development and Implementation.
  • 5 years Linux Systems Administration experience.
  • 3+ years of Hadoop Deployment + Administration (CDH2/3) experience
  • Experience with complete Software Design Lifecycle including design, development, testing and implementation of moderate to advanced complex systems.
  • Strong knowledge on Cluster setup, installation, upgradation & administration on distributions like Cloudera (CDH), Horton works (HDP).
  • Strong knowledge on Hadoop ecosystem components HDFS, Hive, Pig, Map reduce, Job Tracker, Task Tracker, Name Node, Data Node and YARN architecture.
  • Experienced in developing Map Reduce programs using Apache Hadoop for working with Big Data
  • Excellent understanding / knowledge of Big Data and Hadoop architecture and various components such as Hive, HBase, Hive integration, Pig, Sqoop, Flume and knowledge of Mapper/Reducer/HDFS Framework.
  • Expertise in Installing, Updating Hadoop and its related components in multi - node Cluster environment.
  • Installation and Configuring schedulers, Flume, Scoop, Oozie.
  • Loading logs data directly into HDFS using Flume.
  • Experience in big data domains like Shared Service (Hadoop Clusters, Operational Model, Inter-Company Charge back, Life cycle Management).
  • Experience in Cloudera HadoopUpgrades and Patches and Installation of Ecosystem Products through Cloudera manager along with Cloudera Manager Upgrade.
  • Experience in understanding and managing Hadoop Log Files, managing Hadoop infrastructure with Cloudera Manager and involved in building Big Data cluster and successfully performed installation of CHD using Cloudera manager.
  • Experience in configuring clusters, failovers, back up policies for different distributions like Horton works and MapR distributions.
  • Skilled in analyzing the clients existing Hadoop infrastructure and understand the performance bottlenecks and provide the performance tuning accordingly
  • Solid understanding on configuring infrastructure monitoring using Ganglia & Nagios.
  • Strong knowledge on shell scripting and SQL.
  • Experienced in performing typical cluster related activates like Storage capacity management, performance tuning.
  • Involved in implementing High Availability and automatic failover infrastructure to overcome single point of failure for Name node utilizing zookeeper services.
  • Experienced in setting up security related activities for Hadoop clusters using Kerberos and integrated with LDAP at enterprise level.
  • Strong experience on Hadoop distributions like Cloudera & Hortonworks.
  • Extensive experience in Cluster capacity planning, performance tuning, cluster Monitoring, Troubleshooting (Hadoop Operations).
  • Experience in Sqoop configuration to import/export data to/from Teradata/MySQL databases.
  • Expertise in Installing, Configuration and Managing Red hat Linux 4, 5.
  • Good exposure implementing and maintaining Hadoop Security and Hive security.
  • Excellent communication skills, hardworking and good team player with ability to work under pressure in a highly visible role.
  • Hands on experience in application development using Java, RDSBM, and Linux shell scripting.

TECHNICAL SKILLS

SKILLS: Big Data, HDFS, Hive, Pig, Hbase, Sqoop, mahout, Hadoop components, Linux, Windows XP, Server 2008, MySQL.C, JAVA, SQL, PL/SQL, PIG LATIN, UNIX shell scripting

Platform/OS: Red Hat Enterprise (RHEL) / CentOS, Debian / Ubuntu, Mac OSX, Windows XP /Vista /7, Windows Server 2003/2008, UNIX, VMWare ESXi

PROFESSIONAL EXPERIENCE

Confidential, NC

Hadoop Administrator

Responsibilities:

  • Expertise in recommending hardware configuration set up bench mark for cluster set up for Hadoop cluster.
  • Installing, Upgrading and Managing Hadoop Cluster on Cloudera distribution across data centers.
  • Automated all the jobs, for pulling data from FTP server to load data into Hive tables, Using Oozie workflows.
  • Installed, managed and configured domains, server instances in Web logic/Tomcat application servers
  • Managing and reviewing Hadoop and HBase log files for debugging and error analyzing process.
  • Loading logs data directly into HDFS using Flume
  • Experience with implementing UNIX shell scripts to manage and setup data nodes space across cluster.
  • Practical knowledge on functionalities of every Hadoopdaemons, interaction between them, resource utilizations and dynamic tuning to make cluster available and efficient
  • Performed Importing and exporting data into HDFS and Hive using Sqoop.
  • Created, Managing tasks like listing jobs, killing jobs and failing jobs
  • Mystifying and demystifying nodes from the Cluster environment. Experience in day to day production support of Hadoop infrastructure like HDFS maintenance, Backups, manage and review Hadoop log files
  • Implemented load balancing across clusters as part of commission/decommission nodes to cluster.
  • Patching and upgrading Cloudera and Hortonworks clusters
  • Recovering from node failures and troubleshooting common Hadoop cluster issues
  • Scripting Hadoop package installation and configuration to support fully-automated deployments
  • Maintain system integrity of all sub-components (primarily HDFS, MR, HBase, Flume, Oozie, Scoop)
  • Supporting Hadoop developers and assisting in optimization of map reduce jobs, Pig Latin scripts, Hive Scripts, and HBase ingest Required

Confidential, Ashburn VA

Hadoop Administrator

Responsibilities:

  • Installed/Configured/Maintained Apache Hadoop clusters for application development and Hadoop tools like Hive, Pig, HBase, Zookeeper and Sqoop.
  • Worked on importing and exporting data from Oracle and DB2 into HDFS and HIVE using Sqoop.
  • Developed simple to complex MapReduce jobs using Hive and Pig.
  • Configured various property files like core-site.xml, hdfs-site.xml, mapred-site.xml and Hadoop-env.xml based upon the job requirement.
  • Loading log data directly into HDFS using Flume.
  • Performed Name node recoveries from previous backups
  • Responsible for troubleshooting issues in the execution of Map Reduce jobs by inspecting and reviewing log files
  • Conducted root cause analysis and worked with users to troubleshoot map reduce job failures and issues with Hive and Map Reduce.
  • Involved in implementing High Availability and automatic failover infrastructure to overcome single point of failure for Name node utilizing zookeeper services.
  • Performed a Major upgrade in production environment from HDP 1.3 to HDP 2.2.

Confidential, Chicago IL

Hadoop Administrator

Responsibilities:

  • Designing, development, monitor & maintain Hadoop eco system with High Availability.
  • Responsible for architecting Hadoopclusters Translation of functional and technical requirements into detailed architecture and design
  • Exported the analyzed data to the relational databases using Sqoop for visualization and to generate reports for the BI team.
  • Provide technical assistance for configuration, administration and monitoring of Hadoop cluster.
  • Analyzing and recommending the Big Data solutions.
  • Configuring TLS/SSL for the Hadoopcomponents/BDA cluster to provide data security in transit
  • Interacting with customers, business users and technical team for smooth execution of project
  • Manage & coordinate a team of 5 members with offshore-onshore model & track automation progress.
  • Implemented Kerberos for authenticating all the services Hadoop Cluster.
  • Involved in setting up Rack topology
  • Launched R-statistical tool for statistical computing and Graphics.

Confidential, Germantown MD

Linux Administrator

Responsibilities:

  • Performed data analytics using Hive HQL
  • Installing, configuring and updating Solaris 7, 8, Red Hat 7.x, 8, 9, Windows NT/2000 Systems using media and Jumpstart and Kick start.
  • Installed Linux using Pre-Execution environment boot and Kick-start method on multiple servers.
  • Provide Remote support for Linuxand Windows Servers
  • Configuring Samba, NFS, DHCP server in Linux& Windows
  • Working knowledge on the TCP/IP protocols RSH, SSH, RCP, SCP.
  • Adding, removing, or updating user account information, resetting passwords, etc
  • Re-compiling Linux kernel to remove services and applications that are not required.
  • Worked on day to day administration tasks and resolve tickets using Remedy
  • Monitored server and application performance & tuning via various stat commands(top,mpstat, prstat, nfsstat, prtconf, prtdiag, iostat, top, printmgr, hpimliviewdmidecode, smc etc) and tuned I/O, memory etc) for SUN Solaris and RHEL Servers
  • Maintains a disaster recovery plan. Creates backup capabilities adequate for the recoveryf data and understands concepts and processes of replication for disaster recovery
  • Debug and correct installed system software as required.

Confidential

Linux Administrator

Responsibilities:

  • High skill with of most UNIX/Windows Server commands/utilities
  • Installed and configured new hard drives and memory.
  • Involved in preparation of functional and system specifications. Estimated storage requirements for applications.
  • Configured firewall based on Redhat Linuxand FreeBSD 4.x that has three network interfaces.
  • Expertise in Linuxbackup/restore with tar including disk partitioning and formatting.
  • Planned, scheduled and Implemented OS patches on both Solaris & Linux boxes as a part of proactive maintenance
  • Involved in Building and configuring Solaris 8/9/10 using Jump start server and Red Hat Linux Servers Using Kick Start server as required for the project
  • Managing file systems and disk management using Solstice Disk suite.
  • Worked on Logical volume manager to create file systems as per user and database requirements.
  • Experience in Servers consolidation and virtualization using UML Linux, XEN and VMware virtual infrastructure, VMware ESX 2.x, VMware V center.
  • Installed Web Logic 8.1 with SP5 Server and configured Domains, Admin and managed LinuxServers for Applications to be deployed.

We'd love your feedback!