We provide IT Staff Augmentation Services!

Hadoop / Big Data Administrator Resume

5.00/5 (Submit Your Rating)

SUMMARY

  • Overall 10+ years of extensive IT experience that includes 2 years of Hadoop / Big Data Administration, monitoring & trouble shooting of Hadoop cluster and mainframe applications development.
  • Experience in installing apache Hadoop,Cloudera and Hartonworks Hadoopdistribution and configuring the cluster of nodes.
  • Experience with configuration ofHadoop Ecosystem components: Hive, HBase, Pig, flume and Sqoop.
  • Excellent troubleshooting skills in Hardware, Software, Application and Network.
  • In depth understanding/knowledge ofHadoopArchitecture and various components such as HDFS, Yarn, Job tracker, Task tracker, Name node, Data node and Map reduce concepts.
  • Experience in building, maintaining multipleHadoopclusters (PROD, UAT, DEV) of different sizes and configuration
  • Experience in managing and reviewingHadooplog files.
  • Experience in designing and building disaster recovery planning across the data centers to provide business continuity.
  • Monitored the cluster resources & configured the Alerts using Cloudera Manager for theHadoop Cluster.
  • Manage nodes onHadoopcluster, Hadoopcluster connectivity check, Implement newHadoophardware infrastructure, OS integration and application installation, HDFS support and maintenance, Adding/Removing a Node, Data Rebalancing, Maintaining backups for name node, Setup of standby name node, HadoopUpgrades
  • Proficient in Shell Scripting, familiarity with Pig, Python, Perl and Java
  • Implemented custom shell scripts to monitor the system and setup alerts to notify the appropriate teams to take actions
  • Monitored network performance using Netstat, Traceroute and tuned network parameters.
  • Excellent communication and interpersonal skills and outstanding team player with an attitude to learn.
  • Energetic self - starter with excellent analytical and organizational skills. Achieves goals, objectives and milestones in an accurate and consistent manner.

TECHNICAL SKILLS

Hadoop: Apache Hadoop 1.x, 2.x(Yarn, Resource Manager, application Manager, Node Manager, Container, Application Master), Hive, Pig, Sqoop, Zookeeper, Oozie, Flume, Cloudera and Hartonworks Distribution

Programming Languages: Java, J2EE, EJB, Hibernate, Java Beans, Spring, Struts, Web services, Python.Net, C#, VB.Net, ASP.Net, Perl

Database: Oracle, MS SQL SERVER, Hbase, Mongo DB, Sybase, DB2, Teradata.

ETL & BI Tools: OWB, OBIEE, Informatica, Power Center, ETL, IBM Infosphere Information Server Data Stage.

Operating Systems: Windows, Red Hat Linux, Solaris, AIX

Mainframes: Cobol, CICS,MVS, PMS, FOCUS, VSAM

Version Control: SVN

Change Management: JIRA, SM9

Release Management: Jenkins

Schedulers: ZEKE, TIDAL

Others: MS Access, MS Word, MS Excel.

PROFESSIONAL EXPERIENCE

Hadoop / Big Data Administrator

Confidential

Responsibilities:

  • Installation and configuration of 20 node cluster, maintenance, monitoring, performance tuning and troubleshootingHadoop clusters in Development, UAT and Production.
  • Expertise in Capacity Planning, defining hardware and software prerequisites and small to medium Hadoopclusters
  • Experience in Performance tuning the Hadoopcluster
  • Good experience in understanding Hadoop logs and resolving the issues.
  • Experience in fine tuning configuration parameters.
  • Defined file system layout and directory permissions.
  • Monitor Hadoop cluster job performance and capacity planning
  • Implemented best practices for HDFS file system
  • Created groups and users and provided HDFS directory access to the clients.
  • Worked on High Availability for Name Node using Cloudera Manager to avoid single point of failure.
  • Manage and review data backups and log files.
  • Deployed map reduce applications on cluster.
  • Deployed Pig, Hive, Sqoop code into UAT and into PROD
  • Commissioning and Decommissioning Nodes from time to time.
  • Worked withHadoopdevelopers, designers in troubleshooting map reduce job failures and issues and helping to developers.
  • Work with network and Linux system engineers to define optimum network configurations, server hardware and operating system.
  • Production support responsibilities include cluster maintenance and managing Hadoop cluster and admin guide management.

Environment: CDH 4.x,Hadoop, HDFS, Map Reduce, Hive, HBase, Pig, Sqoop, Flume, Redhat Linux, Oracle, Java, JUnit, Agile methodology.

Hadoop / Big Data Administrator

Confidential, MI

Responsibilities:

  • Worked as a Hadoop administrator to setup cluster for Analytics
  • Installing, configuring, administering, debugging and troubleshooting Hadoop clusters.
  • InstalledHADOOPcluster in fully distributed Mode with Name Node, Secondary Name node, resource manager, node manager and data nodes.
  • Created groups and users and provided HDFS directory access to the clients.
  • Granted required permissions to the HDFS file system.
  • Deployed Map Reduce code into UAT from DEV and from UAT to PROD.
  • Deployed Pig, Hive, Sqoop code into UAT and into PROD
  • Tuned theHadoopClusters and monitored the cluster using Clodera manager.
  • Created users to access the HDFS filesystem using Web HDFS
  • Adding new data nodes and decommissioning data nodes and performed cluster load balance
  • Installed and configured Federation of Name Nodes for Name Node high availability for various business team.
  • Managed and Reviewed Hadooplog files
  • Worked on OWB ETL tool to load data into Oracle staging from flat files.

Environment: CDH 4.x,Hadoop 2.2, Hive, Sqoop, Flume, Oozie, Oracle, Redhat Linux, OWB, Java, Agile methodology.

Lead Mainframe Support

Confidential

Responsibilities:

  • Provided technical support and development on the existingmainframePAXUS General system.
  • Provided daytime and night time on-call production support.
  • Provided analysis and solutions for batch abends in order to establish permanent fixes.
  • Coded COBOL programs using the PAXUS SUPRA database.
  • Maintained and handled batch jobs using SORT and File-Aid tools.
  • Worked in some of the online screens using SDF.
  • Supported the EDI business.
  • Worked on a project for the old PMS system to provide a user requirement.
  • Standalone programs and used mainly for creating reports or extract files as required in batch jobs.

Environment: IBMMainframe3390, MVS/ESA, COBOL, JCL, TSO, ISPF, SUPRA, DB2, VSAM, File-Aid, Abend-Aid, Sort, CICS, Alchemist and ZEKE.

MainframeDeveloper

Confidential

Responsibilities:

  • Involved in Analysis and Design, discussed with client requirements, prepared necessary specifications, prepared analysis and design documents.
  • Developed and maintained the mainframe online systems.
  • Migration of applications from CSP V3.3 to CSP V4.1 and CICS/ESA TO CICS TS FOR S/390
  • Analyzing existing COBOL programs and created system documents.
  • Development and Unit testing for COBOL and FOCUS components.
  • Worked on JCL, SORT steps and PROC for batch applications.
  • Reviewing of Programs, Test plans and Test Results.

Environment: MVS, COBOL, JCL, DB2, FOCUS.

We'd love your feedback!