Hadoop / Big Data Administrator Resume
SUMMARY
- Overall 10+ years of extensive IT experience that includes 2 years of Hadoop / Big Data Administration, monitoring & trouble shooting of Hadoop cluster and mainframe applications development.
- Experience in installing apache Hadoop,Cloudera and Hartonworks Hadoopdistribution and configuring the cluster of nodes.
- Experience with configuration ofHadoop Ecosystem components: Hive, HBase, Pig, flume and Sqoop.
- Excellent troubleshooting skills in Hardware, Software, Application and Network.
- In depth understanding/knowledge ofHadoopArchitecture and various components such as HDFS, Yarn, Job tracker, Task tracker, Name node, Data node and Map reduce concepts.
- Experience in building, maintaining multipleHadoopclusters (PROD, UAT, DEV) of different sizes and configuration
- Experience in managing and reviewingHadooplog files.
- Experience in designing and building disaster recovery planning across the data centers to provide business continuity.
- Monitored the cluster resources & configured the Alerts using Cloudera Manager for theHadoop Cluster.
- Manage nodes onHadoopcluster, Hadoopcluster connectivity check, Implement newHadoophardware infrastructure, OS integration and application installation, HDFS support and maintenance, Adding/Removing a Node, Data Rebalancing, Maintaining backups for name node, Setup of standby name node, HadoopUpgrades
- Proficient in Shell Scripting, familiarity with Pig, Python, Perl and Java
- Implemented custom shell scripts to monitor the system and setup alerts to notify the appropriate teams to take actions
- Monitored network performance using Netstat, Traceroute and tuned network parameters.
- Excellent communication and interpersonal skills and outstanding team player with an attitude to learn.
- Energetic self - starter with excellent analytical and organizational skills. Achieves goals, objectives and milestones in an accurate and consistent manner.
TECHNICAL SKILLS
Hadoop: Apache Hadoop 1.x, 2.x(Yarn, Resource Manager, application Manager, Node Manager, Container, Application Master), Hive, Pig, Sqoop, Zookeeper, Oozie, Flume, Cloudera and Hartonworks Distribution
Programming Languages: Java, J2EE, EJB, Hibernate, Java Beans, Spring, Struts, Web services, Python.Net, C#, VB.Net, ASP.Net, Perl
Database: Oracle, MS SQL SERVER, Hbase, Mongo DB, Sybase, DB2, Teradata.
ETL & BI Tools: OWB, OBIEE, Informatica, Power Center, ETL, IBM Infosphere Information Server Data Stage.
Operating Systems: Windows, Red Hat Linux, Solaris, AIX
Mainframes: Cobol, CICS,MVS, PMS, FOCUS, VSAM
Version Control: SVN
Change Management: JIRA, SM9
Release Management: Jenkins
Schedulers: ZEKE, TIDAL
Others: MS Access, MS Word, MS Excel.
PROFESSIONAL EXPERIENCE
Hadoop / Big Data Administrator
Confidential
Responsibilities:
- Installation and configuration of 20 node cluster, maintenance, monitoring, performance tuning and troubleshootingHadoop clusters in Development, UAT and Production.
- Expertise in Capacity Planning, defining hardware and software prerequisites and small to medium Hadoopclusters
- Experience in Performance tuning the Hadoopcluster
- Good experience in understanding Hadoop logs and resolving the issues.
- Experience in fine tuning configuration parameters.
- Defined file system layout and directory permissions.
- Monitor Hadoop cluster job performance and capacity planning
- Implemented best practices for HDFS file system
- Created groups and users and provided HDFS directory access to the clients.
- Worked on High Availability for Name Node using Cloudera Manager to avoid single point of failure.
- Manage and review data backups and log files.
- Deployed map reduce applications on cluster.
- Deployed Pig, Hive, Sqoop code into UAT and into PROD
- Commissioning and Decommissioning Nodes from time to time.
- Worked withHadoopdevelopers, designers in troubleshooting map reduce job failures and issues and helping to developers.
- Work with network and Linux system engineers to define optimum network configurations, server hardware and operating system.
- Production support responsibilities include cluster maintenance and managing Hadoop cluster and admin guide management.
Environment: CDH 4.x,Hadoop, HDFS, Map Reduce, Hive, HBase, Pig, Sqoop, Flume, Redhat Linux, Oracle, Java, JUnit, Agile methodology.
Hadoop / Big Data Administrator
Confidential, MI
Responsibilities:
- Worked as a Hadoop administrator to setup cluster for Analytics
- Installing, configuring, administering, debugging and troubleshooting Hadoop clusters.
- InstalledHADOOPcluster in fully distributed Mode with Name Node, Secondary Name node, resource manager, node manager and data nodes.
- Created groups and users and provided HDFS directory access to the clients.
- Granted required permissions to the HDFS file system.
- Deployed Map Reduce code into UAT from DEV and from UAT to PROD.
- Deployed Pig, Hive, Sqoop code into UAT and into PROD
- Tuned theHadoopClusters and monitored the cluster using Clodera manager.
- Created users to access the HDFS filesystem using Web HDFS
- Adding new data nodes and decommissioning data nodes and performed cluster load balance
- Installed and configured Federation of Name Nodes for Name Node high availability for various business team.
- Managed and Reviewed Hadooplog files
- Worked on OWB ETL tool to load data into Oracle staging from flat files.
Environment: CDH 4.x,Hadoop 2.2, Hive, Sqoop, Flume, Oozie, Oracle, Redhat Linux, OWB, Java, Agile methodology.
Lead Mainframe Support
Confidential
Responsibilities:
- Provided technical support and development on the existingmainframePAXUS General system.
- Provided daytime and night time on-call production support.
- Provided analysis and solutions for batch abends in order to establish permanent fixes.
- Coded COBOL programs using the PAXUS SUPRA database.
- Maintained and handled batch jobs using SORT and File-Aid tools.
- Worked in some of the online screens using SDF.
- Supported the EDI business.
- Worked on a project for the old PMS system to provide a user requirement.
- Standalone programs and used mainly for creating reports or extract files as required in batch jobs.
Environment: IBMMainframe3390, MVS/ESA, COBOL, JCL, TSO, ISPF, SUPRA, DB2, VSAM, File-Aid, Abend-Aid, Sort, CICS, Alchemist and ZEKE.
MainframeDeveloper
Confidential
Responsibilities:
- Involved in Analysis and Design, discussed with client requirements, prepared necessary specifications, prepared analysis and design documents.
- Developed and maintained the mainframe online systems.
- Migration of applications from CSP V3.3 to CSP V4.1 and CICS/ESA TO CICS TS FOR S/390
- Analyzing existing COBOL programs and created system documents.
- Development and Unit testing for COBOL and FOCUS components.
- Worked on JCL, SORT steps and PROC for batch applications.
- Reviewing of Programs, Test plans and Test Results.
Environment: MVS, COBOL, JCL, DB2, FOCUS.
