Hadoop Administrator Resume
Nashville, TN
SUMMARY
- Around 9 years of experience in Design, Implementation and Administration experience in providing storage solutions for Hadoop, SAN/NAS and system administration.
- Hands on experience with HDFS, MapReduce and Hadoop Ecosystem (Pig, Hive, Oozie, Hbase, Flume, and Sqoop).
- Working knowledge in Hadoop HDFS Shell commands.
- Experience in NoSQL database HBASE.
- Created HBase tables to store variable data formats.
- Involved in adding huge volumes of data in rows and columns to store data in HBase.
- Provide support data analysts in running Pig and Hive queries.
- Hands on experience in installing, configuring, and using Hadoop ecosystem components like Hadoop Map Reduce, HDFS, HBase, Oozie, Hive, Sqoop, Pig, Zoo keeper and Flume.
- Deploy Pivotal/ HAWQ (Hadoop Distribution). Migrate data to HAWQ for analytics.
- MapReduce programs to load data and perform analytics using generic filters, sort, and aggregates, develop mapreduce patterns. Use maven build and packaging tool and eclipse IDE. Deploying jar files.
- Defining supporting models.
- Preparing documentation and power points and presentations for clear understanding of the process.
- Interacted with Informatica Azure.
- Write HiveQL, Pig scripts to do analytics across retirement portfolio. Write pig latin macros. Pig udfs using java Mapreduce. Integrate java mapreduce classes with pig scripts. Develop pig scripts as prototype for generating analytics. Convert pig scripts to java mapreduce programs. Write generic mapreduce filter using java generics, generic parser using regex pattern.
- Write pgplsql ETL programs, shell scripts, Data Validation / Comparison scripts
- Configure and Implement daily and monthly cycle.
- Enhance DB2 database applications. Write stored procedures and triggers.
- Experience in implementation of remote replication using Confidential SRDF suite for disaster recovery using SRDF/A and SRDF/S based on the application requirements also worked on BCV and clones
- Experience in the data migration for tech refresh projects, using both array based (Universal Volume Manager, Volume Migration, SRDF, SAN Copy and Open Replicator) and Host based (Unix native tools, Open migrator, Robocopy, LVM Mirror).
- Allocated storage to Unix/Intel environment from IBM SVC, XIV arrays based on the application requirements.
- Migrated TBs of data from IBM SVC to VMAX3 and VMAX arrays using image mode.
- Experience in installation and configuration of Brocade/McData/Cisco enterprise directors and departmental switches.
- Experience in Design, administrating, maintaining and troubleshooting Confidential VMAX3, VMAX10K/20K/40K/DMX Arrays using Solution Enabler, Unisphere and SMC.
- Experience in implementation, Administration and migration of Data from old VMX arrays to new VMAX3 arrays using SRDF. Allocated storage to Unix, Linux, Windows and ESX hosts/clusters from VMAX3, VMAX10K/20K/40K arrays.
- Experienced in configuring and administering Fabric Switches using both CLI and Connectrix Manager/Web Tools/Fabric Manger.
- Experienced in UNIX Administration including network configuration on various operating systems Solaris/Linux/HP/AIX and Windows.
- Proficient in SAN related host connectivity for various Operating systems, Solaris/Linux/HPUX/AIX/Windows operating systems.
PROFESSIONAL EXPERIENCE
Confidential, Nashville, TN
Hadoop Administrator
Responsibilities:
- SKILLS: Big Data: Cloudera CDH, Apache Hadoop. Big Data Ecosystem: HDFS, Map Reduce 2.0, Sqoop, Flume, Zoo
- Hands on experience in installing, configuring, and using Hadoop ecosystem components like Hadoop MapReduce, HDFS, HBase, Oozie, Hive, Sqoop, Pig, Zoo keeper and Flume.
- Well versed with installation, configuration, supporting and managingthe Cloudera - CDH platformwith clusters.
- Excellent understanding / knowledge of Hadoop architecture and various components such as HDFS, Job Tracker, Task Tracker, Name Node, Data Node and Map Reduce.
- Experience in managing Hadoop infrastructure like commissioning, decommissioning, rack topology implementation.
- Experience in managing the cluster resources by implementing fair scheduler and capacity scheduler.
- Experience in Implementing High Availability of Name Node and Hadoop Cluster capacity planning .
- Developed automated scripts using Unix Shell for running Balancer, file system health check and User/Group creation on HDFS.
- Experience in managing and reviewing Hadoop log files.
- Experience in upgrading Hadoop cluster from current version to minor version upgrade as well as to major versions.
- Experience in designing and building disaster recovery planning across the data centers to provide business continuity.
- Monitored the cluster resources & Configured the Alerts using Cloudera Manager for the Hadoop cluster.
- Experience in Complete development life (SDLC) cycle from gathering Requirements, Testing, Implementation and Post Implementation Support.
- Expertise in design and implementation of Slowly Changing Dimensions (SCD) type1, type2 and type3.
- Experience in creating RDD (Requirements Definition Document),HLDD(High level Design document), DDD(detailed design document) ETLMapping sheets and Unit testing document.
- Experience inDataIntegration of various data sources from databases such asXML files, Oracle, Teradata, flat filesandCSV files.
- Experience in requirement analysis, client interaction, system design, development, testing, documentation and implementation with extensive coding standards.
- Involved in Unix to Linux & PL-SQL code to AbInitio Graphs migration set of projects.
- Coded well-tuned SQL/PL-SQL scripts for warehouse instance.
- Coded well-tuned Unix Korn Shell scripts, Wrapper scripts for high volume data warehouse instances.
- Prominent use of AbInitio in every walk of Development, testing and other phases.
- Worked on scheduling tools like Maestro,Crontab and Autosys.
- Provided on 24x7 Production support for daily, weekly and monthly incremental and complete refresh warehouse environment.
- Experience in Product migration and Release management Process.
- Extensive experience in analysis of Source Systems,Staging area& TargetWarehouse and Data Martsystems.
- Excellent analytical and Communication Skills, Ability to work independently and good team player. Keeper, Oozie, Hive, Pig. ETL Tools: AbInitio (GDE 3.1, Co-Operating System 3.1). Database: IBM DB2, Oracle.
- Programming Languages: JDBC, XML and Web Services. Scheduling Tools: Maestro, Autosys, Cron Tab.
- Operating Systems: AIX,Linux, Cent OS, Red Hat Linux, MS Windows family. Concepts: OLTP, OLAP, Data Marts, Dimensional Modeling. Scripting Languages: SQL, PL/SQL, UNIX Shell Scripting (Korn).
- Experience in setup, configuration and management of security for Cloudera Hadoop clusters.
- Responsible for building scalable distributed data solutions using Hadoop.
- Responsible for day-to-day activities which includes HDFS support and maintenance, Cluster maintenance, creation/removal of nodes, Cluster Monitoring/ Troubleshooting.
- Involved in manage and review the Hadoop log files, Backup and restoring, capacity planning
- Worked with Hadoop developers and operating system admins in designing scalable supportable infrastructure for Hadoop.
- Responsible for deciding the hardware configurations for the cluster along with other teams.
- Implemented the Cluster High Availability incase of crash or planned maintenance.
- Responsible for scheduling jobs in Hadoop using Fair scheduler .
- Involved in configuring Oozie workflow engine to run multiple Hive jobs.
- Configured Sqoop and developed scripts to extract data from DB2 into HDFS.
- Worked extensively with Sqoop for importing metadata from DB2.
- Involved in administration, configuration management, monitoring, debugging and performance tuning of Hadoop environments.
- Continuous monitoring and managing the Hadoop cluster through Cloudera Manager.
- Configured the Alerts using Cloudera Manager for the Hadoop cluster.
- Installed and configured Flume, Hive, Sqoop,Zookeeper and Oozie on the Hadoop cluster.
- Maintain extensive documentation on Hadoop cluster, policies and configurations .
- Defining supporting models
- Preparing documentation and power points and presentations for clear understanding of the process
- Interacted with Informatica
- Diligently teaming with the infrastructure, network, database, application and business intelligence teams to guarantee high data quality and availability.
Confidential, Nashville, TN
Sr Storage Engineer
Environment: Confidential VMAX3,VMAX40K/20K/10K, DMX-4, DMX3, CX4,CX3-80, Ibm SVC, XIV Arrays, Cisco MDS (9500 series), Brocade 48K, 4100, SYMCLI Solutions Enabler, Unisphere, Navisphere Manager, Cisco CLI and Cisco DCNM, Fabric Manager, Aix, Solaris, Linux, Windows, and ESX clusters, Unix Servers, and VMWare
Responsibilities:
- Part of the tech refresh project team and migrated PBs of storage from old DMX, VMAX arrays to new VMAX3 using SRDF. Migrated storage from IBM SVC arrays to the VMAX3 arrays involving various host environments including AIX, Solaris, Linux, windows and ESX clusters.
- SAN and storage administration environment consisting of VMAX, DMX-3/DMX-4, Clariion CX 3-80, Cisco 9513, and 9509 Enterprise directors and Brocade 48k, 4900, 3900.
- Provisioned storage for multi-vendor host environments from DMX using Confidential Control Center symcli. Also, provisioned storage from Clariion CX 3-80 using Confidential Navisphere manager and Navicli.
- Extensively used Symcli/SMC for virtual provisioning like creating the thin pools, data devices and adding data devices to thin pools and creating thin devices and allocating to various types of hosts from V-max
- Using migration techniques like Array based i.e. Open replicator and SRDF and Host based i.e. Open Migrator/LVM Mirroring to migrate the data from DMX1000/DMX2000/CX600/700 to DMX4/VMAX storage system in UNIX/Windows environment for online data migration with less downtime
- Created aliases, zone and added newly created zones to existing active configuration and enabled it by using CLI and web tools for brocade switches.
- Created the VSANS and added the interfaces, created the device-aliases zones and added the zones to zonesets and activated the zonesets using CLI fabric and device manager for MDS cisco switches
- Provided direct technical support for the coordination and implementation of releases, upgrades or changes to the Windows and unix server environments.
- Working on VMAX 3. Mostly migrating the servers from different Array to VMAX3 using SRDF and allocating storage from VMAX3 Array
- Assigned all Flash XtremIO storage to critical hosts (Windows/Unix/ESX)
- Worked on XtremIO with four X-Brick cluster worked on VPLEX Dual and quad engines
- Discoverd storage from backend storage created extents, devices and Virtual volumes and assigned to various type of hosts (unix/windows/ESX)
- Worked on configuration with VPLEX METRO and GEO.
- Migrated backend storage without downtime using VPLEX data mobility for lease rollover of storage arrays.
- Experience in administration of NetApps FAS filers (7-mode FAS 3270 and cluster mode FAS 8040) such as creating SVMs, aggregates, volumes, configuring CIFS/NFS exports, enabling de-duplication and implementation of local and remote replication for backup/DR purpose.
- Experienced in creating Storage Virtual Machines (SVMs), creating aggregates, volumes, export policies, protection policies for local and remote replication for the 8-mode cdot clusters using Oncommand System Manager.
- Experienced in Configuration of NFS and CIFS shares on NetApp FAS filers using CLI/FilerView/Oncommand System manager.
- Migrated NFS/CIFS shares from old NAS filers (FAS 3070) to new FAS filers (FAS 3270)
- Allocated storage to new-builds/existing servers/clusters of various host OS flavors (AIX, Solaris, Linux, VMWare ESX nodes and Windows) from Confidential, NetApp and IBM enterprise storage arrays.
- Implemented local replication by configuring snapshots for backups and snapmirror for Disaster Recovery for the production NAS volumes
- Participated and supported multiple enterprise DR tests by breaking and resynching snapmirror relation for the production NAS volumes when needed for testing.
- Provisioned Block Storage from NetApp FAS filers (FAS 6080) for Unix/Windows servers by creating volumes, LUNs and mapping luns to the hosts.
- Prepared and documented Standard Operating procedures for Storage/NAS and SAN related tasks.
- Worked on various Pri-1Pri-2 issues related to SAN/NAS and storage issues and coordinated with vendors for any hardware replacements and code upgrades.
- Experienced in Troubleshooting various performance issues in SAN/NAS/Storage
Confidential, Nashville, TN
Sr. SAN Storage Engineer
Responsibilities:
- Successfully allocated Storage using SMC (7.1), ECC 6.0, Symcli, Navisphere and Navicli for open systems storage environment from CX3, CX4, DMX-3, DMX-4 and VMAX to Windows, Solaris and HPUX/AIX servers
- Basic experience with ISILON
- Extensively used Symcli/SMC for virtual provisioning like creating the thin pools, data devices and adding data devices to thin pools and creating thin devices and allocating to various types of hosts from V-max
- Extensively used symcli and SMC for auto provisioning like creating the initiator group, port group, storage group and adding them to masking views.
- Took VMAX training from Confidential .
- Configuration and allocation of storage for production and corporate VMware hosts from DMX-3/DMX-4 and Clariion including configuration of specific port settings for FA ports.
- Tested and implemented SRDF/A for selected high performance Tier 1 applications. Involved in the pilot testing of SRDF/A and certifying it for production use.
- Upgraded hosts as per the Confidential HEAT report and installed / upgraded and configured Power Path for AIX / Solaris and Windows Operating Systems before migrating to the new storage array and installed Symcli software.
- Used migration techniques like Array based i.e. Open replicator and SRDF and Host based i.e. Open Migrator/LVM Mirroring to migrate the data from DMX1000/DMX2000/CX600/700 to DMX3/DMX4 storage system in UNIX environment for online data migration with no downtime.
- Migrated data successfully from CX600/CX700 to CX4-240 using Mirror View/S and SAN Copy and CX600/CX700 to DMX-3 to DMX-4 using Open Replicator /Open Migrator.
- Installed and configured Power Path for AIX / Solaris /HP and Windows Operating Systems to support the load balancing and failover features among the HBAs on the system and also installed Symcli software.
- Implemented and tested Disaster Recovery using Mirror view/S and Mirror view/A for production.
- Provisioned storage from CX3 and CX4 Clarrions to windows, AIX, HP-UX, Solaris and VMware using Navisphere Manager. Responsibilities also included creating Raid groups, binding the LUNs Creating storage groups.
- Responsibilities included zoning using Cisco Device Manager/Fabric Manager/Web Tools/CLI and ECC.
- Administration of NSX Celerra systems including creation of file systems, exporting and mounting CIFS/NFS shares and configuring network interfaces.
- Configured Celerra gateway with Symmetrix and Clarion arrays
- Allocated storage from DMX -3/4 to Celerra NS40G/80G and NSX series.
- Configured DNS, NIS and NTP servers for the Data movers.
- Extensive Experience in providing file system creation and file system export for both Windows and UNIX clients using Celerra CLI and Celerra manager and managing the file system quotas.
- Configured CIFS servers and VDM’s for Windows only environment.
- Maintained all network file system (NFS) mounts and also monitored and resolved troubleshooting issues and as an administrator created users and user groups, provided access rights and privileges to users.
- Experience in Data Migration from NS704G to NS80G and NSX using CDMS tool, Celerra Replicator and SRDF.
- Implemented SRDF/A for Celerra for disaster recovery.
- Extensive Experience in providing file system creation and file system export for both Windows and UNIX clients using Celerra CLI and Celerra manager and managing the file system quotas.
- Identify underutilized resources (FA ports and host based assets) using Storage scope reporting and proactively resolved the capacity allocation issues.
- Create/update/maintain all storage related documentation and diagrams of the storage hardware.
- Troubleshooting routine critical issues including threshold optimization, server throughput, ports availability, meeting zoning requirements, one-path down, host not seeing storage and storage management problems.
Confidential
Sr. Storage Administrator
Responsibilities:
- Responsible for administration of Confidential SAN environment in data center, which contain data in Confidential Symmetrix DMX2000 and DMX3s, Clariion connected to 250 UNIX and Windows servers.
- Prepared data migration procedures by discussing with “systems administrator”, “database administrator” and “application administrators”.
- Implemented the data migration seamless with proper plan with no or minimal downtime using data migration from either Confidential or UNIX native tools.
- Implemented storage refreshes (data Migration) from Symmetrix to Symmetrix using SRDF, Clariion to Clariion using SANCopy.
- Prepared all the necessary documents (host list, required software updates, required disk capacity planning) prior to the migration by discussing with various people in the organization.
- Implemented Campus/MAN DR Solution using SRDF/S for a machine critical application.
- Involved in a “consistent data replication” for a machine critical application with oracle database from DMX-3 to DMX-3 using SRDF/A Technology.
- Implemented of Confidential Time Finder Clones for development and testing environments (UNIX and Windows) and set them to do a periodic refresh from the production.
- Followed proper Change management procedures, well prepared and submitted all the required documents for all SAN/Storage migration projects.
- Designed, implemented and managed the Backup and Recovery environment utilizing Time Finder Clones.
- Prepared proper documentation for all the SAN/Storages systems and transferred the knowledge and procedure to the storage team.
- Took part in planning, commissioning of a new DMX-3’s and deployed all the agents for ECC administration.
- Installed and configured Cisco MDS 9509 switches
- Cisco MDS SAN configuration (VSAN and zoning within the VSAN)
- Generating reports for SAN and storage system usage also collects the history for future capacity planning.
- Well versed with CLI tools (both SAN and Storage) usage in case if GUI/ECC servers are down for maintenance.
- Implemented SAN expansion by joining the new switches to the existing fabric or merging two fabrics with minimal or no disturbance to the production.
- Generating health reports for the storages and SAN proactively and take necessary action.
- Installation, configuration and driver upgrades of HBA's (Emulex & Qlogic) on UNIX and Windows servers.
- Create RAID Groups, BIND LUNs, and Create Storage Groups using Navisphere Manager.
- Perform Zoning for SANCopy. Create scripts using Navicli commands to perform migration.
- Involved in scheduling and coordinating between various technical resources to make the migration go successful without any issues/problems.
- Day-to-day administration of SAN and storage infrastructure using ECC, Symcli, Navisphere Manager.
Environment: Storage: Confidential Symmetrix DMX2000, DMX-3 Clariion CX500/700,IBM DS8300, HDS 9580, 9960,9200
Confidential
Storage Administrator
Responsibilities:
- As a storage administrator, taking care of the day to day storage provisions from both Symmetrix and Clarions.
- Creating Raid groups, storage groups and binding the Clariion luns to the hosts.
- Creating bigger luns (metas) to support the application needs using SYMCLI
- Converting the STDs to BCVs using symconfigure, create and manage device groups by establish, split and setup the BCVs for the backup operations.
- Periodic data refresh on the for test environment from the production using Time Finder Technology.
- Restoring the data from the BCVs in case of a corruption/deletion on the production environment.
- Power path software installations, configurations and administration on various operating systems to balance the I/O load among multiple HBAs and also responsible to setup the Path failover for all the production hosts.
- Taking care of all the issues related to the host connectivity including the HBA installations and configurations.
- Supporting both production and testing environments.
- Well versed with SYMCLI software heavily used to maintain the storage environment.
- Monitor and reporting the health of the Fabrics.
- Responsible for providing user support which included systems set up, hardware/software installation, configuration and general maintenance.
- Analyzed, tracked and resolved complex software/hardware matters of significance pertaining to UNIX servers and data center operations. Coordinated hardware/software installations and upgrades to ensure work is properly performed in accordance with company policy.
- Hardware and Software applications installation and upgrading servers with latest software’s and service packs.
Environment: Storage: Confidential DMX2000, Symmetrix 8830/8730, Clariion CX600
