Sr. Data Architect Resume
Phoenix, AZ
SUMMARY
- Cloudera Certified Hadoop Administrator with 9+ years of professional IT experience which includes 3 years of experience in Big Data ecosystem related technologies.
- Excellent understanding, knowledge of Hadoop architecture and various components such as HDFS, CLDB, RM, Namenode, Datanodes, Pig, Hive, Sqoop, Oozie, HBase, Yarn and MapReduce programming paradigm.
- Hands on experience in installing, configuring, and using Hadoop ecosystem components like HDFS, HBase, Zookeeper, Oozie, Spark, Hive, Sqoop, Pig, Impala and Flume.
- Well versed with installation, configuration, managing and supporting Hadoop cluster using various distributions like Apache Hadoop, Cloudera - CDH and Hortonworks HDP and MapR.
- Very strong experience working with Ansible playbooks and Jenkins for automating the tasks to execute across the cluster.
- Extensively used Splunk for reading logs and setup of proactive alerts.
- Helped in planning, development and architecture of Hadoop ecosystem.
- Experience on design, configure and manage backup and disaster recovery using snapshots and mirror volumes.
- Experience in providing security for Hortonworks cluster with Kerberos also configuring Mapr Security in MapR cluster.
- Experience on Hadoop cluster maintenance, including data and metadata backups, file system checks, commissioning and decommissioning nodes and upgrades.
- Expertise in cluster benchmark and configure the memory settings.
- Monitor and manage Linux servers (Hardware profiles, Resource usage, Service status etc).
- Configuring and using cluster monitoring tools like Ganglia, Grafana, Nagios and Icinga.
- Worked on data lake (ETLP) concepts on most of the big data projects.
- Good knowledge in different working strategies like Agile, Waterfall and Scrum methodologies.
- Experience in Verticals including Retail, Telecom, finance and insurance domains.
- Familiarity and experience with data warehousing and ETL tools.
- Major strengths are familiarity with multiple software systems, ability to learn quickly new technologies, adapt to new environments, self-motivated, team player, focused adaptive and quick learner with excellent interpersonal, technical and communication skills.
TECHNICAL SKILLS
Big Data Technologies: HDFS, YARN, MapReduce, Pig, Hive, Sqoop, Spark.
Big Data Distributions: MapR, Hortonworks, Cloudera.
Installation: Jenkins, Ansible, GitLab
Batch scheduling tool: BMC Control-M, Oozie
Scripting Languages: Shell, Bash
Monitoring tools: Icinga, Grafana, Ganglia, Nagios, Ambari, Splunk, Netcool.
Reporting Tools: Service-Now, Tableau, Jaspersoft
Programming Languages: SQL, PL/SQL,Core Java, Chef, Puppet, Basic Python
Application Servers: Apache Tomcat, WebLogic Server, WebSphere, JBoss
Databases: Oracle9.x, 10g, 11g, MySQL Server, DB2, HBase, MaprDB.
Networking & Protocols: TCP/IP, Telnet, HTTP, HTTPS, FTP, SNMP, DNS.
Operating System: Linux, UNIX, MAC, Windows.
PROFESSIONAL EXPERIENCE
Confidential, Phoenix, AZ
Sr. Data Architect
Responsibilities:
- Involved in delivery of new project solution based on company's technology and ensuring infrastructure services are projected based on standards.
- Implemented MapR security authentication protocol for existing cluster.
- Benchmarking and Stress Testing with TeraGen TeraSort, TestDFSIO and Hive aggregation.
- Managing data backups and disaster recovery for both cluster and volume level using snapshots and mirroring techniques.
- Coordinated with MapR team for root cause analysis and bug fixes.
- Proactive setup of alerts for smoot run of 650 node production cluster.
- Configure high availability for the ResourceManager to prevent single point of failure.
- Created self-healing mechanism to ensure realtime service to up and running like tomcat.
- Building splunk dashboards to monitor cluster utilization and services.
- Writing bash scripts frequently, depending on the project requirements
- Exported the generated results to Tableau for testing by connecting to the corresponding Hive tables using the Hive ODBC connector.
- Worked on an Oozie scheduler workflows to automate Sqoop, Hive and Pig jobs that extract the data in a timely manner also for cluster stats.
- Configured HIVE and tweaking it to improve performance.
- Analyzed overall jobs run in each queue and advising application team to improve cluster performance as part of fair scheduler maintenance.
- Ensuring jobs not to impact other realtime application as on shared datanodes.
- Troubleshooting Hadoop failed jobs and providing recommendations to application team.
- Using change management and incident management process to follow company standards.
Environment: MapR 5.2, MCS, MapR DB, Apache HBase, Spark, Hive, Sqoop, Unravel, Icinga 2.7, Ganglia 3.7.1, Nagios 4.3.4, Splunk 7.0.2, ServiceNow, Cisco UCS Manager.
Confidential, Sunnyvale, CA
Sr. Hadoop Administrator
Responsibilities:
- Primary responsible to keep Hadoop clusteres with 6000+ nodes, up and running.
- Built Hadoop clusters by using Ambari and express upgrades.
- Changed the configurations based on the requirements for the better performance of the jobs and dynamic tuning to make cluster available and efficient.
- Worked on setting up namenode high availability on major production cluster and automatic failover control using zookeeper and quorum journal nodes.
- Configured the queues as part of capacity scheduler in different environments.
- Commissioned and decommissioned of datanodes as part of ad-hoc or maintenance activity.
- Used Ansible Tower and wrote playbooks for automate repetitive tasks, quickly deploys of critical applications and proactive change.
- Developed scripts for benchmark in the form of DFSIO, teragen, terasort and wordcount.
- Configured YARN and fine-tuned YARN settings to improve performance.
- Creating case with Hortonworks for the bugs reported in the clusters.
- Formulated procedures for installation of Hadoop patches, updates and version upgrades.
- Implemented Kerberos security authentication protocol for production cluster.
- Provided security for Hadoop cluster Active Directory/LDAP, and TLS/SSL utilizations.
- Monitored host resources like CPU, RAM, HDD/mounts and security logs through Splunk.
- Written automated script to monitor jobs.
- Experienced in setting up project volume for the new projects.
- Built Hadoop cluster on Amazon Web Services by launching EC2 instances and using Elastic Load Balancer, S3 for storage as part of POC.
- Experienced in managing and reviewing log files.
- Involved in configuring Oozie workflow engine to run multiple Hive jobs.
- Worked with Hadoop developers, designers in troubleshooting Hive job failures and issues and helping to developers.
Environment: HDP 1.x and 2.x, Ambari 2.x, CDH 5.7, HDFS, MapReduce, Yarn, Hive, Oozie, Zookeeper, Redhat/Centos 6.5, Grafana 3.0.4, Nagios 3.5, Splunk 6.3, Ansible 2.4.3, GitLab, Espresso, Central-Station.
Confidential, NY
Control-M Lead / Project Coordinator
Responsibility:
- Implemented complete end-to-end instillation of BMC Control M tool for batch scheduling.
- Acquired, organized and managed of team members.
- Tracked and reported project metrics, measurements based on the SLA and deliver on committed dates.
- Conduct weekly project status meetings and publish status reports to the onsite team, PMO office.
- Co-ordinated with onsite team to gather requirements and issue resolution.
- Tracked issues and discrepancies through Service Center and ensure till resolution.
- Analyzed the requirements and convert them into the High-level design documents.
- Interact with onsite team to get the approvals for the design, coding, testing and implementation plans.
- Scheduled, prioritize and distribute the work to offshore team.
- Developed the knowledge base within the project for a quick reference.
- Executed the project by abiding to the CMM quality standards and PMP norms.
- Performed walkthrough on low-level design, unit test plans and implementation plan at various stages of the project prepared by the team.
- Planned and executed of defect prevention with monitoring.
- Planned quantitative resources management for smooth batch workflow in Control-M.
- Delivered the quality products as per the implementation plans with in deadlines.
- Provided production and test support with a quick turnaround time.
Confidential
Linux Administrator
Responsibilities:
- Administration of RHEL 5.4 which includes installation, testing, tuning, upgrading and loading patches, troubleshooting server issues.
- Configure ofLinuxand VMware infrastructure through our existing Kickstart infrastructure.
- ConfigureLinuxguests in a VMware ESX environment.
- Understand server virtualization technology such as VMware.
- Worked on Cisco USC, virtual infra on VMware, Storage migration and installations.
- Installing, configuring, custom building Oracle10g and preparing servers for database includes adding kernel parameters, software installation, permissions etc.
- Implemented multitier application provisioning in OpenStack cloud, integrating with Puppet.
- Involved in integrated Vsphere hypervisor with OpenStack.
- Configure and maintained FTP, DNS, NFS and DHCP servers.
- Configuring, maintaining and troubleshooting of local development servers.
- Performed configuration of standardLinuxand network protocols, such as SMTP, DHCP, DNS, LDAP, NFS, SMTP, HTTP, SNMP and others.
- Written shell scripting for automation.
- Worked on virtual and physicalLinuxhost for decommission.
- ServerAdministratorTomcat, Tomcat serving dynamic servlet and JSP requests.
- Managing cron jobs, batch processing and job scheduling.
- Worked on planning for the recovery of critical IT systems and services in a fallback situation following a disaster that overwhelms the resilience arrangements.
- Monitoring system activities like CPU, memory, disk and swap space usage to avoid any performance issues.
- Tuning the Kernel parameters for the better performance of applications.
- Provided 24X7 on-calls production and customer support including trouble shooting problems.
Environment: LINUX, FTP, Shell, UNIX, VMware, NFS, TCP/IP, Puppet, Oracle, Red Hat Linux.
Confidential
VSAT Engineer
Responsibilities:
- Developed software to control VSAT antenna to track communication satellite.
- Installation, operational and maintenance of PowerVu equipment like high power amplifiers, up converters, modulators and antenna position controllers.
- Tested and installed different types of VSAT antenna such as prime focus, offset, cassegrain and gregorian as per business need.
- Radiation pattern optimization for all types of antennas and processing for NOCC test and demonstration of entire system from 0.8-meter till 9.3-meter diameter.
- Experience on RF Test and measurement equipment like General Dynamics make Spectrum Analyzer, Network Analyzer, Signal Generator and Power Meter and Promax make Satellite Level Meter.
Environment: GeniDAQ, ADAM modules, Encoders.
