We provide IT Staff Augmentation Services!

Sr. Data Architect Resume

3.00/5 (Submit Your Rating)

Phoenix, AZ

SUMMARY

  • Cloudera Certified Hadoop Administrator with 9+ years of professional IT experience which includes 3 years of experience in Big Data ecosystem related technologies.
  • Excellent understanding, knowledge of Hadoop architecture and various components such as HDFS, CLDB, RM, Namenode, Datanodes, Pig, Hive, Sqoop, Oozie, HBase, Yarn and MapReduce programming paradigm.
  • Hands on experience in installing, configuring, and using Hadoop ecosystem components like HDFS, HBase, Zookeeper, Oozie, Spark, Hive, Sqoop, Pig, Impala and Flume.
  • Well versed with installation, configuration, managing and supporting Hadoop cluster using various distributions like Apache Hadoop, Cloudera - CDH and Hortonworks HDP and MapR.
  • Very strong experience working with Ansible playbooks and Jenkins for automating the tasks to execute across the cluster.
  • Extensively used Splunk for reading logs and setup of proactive alerts.
  • Helped in planning, development and architecture of Hadoop ecosystem.
  • Experience on design, configure and manage backup and disaster recovery using snapshots and mirror volumes.
  • Experience in providing security for Hortonworks cluster with Kerberos also configuring Mapr Security in MapR cluster.
  • Experience on Hadoop cluster maintenance, including data and metadata backups, file system checks, commissioning and decommissioning nodes and upgrades.
  • Expertise in cluster benchmark and configure the memory settings.
  • Monitor and manage Linux servers (Hardware profiles, Resource usage, Service status etc).
  • Configuring and using cluster monitoring tools like Ganglia, Grafana, Nagios and Icinga.
  • Worked on data lake (ETLP) concepts on most of the big data projects.
  • Good knowledge in different working strategies like Agile, Waterfall and Scrum methodologies.
  • Experience in Verticals including Retail, Telecom, finance and insurance domains.
  • Familiarity and experience with data warehousing and ETL tools.
  • Major strengths are familiarity with multiple software systems, ability to learn quickly new technologies, adapt to new environments, self-motivated, team player, focused adaptive and quick learner with excellent interpersonal, technical and communication skills.

TECHNICAL SKILLS

Big Data Technologies: HDFS, YARN, MapReduce, Pig, Hive, Sqoop, Spark.

Big Data Distributions: MapR, Hortonworks, Cloudera.

Installation: Jenkins, Ansible, GitLab

Batch scheduling tool: BMC Control-M, Oozie

Scripting Languages: Shell, Bash

Monitoring tools: Icinga, Grafana, Ganglia, Nagios, Ambari, Splunk, Netcool.

Reporting Tools: Service-Now, Tableau, Jaspersoft

Programming Languages: SQL, PL/SQL,Core Java, Chef, Puppet, Basic Python

Application Servers: Apache Tomcat, WebLogic Server, WebSphere, JBoss

Databases: Oracle9.x, 10g, 11g, MySQL Server, DB2, HBase, MaprDB.

Networking & Protocols: TCP/IP, Telnet, HTTP, HTTPS, FTP, SNMP, DNS.

Operating System: Linux, UNIX, MAC, Windows.

PROFESSIONAL EXPERIENCE

Confidential, Phoenix, AZ

Sr. Data Architect

Responsibilities:

  • Involved in delivery of new project solution based on company's technology and ensuring infrastructure services are projected based on standards.
  • Implemented MapR security authentication protocol for existing cluster.
  • Benchmarking and Stress Testing with TeraGen TeraSort, TestDFSIO and Hive aggregation.
  • Managing data backups and disaster recovery for both cluster and volume level using snapshots and mirroring techniques.
  • Coordinated with MapR team for root cause analysis and bug fixes.
  • Proactive setup of alerts for smoot run of 650 node production cluster.
  • Configure high availability for the ResourceManager to prevent single point of failure.
  • Created self-healing mechanism to ensure realtime service to up and running like tomcat.
  • Building splunk dashboards to monitor cluster utilization and services.
  • Writing bash scripts frequently, depending on the project requirements
  • Exported the generated results to Tableau for testing by connecting to the corresponding Hive tables using the Hive ODBC connector.
  • Worked on an Oozie scheduler workflows to automate Sqoop, Hive and Pig jobs that extract the data in a timely manner also for cluster stats.
  • Configured HIVE and tweaking it to improve performance.
  • Analyzed overall jobs run in each queue and advising application team to improve cluster performance as part of fair scheduler maintenance.
  • Ensuring jobs not to impact other realtime application as on shared datanodes.
  • Troubleshooting Hadoop failed jobs and providing recommendations to application team.
  • Using change management and incident management process to follow company standards.

Environment: MapR 5.2, MCS, MapR DB, Apache HBase, Spark, Hive, Sqoop, Unravel, Icinga 2.7, Ganglia 3.7.1, Nagios 4.3.4, Splunk 7.0.2, ServiceNow, Cisco UCS Manager.

Confidential, Sunnyvale, CA

Sr. Hadoop Administrator

Responsibilities:

  • Primary responsible to keep Hadoop clusteres with 6000+ nodes, up and running.
  • Built Hadoop clusters by using Ambari and express upgrades.
  • Changed the configurations based on the requirements for the better performance of the jobs and dynamic tuning to make cluster available and efficient.
  • Worked on setting up namenode high availability on major production cluster and automatic failover control using zookeeper and quorum journal nodes.
  • Configured the queues as part of capacity scheduler in different environments.
  • Commissioned and decommissioned of datanodes as part of ad-hoc or maintenance activity.
  • Used Ansible Tower and wrote playbooks for automate repetitive tasks, quickly deploys of critical applications and proactive change.
  • Developed scripts for benchmark in the form of DFSIO, teragen, terasort and wordcount.
  • Configured YARN and fine-tuned YARN settings to improve performance.
  • Creating case with Hortonworks for the bugs reported in the clusters.
  • Formulated procedures for installation of Hadoop patches, updates and version upgrades.
  • Implemented Kerberos security authentication protocol for production cluster.
  • Provided security for Hadoop cluster Active Directory/LDAP, and TLS/SSL utilizations.
  • Monitored host resources like CPU, RAM, HDD/mounts and security logs through Splunk.
  • Written automated script to monitor jobs.
  • Experienced in setting up project volume for the new projects.
  • Built Hadoop cluster on Amazon Web Services by launching EC2 instances and using Elastic Load Balancer, S3 for storage as part of POC.
  • Experienced in managing and reviewing log files.
  • Involved in configuring Oozie workflow engine to run multiple Hive jobs.
  • Worked with Hadoop developers, designers in troubleshooting Hive job failures and issues and helping to developers.

Environment: HDP 1.x and 2.x, Ambari 2.x, CDH 5.7, HDFS, MapReduce, Yarn, Hive, Oozie, Zookeeper, Redhat/Centos 6.5, Grafana 3.0.4, Nagios 3.5, Splunk 6.3, Ansible 2.4.3, GitLab, Espresso, Central-Station.

Confidential, NY

Control-M Lead / Project Coordinator

Responsibility:

  • Implemented complete end-to-end instillation of BMC Control M tool for batch scheduling.
  • Acquired, organized and managed of team members.
  • Tracked and reported project metrics, measurements based on the SLA and deliver on committed dates.
  • Conduct weekly project status meetings and publish status reports to the onsite team, PMO office.
  • Co-ordinated with onsite team to gather requirements and issue resolution.
  • Tracked issues and discrepancies through Service Center and ensure till resolution.
  • Analyzed the requirements and convert them into the High-level design documents.
  • Interact with onsite team to get the approvals for the design, coding, testing and implementation plans.
  • Scheduled, prioritize and distribute the work to offshore team.
  • Developed the knowledge base within the project for a quick reference.
  • Executed the project by abiding to the CMM quality standards and PMP norms.
  • Performed walkthrough on low-level design, unit test plans and implementation plan at various stages of the project prepared by the team.
  • Planned and executed of defect prevention with monitoring.
  • Planned quantitative resources management for smooth batch workflow in Control-M.
  • Delivered the quality products as per the implementation plans with in deadlines.
  • Provided production and test support with a quick turnaround time.

Confidential

Linux Administrator

Responsibilities:

  • Administration of RHEL 5.4 which includes installation, testing, tuning, upgrading and loading patches, troubleshooting server issues.
  • Configure ofLinuxand VMware infrastructure through our existing Kickstart infrastructure.
  • ConfigureLinuxguests in a VMware ESX environment.
  • Understand server virtualization technology such as VMware.
  • Worked on Cisco USC, virtual infra on VMware, Storage migration and installations.
  • Installing, configuring, custom building Oracle10g and preparing servers for database includes adding kernel parameters, software installation, permissions etc.
  • Implemented multitier application provisioning in OpenStack cloud, integrating with Puppet.
  • Involved in integrated Vsphere hypervisor with OpenStack.
  • Configure and maintained FTP, DNS, NFS and DHCP servers.
  • Configuring, maintaining and troubleshooting of local development servers.
  • Performed configuration of standardLinuxand network protocols, such as SMTP, DHCP, DNS, LDAP, NFS, SMTP, HTTP, SNMP and others.
  • Written shell scripting for automation.
  • Worked on virtual and physicalLinuxhost for decommission.
  • ServerAdministratorTomcat, Tomcat serving dynamic servlet and JSP requests.
  • Managing cron jobs, batch processing and job scheduling.
  • Worked on planning for the recovery of critical IT systems and services in a fallback situation following a disaster that overwhelms the resilience arrangements.
  • Monitoring system activities like CPU, memory, disk and swap space usage to avoid any performance issues.
  • Tuning the Kernel parameters for the better performance of applications.
  • Provided 24X7 on-calls production and customer support including trouble shooting problems.

Environment: LINUX, FTP, Shell, UNIX, VMware, NFS, TCP/IP, Puppet, Oracle, Red Hat Linux.

Confidential

VSAT Engineer

Responsibilities:

  • Developed software to control VSAT antenna to track communication satellite.
  • Installation, operational and maintenance of PowerVu equipment like high power amplifiers, up converters, modulators and antenna position controllers.
  • Tested and installed different types of VSAT antenna such as prime focus, offset, cassegrain and gregorian as per business need.
  • Radiation pattern optimization for all types of antennas and processing for NOCC test and demonstration of entire system from 0.8-meter till 9.3-meter diameter.
  • Experience on RF Test and measurement equipment like General Dynamics make Spectrum Analyzer, Network Analyzer, Signal Generator and Power Meter and Promax make Satellite Level Meter.

Environment: GeniDAQ, ADAM modules, Encoders.

We'd love your feedback!