Cassandra Admin Resume
Bridgewater, NJ
OBJECTIVE
- To pursue an outstanding career in the present fast growing highly competitive world of Information Technology and seeking a challenging position in Software Design and Development in an organization dat provides congenial work environment and scope for mutual value addition
SUMMARY
- Over 7 years of experience in Big Data, NoSQL and RDBMS.
- 3+ Years of experience in physical design, planning, administration and performance tuning in Apache and DSE Cassandra and Hadoop(Cloudera).
- 5 year’s production DBA support on IBM DB2 UDB, SQL SERVER, MYSQL and SYBASE
- 2+ years of experience in handling complete life cycle of software projects involving requirements study, analysis, design, product conceptualization, estimation, solution architecture, coding, project management and delivery.
- 2 years of experience in the IT infrastructure management.
- 2 years of experience in Java
- 2+ years of experience in System Administration with AIX (User Management, LVM management)
- In depth noledge of DataStax Cassandra and experience with installing, configuring and monitoring cluster
- Experience in managing multi - tenant Cassandra clusters on public cloud environment - Amazon Web Services (AWS)-EC2, Rackspace and on private cloud infrastructure - OpenStack cloud platform
- Designing and working with Cassandra Query Language noledge in Cassandra read and write paths and internal architecture
- Implemented multi-data center and multi-rack Cassandra cluster
- Extensive noledge of Hadoop Architecture and various components such as HDFS, MapReduce, NameNode, DataNode, Secondary NameNode, JobTracker, TaskTracker.
- Hands on experience in installing and using Hadoop ecosystem components Hive, Pig, Impala, Oozie, Sqoop, Flume, HUE, HCatalog, ZooKeeper
- Experience in administering thes Linux systems to deploy Hadoop cluster and monitoring the cluster using Nagios and Ganglia.
- Experience using DSE Sqoop for importing data from RDBMS to Cassandra
- Experience in Designing, Installing, Configuring, Capacity Planning and administrating Hadoop Cluster of major Hadoop distributions - Cloudera Manager, Hortonworks Ambari & Apache Hadoop.
- Experience in DB migrations From AIX to LINUX and Data Dumps From Mainframe to Linux
- Excellent technical, analytical, interpersonal, problem solving and organizational skills
- Experience in Writing and maintaining Shell Scripts, DB2 Stored Procedures, Triggers.
- Experience in Java Programming and JDBC.
- Adherence of ITIL/ITSM Practices: ITIL-based IT Services Management
- Hands on Experience with supporting ERP products PeopleSoft, SAP (DBA cockpit)
TECHNICAL SKILLS
OS: AIX, Solaris, Linux, Windows (2000, XP)
S/W Languages: Visual Basic 6.0, JAVA
Big Data: Cassandra, Hadoop, HDFS, MapReduce, Hive, Pig, Sqoop, Flume, Zookeeper.
Databases: DB2 UDB 9.x (For Windows, Linux,AIX), Sybase 15, Sql Server, MySQL,MS-ACCESS
Security: Kerberos
Scripting Language: Bash, Python, Ruby .
BI Tools: Crystal Report 7.0, Brio, COGNOS, SAP BW
Data Modeling Tools: ERWIN
Monitoring Tools: SPLUNK, FOGLIGHT, GUARDIUM
External Archiving Storage: Tivoli Storage Management
Disaster Recovery & Failover Methods: HADR, HACMP
Application Servers: IIS 5.0, Apache Tomcat, IBM net.commerce, Websphere, Weblogic
Build Tools: Maven, SBT, Gradle, ANT
PROFESSIONAL EXPERIENCE
Confidential, Bridgewater, NJ
Cassandra Admin
Responsibilities:
- Administered and Maintained a 1000+ Node Cassandra Inventory.
- Design/Build Cassandra Clusters both on the physical machines and on AWS
- Capacity Planning and its growth for one year for the new cluster Builds
- Worked on Re-designing the clusters in to multiple Datacenters for performance improvement.
- Worked on Cassandra upgrades from DSE 4.6 to Apache 2.1
- Developed a Manual compaction strategy to reclaim space until the new hardware is added
- Re-Designed the existing Data model on one of our Cluster which has the largest partition.
- Developed a Sub-range repair mechanism to avoid over-streaming of data between nodes.
- Worked on Regressive Testing on Apache Cassandra 2.1 and later versions to record Benchmarking stats on our Personal cloud and Messaging applications.
- Worked on adding a huge set of 80 nodes in to one of our Multi Datacenter Cluster.
- Worked on JIRA tickets to fix bugs/issues encountered by our customers.
- Build bash scripts for Cassandra monitoring on the Cassandra cluster to track Latencies/Corrupted files/Cluster health
- Decommissioned Ops center and worked on Finding an open source alternative for Cassandra Alerting
- Created MOP’s on the Cassandra Version upgrades, Adding nodes, Building new DC
- Worked on Various Test cases to identify bugs and find a solution in the Test Environment
- Participated in 24/7 on call rotation.
- Participated in AWS patching with Site reliability engineers
- Worked with offshore DBA’s on day to day Maintenance activities.
Confidential, Atlanta, GA
Cassandra Architect
Responsibilities:
- Maintained a 40 Node Cassandra Cluster With Spark and Graph Integration for a Service Assurance applications and a 6 Node ring for video on Demand(VOD) service.
- Did an assessment on the Cassandra clusters for performance improvements and to knock out the existing time outs.
- Design/Build a non Vnode Cassandra Ring for a Service assurance application on VM’s for non-prod and Physical machines for a Production Ring.
- DSE version Upgrades from 4.7 to 4.8
- Designed a Data Model as cassandra to be a Data store with Spark integration for an application to support Real time analytics and correlation system to provide the ability to stitch multiple events and Data sources together in to service level view for a single subscriber.
- Adding/removing nodes from the both Vnodes and non Vnodes based clusters.
- Build bash scripts for Proactive monitoring on the Cassandra cluster by moving the Cassandra Mbeans to a monitoring Tool Cacti and setup the alerts for tasks like Threadpools/Read/Write Latencies/Compaction Statistics
- Implemented a Backup/restore Model for Cassandra.
- Created MOP’s on the Cassandra Version upgrades from 1.0 to 1.2
- Worked with DataStax on considering moving from an open source apache Version of Cassandra to the Datastax enterprise Edition.
- Used ANSIBLE and automation tool to automate the Cassandra Tasks such as New installtions/configurations and Basic Server Level Checks.
- Worked on a Vendor product ADRENALIN which is a Multi-screen video back office dat provides a management solution for delivery of Video-on-Demand(VOD).
- Assessment on a Mysql Databases to improve performance such as modifying Innodb Buffer and Cpu/Memory Improvements and Proactive tasks by enabling Binarylogs/slow log queries.
- Troubleshooting SqlServer Databases on a failed replication/mirroring Tasks between two Databases
- Administered Unix/Linux Servers.
- Worked on Applying server patching on a monthly Basis.
- Worked on Splunk, Created Dashboards for the Video On demand Service, on the Customer streaming at all our 20 Sites
- Managed various in house applications such as compendium which aggregates the information from Splunk and adrenalin to present the user.
Confidential, Provo, UT
Cassandra Admin
Responsibilities:
- Administered and Maintained a Multi-Datacenter Cassandra cluster
- Trained the team on the concepts of Cassandra and how to build applications on top of it
- Experience in performance tuning a Cassandra cluster to optimize it for writes and reads
- Involved in the process of data modeling Cassandra Schema
- Installed and Configured DataStax OpsCenter and Nagios for Cassandra Cluster maintenance and alerts.
- Benchmarked Cassandra cluster based on the expected traffic for the use case and optimized for low latency
- Built Cassandra Cluster both on the physical machines and on AWS
- Exported the business required information to RDBMS using Sqoop to make the data available for BI team to generate reports based on data
- Actively participated with the development team to meet the specific customer requirements and proposed effective Hadoop solutions
- Designed a Stargate solution for the search requirements on top of the Cassandra Cluster
- Data modeled the new solution based on Cassandra and on the use case
- Scheduled repair and cleanup process in production environment
- Provided security to the cluster by implementing Kerberos for Hadoop and Cassandra clusters
- Tested the high-availability architecture with enabling/disabling SSL, enabling/disabling Kerberos,
- Management Services directories as NFS mount
Confidential, Fremont, CA
Cassandra and Hadoop Admin
Responsibilities:
- Designed, Automated the process of installation and configuration of secure DataStax Enterprise Cassandra cluster using puppet on Linux platform
- Performed Cassandra Cluster benchmarking and tuned accordingly for better performance
- Designed and configured gateway node to the cluster
- Commissioning and Decommissioning cluster nodes
- Configured Log Stash for centralized log monitoring
- Deployed a Cassandra cluster in cloud environment
- Experience using SStableloader to load data into Cassandra cluster
- Designed and implemented a strategy to upgrade the DataStax enterprise cluster from 3.1.4 to 4.0.1
- Designed and implemented a strategy to move production data to a different datacenter without downtime
- Implemented Cassandra data replication
- Managed Cassandra cluster using nodetool utility and also used sstableloader to load external data in to another cluster with a different number nodes or replication strategy
- Designed and implemented a strategy to securely move production data to Development for testing purposes using sstableloader
- Configured authorization to Cassandra cluster using Password Authenticator and Kerberos for Hadoop cluster
- Used Datastax Java driver to connect to Cassandra Cluster
- Worked on requirements from Engineering and Reporting teams to design solutions on the Hadoop ecosystem
- Worked on performing major upgrade of cluster from CDH3u6 to CDH4.4.0
- Implemented Namenode HA and automatic failover infrastructure to overcome single point of failure for Namenode utilizing Zookeeper services.
- Tuned the cluster by Commissioning and decommissioning the DataNodes
- Deployed remote Hive Metastore using MySQL
- Migration of Oracle PL-SQLs queries to Hive using inbuilt and custom UDFs
- Developed Pig scripts for handling threat analysis raw data to be analyzed
- Ensured our Hadoop clusters are built and tuned in the most optimal way to support the activities of our Big Data teams
Confidential, Reston, VA
DB2/SYBASE DBA
Responsibilities:
- Hands on work, DB migrations From AIX to Linux (DB2 V 8.1 to V 9.7.4) & (DB2 V 9.7.4 to 10.1.2)
- Certified DB2 V 10 FP 2 in the Linux Env Which includes compatibility testing with the 3rd party tools such as Foglight, Guardium, VAS Before migrating the DB’s from 9.7.4
- Data Migration From Mainframe To Linux Through Flat Files
- Maintaining the production and Development database servers
- Support 24 hours on call
- Environment build (DB2 and Sybase)
- Maximize database performance by setting OS/DB/DBM and storage parameters, allocate system resources and improve performance. Capacity planning and estimation (database size, log size, tablespace page size, containers size, bufferpool calculations) Responsible for application, memory, IO and storage tuning, query optimization
- Designing of Linux VM and Capacity planning
- Investigate performance issues and other database problems using snapshots, event-monitors and other DB2 tools like db2pd, db2top
- Database Design, migration, installation
- Do Production Deployments during maintenance windows.
- Resource development includes training, coaching and evaluation
- Identifying the technical gaps, planning and implementation of best practices to overcome the issues
- Root cause Analysis /Problem Troubleshooting
- Automation to reduce manual efforts.
Confidential, Atlanta, GA
DB2 DBA
Responsibilities:
- Design and deploy a complete Enterprise Wide Information Data Warehouse Application.
- Design and deploy a complete Enterprise Wide Information Data Warehouse Application.
- A global Application spreading across all three continents, with data in a centralized DB2/AIX partitioned environment on v8.1
- Maximize database performance by setting installation and storage parameters, allocate system resources and improve performance Estimate database size, log size, tablespace page size, containers size, bufferpool calculations, Responsible for application, memory, IO and storage tuning, query optimization.
- Investigate performance and other database problems using Quest Central Tool, snapshots, event-monitors, Linux System Tools, Data Palette (Database Monitoring tool designed by this Organization)
- Worked extensively on large DPF environment running on DB2 v8.1
- Maintaining the development, test and production database servers
- Designing and developing Data Recovery strategies
- Database migration, installation
- Technical documentation & Best practice documentation
- Experience in DB migrations From AIX to LINUX and Data Dumps From Mainframe to Linux
- Worked extensively in setting up the maintenance shell scripts
- Experienced with Storage Administration like EMC SAN Administration,
- Debugging their application problems while developing the application
- Worked extensively in setting VMware, NetApp, NAS Administration, VERITAS Net Backup and Legato
- Providing Technical support and halping users on designed screens