We provide IT Staff Augmentation Services!

Cassandra/mongodb/arangodb/hadoop/oracle Expert/architect Resume

Atlanta, GA

SUMMARY:

  • Seasoned professional with expertise in advanced NoSQL/BigData/Cloud technologies and Project Management
  • Resourceful leader and performer with proven ability to strengthen business operations and solutions through extensive experience of 18+ years in IT
  • Proven Engineering lead in architecting database, AWS, BigData, Automation, NoSQL and Traditional databases/applications at large scale for large businesses.
  • Strong project manager with proficiency in communicating effectively with clients and optimizing performance of talented technical staff.

TECHNICAL SKILLS:

  • DataStax (DSE4.x/5.x/6.x) Cassandra
  • Apache Cassandra
  • OpsCenter
  • Cloudera/Hortonworks
  • Hadoop 1.x/2.x
  • Hive
  • HBase
  • CouchBase
  • MongoDB
  • ArangoDB
  • Oracle 8i/9i/10g/11g,12c, 13c
  • SQL Server
  • MySQL
  • Postgresql
  • Neo4J
  • DynamoDB
  • Redis
  • RDS
  • Pig
  • Impala
  • Splunk
  • AppD
  • Dynatrace
  • Grafana
  • Spark
  • Solr
  • Kafka
  • AWS - EC2/S3/VPN
  • Performance and Troubleshooting issues
  • Backup/Recovery
  • Datos IO
  • Ansible/Chef/Puppet
  • Java, J2EE, Hibernate
  • Python
  • Scala
  • Project Management
  • Atlassian tools(Confluence,JIRA,Bitbucket,Hipchat)
  • Git, Maven, Tomcat, Jenkins
  • Zookeeper
  • Oozie

PROFESSIONAL EXPERIENCE:

Confidential - Atlanta, GA

Cassandra/MongoDB/ArangoDB/Hadoop/Oracle Expert/Architect

  • Architect and Designed Data Model for Cassandra/ArangoDB/MongoDB/NoSQL databases for OMS/Pricing environment for many use cases with Application teams/Devops; Design and implement NoSQL database systems of Cassandra database clusters THD Pricing/Sales analytical systems;
  • Designed/implemented multi-datacenter Cassandra clusters environment of 50 node on 3 regions; Designed and configured various Cassandra database parameters for multi-datacenter environment including sizing the memory (JVM), sizing the cluster (number of nodes), sizing the storage for Cassandra;
  • Automated mass deployment of Cassandra database builds using Ansible playbooks; Designed Ansible playbooks and scripts for mass infrastructure deployments, builds, maintenance activities such as Cassandra cluster health checks, backups, repairs, analysis, log aggregation and etc;
  • Designed the plans for upgrading On-Premise Multi-datacenter Cassandra clusters to AWS EC2/S3 environments on cloud; Designed and Migrated Data from Oracle to Cassandra using Spark jobs;
  • Experience in bootstrapping, decommissioning, removing, replacing, and repairing nodes; Experience in Performance tuning Apache Cassandra cluster to optimize writes and reads; Troubleshoot read/write latency and timeout issues in Cassandra; Experience in setting up the required replication factors for various use cases;
  • Designed data modeling for Cassandra environment for OMS and Inventory applications; keyspace/column family design, database coding, and database performance tuning and making schematic for Cassandra database architecture; Tuned Cassandra systems with various tuning techniques such as manual compaction, diagnosing freezing/unresponsive Cassandra peers;
  • Architect Multi-Data Center Cassandra clusters for THD environment; Architect multi-data center clusters with various snitches; Achieved data ingestion from Oracle into Cassandra 2.x/3.x; Created Cassandra column familes and tuned queries in CQL; Managed and Tuned batch/scheduled jobs on Cassandra databases and re-engineered the data model to avoid adhoc and batch jobs on Cassandra;
  • Architect/Tuned Cassandra clusters/nodes with garbage collection tuning, appropriate compaction strategy setup, tuning the large number of tombstones, tuned large SSTables and partitions; Studied performance model of Cassandra clusters and analyzed the queries for tunable consistency with various consistent levels;
  • Possess in depth knowledge of Apache Cassandra and Datastax Enterprise Cassandra DSE 5.0/6.0, In depth knowledge in Cassandra read, writes paths, internal architecture, repair, compaction, designing read/write consistency level and GC Pauses and tuning; Good knowledge on Datastax Spark/Search/ Solr in indexing and managing searches;
  • Possess hands-on with Cassandra nodetool utility; Attended and resolved various production issues in depth analysis using Nodetool, system logs, debug logs and cassandra traces;
  • Worked closely with DataStax for various production, lower and database modeling issues and resolved;
  • Automation: Automated DB tasks to scale not limiting to below (using Ansible, python, Shell, Groovy and Jenkins)
  • Install/Upgrade DSE Cassandra 6.x/Apache Cassandra, Automate Multi-DC, Spark Job Server Update/Re-deploy, Configs/ Parameters Update in AWS, Add/Remove Nodes to Data Center, Parameters for tuning - sysctl.conf, Cassandra.yaml, jvm.options, Cassandra - Repair Scripts, Cassandra - Health Checks

ENVIRONMENT: DSE 4.x/5.x/6.x,Cassandra 1.0/2.0/3.0/4.0, OPS Center 6.0, Ansible, Spark, Kafka, AWS S3/EC2, AppDynamics, Splunk, HBase, Hadoop 1.x, 2.x.x, Hive 0.13/0.14/2. x, Sqoop, Map Reduce, Golden Gate 10/11, KSH/BASH Shell, NoSQL, Git, Jenkins

Confidential, Brisbane, CA

DBA Architect - Cassandra/Hadoop/Oracle

  • Architect Multi-Data Center Cassandra clusters for Confidential environment; Architect multi-data center clusters with various snitches; Achieved data ingestion from Oracle into Apache Cassandra 1.x; Created Cassandra column families and tuned queries in CQL; Managed and Tuned batch/scheduled jobs on Cassandra databases and re-engineered the data model to avoid adhoc and batch jobs on Cassandra;
  • Architect/Tuned Cassandra clusters/nodes with garbage collection tuning, appropriate compaction strategy setup, tuning the large number of tombstones, tuned large SSTables and partitions; Studied performance model of Cassandra clusters and analyzed the queries for tunable consistency with various consistent levels;
  • Responsibilities included design, development and support production 24x7 Oracle 9i, 10g & 11g databases in Confidential, Samsclub.com, CANADA GM; Providing Database build/support project plans, LOEs for Confidential US GM e-commerce for database scaling, changes, migrations and upgrades;
  • Implemented ORACLE REAL APPLICATION CLUSTER technology to Confidential and MTEP tenants for scaling database hardware resources and increasing the availability (HA Solutions) of the database systems; Participated on all the critical issues encountered during various phases of discharging RAC for various applications such as Order/Inventory/Catalog applications;
  • Designed and documented blueprints for HA (RAC) solutions, Disaster recovery (between East and North data centers), Replication technologies for Confidential, US GM and CANADA GM; Created process standards for database maintenance such as entire site down, application down
  • Designed the process for Holiday Performance/Capacity Planning for all Confidential e-commerce tenants; Arranged multiple levels of Stress/Load tests on Database and application with keynotes and other application testing vendors on the production site; Identified the key bottlenecks during the site stress tests and generated postmortem results with detailed analysis based on order processed/min and other businesses requirements
  • Designed Site monitoring strategy on Nagios/Proactive Net for Order Management System, Catalog Inventory, Payment Gateway, Credit card authorization, Customer Relationship Management databases for Siteops/Database Admins to support the database 24x7 with pagers and alerts; Written SQL/PLSQL scripts and BASH/KSH/SH shell programs for monitoring RAC, ASM and Oracle databases;
  • Analyzed and resolved issues such as DDL/DML locks and instance level locks and latch contention, enqueues and waitevents, ITL issues, Invalid objects, hourly and daily AWR reports (Automatic Workload Repository), Tablespace High Water Mark, Max datafiles limit, Top SQLs with 10046/10053 trace analysis, Top Active Sessions, blocking sessions, highly utilized datafile volumes, dump file growth, ORA errors, database parameter changes, database memory hit ratios, Partition growth, Indexes utilization/tuning, Soft/Hard parsing;

ENVIRONMENT: Oracle 9i/10g/11g/12c; Cassandra 1.0, Hadoop 1.x, HBase, Hive, Sqoop, Map Reduce, Golden Gate 10/11, KSH/BASH Shell, NoSQL, Git, Jenkins

Confidential

Senior Database Administrator

  • Supported Data Conversion Phase I implementation to production. Built a staff calendar webpage using PL/SQL and Oracle Web Server 3.0. Tested the feasibility of Oracle Parallel Server on 8i for PDM. Implemented strategies to load 757 model data using ERWin from the legacy systems into Production.
  • Involved Conceptual and Physical Database design; Normalized Table (Commodity) containing 360 columns into 12 different tables and changed the all the SQLs and DMLs running against the table commodity which was primary accessed by CBEC (Central Board of Excise and Customs); Partitioned the table commodity with local and global indexes to avoid FTS
  • Designed and Written 15000 lines of BASH/CSH/KSH Unix Shell Scripts/SQL/PLSQL to construct Manual Replication Environment using Oracle Materialized views and PLSQL stored procedures to replicate Master Site (NRM - National Risk Manager which should have all the data) and Client ( LRM Local Risk Manager - 25 locations)
  • Accomplished the initial data load into the RMS which was about to take 5-7 days of data; Adopted various techniques on site to make the data loader faster such as adding parallel threads, huge DB BUFFER BLOCKs, More I/O slaves for each disk, High network bandwidth; Populated data from Embedded system applications (ICE GATE) into RMS (Risk management system) Database after the initial data load;
  • Designed jobs to collect the daily statistics of tables and indexes to get the best explain plan by CBO; Facilitated application team to design complex queries and PL/SQL blocks;
  • Designed the entire database structure of iJET application; Involved in the Requirements gathering phase from the Business Users, Data integration and document design; Provided document for data dictionary (Contains explanations about all database schema objects such as Tables, Indexes, Clusters, Materialized views, Packages, Stored procedures and Triggers) to application team and database team to understand about the database architecture
  • Improved performance by exhaustive performance tuning which included moving complicated business logic from the application to database, identifying poorly written SQL, SQL without bind variables, and appropriate indexing. Recommended Multipool configuration for weblogic applications; Tuned complex queries using Explain Plan, SQL TRACE with TKProf and dynamic performance views to improve the performance of query processing
  • Administered binary and raw file in database using Large object using DBMS LOBS package; Applied object oriented concepts to utilize the oracle’s resource both collection and object views; Maintained database Integrity through Key Constraints and Check Constraints

Hire Now