We provide IT Staff Augmentation Services!

 hadoop Developer/sql Dba  Resume

5.00/5 (Submit Your Rating)

White Plain New, YorK

PROFESSIONAL SUMMARY

  • Around 9 years of IT experience as a SQL DBA, Administration & Development with around 4 years of experience in the Big Data like Hadoop and it’s Ecosystem in Development, Test and Production Environment on various business domains like Health, Insurance and Telecommunication using jave, scala and SQL.
  • Excellent understanding of Hadoop architecture and its components such as HDFS, JobTracker, TaskTracker, NameNode, DataNode, Resource Manager, Node Manager and MapReduce programming paradigm. 
  • Hands on experience in installing, configuring and using Hadoop Ecosystem - HDFS, MapReduce, Pig, Hive, Flume, Hbase, Spark, Sqoop, Flume and Oozie.
  • Hands on experience using Cloudera, HortonWorks and Appache Hadoop distributions.
  • Ingested the data from various file system to HDFS using UNIX command line utilities. 
  • Data was ingested into the system using Kakfa that receives data from various database providers onto HDFS for analysis and data processing.
  • Importing and exporting data job's, to perform operations like copying data from HDFS and to HDFS using Sqoop.
  • Experience with complex data processing pipelines, including ETL and data ingestion dealing with structural, unstructural and semi-structural data. 
  • Exposure to Cloudera development environment and management using Cloudera Manager.
  • Strong understanding of various Hadoop services, MapReduce and YARN architecture.
  • Responsible for writing Map Reduce programs.
  • Experience with Spark, Spark SQL and Spark Streaming.
  • Experience with various performance optimization like using distributed cache for small dataset, partitioning, bucketing in hive and map side joins when writing map reduce jobs
  • Hands on experience loading data to Hive partitions using different condition and managing buckets in Hive as well.
  • Design and developed Map Reduce jobs to automate transfer the data from HBase.
  • Expertise in OLAP as analysis processes using PIG and in OLTP using HIVE and MapReduce.
  • Experienced in developing UDFs for Hive, Pig, MapReduce using Java.
  • Excellent understanding of NoSQL databases like HBase, MongoDB & Cassandra.
  • Understanding of Oozie to schedule all Hive/sqoop/Hbase jobs.
  • Experience in Microsoft cloud and setting cluster in Amazon EC2 & S3 including the automation of setting & extending the clusters in AWS Amazon cloud.
  • Hands on expertise in real time analytics with Apache Spark (RDD, DataFrames andStreamingAPI). 
  • Experience for using RDD lineage to reconstruct lost data, when partition is lost.
  • Used Spark DataFrames API over Cloudera platform to perform analytics on Hive data.
  • Realtime experience in integrating Hadoop with Apache Storm and Kafka. Expertise in uploading Click stream data from Kafka to HDFS, HBase and Hive by integrating with Storm.
  • Background with traditional databases such as SQL Server, Oracle, MySQL. 
  • Extensive experience in MSSQL database administration, management, migration, upgrade, production, and design with MS SQL Server 2016/ 2014/2012 (with always on high availability and Azure SQL)2008r2/2005 using SSMS, SSIS, SSAS, SSRS in Multiple industries.
  • Highly experienced in T-SQL, Database Designing, implementation, Deployment and System Administration using Microsoft SQL Server as well as Windows 2003, 2008, 2012 and 2012r2 with VMware Virtualization.
  • Experience using Performance Monitor/Profiler to solve Dead Locks, long running queries.
  • Extensively used the native tools like Index Tuning Wizard, Database Tuning Advisor, Profiler, Performance Monitor, Activity Monitor, quest Fog light and Event Viewer for performance analysis.
  • Highly experience in Data Migration between SQL to Azure and SQL to SQL by using side-by-side upgrade process Management.
  • Experience in creating and managing database objects like tables, views, stored procedures, Triggers, user defined data types and functions.
  • Expertise in implementing and maintaining efficient Disaster Recovery Strategies and High availability like Replication (Snapshot and Transactional with updatable subscriptions), Log shipping, Clustering with VMware and Hyper-V (Active-Active and Active Passive).
  • Experience with Database consistency checks using DBCC & DMV’s Dynamic Management Views.
  • Real time experience in performing integrity checks. Methods include configuring the database maintenance plan wizard and DBCC.
  • Experience in troubleshooting SQL issues by using SQL Tools Execution plan, Trace.
  • Have experience in Performance Tuning, T-SQL Query Optimization and Solving Blocking issues.
  • Expertise in Identifying the Bottlenecks caused by complex queries using Activity Monitor/SQL Profiler and handling them by implementing better Query Execution plans.
  • Extensive experience about query optimization, complex Scripts, Batches, Stored Procedures, Cursor, Triggers, Tables, Views, User Defined Functions, Data Integrity, and SQL joins.
  • Experience in ETL between Homogenous and Heterogeneous System (Oracle/Access/Excel/CSV/XML) using SQL tools (SSIS, DTS, Bulk Insert, BCP) and providing a definite solution.
  • Experience of SQL Server Database Administration tasks such as planning, scheduling jobs, backup and recovery strategies, index tuning and performance tuning, and archiving data.
  • Experience in monitoring SQL server, capturing performance problem and improve performance by tuning Databases.
  • Experience in creating and maintaining Backup and Restore, Mirroring and Log-Shipping strategy as a part of Disaster recovery.
  • Major strengths are familiarity with multiple software systems, ability to learn quickly new technologies, adapt to new environments, self-motivated, team player, focused adaptive and quick learner with excellent interpersonal, technical and communication skills.

TECHNICAL SKILLS:

Database: MS SQL Server 2005/2008/2012/2014/2016 , Oracle11g,Mysql, Hadoop HDFS and Map reduce, sqoop, Hive, Spark and Pig, Kafka, Oozie.

Language: T-SQL, PL/SQL, Linux/Unix, CMD, Java, python and Scala.

Server: Windows Server 2003/2008/2012/2012 r2,Unix, Linux

DB Tools: Lite Speed (Backup Tools), SQL native Monitoring tools, NT Performance monitor/System monitor, TSQL, SQL Profiler and SQL Query Analyzer

Reporting Tools: Crystal Reports, BID, SQL Server Report Builder, Visual Studio

Data Migration: MSDTS, SSIS, ETL, BCP, Data Migration, Import/Export Wizard, Sqoop.

PROFESSIONAL EXPERIENCE

Confidential,White Plain, New York

 Hadoop Developer/SQL DBA 

Responsibilities

  • Involved in requirement gathering, Business Analysis and translated business requirement into technical design in Hadoop and Big data
  • Configure and working with multi nodes Hadoop cluster, Installed Cloudera, Apache Hadoop, Hive, Pig and Spark and commissioning & decommissioning of datanode, namenode recovery, capacity planning, and slots configuration.
  • Responsible for analyzing Hadoop cluster and different big data analytic tools including Pig, Hive and Spark.
  • Responsible for building scalable distributed data solutions using Hadoop.
  • Importing and exporting data into HDFS from different database and vice versa using SQOOP .
  • loading data from Local file system to HDFS and HDFS to LINUX file system.
  • Perform architecture design, data modeling, and implementation of SQL, Big Data platform and analytic applications for the consumer products.
  • Implemented test scripts to support test driven development and continuous integration.
  • Working on troubleshooting, monitoring, tuning the performance of Mapreduce Jobs.
  • Responsible to manage data coming from different sources.
  • Transformed the data by applying ETL processes using Hive with large sets of structured, semi structured and unstructured data.
  • Exported the analyzed data to the relational databases using Sqoop for visualization and to generate reports using SQL BID or Data Tools.
  • Experience in managing and reviewing Hadoop log files.
  • Managing different jobs using Fair scheduler.
  • Using PIG predefined functions to convert the fixed width file to delimited file.
  • Responsible for Optimizing and tuning Hive, Pig and Spark to improve performance and solve performance related issues in Hive and Pig scripts with good understanding of Joins, Group and aggregation.
  • Involved in scheduling Oozie workflow engine to run multiple MR, Hive and Pig job.
  • Managing SQL and Hadoop cluster, adding and removing cluster nodes, cluster monitoring and troubleshooting, manage and review data backups, manage and review Hadoop log files and manual fail over to check cluster functioning.
  • Querying Spark code using Scala and Spark-SQL for faster testing and data processing.
  • Develop Spark Streaming application for one of the data source using Scala, Spark by applying the transformations.
  • Export and Import the data from different sources like HDFS/HBase into SparkRDD and spark to different sorces.
  • Experienced with Spark Context, Spark-SQL, Data Frame, Pair RDD's, Spark YARN.
  • Involved in converting Hive/SQL queries into Spark transformations using Spark RDD, Scala.
  • Utilized kafka for messaging and subscribing to topic, where the producer produce a topic and consumer consumes the data via subscription.
  • Maintain various databases for Production, and in Installation, Configuration and upgradation of SQL Server 2008r 2/2012/2014/2016 technology, service packs and hot fixes for MS SQL Server 2008/2012.
  • Responsible and maintenance of different level SQL Server High availability solutions with SQL Server Failover Clustering, Replication,Database Snapshot, Log shipping, and Database Mirroring.
  • Responsible for Capacity planning, immediate performance solution, Performance Tuning, Troubleshooting, Disaster Recovery, backup and restore procedures.
  • Solid hand on performance monitoring with Activity monitor, SQL Profiler, Performance monitor, Database Tuning Advisor, DMVs and SQL Diagnostic Manager.
  • Responsible for implementing different types of Replication Models such as Transactional, Snapshot, Merge and Peer to Peer.
  • Extract, Transform, and Load data (ETL) from Big data sources to SQL Server using SQL Server Integration Services (SSIS) Packages on BID and SQL Data Tools.
  • Responsible for Database and Log Backup & point in time Restoration Process, Backup Strategies and Scheduling Backups.
  • According to Backup strategies and schedule, using third party tool like Lite Speed, as a part of database maintenance plans.
  • Responsible for Security Administration- creating users and assigning them roles and privileges.
  • Migrating contents from SQL server 2005/2008 /2008 r2to SQL Server 2014 using migration wizard.
  • Performance tuning of Queries and Stored Procedures using graphical execution plan, Query analyzer, monitor performance using SQL Server Profiler and Database Engine Tuning advisor.
  • Experience in configuration of Report Server, Report Manager, Report scheduling, give permissions to different level of users in SSRS 2008/2008R2/2005.
  • Use of DBCC and DMV Utilities to maintain the consistency and integrity of each database in the production servers.
  • Conducted root cause analysis of application availability and narrow down the issues related to coding practices, Database Bottlenecks, or Network Latency.
  • Create the MS SSIS packages for executing the required tasks. Created the Jobs and scheduled for daily running.
  • Design and configure Database, tables, indexes, store procedures, functions and triggers.
  • Experience in table/indexing, partitioning and full text search and tuning the Production server to get the performance improvement.
  • Maintaining both DEV/QA/Test and PRODUCTION Servers in sync. Installed and reviewed SQL server patches as well as service packs.
  • Troubleshot database status, performance, Replication, Log-shipping as well as various errors on production, development and UAT servers.
  • Migrate/converting DTS package to SSIS package and using those package to import and export the data from SQL Servers.

Environment: Hadoop, HDFS, MR,Hive, Pig, Spark,Sqoop, HBase, Java, Scala, Shell Scripting, Linux Red Hat. Multi node Hadoop cluster, SQL Cluster, SQL Server 2012/2014(SQL Server Management Studio, Query Analyzer, SQL Profiler, Index Tuning Wizard, SSIS, MSSQL Server 2012 Analysis Server), Microsoft Visual studio 2012 MS-Reporting Services, ADO.NET, XML.

Confidential,Warren, New Jersey

Hadoop Developer/SQL DBA 

Responsibilities

  • Configure and working with multi nodes Hadoop cluster, Installed Cloudera, Apache Hadoop, Hive, Pig and commissioning & decommissioning of datanode, namenode recovery, capacity planning, and slots configuration.
  • Developed PIG scripts to transform the raw data into intelligent data as specified by business users.
  • Worked in AWS environment for development and deployment of Custom Hadoop Applications.
  • Worked closely with the data modellers to model the new incoming data sets.
  • Involved in start to end process of Hadoop jobs that used various technologies such as Sqoop, PIG, Hive, Map Reduce and Shell scripts (for scheduling of few jobs.
  • Expertise in designing and deployment of Hadoop cluster and different Big Data analytic tools including Pig, Hive, Oozie, Zookeeper, SQOOP, flume, Spark, Impala, Cassandra with Horton work Distribution.
  • Maintenance Hadoop, Map Reduce processes, HDFS, AWS platform and developed multiple Map Reduce jobs in PIG and Hive for data cleaning and pre-processing.
  • Involved in creating Hive tables, Pig tables, and loading data and writing hive queries and pig scripts
  • Assisted in upgrading, configuration and maintenance of various Hadoop infrastructures like Pig, Hive, and Sqoop.
  • Import the data from different sources like RDBMS/Hbase into Hadoop.
  • Transformed the data by applying ETL processes using Hive with large sets of structured, semi structured and unstructured data.
  • Exported the analyzed data to the relational databases using Sqoop for visualization and to generate reports using SQL BID or Data Tools.
  • Configured deployed and maintained multi-node Dev and Test Kafka Clusters.
  • Performed transformations, cleaning and filtering on imported data using Hive, Map Reduce, and loaded final data into HDFS.
  • Experience in Oozie and workflow scheduler to manage Hadoop jobs by Direct Acyclic Graph (DAG) of actions with control flows.
  • Worked on tuning Hive and Pig to improve performance and solve performance related issues in Hive and Pig scripts with good understanding of Joins, Group and aggregation and how it does Map Reduce jobs
  • Create and maintain various databases for Production, Development and Testing Servers using MS SQL Server2005, 2008 and 2012 with always on high availability groups.
  • Extract, Transform, and Load data (ETL) from heterogeneous data sources to SQL Server using SQL Server Integration Services (SSIS) Packages.
  • Responsible for Database and Log Backup & Restoration Process, Backup Strategies and Scheduling Backups.
  • Responsible for Security Administration- creating users and assigning them roles and privileges.
  • Migrating contents from SQL server 2005/2008 to SQL Server 2008 R2 using migration wizard.
  • Performance tuning of Queries and Stored Procedures using graphical execution plan, Query analyzer, monitor performance using SQL Server Profiler and Database Engine Tuning Advisor.
  • Implemented transactional replication between Primary Server and Read Only servers.
  • Experience in configuration of Report Server, Report Manager, Report scheduling, give permissions to different level of users in SSRS 2008/2008R2/2005.
  • Responsible for implementing and maintaining efficient Disaster Recovery Strategies and High availability like Replication (Snapshot and Transactional with updatable subscriptions), Log shipping, Clustering with VMware (Active-Active and Active Passive).
  • Use of DBCC Utilities to maintain the consistency and integrity of each database in the production server.
  • Conducted root cause analysis of application availability and narrow down the issues related to coding practices, Database Bottlenecks, or Network Latency.
  • Used SSIS to create ETL packages (.dtsx files) to validate, extract, transform and load data to data warehouse databases and process SSAS cubes.
  • Experience in table/index partitioning and full text search and tuning the Production server to get the performance improvement.
  • Maintaining both DEV/Test and PRODUCTION Servers in sync. Installed and reviewed SQL server patches as well as service packs.
  • Creating Backup strategies and scheduling using third party tool like Lite Speed, as a part of database maintenance plans.
  • Created the MS SSIS packages for executing the required tasks. Created the Jobs and scheduled for daily running.
  • Design and configure Database, tables, indexes, store procedures, functions and triggers
  • Troubleshot database status, performance, Replication, Log-shipping as well as various errors on production, development and UAT servers.
  • Experienced in Point-in-time restoring databases in production and development servers.
  • Migrate DTS and SSIS to import and export the data from SQL Servers.
  • Monitoring SQL servers for performance ad-hoc and pro-actively.
  • Design and implement T-SQL Scripts and Stored Procedures and tune up them.
  • Providing 24/7 supports to SQL application developers in implementing the applications on production server.

Environment: Apache Hadoop, HDFS, MapReduce, Sqoop, Flume, Pig, Hive, HBase, Oozie, Java,Scala, Kafka, Linux, SQL Server 2012(SQL Server Management Studio, Query Analyzer, SQL Profiler, Index Tuning Wizard, SSIS, MSSQL Server 2012 Analysis Server), Microsoft Visual studio 2012 MS-Reporting Services, ADO.NET, XML.

Confidential,Marietta, Ohio

SQL DBA

Responsibilities:

  • Create and maintain various databases for Production, Development and Testing Servers using MS SQL Server 2005. Planning the location of data and Transaction log files on the disk.
  • Install and configure SQL Server 2005/ 2008 on the server and client machines.
  • Extract, Transform, and Load data from heterogeneous data sources to SQL Server using SQL Server Integration Services (SSIS) Packages.
  • Experience of DBCC Utilities to maintain the consistency and integrity of each database in the production server.
  • Conducted root cause analysis of application availability and narrow down the issues related to coding practices, Database Bottlenecks, or network latency.
  • Resolve locking and blocking issues.
  • Experience with Database consistency checks using Store Procedure, DBCC & (Dynamic Management Views) DMV’s.
  • Experience in table/index partitioning and full text search.
  • Tuning the Production server to get the performance improvement.
  • Running Index tuning wizard to identify missing indexes.
  • Use of Microsoft Diagnostic utilities to take memory dumps and automated traces.
  • Implementing and Monitoring SQL Server jobs using custom scripts.
  • Monitoring SQL Server LOGS, Application logs and analyzing the logs.
  • Connecting SQL Server remotely using Terminal services.
  • Providing 24*7 supports to SQL application developers in implementing the applications on production server.
  • Maintaining both DEV/QA/Test and PRODUCTION Servers in sync.
  • Creating users and assigning appropriate permissions.
  • Creating Backup strategies and scheduling using third party tool like Lite Speed, as a part of database maintenance plans.
  • Created the MS SSIS packages for executing the required tasks. Created the Jobs and scheduled for daily running.
  • Involved in Performance Tuning for source databases, target databases.
  • Maintained BizTalk Application Servers and BizTalk SQL Servers 2008.

Environment: SQL Server (SQL Server Management Studio, Query Analyzer, SQL Profiler, Index Tuning Wizard, SSIS, Visual Studio.

Confidential,New York, New York

 Network Admin/SQL DBA

  • Install and configure workstation and windows server 2003.
  • Installation, Configuration and up gradation of SQL Server 2000/2005/2008 technology, service packs and hot fixes using MS SQL Server 2008.
  • MS SQL Server 2005, Planning the location of data and Transaction log files on the disk.
  • Responsible and knowledge of high availability of SQL Server solutions with SQL Server Failover Clustering, Database Snapshot, Log shipping, and Database Mirroring.
  • Responsible for Capacity planning, Performance Tuning, Disaster Recovery trouble shooting, backup and restore procedures.
  • Solid hands on performance monitoring with Activity monitor, SQL Profiler, Performance monitor, and maintain various databases for Production, Development and testing servers, using tuning advisor, SP commands, DMVs, and DMFs.
  • Responsible for implementing different types of Replication Models such as Transactional, Snapshot, Merge and Peer to Peer.
  • Extract, Transform, and Load data from heterogeneous data sources to SQL Server using SQL Server Integration Services (DTS) Packages.
  • Implementing DBCC Utilities to maintain the consistency and integrity of each database in the production server.
  • Conducted root cause analysis of application availability and narrow down the issues related to coding practices, Database Bottlenecks, or network latency.
  • Resolve all locking, blocking and deadlocking issues.
  • Experience with Database consistency checks using DBCC & Dynamic Management Views using DMV’s.
  • Suggested Disk Capacity, Processors and Memory based on the Capacity planning.
  • Experience about Desktop/Laptop’s Network connectivity and support to end users.
  • All System configuration, update patches, drivers, setup user and administrative accounts, assign permissions and security.
  • Provide local and remote help desk support to all staff members, resolving various technical issue.
  • Maintain and install applications/devices on Local and Wide Area Network level Server 2003 Windows XP, MS OFFICE LAN/WAN & WLAN connectivity, Network Setup, IP configuration (HTTP, TCP/IP, DNS, DHCP and WINS), Network Printer configuration, share files folders.
  • Deployment and manage Exchange Server including server backup, email backup and support end users.
  • Create and manage Server 2003 security, such as monitoring system performance, Event Viewer log, Firewall, Virus protection, Windows update, Spyware Protection.

Environments: SQL Server (SQL Server Management Studio, Query Analyzer, SQL Profiler, Index Tuning Wizard, SSRS, SSIS, Visual Studio.

We'd love your feedback!