We provide IT Staff Augmentation Services!

Hadoop Consultant Resume

3.00/5 (Submit Your Rating)

TX

SUMMARY:

  • Software development, programming design and systems management and implementation using Oracle, Informix, MySQL databases in UNIX Client - Server environment.
  • Big Data Ecosystems including Hadoop MapReduce, Pig, Hive, Impala, Sqoop, Flume, Oozie, Zookeeper, Spark, Scala.
  • Excellent knowledge of Hadoop Ecosystem Confidential and components such as Hadoop Distributed File System (HDFS), MapReduce 1(Job Tracker) and MapReduce 2(YARN).
  • Hands on experience in importing and exporting data from different databases like Oracle, MySQL into HDFS using Sqoop.
  • Expertise in developing Pig Latin scripts and using Hive Query Language. Used Pig as ETL tool to do transformations, event joins, filter and some pre-aggregation.
  • Extensive knowledge of UNIX/Linux OS, System administration skills, Utilities and Shell scripts.
  • Strong critical thinking and analytical skills wif problem solving and root cause analysis experience.
  • Good working knowledge of Tableau visualization tool.
  • Strong knowledge of system and software quality assurance best practices and methodologies. Knowledge of applicable data privacy practices and laws - CPI-810, SOX.

TECHNICAL SKILLS:

Big Data Ecosystems: Hadoop, HDFS, MapReduce, HBase, Hive, Pig, Scala, Sqoop, Flume, Oozie, Spark, Zookeeper

Databases: Oracle, MySQL, SQL Server, Informix, SAP HANA, DB2

Operating Systems: Linux, UNIX, AIX, HPUX, Windows Server 2008

Tools Eclipse IDE, PuTTY, SFTP, TCP/IP, Ant, CVS, Toad for Oracle, Informatica, HP Quality Center, SharePoint: Programming Languages

C, Java, SQL, NoSQL, PL/SQL, R, Pig Latin, HiveQL, Shell scripts: Packages

PROFESSIONAL EXPERIENCE:

Confidential, TX

Hadoop Consultant

Responsibilities:

  • Involved in data ingestion from relational databases into HDFS using Sqoop. Created shell scripts to ftp POS and Product reviews data into HDFS.
  • Configured Flume to capture the textual and weblog data into HDFS.
  • Data cleansing and data enrichment to remove duplicates, null values was done using Pig Latin and HiveQL.
  • Build exception files for all non-compliant data using Pig. Responsible for managing data from various sources.
  • Created Hive External tables for Semantic data and loaded the data into tables and query data using HiveQL.
  • Good experience in monitoring and managing Hadoop cluster using Cloudera Manager.
  • Worked on Impala for exposing data for further analysis and for generating transforming files from different analytical formats to text files.
  • Generate final reporting data using Tableau for testing by connecting to the corresponding Hive tables using Hive ODBC connector.

Environment: Cloudera CDH5.7, MapReduce, Flume, Sqoop, Hive, Pig, Oozie, Impala, Tableau, Eclipse IDE, Java, Oracle, MySQL.

Confidential, TX

Responsibilities:

  • Extracted the data from Oracle into HDFS using Sqoop.
  • Experience in writing Pig scripts to transform raw data from several data sources into forming baseline data.
  • Solved performance issues in Hive and Pig scripts wif understanding of Joins, Group and aggregation and how it translates to MapReduce jobs.
  • Used Hive to analyze the partitioned and bucketed data and compute various metrics for reporting.
  • Worked wif UDFs as and when necessary to use in Pig and Hive queries.
  • Managed and reviewed Hadoop log files. Tested raw data and executed performance scripts.

Environment: Hadoop, Sqoop, MapReduce, Hive, Pig, PuTTY, HP QC, Java, Oracle.

Confidential

Software Engineer

Responsibilities:

  • Lead and played key role in the successful migration of SIGS West local Ordering system to xRM to create one Unified Ordering system for processing both West and East region Local Service Requests (LSRs).
  • Tech Lead for Flow Through team in Wholesale Local group, responsible for automation of the Local Service Requests (LSR). Mentor Development team through design, development and deployment phases.
  • Worked wif Business and cross functional teams in the successful implementation of various releases to increase Flow Thru percentage rate from 50% to 90% and maintain SLA for local metrics.
  • Successfully completed project related to the sale of 14 states residential land lines and small business assets to Frontier Communications - worked closely wif business, infrastructure and DBA teams in the data sizing, servers allocation/repurpose, migration of customer and transaction data. This project resulted in the yearly savings of approx. $1.5 Million by repurposing/elimination of hardware and elimination of software licensing.
  • Worked wif SQL queries, Triggers, Stored Procedures and wrote interfaces to backend Mainframe DB2 Order processing system NOCV. Supported NMC team in resolving customer issues. Worked closely wif DSS team for the generation of real time reports of LSR orders.
  • Proactively worked wif testing team to complete progression testing prior to release code freeze.
  • Met CPI-810 compliance remediation for logging and password access, RCO and apps scans items. Supported to improve predictability in IT production releases and continue to reduce release cycle time.

Environment: C, Java, AIX, Informix, SQL, SQL Server, Eclipse IDE, HP QC, Informatica

We'd love your feedback!