Lead Big Data Consultant Resume
North, CarolinA
SUMMARY
- Passionate and Experienced Big Data technologist with over 10+ years of overall experience and 5+ years of Big Data experience.
- Well versed with web development using the Microsoft Stack and Java Development Technologies and graduated into the Big Data domain currently developing applications on the Hadoop ecosystem.
- Continuously adapting to new technologies with good understanding of the Cloud (AWS/Azure), Streaming frameworks like Spark, and languages with functional features like Scala and Python.
TECHNICAL SKILLS
Big Data Ecosystems: Hadoop, MapReduce, HDFS, YARN, HBase, Spark, Kafka, Zookeeper, Hive, Pig, Sqoop, Hbase, Cassandra, Oozie, Flume, CDH(Cloudera), SOLR, Proto, AVRO, KITE, Vertica, Tableau
Cloud: Working knowledge of AWS, Azure
Programming Languages: Java, Spark, C#, C/C++, VB
Scripting Languages: JSP & Servlets, PHP, JavaScript, XML, HTML, Python, Bash
Databases: NoSQL, Oracle, MS SQL Server, MySQL
Tools: Eclipse, IntelliJ, MS Visual Studio
OS: LINUX (Yum, RPM), Windows XP, 7, MS DOS
Continuous Integration: JIRA, Jenkins, Crucible, Git, Chef
PROFESSIONAL EXPERIENCE
Confidential
Lead Big Data Consultant
Responsibilities:
- Lead and Mentored 4 - member teams in 2 two projects; owned a KPI on which we are reporting on the first and owned the Electronic Data Interchange Transaction type for which we were reconciling failure points in the second.
- Initiated and conducted series of Cross Team requirement gathering sessions to understand workflows and broke them down into technical JIRAs in the backlog
- Designed and Developed Map Reduce pipelines using Apache Crunch Java Library/Spark Framework; Ingesting data from HDFS/Hbase/Kafka; Serializing data with Google ProtoBuff/AVRO; Wrote transformed data into HDFS/Hbase/Vertica using Database Schema Migration tooling
- Solved a problem with skewed data storage on Hbase (Hotspotting) using HashKey Salting strategy.
- Solved a problem with troubleshooting by creating a utility tool that extends an existing API by further deserializing AVRO records and printing the data.
- Extensively used Oozie to schedule jobs, chain multiple jobs and monitor jobs; Used Kite SDK to efficiently Partition and Store data on HDFS
- Extensive troubleshooting using HIVE
Confidential, North Carolina
Big Data Consultant
Responsibilities:
- Worked on the Hadoop ecosystem developing Pipelines using the MapReduce Framework
- Also wrote to Cassandra from the pipelines as well as directly from Oracle to expose large volume of data to a Transactional Web Application.
- Used Hive to create Internal/External tables and perform optimized joins and stored results as a final Hive Table used by Analysts with HiveQL
Confidential
Technology Consultant
Responsibilities:
- Designed, Developed and Supported various Applications including Web Applications, Console Applications, and Web Services.
- Closely worked with Project Managers, Business Users and Business Analysts; Involved in Requirement Gathering and designing Technical Specification Documents.
- Extensively used the Microsoft Stack, Java Stack and third-party libraries like Google Maps API, Confidential Rad Controls, SAP Web Services (BAPI), Salesforce, etc.
