We provide IT Staff Augmentation Services!

Big Data Architect And Lead Resume

4.00/5 (Submit Your Rating)

Houston, TX

SUMMARY:

  • Accomplished technology professional with over 16+ years of experience in IT industry, in the areas of Architecture, Solutioning, Delivery Excellence - Project Management and Centre of Excellence
  • In-depth and hands-on experience across a variety of Technology platforms viz. Big data and Hadoop, In-memory data grid, Enterprise Content Management and Business Process Management
  • Big Data Architect and part of the Confidential BigData team with key focus on devising Big Data Architecture & Solutions across the Big Data Application Layer (Real-time - Streaming, In-memory data grid) and Data Layer (Hadoop, NOSQL, MPP)
  • Experience understanding and analyzing the requirements, providing solution, technical architecture, design of application, development, testing and deployment of the solution using Big Data Technology stack
  • Cloudera Certified Developer for Apache Hadoop (CCDH V5) with excellent experience on Hadoop Ecosystem (HDFS, YARN, MapReduce, Spark, HBase, HIVE, PIG, Sqoop, Oozie, Flume and Zookeeper), No SQL (HBase), Java and Scala
  • Extensive development experience and hands on experience on Hadoop ecosystem --developing ingestion strategies and build transformations using Spark, PIG, Hive and MR jobs
  • Recent experience has been on Apache Spark 1.2.0, Spark SQL, Hive and HBase
  • Good understanding of AWS components and its overall architecture with some prototyping experience on the same

TECHNICAL SKILLS:

Roles Played: Bigdata Architect, Documentum Architect, Technical Consulting, Technical Lead, Project Manager, Practice Lead, COE Lead

Hadoop Ecosystem: HDFS, Yarn, Spark, MR, Hive, HBase, Pig, Kafka, Oozie, Flume, Sqoop, Storm,In-Memory

Data Grid: GemfireCloud: AWS - EC2, EMR, Redshift, S3, etc Java, Scala, Python, Shell scripts

Documentum: xCP, D2, Webtop, DCM

ECM: OpenText Livelink, Lotus Notes, Core Java, Spring, HTML, JavaScript, XML, JSON

PROFESSIONAL EXPERIENCE:

Confidential, Houston, TX

Big Data Architect and Lead

Responsibilities:

  • Attend client workshops and understand the requirements along with Engineering and Data Science team to design a solution
  • Work with the client infrastructure and admin teams to design and build cluster needed
  • Involved in design and implement Data ingestion strategy using Flume and Sqoop
  • Design considerations for file format and implemented parquet format
  • Designed and implemented the parquet processing using Spark SQL and Dataframes
  • Designed and implemented data validation, checks and transformations using Spark jobs
  • Designed and implemented Operational store that stores operational and error information about jobs, data, processing, etc
  • Worked with the data science team to build queries on Hive and identify transformations to be applied
  • Design the schema for the target HBase schema
  • Design and implement the as-is data and transformed data to push into HBase
  • Verify that the data is pushed correctly in HBase

Environment: -Hortonworks distribution 2.3, Spark, Hive, HBase, Phoenix, Oozie, Sqoop, Flume, Parquet Format, Java, Scala

Confidential, NYC, NY

Big Data Architect and Lead

Responsibilities:

  • Attend client workshops to understand the business requirements
  • Design a solution for the requirements
  • Work with the client design team to develop the design
  • Technical guidance to the development team
  • Ensure that the developed design and code meet the standards and client requirements

Environment: -Hortonworks distribution 2.3, Gemfire 8.1, HBase, Phoenix, Spring Data, Java

Confidential, NYC, NY

Big Data Architect

Responsibilities:

  • Attend client workshops to understand the requirements
  • Work with the IBM CDC team on the dynamics of the full/delta data to be ingested to Hadoop
  • Design a Framework which is scalable and cater to multiple sources in time.
  • Design schema validation and also checks for data checks via MapReduce jobs
  • Design a strategy for data archival and data lineage
  • Design handling of full refresh and delta changes on an hourly basis
  • Design a daily reconciliation process to handle the changes
  • Innovatively designed the operational data model to store the job, task and error details.
  • Technical guidance to the development team
  • Ensure that the developed design and code meet the standards and client requirements

Environment: -Hortonworks distribution 2.0, Hadoop - HDFS, MR, Hive, Pig, Flume, Sqoop, Oozie, Java

Confidential, Atlanta, GA

Big Data Architect

Responsibilities:

  • Led the Architecture workshops with the customer team to understand the requirements and put up the Bigdata Component Stack and Design.
  • Deliver High Level proposed Architecture stack
  • Led the low level design and execution of the overall engagement.

Environment: -Cloudera distribution 5.3, Kafka, Spark 1.2, HBase, Hadoop, Java

Confidential

Big Data Architect

Responsibilities:

  • Design and develop various PoC’s - batch and realtime processing
  • Part of the R&D team to explore different tools / products in Big Data space
  • Involved to fix issues on projects that had technical issues
  • Work on RFP’s with the pre-sales team to provide solutions to the Big Data proposals
  • Perform customer visits and do a pre-sales on Big Data

Confidential, Atlanta, GA

Big Data Architect

Responsibilities:

  • Design a new generation low latency high throughput In-memory Grid for SWA’s Cargo unit.
  • Model the NoSQL Data Grid as required by the access patterns as well as transactions requirements.
  • Design and configure the various regions to store the data
  • Configure the regions in replicated and partitioned based on the use cases
  • Design and configure different eviction conditions and for the data to persist on disk for overflow cases
  • Configure various regions for indexing that is built in

Environment: -Pivotal Gemfire 7.0.2

Confidential, Atlanta, GA

Big Data Architect

Responsibilities:

  • Understand the requirements
  • Worked with Senior Architect to create ingestion strategy from various sources into Hadoop.
  • Guide the team during project development
  • Ensure the design and developed code meet the requirements and standards

Environment: -Cloudera distribution 4, Hadoop - HDFS, MR, Hive, Pig, Flume, Sqoop, Oozie, Java

Confidential

Documentum Architect and Technical Manager

Responsibilities:

  • Attend requirement workshops directly with the stakeholders along with the Bureau Veritas IT team to understand the requirements and provide a solution
  • Provide a roadmap for Documentum applicaton landscape
  • Provide solution to New applications, Upgrades and Migration projects
  • Monitor and control the progress of various applications with the help of leads.

We'd love your feedback!