Big Data Architect and Lead Resume Houston, TX - Hire IT People

SUMMARY:

Accomplished technology professional with over 16+ years of experience in IT industry, in the areas of Architecture, Solutioning, Delivery Excellence - Project Management and Centre of Excellence
In-depth and hands-on experience across a variety of Technology platforms viz. Big data and Hadoop, In-memory data grid, Enterprise Content Management and Business Process Management
Big Data Architect and part of the Confidential BigData team with key focus on devising Big Data Architecture & Solutions across the Big Data Application Layer (Real-time - Streaming, In-memory data grid) and Data Layer (Hadoop, NOSQL, MPP)
Experience understanding and analyzing the requirements, providing solution, technical architecture, design of application, development, testing and deployment of the solution using Big Data Technology stack
Cloudera Certified Developer for Apache Hadoop (CCDH V5) with excellent experience on Hadoop Ecosystem (HDFS, YARN, MapReduce, Spark, HBase, HIVE, PIG, Sqoop, Oozie, Flume and Zookeeper), No SQL (HBase), Java and Scala
Extensive development experience and hands on experience on Hadoop ecosystem --developing ingestion strategies and build transformations using Spark, PIG, Hive and MR jobs
Recent experience has been on Apache Spark 1.2.0, Spark SQL, Hive and HBase
Good understanding of AWS components and its overall architecture with some prototyping experience on the same

TECHNICAL SKILLS:

Roles Played: Bigdata Architect, Documentum Architect, Technical Consulting, Technical Lead, Project Manager, Practice Lead, COE Lead

Hadoop Ecosystem: HDFS, Yarn, Spark, MR, Hive, HBase, Pig, Kafka, Oozie, Flume, Sqoop, Storm,In-Memory

Data Grid: GemfireCloud: AWS - EC2, EMR, Redshift, S3, etc Java, Scala, Python, Shell scripts

Documentum: xCP, D2, Webtop, DCM

ECM: OpenText Livelink, Lotus Notes, Core Java, Spring, HTML, JavaScript, XML, JSON

PROFESSIONAL EXPERIENCE:

Confidential, Houston, TX

Big Data Architect and Lead

Responsibilities:

Attend client workshops and understand the requirements along with Engineering and Data Science team to design a solution
Work with the client infrastructure and admin teams to design and build cluster needed
Involved in design and implement Data ingestion strategy using Flume and Sqoop
Design considerations for file format and implemented parquet format
Designed and implemented the parquet processing using Spark SQL and Dataframes
Designed and implemented data validation, checks and transformations using Spark jobs
Designed and implemented Operational store that stores operational and error information about jobs, data, processing, etc
Worked with the data science team to build queries on Hive and identify transformations to be applied
Design the schema for the target HBase schema
Design and implement the as-is data and transformed data to push into HBase
Verify that the data is pushed correctly in HBase

Environment: -Hortonworks distribution 2.3, Spark, Hive, HBase, Phoenix, Oozie, Sqoop, Flume, Parquet Format, Java, Scala

Confidential, NYC, NY

Big Data Architect and Lead

Responsibilities:

Attend client workshops to understand the business requirements
Design a solution for the requirements
Work with the client design team to develop the design
Technical guidance to the development team
Ensure that the developed design and code meet the standards and client requirements

Environment: -Hortonworks distribution 2.3, Gemfire 8.1, HBase, Phoenix, Spring Data, Java

Confidential, NYC, NY

Big Data Architect

Responsibilities:

Attend client workshops to understand the requirements
Work with the IBM CDC team on the dynamics of the full/delta data to be ingested to Hadoop
Design a Framework which is scalable and cater to multiple sources in time.
Design schema validation and also checks for data checks via MapReduce jobs
Design a strategy for data archival and data lineage
Design handling of full refresh and delta changes on an hourly basis
Design a daily reconciliation process to handle the changes
Innovatively designed the operational data model to store the job, task and error details.
Technical guidance to the development team
Ensure that the developed design and code meet the standards and client requirements

Environment: -Hortonworks distribution 2.0, Hadoop - HDFS, MR, Hive, Pig, Flume, Sqoop, Oozie, Java

Confidential, Atlanta, GA

Big Data Architect

Responsibilities:

Led the Architecture workshops with the customer team to understand the requirements and put up the Bigdata Component Stack and Design.
Deliver High Level proposed Architecture stack
Led the low level design and execution of the overall engagement.

Environment: -Cloudera distribution 5.3, Kafka, Spark 1.2, HBase, Hadoop, Java

Confidential

Big Data Architect

Responsibilities:

Design and develop various PoC’s - batch and realtime processing
Part of the R&D team to explore different tools / products in Big Data space
Involved to fix issues on projects that had technical issues
Work on RFP’s with the pre-sales team to provide solutions to the Big Data proposals
Perform customer visits and do a pre-sales on Big Data

Confidential, Atlanta, GA

Big Data Architect

Responsibilities:

Design a new generation low latency high throughput In-memory Grid for SWA’s Cargo unit.
Model the NoSQL Data Grid as required by the access patterns as well as transactions requirements.
Design and configure the various regions to store the data
Configure the regions in replicated and partitioned based on the use cases
Design and configure different eviction conditions and for the data to persist on disk for overflow cases
Configure various regions for indexing that is built in

Environment: -Pivotal Gemfire 7.0.2

Confidential, Atlanta, GA

Big Data Architect

Responsibilities:

Understand the requirements
Worked with Senior Architect to create ingestion strategy from various sources into Hadoop.
Guide the team during project development
Ensure the design and developed code meet the requirements and standards

Environment: -Cloudera distribution 4, Hadoop - HDFS, MR, Hive, Pig, Flume, Sqoop, Oozie, Java

Confidential

Documentum Architect and Technical Manager

Responsibilities:

Attend requirement workshops directly with the stakeholders along with the Bureau Veritas IT team to understand the requirements and provide a solution
Provide a roadmap for Documentum applicaton landscape
Provide solution to New applications, Upgrades and Migration projects
Monitor and control the progress of various applications with the help of leads.

We provide IT Staff Augmentation Services!

Big Data Architect And Lead Resume

Houston, TX

We'd love your feedback!

Resume Categories

Client Services

Job Seekers

Visa Sponsorship