Big Data & Analytic Architect / Team Leader Resume
SUMMARY
- Design and development of relational (such as Microsoft SQL Server, Oracle) and NoSQL (such as Cassandra, HBase) databases exploiting and enhancing the Enterprise data models.
- Database and Data Warehouse designing and also migration from relational databases to NoSQL databases and Hadoop (HDFS) ecosystem.
- Working close with data sciences and data analysis teams for data discovery, data mining and machine learning projects.
- Choosing and optimizing file formats and proper compression types in Hadoop.
- Message broker implementations(with Kafka) with customized consumer development for HDFS, HBase and Cassandra ingestion, also streaming/messaging solutions with Storm and Spark Streaming.
- Spark based big data development with Scala and related technologies like Spark SQL, Spark Streaming.
- HBase and Cassandra schema design and physical implementation by dimensioning data with fact tables, time series with in batch and also real - time models.
- Hands-on end to end Lambda architecture design and implementation of both batch and stream data processing.
- Hands-on Kappa designing and implementation experience.
- Facilitate scrum ceremonies (grooming, sprint planning, retrospectives, daily stand-ups, etc.)
- Coordinate and manage code releases.
- Communicate with other managers, architects, scrum masters, developers, product managers and support specialists on product issues.
TECHNICAL SKILLS
Languages: Scala, Java, Sql, C#
Unit Testing: JUnit, ScalaTest, NUnit, Microsoft Unit Testing Framework, MRUnit
Databases: Microsoft SQL Server, Oracle, MySQL Cassandra, HBase, Redis MemSQL, VoltDB, Crate.IO
RDBMS: Microsoft SSIS/SSRS, Oracle BI/OBIEE, MicroStrategy, Redshift with BI integration, Hive (with MR and Tez), Drill
Cluster Computing: MapReduce jobs (Java based), Sqoop, HBase, Oozie, Flume, YARN, Spark Core(Scala based), Spark Streaming (Scala based), Storm, Alluxio (formerly Tachyon), Ignite, Tez, Drill, Hive LLAP
Message Brokers: Kafka, RabbitMQ, 0MQ, MSMQ, Kinesis
Version/Build/Deployment: Git, Svn, Maven, Sbt, Gradle, Jenkins
PROFESSIONAL EXPERIENCE
Big Data & Analytic Architect / Team Leader
Confidential
Responsibilities:
- Multiple Data Lake and Data warehouse solutions in Big Data domain.
- Creating, running and monitoring teams for big data and data science activities.
- Driving and helping decision making across the teams.
- Creating and implementing training model for developers, analysis and business stakeholders in terms of big data domain and data solutions.
- Responsible for hands-on activities to create POCs, POTs and prototypes.
- Managing the life-cycle of code development, from brainstorming to sprints to deployment.
Big Data Architect & Developer
Confidential
Responsibilities:
- Big Data application archite cture and development for both batch and fast data processing projects - with Kafka, Spark Core, Spark Streaming, Elasticsearch, Spark SQL, HBase, as well Flume and Spark based data ingestion and customized ETL/ELT pipelines on HDFS.
- Oozie based customized data ingestion orchestration.
- Also responsible for architecture and development of Tachyon and Ignite based in-memory data service layers.
- Hands-on experience on VoltDB, MemSQL, Crate.IO with Spark integration.
Big Data Architect & Developer SystemCore
Confidential
Responsibilities:
- Data ingestion solutions from relational, file, log or streaming based sources to Hadoop ecosystem.
- MapReduce development by Java coding.
- Cassandra & Hadoop and Cassandra & Spark integrations.
- Customized Kafka implementations.
- Java and C# based SOAP and RESTful web service development.
- Database designing on relational database engines and Normalization/Denormalization for ETL and DW operations
- Massage broker layers implementation.
Senior .NET & Big Data Developer
Confidential, Belmont, CA
Responsibilities:
- Implemented new data flow from RabbitMQ to Cassandra/MS-Sql server by C# development.
- Built new data API and service engines to expose data to mixed client apps.
- Responsible to create MapReduce jobs against to Hadoop cluster by Java and Hive as a POC app.
- Re-engineered the existing MS-Sql database objects(SPs’ and Functions) to increase performance on BI and app layers
Architect & Sr. Developer
Confidential, Mill Valley, CA
Responsibilities:
- Working with stakeholders to create logical layers for internal applications requested by business.
- Creating user cases with analysts and main stakeholders.
- Coordinate with analyst team and project manager to translate the business requirements into achievable goals by actionable and governable road-maps and designs.
- Planning the tiers according to logical layers.
- Coding tiers in compliance with corresponding logical layers.
- Designing and coding relational database layers and objects
- Creating MS-Sql server SSIS packages with C# coding to feed the data transfer requirements.
- Developing finance and mortgage applications with data modeling depending on industry requirements.
- Designed and wrote file transfer application using ftp on ssh (sftp) protocol
- Creating variable report files in terms of business requirements
- Designed and created enterprise level error handling and monitoring application.
Architect & Sr. Develope r
Confidential
Responsibilities:
- Taken role as a senior developer, team leader and also architect in multiple projects, including web based b2b & b2c e-commerce, desktop, service, n-tier projects.
- Completed several projects in the SaaS, Cloud Computing and SOA spaces that provides BI solutions in the cloud (as SaaS)
- Used Visual Basic 6, Java, Asp, Asp.Net, C#, JavaScript as programming languages/frameworks and MS Access, MS-Sql Server, Oracle, MySQL, Redis, Memcached, Cassandra etc. as database.