We provide IT Staff Augmentation Services!

Lead Big Data Consultant Resume

2.00/5 (Submit Your Rating)

SUMMARY

  • 14 + years of IT experience along with 4 years of HADOOP ecosystem including Spark/Scala
  • Actively involved in the life cycle of project that includes implementing the Systems, Designing, Development, Testing and Documentation.
  • Possess in depth knowledge in HADOOP/HDFS Architecture.
  • Experienced Big Data developer with good knowledge of HDFS and Spark components such as Spark SQL, Spark Streaming, and related API’s
  • Exposure to HDFS eco systems such as Pig, Hive, Impala, Sqoop, Yarn & Cloudera
  • Exposure in developing Hive scripts and accessing them via user defined functions
  • Expert in utilizing RDD’s, Data frames and Datasets to solve complex Spark solutions
  • Experience in analyzing high volume streaming data sources such as Twitter, Kafka and AKKA streams.
  • Experience in configuring and utilizing 5 node Kafka cluster followed by utilizing Kafka streams through Scala
  • Sound knowledge in utilizing Amazon AWS S3 cloud cluster executing spark code to perform sales data analysis on the sales data using Spark through Scala
  • Experience in using the no sql database: Cassandra
  • Experience in using editors such as Eclipse, Scale IDE and Canopy to develop and debug Python based Big data solutions
  • Possess minimal knowledge in job/workflow scheduling and monitoring using OOZIE
  • Background with traditional databases such as Oracle, SQL Server, MySQL
  • Have very good experience in developing web based applications using C#, HTML, CSS on .Net framework including SharePoint

TECHNICAL SKILLS

Big Data Eco systems: HDFS, MapReduce, Pig, Hive, Spark 2.0, Spark - SQL, Spark Streaming, Kafka, AKKA

Operating System: Windows 10, Knowledge in Linux

Programming Languages: Scala, Python, C#

Cluster/Cloud Systems: Amazon Web Services (AWS S3 Cloud)

No-SQL Database: Cassandra

PROFESSIONAL EXPERIENCE

Confidential

Lead Big Data consultant

Responsibilities:

  • Responsible for building scalable distributed data solutions using Spark and Hadoop Ecosystem
  • Responsible for design & development of spark SQL using Scala based on functional specs.
  • Preformed real-time analysis of the incoming data using Kafka consumer API, Kafka topics, spark structured streaming.
  • Monitoring Kafka Data pipeline using Chronograph along with Influx DB and Influx sink connector
  • Using spark structure streaming to consume the data from Topics.
  • Water Marking for Session state and data lag to reduce the loss of data.

Technologies: HDFS, Scala, Spark structured streaming, Spark-SQL, Apache Kafka, Hive, Kibana and Shell Script.

Confidential

Big Data Consultant

Responsibilities:

  • Interact with business users; finalize scope, requirements and release phase schedule.
  • Custom producer is built for various online data sources
  • Set up and configure a 5 node Kafka cluster and monitor the broker activities
  • Monitoring Kafka Metrics at different levels such as broker, producer and consumer.
  • Configure Producers, set partitions and corresponding replication to handle effective fault tolerance
  • Using Spark stream as Consumer to store in Data lakes like S3 Storage, HDFS and MySQL with Dynamic partitions
  • Aggregate, analyze and assess the input data to perform effective RDD transformation and action
  • Implement the best spark practices such as persist(), one time collect(), etc., to have effective performance
  • Effective utilization of Data Frame and Dataset API’s of Spark 2.0 and save data into Cassandra
  • Applied catalyst optimize both cost and rule base using query.
  • Building Data pipeline and involved in Postgres

Technologies: Spark Streaming, Scala IDE, Kafka, Cassandra

Confidential

Big Data Consultant

Responsibilities:

  • Involve actively in setting up a Kerberised cluster followed by building a HADOOP ecosystem in compliance with best practices
  • Perform data export from SQL Server database into HDFS through SQOOP commands.
  • Perform proof of concepts in identifying the effective usage of Map Reduce, Pig and Hive.
  • Create Hive Internal tables with appropriate partitions and buckets
  • Develop user-defined functions using eclipse to access scripts developed using Hive and monitor performance
  • Perform proof of concepts in storing the analytics data into a no sql database. Eventually, came up with Cassandra.
  • Perform overall research, design and development to take Big Data technologies deeper into the organization.

Technologies: Spark Streaming, Scala IDE, AKKA, Kafka, Map Reduce, Pig, Hive, Cassandra

Confidential, Mt Laurel, NJ

Big Data/SharePoint Analyst

Responsibilities:

  • Performed proof of concepts in utilizing Amazon’s AWS environment to start up big data analytics
  • Responsible for migrating the processed form data into HDFS using SQOOP.
  • Perform Kerberos authentication to access HDFS.
  • Effectively use Map Reduce algorithms to analyze data and generate sales reports
  • Interact with the business analysts, design flowcharts covering the entire flow cycle of the form, create user groups, set user roles and assign corresponding permission levels.
  • Design request forms using InfoPath 2010 based on the requirement flow charts.
  • Design and Develop Workflows to automate the business process using Nintex 2010.

Technologies: Map Reduce, Flume, Kerberos, SharePoint 2010, InfoPath 2010, Nintex Workflows 2010

Confidential, Port Washington, NY

SharePoint Developer

Responsibilities:

  • Gathering Requirements from Client and feasible study on requirements.
  • Designing the SharePoint InfoPath forms.
  • Design and maintaining the Database Tables and database Objects.
  • Used Web Service to submit and receive the data from database.
  • Visual Studio Tools for Application (VSTA) is used to write C# code at behind InfoPath forms.
  • Handling Events, setting default values and retrieving the values are done in object Models.
  • Adapted Scrum Agile Methodology for Project management.
  • Used Test Driven Development method on testing InfoPath forms.
  • Used Rhino Mock unit testing.
  • Developing Reports in Sql Server Reporting Service and deployed in SharePoint Portal.
  • Responsible for Creating Report models in SSRS.
  • Created Triggers, Views and Procedures in Sql Server 2005/2008.
  • Worked on various ASP .NET/ ASP .Net MVC websites
  • Extensive use of Repeating Tables and Repeating section Tools of InfoPath forms.

Environment: MOSS 2007 and 2010, WSS 3.0, SharePoint Designer 2007, InfoPath forms and Services, .Net 3.5.

Confidential, Warren, NJ

SharePoint Developer

Responsibilities:

  • Analyzed the existing development environment and guided them to set up SharePoint
  • Installed and Configured MOSS 2007 on development, QA and Production environments along with DR server with best practices in place
  • Integrating Sql Server Reporting Services with Microsoft office SharePoint 2007.
  • Installed and Configured Active Directory for adding users and roles.
  • Adapted Scrum Agile Methodology for Project management.
  • Implemented Document library organization, its structures and uses
  • Generate the report in different format e.g. PDF, excel in SharePoint
  • SharePoint Object Model.
  • Creating KPI based on excel data.
  • Implemented MVC as GUI framework MFC called document/view architecture.
  • Schedule the reports on monthly and weekly basis in Share Point.
  • Integration of Excel Services and displaying the charts in portal.
  • Configured Out of Box XML, Form and Dataview WebParts.
  • Assign Roles and Permission to sites, lists and list items.
  • Created Features for new Content Type and Site Columns, and incorporated them into a site definition
  • Created Custom Site Definition using Feature Stapling.
  • Created Features for new Content Type and Site Columns, and incorporated them into a site definition
  • Developed Custom Themes.CSS and deployed in SharePoint
  • Involved in customizing and branding Web content Management.
  • Established SharePoint standards and best practices for SharePoint application development and developed a SharePoint design and development review check.

Environment: MS Office SharePoint Server 2007, Visual Studio 2008 with C#, SQL Server Reporting Services, Excel Services, Active Directory, C#. Net, ASP. Net 3.5, VM Ware Work station, MS-InfoPath 2007,Nintex Workflow, Visual SourceSafe 2005, ADO, ADO.NET 2.0, IIS 6.0, SQL Server 2005, HTML, DHTML, XML,Windows Server 2003.

Confidential, Warren, NJ

SharePoint Developer

Responsibilities:

  • Established SharePoint standards and best practices for SharePoint application development and developed a SharePoint design and development review check lists.
  • Preparing approach document.
  • Development and Unit Testing
  • Created SharePoint Web Parts using Visual Studio 2005 with Microsoft. SharePoint and Microsoft.SharePoint.Webpartpages object model
  • Configured Out of Box Page viewer Webparts to display linked contents.
  • SharePoint Object Model
  • Branding of Sites
  • SharePoint Designer Workflow Development
  • SharePoint Custom Workflow Development
  • Created Features for new Content Type and Site Columns, and incorporated them into a site definition.
  • Implemented MVC architectural pattern.
  • Developed a custom SharePoint application template
  • Used U2UCAML query builder for writing CAML queries.
  • Created User Controls to be used in custom applications pages.
  • Backup and restore of Site Collection and Sites.
  • Self -Service Site Creation from the Configure Self-Service Site Creation page for the virtual server.
  • Designed and created new workflows to automate business processes related to several different types of corporate documents using SharePoint Designer 2007.
  • Published InfoPath forms directly to a forms library in a SharePoint portal
  • Used audience targeting.
  • Performed application tuning including monitoring, troubleshooting and optimizing performance
  • Post-Release maintenance, bug fixing and adding new features, based on user requests.

Environment: Microsoft Office SharePoint Server 2007, Visual Studio 2008 with C#, Workflows, STSADM, C#. Net, ASP. Net 3.5, VM Ware Work station, MS-InfoPath 2007, Visual SourceSafe 2005, ADO, ADO.NET 2.0, IIS 6.0, SQL Server 2005, CSS

Confidential

WCF Developer

Responsibilities:

  • Design and development of WCF services.
  • Development of COM interop from VB to WCF services.
  • Responsible for integrating all Services.
  • Development of new enhancements in Landscape VB Application

Environment: C#, WCF, COM Interop, VB, SQL Server 2005

We'd love your feedback!