Lead Big Data Consultant Resume
SUMMARY
- 14 + years of IT experience along with 4 years of HADOOP ecosystem including Spark/Scala
- Actively involved in the life cycle of project that includes implementing the Systems, Designing, Development, Testing and Documentation.
- Possess in depth knowledge in HADOOP/HDFS Architecture.
- Experienced Big Data developer with good knowledge of HDFS and Spark components such as Spark SQL, Spark Streaming, and related API’s
- Exposure to HDFS eco systems such as Pig, Hive, Impala, Sqoop, Yarn & Cloudera
- Exposure in developing Hive scripts and accessing them via user defined functions
- Expert in utilizing RDD’s, Data frames and Datasets to solve complex Spark solutions
- Experience in analyzing high volume streaming data sources such as Twitter, Kafka and AKKA streams.
- Experience in configuring and utilizing 5 node Kafka cluster followed by utilizing Kafka streams through Scala
- Sound knowledge in utilizing Amazon AWS S3 cloud cluster executing spark code to perform sales data analysis on the sales data using Spark through Scala
- Experience in using the no sql database: Cassandra
- Experience in using editors such as Eclipse, Scale IDE and Canopy to develop and debug Python based Big data solutions
- Possess minimal knowledge in job/workflow scheduling and monitoring using OOZIE
- Background with traditional databases such as Oracle, SQL Server, MySQL
- Have very good experience in developing web based applications using C#, HTML, CSS on .Net framework including SharePoint
TECHNICAL SKILLS
Big Data Eco systems: HDFS, MapReduce, Pig, Hive, Spark 2.0, Spark - SQL, Spark Streaming, Kafka, AKKA
Operating System: Windows 10, Knowledge in Linux
Programming Languages: Scala, Python, C#
Cluster/Cloud Systems: Amazon Web Services (AWS S3 Cloud)
No-SQL Database: Cassandra
PROFESSIONAL EXPERIENCE
Confidential
Lead Big Data consultant
Responsibilities:
- Responsible for building scalable distributed data solutions using Spark and Hadoop Ecosystem
- Responsible for design & development of spark SQL using Scala based on functional specs.
- Preformed real-time analysis of the incoming data using Kafka consumer API, Kafka topics, spark structured streaming.
- Monitoring Kafka Data pipeline using Chronograph along with Influx DB and Influx sink connector
- Using spark structure streaming to consume the data from Topics.
- Water Marking for Session state and data lag to reduce the loss of data.
Technologies: HDFS, Scala, Spark structured streaming, Spark-SQL, Apache Kafka, Hive, Kibana and Shell Script.
Confidential
Big Data Consultant
Responsibilities:
- Interact with business users; finalize scope, requirements and release phase schedule.
- Custom producer is built for various online data sources
- Set up and configure a 5 node Kafka cluster and monitor the broker activities
- Monitoring Kafka Metrics at different levels such as broker, producer and consumer.
- Configure Producers, set partitions and corresponding replication to handle effective fault tolerance
- Using Spark stream as Consumer to store in Data lakes like S3 Storage, HDFS and MySQL with Dynamic partitions
- Aggregate, analyze and assess the input data to perform effective RDD transformation and action
- Implement the best spark practices such as persist(), one time collect(), etc., to have effective performance
- Effective utilization of Data Frame and Dataset API’s of Spark 2.0 and save data into Cassandra
- Applied catalyst optimize both cost and rule base using query.
- Building Data pipeline and involved in Postgres
Technologies: Spark Streaming, Scala IDE, Kafka, Cassandra
Confidential
Big Data Consultant
Responsibilities:
- Involve actively in setting up a Kerberised cluster followed by building a HADOOP ecosystem in compliance with best practices
- Perform data export from SQL Server database into HDFS through SQOOP commands.
- Perform proof of concepts in identifying the effective usage of Map Reduce, Pig and Hive.
- Create Hive Internal tables with appropriate partitions and buckets
- Develop user-defined functions using eclipse to access scripts developed using Hive and monitor performance
- Perform proof of concepts in storing the analytics data into a no sql database. Eventually, came up with Cassandra.
- Perform overall research, design and development to take Big Data technologies deeper into the organization.
Technologies: Spark Streaming, Scala IDE, AKKA, Kafka, Map Reduce, Pig, Hive, Cassandra
Confidential, Mt Laurel, NJ
Big Data/SharePoint Analyst
Responsibilities:
- Performed proof of concepts in utilizing Amazon’s AWS environment to start up big data analytics
- Responsible for migrating the processed form data into HDFS using SQOOP.
- Perform Kerberos authentication to access HDFS.
- Effectively use Map Reduce algorithms to analyze data and generate sales reports
- Interact with the business analysts, design flowcharts covering the entire flow cycle of the form, create user groups, set user roles and assign corresponding permission levels.
- Design request forms using InfoPath 2010 based on the requirement flow charts.
- Design and Develop Workflows to automate the business process using Nintex 2010.
Technologies: Map Reduce, Flume, Kerberos, SharePoint 2010, InfoPath 2010, Nintex Workflows 2010
Confidential, Port Washington, NY
SharePoint Developer
Responsibilities:
- Gathering Requirements from Client and feasible study on requirements.
- Designing the SharePoint InfoPath forms.
- Design and maintaining the Database Tables and database Objects.
- Used Web Service to submit and receive the data from database.
- Visual Studio Tools for Application (VSTA) is used to write C# code at behind InfoPath forms.
- Handling Events, setting default values and retrieving the values are done in object Models.
- Adapted Scrum Agile Methodology for Project management.
- Used Test Driven Development method on testing InfoPath forms.
- Used Rhino Mock unit testing.
- Developing Reports in Sql Server Reporting Service and deployed in SharePoint Portal.
- Responsible for Creating Report models in SSRS.
- Created Triggers, Views and Procedures in Sql Server 2005/2008.
- Worked on various ASP .NET/ ASP .Net MVC websites
- Extensive use of Repeating Tables and Repeating section Tools of InfoPath forms.
Environment: MOSS 2007 and 2010, WSS 3.0, SharePoint Designer 2007, InfoPath forms and Services, .Net 3.5.
Confidential, Warren, NJ
SharePoint Developer
Responsibilities:
- Analyzed the existing development environment and guided them to set up SharePoint
- Installed and Configured MOSS 2007 on development, QA and Production environments along with DR server with best practices in place
- Integrating Sql Server Reporting Services with Microsoft office SharePoint 2007.
- Installed and Configured Active Directory for adding users and roles.
- Adapted Scrum Agile Methodology for Project management.
- Implemented Document library organization, its structures and uses
- Generate the report in different format e.g. PDF, excel in SharePoint
- SharePoint Object Model.
- Creating KPI based on excel data.
- Implemented MVC as GUI framework MFC called document/view architecture.
- Schedule the reports on monthly and weekly basis in Share Point.
- Integration of Excel Services and displaying the charts in portal.
- Configured Out of Box XML, Form and Dataview WebParts.
- Assign Roles and Permission to sites, lists and list items.
- Created Features for new Content Type and Site Columns, and incorporated them into a site definition
- Created Custom Site Definition using Feature Stapling.
- Created Features for new Content Type and Site Columns, and incorporated them into a site definition
- Developed Custom Themes.CSS and deployed in SharePoint
- Involved in customizing and branding Web content Management.
- Established SharePoint standards and best practices for SharePoint application development and developed a SharePoint design and development review check.
Environment: MS Office SharePoint Server 2007, Visual Studio 2008 with C#, SQL Server Reporting Services, Excel Services, Active Directory, C#. Net, ASP. Net 3.5, VM Ware Work station, MS-InfoPath 2007,Nintex Workflow, Visual SourceSafe 2005, ADO, ADO.NET 2.0, IIS 6.0, SQL Server 2005, HTML, DHTML, XML,Windows Server 2003.
Confidential, Warren, NJ
SharePoint Developer
Responsibilities:
- Established SharePoint standards and best practices for SharePoint application development and developed a SharePoint design and development review check lists.
- Preparing approach document.
- Development and Unit Testing
- Created SharePoint Web Parts using Visual Studio 2005 with Microsoft. SharePoint and Microsoft.SharePoint.Webpartpages object model
- Configured Out of Box Page viewer Webparts to display linked contents.
- SharePoint Object Model
- Branding of Sites
- SharePoint Designer Workflow Development
- SharePoint Custom Workflow Development
- Created Features for new Content Type and Site Columns, and incorporated them into a site definition.
- Implemented MVC architectural pattern.
- Developed a custom SharePoint application template
- Used U2UCAML query builder for writing CAML queries.
- Created User Controls to be used in custom applications pages.
- Backup and restore of Site Collection and Sites.
- Self -Service Site Creation from the Configure Self-Service Site Creation page for the virtual server.
- Designed and created new workflows to automate business processes related to several different types of corporate documents using SharePoint Designer 2007.
- Published InfoPath forms directly to a forms library in a SharePoint portal
- Used audience targeting.
- Performed application tuning including monitoring, troubleshooting and optimizing performance
- Post-Release maintenance, bug fixing and adding new features, based on user requests.
Environment: Microsoft Office SharePoint Server 2007, Visual Studio 2008 with C#, Workflows, STSADM, C#. Net, ASP. Net 3.5, VM Ware Work station, MS-InfoPath 2007, Visual SourceSafe 2005, ADO, ADO.NET 2.0, IIS 6.0, SQL Server 2005, CSS
Confidential
WCF Developer
Responsibilities:
- Design and development of WCF services.
- Development of COM interop from VB to WCF services.
- Responsible for integrating all Services.
- Development of new enhancements in Landscape VB Application
Environment: C#, WCF, COM Interop, VB, SQL Server 2005