We provide IT Staff Augmentation Services!

Big Data Architect Resume

5.00/5 (Submit Your Rating)

Cincinnati, OH

SUMMARY

  • About 15 years of experience in various roles as Big Data Architect/Data Engineer, Senior Developer and Onsite Coordinator. Handled various Web Applications (Java/J2EE/Spring MVC/AngularJS/ASP.NET/SharePoint 2007/2010/2013 ) based collaborative and publishing environment in Retail, HealthCare, Insurance and Networking.
  • Versatile team player with excellent communication and interpersonal skills.
  • Extensively Worked on Hadoop BigData Platform (Apache Spark, Hive, Pig Script etc), J2EE, Java, ASP.NET, C#, Office 365 and Microsoft Office SharePoint 2007/2010/2013 with Onsite/Offshore Model.

TECHNICAL SKILLS

Skill Sets: HTML5, CSS3, Javascript, ASP.NET 2.0, C#, Microsoft Team Foundation Server(TFS) 2008/2010, Microsoft Office SharePoint Server(MOSS) 2007/2010/2013 , WSS 3.0, SharePoint Designer 2007/2010/2013 , Silverlight 4, BizTalk 2006, Oracle 9i/10g, SQL Server 2005/2008/2012 , MySQL, DB2, IIS 5.1/6.0/7.0, COBOL, AJAX, HTML, Classic ASP, Windows Communication Foundation (WCF), Windows Workflow Foundation (WWF), Microsoft Office InfoPath Forms 2007, NUnit, Collaborative Application Markup Language (CAML), Telerik, Metalogix(Axceler), K2, AvePoint, Nintex and Bamboo Solutions, PowerShell scripting, Language - Integrated Query (LINQ), TOAD, Apache Lucene, SAP HANA, JQuery, AngularJS, NodeJS.

Big Data Skills: Azure HD Insight, Cloudera, AWS EMR Platform & S3, Cloud Oracle Exadata Services, HortonWorks, Databricks, Alteryx, Hive, MapReduce, Ambari, YARN, Apache Spark with Python & Scala, Clojure (Functional Programming) Talend 6, R Studio, H2O, Oozie, Flume, HDFS, Avro File formats, SQOOP, Hbase, HCatalog, Scala, R Language, Cassandra, Azure Eventhub, Azure Analytics, Azure DataLake, Apache Kafka, SAS with Hadoop and Pig Scripts.

Machine Learning & Deep Learning Skills: TensorFlow with Python, MicrosoftML and Spark ML

Data Analytics & Modeling skills: SAS, SPSS, Alteryx.

WebServer: Apache Tomcat 7/8 & Websphere Application Server 8.5.

Framework: J2EE, .NET 2.0/3.5/4.0

Tools: Ecllipse, Visual Source Safe 2005, Spyder, R Studio, SubVersion (SVN), Visual Studio Team Foundation 2005/2008/2010 , VISIO, Microsoft Enterprise 2007/2003, Office Communication Server, CLARITY, REMEDY,Git.

PROFESSIONAL EXPERIENCE

Confidential, Cincinnati, OH

Big Data Architect

Environment: Cloudera, AWS, Linux 12 SUSE, Redhat Linux Server 7.2, Windows Server 2012/2016, Apache Tomcat 7/8, MongoDB, Microsoft Azure, Cloudera, HDInsight in MS Azure, Azure Eventhub, Azure Analytics, SAS on Hadoop, ElasticSearch, Angular2, J2EE with Spring MVC framework, Maven, H2O, MlLib, Shiny for R Studio and HighCharts.

Responsibilities:

  • Overall Solution Architect activities that involves:
  • High Level Technical architecture during design phase and mentor the Leads to perform low level designs.
  • Interact with P&G sales team and perform end-to-end solution design on Big Data Environment.
  • Monitor development guidelines process during development and fix any issues that arise.
  • Release management activities using Versioning tools (Git, SVN, TFS etc) to maintain healthier environment.
  • As part of Solution Architecture Group, implement TOGAF architecture in process for various data science projects.
  • Data Modeling and building DataMarts using Hive & Spark Jobs.
  • Models Implemented - Data Scientist - Initially developed in R Studio and migrated to Spark:
  • Logistic Regression - To find selective demographic features for Product Category based on sales data.
  • ARIMA (AutoRegressive Integrated Moving Average) for predictive analytics on Time Series data.
  • GLMNet (Lasso Regression) to find lift in sales based on promotions.
  • Random Forest on POS data to find growing & potential sales of UPCs.
  • Image recognition models using TensorFlow to identify products in shelf.
  • Used Clojure API for Machine Learning Models.
  • Hands on development of several Apache Spark Jobs using Scala, Java & Python.
  • Hands On with Impala & Hive queries to perform daily tasks.
  • Experience in creating and implementing data architecture standards for master data management, data quality, data dictionary, meta data and data security.
  • Performed data ingestion using AWS Cloud (EC2, S3, Lambda, EMR, RDS, Redshift).
  • Implementation of Client Side Encryption for AWS S3 file handling.
  • Integration of Amazon S3 and AWS Lambda to invoke Lambda functions using S3 bucket notification for Image Processing based on targeted device.
  • Created execution roles for AWS Lambda function.
  • Scheduled jobs using AWS Batch for Deep Learning Training (Image Processing).
  • Creation of Docker Images to install and configure Kafka Cluster.
  • Experience in writing Unix Shell scripting for automated environment setup configuration.
  • Experience in using Avro, ORC and Parquet File formats in Spark jobs and identification of appropriate file formats based on Business Requirements.
  • Procurement and configuration of AWS EMR environment with necessary Hadoop components (Apache Spark, MapReduce, YARN, HBase, HDFS & Pig Script) based on Business Requirements.
  • Procured Big Data on AWS EC2 and configured Hadoop components based on Business requirements using Ambari.
  • Developed Stream Analytics using Kafka for realtime anomaly detection using Kafka pipeline and estabilished Kafka Mirrormaker.
  • Created Data Pipeline with the Kafka Connect API .
  • Integration with On-Prem storage system to migrate data into Hadoop platform and processing with Spark.
  • Apache Spark integrated with Google Tesseract to perform OCR on PDF forms using Scala. Later enhanced with Spark Streaming.
  • Performed ETL operation of ingestion of Data into HFDS using Alteryx.
  • Development of Talend data intake, cleansing and transformation process in a Cloudera DataLake framework
  • Using Talend performed:
  • File management: open, move, compress, decompress without scripting
  • Control and orchestrate data flows and data integrations with master jobs
  • Map, aggregate, sort, enrich, and merge data
  • Implemented Machine learning using H2O integrated with R Server and rendered JSON output for HighChart visualization.
  • Performed text search with Elasticsearchusing JSON and Restful API on TeraByte of POS data to perform propensity model.
  • Data Ingestion into HBase from on-prem File System (FTP, Network Drive etc) using Flume, Apache Spark and MapReduce.
  • Maintenance of Azure Data Factory using activities and pipelines.

Confidential, Sunnyvale, CA

Technical Lead

Environment: J2EE, Java, Spring MVC, Hibernate, Apache Tomcat 6, ASP .Net, C#, WCF, WWF, Microsoft Office SharePoint 2010, SQL Server 2005/2008, SQL Server Reporting Services (SSRS) 2008, Oracle 10g, MS-ILM2, Win Server 2003, IIS 6.0/7.0, AJAX, AjaxControlToolKit, Team Foundation Server 2010, NUnit, LINQ and Silverlight 4.

Responsibilities:

  • Typical development and enhancement of J2EE portals that involve development and deployment.
  • Gathering requirements from Business users and training them on SharePoint Web Applications usability.
  • Development of complex InfoPath forms for data entry screens.
  • Development of Customized Windows Workflow Foundation (WWF) based Document Approval Workflows.
  • Automated the migration of Excel Documents to SharePoint Lists based data.
  • Development of Reports with SharePoint Lists backend using Report Builder and SSRS.
  • Using BCS (Business Connectivity services) for integration with Oracle and REMEDY.
  • Developed SilverLight application for Issue Tracker and Fast Pass Request.
  • Developed components for Document Conversion to PDF using Document Conversion Services.
  • Automated the Weekly Newsletter and Notification process by generating HTML encoded emails and sending to various departments users. Also created WebPart to display them in Sites Homepage.
  • Implementation of Blog and Social Networking sites using Newsgator.
  • Implemented LINQ for querying data from SharePoint Lists.
  • Customization of Search Results to display them in rich UI using Custom XSLT.
  • Implementation of Faceted Search using Fast Search using Custom Managed Metadata mapping with Remedy.
  • Creation and Deployment of WSPs using PowerShell scripts.
  • Troubleshooting of various environment based issues with SharePoint.
  • Overall Project maintenance and execution using Team Foundation Server (TFS).
  • Single Sign on integration with REMEDY using Secure Store Services.
  • Implementation of High Availability of SQL Server using Database Mirroring.
  • Worked with DocAve tool on scheduling backup and granular restoration.
  • User profile extension with Active Directory field mapping and profile sync scheduling.
  • Implementation of Code Signing certificate for InfoPath forms.

Confidential, Philadelphia, PA

SharePoint Architect

Environment: ASP .Net, C#, WCF, WWF, Microsoft Office SharePoint 2007/2010, SQL Server 2005/2008, Oracle 10g, MS-ILM2, Win Server 2003, IIS 6.0/7.0, AJAX, AjaxControlToolKit, NUnit, WebTrends, Google Analytics, Team Foundation Server 2010 and Silverlight 4.

Responsibilities:

  • Gathering requirements from Business users and training them on SharePoint Web Applications usability.
  • Coordination with Offshore team on daily basis and providing technical advice on any development issues.
  • Setting up environment for SharePoint 2010.
  • Migration of SharePoint 2003 and 2007 sites into SharePoint 2010.
  • Technical design and coding of various complex webparts.
  • Integration with Project Server and Excel Services.
  • Using BCS (Business Connectivity services)for integration with Oracle and Documentum.
  • XSLT customization for Data View webparts.
  • JQuery based implementation for UI rich screens.
  • KPI usage with Gantt chart preparation for FootPrints application tickets.
  • Code Versioning and Task allocation with Team Foundation Server (TFS).

Confidential, Raritan, NJ

Technical Lead/Architect

Environment: ASP .Net, C#, WCF, WWF, Microsoft Office SharePoint 2007/2010, SQL Server 2005/2008, SQL Server Reporting Services 2005, Oracle 10g, MS-ILM2, Win Server 2003, IIS 6.0/7.0, AJAX, AjaxControlToolKit, NUnit, WebTrends, Google Analytics, Team Foundation Server 2008 and Silverlight 4.

Responsibilities:

  • Gathering requirements from Business users and training them on SharePoint Web Applications usability.
  • Coordination with Offshore team on daily basis and providing technical advice on any development issues.
  • Involved in SharePoint Farm Topology Design team for SharePoint 2010 implementation.
  • Configuration and SharePoint Farm Remediation activities.
  • Creation of Customized AJAX Based SharePoint WebParts.
  • Defining Enterprise Content Management (ECM) for document management and publishing sites.
  • Handled integration with other content management tools like Documentum and eRoom.
  • Publishing Site Pages Branding using Master Pages based on Wireframes.
  • User Profile extensions using SharePoint Shared Services.
  • Design with CAML query for filtering lists using SharePoint Object Model.
  • Preparation of Technical Specifications.
  • Integration of CodePlex solutions with OOB SharePoint Functionality.
  • Creation of Application Pages.
  • Complex Stored Procedures with Query Optimization.
  • Created Application Definition files for BDC using MetaMan to fetch profile metadata from Oracle.
  • Packaging and Deployment of WebParts with WSP Builder.
  • Participate day-to-day AGILE(SCRUM) Meeting and forward progress of the project to Sprint Owners.
  • Developed mobile based SharePoint sites with customized compact.browser files.
  • Processes and controls change approvals as necessary to accommodate Backlog request.
  • Coordinate the administration of the project (meetings, conference calls).
  • Monitor critical path of project activities and institutes risk management initiatives with mitigation plan.
  • Provide Technical guidance and Feasibility reports to the project team to inform decision making.
  • Implemented Capture Server with KnowledgeLake for Document capture and indexing.
  • Used PowerShell scripting for administrative activities.
  • Developed Functoids and Correlated card approval process using BizTalk.
  • SharePoint Admin activities for Components deployment and Site Migration.
  • Using AvePoint for Site Backup and restoration schedule.
  • Implemented faceted search using Fast Search.
  • Debugging and bug fixing during UAT.
  • Designed simple forms and SharePoint integration with Silverlight 4.
  • Maintaining code versions and automating deployments using Team Foundation Server (TFS).
  • Usage of third party WebParts from Bamboo Solutions.
  • Customized Pod cast and Workflow development using Nintex.
  • Handled integration with MS Office tolls like Excel and Outlook 2007.

We'd love your feedback!