Big Data Manager Resume
Reston, VA
SUMMARY:
- 20+ years of enterprise - level application programming, architecture and management experience with deep knowledge in Big Data Technology Stack covering Data Science, Data Engineering, Machine Learning & Cloud Computing Technologies
- Accomplished patented work with data analytics, natural language processing, streaming data platform along with NoSQL schema design, infrastructure setup, development and architecture patterns to provide DaaS & SaaS solutions
- Worked with highly unstructured and semi-structured multilingual datasets over range of tools covering Cloudera’s CDH Hadoop instances, Amazon’s AWS services, RStudio, Python etc.
TECHNICAL SKILLS:
Big Data tools: Hadoop, HDFS, HBase, Confluent, Kafka, Impala, Zookeeper, Hive, Pig, Sqoop, Flume, Solr, Oozie, MapReduce, HCatalog, Cloudera, Hortonworks, R, RStudio, Databricks, Jupyter Notebook
DevOps: GitHub, Git, AWS Confidential Pipeline, MSBuild, JUnit, Docker, Jenkins, Visual Studio, Vagrant, Amazon Web Services, Azure, Google Cloud, New Relic, Elastic Search, Jira, TFS, PowerShell
Languages: Python, C#, JAVA, Scala, VB.NET, ASP.NET, C/C++
Scripts: Python, Shell (bash), Perl, JavaScript, HTML, UML, XML, VB Script
Databases: Amazon Redshift, PostgreSQL, Neo4j, NoSQL, MS SQL Server, Oracle 10g, MS-AccessPowerBuilder, Informix 7.1
Development tools: Eclipse, Maven, Tomcat, Visual Studio, Ant, Visio 2010, Apache Tomcat, IIS, Web Services, XMLJavaScript, HTML, DHTML, XML/XSLT, CSS
Other tools: MATLAB, IoT, BOT, TensorFlow, TensorBoard, Cloud, COGNOS, Rational Rose, Domo, Cassandra, Centerview, MangoDB, ArcGIS, ArcInfo, Spark, Splunk, MarkLogic, Dockers, Vagrant, Tableau, Qlik Sense, AlteryX
WORK EXPERIENCE:
Big Data Manager
Confidential, Reston, VA
Responsibilities:
- Analyzed and optimized Confidential ’s Data Games application to remove bottlenecks from real-time transactions using Python and Apache Kafka/Confluent providing Docker-based implementation for team and mothership servers
- Implemented Continuous Integration and Continuous Deployment ( Confidential ) pipeline integrated with GitHub for Tech College curriculum’s project submission and its auto assessment
- Developed an NLP driven voice-based intelligent system (patent applied) where executives can obtain real-time deep dive analysis data over an IoT system rather than working with dashboard or dependent on supporting teams
- Demonstrated a practice in evaluating and selecting machine learning models for building predictive analytics and analyzing dependent parameters, for proactive mitigation steps
Technologies: IoT, TensorFlow, TensorBoard, NLP, Apache Kafka/Confluent, AWS Services, Confidential pipeline, PostgreSQL, Redshift, Neo4j, R, RStudio, Python, Docker, GitLab
Senior Systems Architect
Confidential, Mclean, VA
Responsibilities:
- Developed and implemented Big Data and Cloud computing technologies stack for the company’s services products infrastructure needs - Web Services for Human Geographic analysis
- Performed data studies, data discoveries and data preparation using varying tools including BigQuery, Scraping tools, R, HDFS, HBase, SOLR, Gnip API etc.
- Designed NoSQL schema design and developed data models for data integration including third-party cloud services and technologies for social media analytics and sentimental analysis using NLP and Soundex services
Technologies: HDFS, HBase, Impala, PIG, HIVE, EC2, Solr, PostgreSQL, Neo4j, Cloudera, BigQuery, Google maps, .NET, R, RStudio, Python, Hadoop, MapReduce
Project Lead
Confidential, Fairfax, VA
Responsibilities:
- Built, supported and managed Dashboard applications for Confidential to meet their decision-making needs, successfully delivered 5+ major releases and 50+ minor releases
- Designed and developed web services for AJAX, session clean up and file security implementations; to be consumed by 80+ modules within HPMS (Health Plan Management System)
- Using TFS implemented software application lifecycle system for continuous integration, TDD (Test Driven Development), project management, version control
- Proactively involved the team and the client on how Big Data can help to evolve better solutions for HPMS applications
- Developed and integrated Confidential Bank) portal with the Microsoft’s HealthVault cloud services to secure the Microsoft Gold partnership for the startup Audacious Inquiry(AI)
- Developed the SOAP/REST services for the sales teams’ analysis. Consumed the REST services data feed from different provider for the Confidential health campaign websites
- Led development, software and database design, development and implementation architecture, third-party integration all throughout the SDLC implementation for range of projects
Technologies: Cloud Computing - Microsoft Azure, ASP.NET 2.0/3.5, AJAX, DotNetNuke, Visual Studio 2005, SQL Server 2005, HealthVault, HL7, SharePoint, Windows XP
Lead Systems Engineer
Confidential
Responsibilities:
- Developed an end-to-end automated system in .NET - C#, ASP.NET to feed data on a day-to-day basis from a UNIX-based host system to Oracle database
- Developed components and supported project including Future Business Model (FBM) intended to bring in new and improve processes for mortgage business at Confidential ; Hotel Management System (HMS) at Hilton, First Trust data migration services which had myriad of client-servers and required data consolidation
- Integrated functional prototypes with mortgage enterprise and other interface-based OOD, custom user, server controls
- Supervised data management for the migration of some of the existing reporting structure to VS 2005 using SQL Server Integration Services (SSIS), SQL Server Reporting Services (SSRS), and Crystal Reports
- Led as an onsite coordinator for full software lifecycle implementation of the HRMS intranet and desktop applications for Bank of Bahrain covering requirement-gathering, holistic design with respect to client system integration, implementation, data migration, testing, and onsite support.
- Wrote requirements in UML, created Use Cases, Sequence, Object Model, Component UML diagrams in Rational Rose. Validated OOA model against use cases, developed new use cases, elaborated analysis model using specification-actual design pattern in Rational Rose/UML
- Developed and supported Crystal Reports (ad hoc and standard report sets); Cross-tab, Conditional, Drill-down, OLAP type
- Established defect closures and causal analysis process to determine the defect root cause, effort required to implement process improvements and the expected impact on software quality as part of CMM capability maturity model
- Created/implemented performance testing using Mercury Interactive’s LoadRunner and WinRunner, writing scripts and monitoring resources to identify performance; overall, maintained the regression suite/repository for the project
- Involved in requirements gathering, restructuring of implementation strategy, designing data base schema, creating catalogs (Access and Oracle database), generating queries, and developing cube architecture for drill down reports and developing/modifying COGNOS reports.
Technologies: C#, VB.NET, ASP/ASP.NET/HTML, Jscript, VB Script, Java, Crystal Report, Rational Suite, SQL Server 2000, Oracle 9i, Load/Win Runner, IIS, Informix, Crystal Report8.0/9.0, PowerBuilder, Business Object, COGNOS - Impromptu, Power Play, MOVEit products, Empower.NET, Infragistic’s controls, Perl, Shell Scripting, Windows 2003, SSIS, SSRS, UNIX, FxCop 1.312, NUnit 2.2, NDoc 1.3.1
Research Engineer
Confidential
Responsibilities:
- Developed the Decision Support System (DSS) for demand site energy planning focusing on renewable sources using the regional datasets
- Successfully simulated and implemented the numerical model for Gasification process by extensively using supporting tools for Confidential analysis and mapping to build the databases, perform analysis, and generate maps by joining the spatial and attribute data in ArcInfo 2.04, ArcView 2.04 and GRAM ++
- Developed, documented, and presentation papers/reports/project methodologies using the forecasted scenario datasets for the individual/aggregated modules in the form of pictorial graphs and geographical maps
- Researched and designed to build the mathematical model for demand site energy managements using Gasification & Solar as an energy source
Technologies: MATLAB 5, Mathematica, Access 98, Linux, Dbase, Excel, GRAM, Confidential - ArcView / ArcInfo 2.04, VC++ 6.0, Windows