Big Data Solutions Architect/data Scientist/consultant Resume
Irving, TX
EXECUTIVE SUMMARY:
- I have 12+ years’ Experience in Business, Technology and Complex Program Management across Multiple Industry Domains has facilitated a Deep Direct as well as Intuitive Understanding of the Value of Data (incl. Big Data, MPP and NoSQL Technologies) Where, How and Why to Apply Technology (incl. New and Emerging Technologies) to Solve Business’s Problems and identify Opportunities.
- Proof of concept (POC), work breakdown structure, Agile Methodologies, Massive Parallel Processing (MPP), Cluster Analysis, Development, Predictive Analytics, Data Science, Data mining, Social, Churn Analysis.
- Architecture - Deep technology understanding, architecture and design skills, hands on experience.
- Leadership & Co-ordination - Successfully led and mentored multiple teams in multiple portfolio’s in Big Data and Web based Business Applications
- Technical Expertise - As an Architect for the Analytical/Services/Applications, I have shown great technical expertise in developing the System for the customer with very aggressive deadlines.
AREAS OF EXPERTISE:
- Big Data Solutions Architecture, Analytics
- Project Management
- Infrastructure Management
- Onsite and Offshore Management
EXPERIENCE AND SKILLS:
Languages: Java, Scala.
Hadoop Core: HDFS, MapReduce, YARN.
Hadoop ecosystem: Hive, Pig, Hbase, Sqoop, Flume, ZooKeeper, Oozie, Ganglia, Nagios, R, RHadoop, Mahout, Kafka, Storm, Spark, Solr, Banana, Drill, Knox, Tableau.
Hadoop Cluster: Setup, Chef, Installation, maintainance, Cloudera, Hortonworks, Apache Hadoop, Amazon EC2
Web tech: J2EE, Web Services, REST, HTTP, XML, Javascript, CSS.
Mobile Tech: MobileWeb.
Databases: HBase, Cassandra, MongoDB, Riak, Sql Server, MySQL, Oracle.
Web Servers: Web Sphere, Tomcat, Jboss.
Version Tools: Visual Source Safe, SVN.
OS: Windows, Linux/Unix.
Process: CMM, ISO, Agile.
PROFESSIONAL EXPERIENCE:
Confidential, Irving, TX
Big Data Solutions Architect/Data Scientist/Consultant
Responsibilities:
- Video analytics, customer call care search analytics.
- Real-Time In-Memory and Batch processing implemented for VHO, Billing, Ordering and Vision data sources.
- Implementation of the next generation architecture for more efficient data ingestion and enrichment.
- Data Ingestion with Kafka, Transformations in Spark, Storage in Cassandra, Indexing on Solr and search engine.
- Predictive Analytics with Decision trees, RandomForest, Bayes models and clustering with K-means/ SparkMllib.
- Sentiment Analysis, Emotion, Text mining, Supervised and Un Supervised Machin Learning.
- Prototype on Call recordings, Sentiment, Churn Analysis and Next Gen Data Warehousing.
- Extensive Hands-on experience with the big data echo system tools.
Environment: DataStax, Hortonworks, Cassandra, Hadoop, HDFS, Flume, Kafka, Spark MLlib, Solr, Banana, hue, Scala, R, Machin Learning, CentOS/Linux.
Confidential, Plano, TX
Big Data Solutions Architect
Responsibilities:
- Managed teams spanning multiple continents to develop a multi-petabyte of machine-data collection and analytics solutions for a leading food & beverages client, facilitated and development of a Big Data ‘Center of Excellence’.
- Architecture, End to End platform implementation as per established standards.
- Evaluated proof of concepts for use cases, executed prototypes, and documented, best practices.
- Distributed, Real-Time In-Memory and Batch process frameworks implemented for multiple data sources.
- Devised, lead implementation of the next generation architecture for more efficient data ingestion and enrichment.
- Text mining, Supervised and Un Supervised Machin Learning, Predictive Analytics.
- Provided multiple visualization interfaces for various users and business partners.
- Implemented agile process, Scrum technics, Story reviews.
- Ingestion, Data Storage, Analysis, Visualization, and Security formulated and optimized.
Environment: Hortonworks, Ambari, Hadoop, HDFS, Map Reduce, Hive, Flume, Kafka, Storm, Spark, Solr, Banana, HBase, Oozie, Java, Scala, Tableau, R, Mahout, Machin Learning, Revolution R, Drill, Knox, CentOS/SELS.
Confidential, Bothell, WA
Big Data Infrastructure Architect
Environment: Hadoop, HDFS, Map Reduce, Chef, Ruby, Ganglia, Hive, Flume, Oozie, Java, Riak, Linux CentOS.
Responsibilities:
- Architecture end to end Admin, development and support of Hadoop, Riak.
- Worked on Hadoop 2.x of Hadoop.
- Experienced in Cluster Setup, Chef provisioning and administering Hadoop, Riak clusters.
- Administrating core components Name Node, Resource Manager, Node Manager, Data Node and SNN.
- Managing Hadoop and Riak Clusters, checking cluster Health, Version Upgrades.
- Strong experience in performance tuning, troubleshooting.
- Setting permissions, Commissioning, Decommissioning, Balancing, and Managing Nodes.
- Setup and monitoring tools Ganglia and Nagios
- Analytics implemented with using Map Reduce programming (MPP), Java, Hive scripting.
- Worked on Continuous data ingestion/integration tools with Jenkins, Oozie, and Flume.
- Experience on Disaster Recovery(DistCp) implementation for Hadoop
- Involved in setup and maintenance of Hadoop clusters for Dev and Prod environments.
Confidential, Bloomington, IL
Big Data Consultant
Environment: Hadoop, HDFS, Map Reduce, Hive, Pig, HBase, Sqoop, Flume, Oozie, Java, J2EE, ZooKeeper, Splunk, Cassandra, Chef, CDH3/4, Amazon EC2, Cloud Computing, R&D, Linux CentOS, Windows.
Responsibilities:
- Installation, Admin, development and support of Hadoop, hive, Sqoop, HBase, Oozie & related Hadoop stack.
- Providing analytical dash boards, reports and POC’s to the Directors, Service Managers and stack holders. And participate and presenting the work progress of the System.
- Worked on Design patterns like summarization, bloom filtering, join and Meta Patterns.
- Analytics implemented with using Map Reduce programming (MPP), Java, Hive and Pig scripting.
- Worked on Continuous data ingestion/integration tools with Sqoop, Flume.
- Experience on HBase, MongoDB and Cassandra NoSQL databases.
- Solid understanding and experience with extract, transform, load (ETL) methodologies.
- Worked on Cloudera distribution of Hadoop.
- Experienced in installing, Chef configuring, monitoring and administering Hadoop, Cassandra clusters.
- Administrating core components Name Node, Job Tracker, Task Tracker, Data Node and SNN.
- Managing Hadoop Cluster, checking cluster Health, Version Upgrades.
- Tuning and performance improvement of cluster
- Migrating data from one cluster to another using distcp
- Involved in setup and maintenance of Hadoop clusters for distributed dev/staging/production.
- Created scripts to form Amazon EC2 clusters and for processing to the internal resources.
- Onsite & Offshore team management and call rotation
Confidential
Lead Consultant
Environment: Java, J2ee, Spring MVC, Mobile Web, JavaScript, jQuery, CSS3, IBM RSA, JAX-RS, Jersey, Web Sphere, SVN, Web Services, AJAX, UI, XML, HTML, Agile/Scrum, Windows XP/2007.
Supporting Tools: HP Service Manager, Diagnostics, BAC, Splunk Business Intelligence and Warehouse analysis, THS Tracker, JRF Data Tool, and Tealeaf.
Responsibilities:
- Proposed design patterns and emerging technologies to support application development projects.
- Worked on front end tools JQuery, JSON, and JSTL and with style sheets CSS3.
- Creating and consuming REST and SOAP Web Services.
- Performing the maintenance work of the j2ee applications as per the stipulations listed by the firm.
- Worked with JEE controller frameworks (Spring MVC, DDUI, Struts).
- Supporting and monitoring live-deployed production environments supporting high concurrent user load.
- Triaging Auto Quote Purchase prod issues through Technical Observation Post.
- Production support, Root Cause Analysis, Clustering, Assigning tasks for related teams and escalation.
- SME in Service, Problem, Incident, and Request tickets Management, Metrics and reports.
- Checkout for infrastructure, implementation and maintenance windows.
- Coordinated with development and testing teams to complete applications. Onsite & Offshore team management and call rotation.
Confidential, South San Francisco, CA
Social Telephony Architect
Environment: Java, J2EE, Python, Web Services, Face book API, Visio, MySQL, .Net CF, Windows Mobile SDK, Windows XP/2003, Linux/Unix, WAS, JavaScript, CSS.
Responsibilities:
- Architecture, Created detailed workflows, prototypes and the requirements that effectively communicate the interactive design.
- Understand and explore the market products and Come up with innovative ideas.
- Created the formulated product visually with Visio, PowerPoint tools.
- Implemented the Call flow Apps like Community Call, Click 2 call, Messaging, Voice monetization, Parental control, Voice micro blogging, Virtual numbers.
- Created Traceability matrix, Reviewed the tasks time to time.
- Face Book, Twitter, Pudding media Integration.
- Developed all Apps for Windows mobile environment and Implemented web service.
- Implemented web application for managing subscriber's information.
Confidential, Sacramento, CA
Programmer
Environment: Java, J2EE, Web Services, SQL Server, Windows, Web Sphere, JavaScript, HTML, VSS
Responsibilities:
- Created composite web custom controls for easy reuse.
- Server -side validation utilizing J2ee controls.
- Extensive use of Web services.
- Involved in migration of ASP application to J2EE
- Design and development of Web pages using JSP, JScript, HTML, DHTML, JDBC and XML.
- Extensive user of Stored Procedures, Triggers.
