Data Scientist/big Data Architect/sr Software Engineer Resume
Indianapolis, IN
SUMMARY
- Solutions - focused, cutting-edge Senior Data Scientist, Big Data Architect, Solution Architect with more that 4 years of extensive experience working on Big Data with profound expertise in Big Data Infrastructure & Analytics, Data Science & Machine Learning, Statistics, and Distributed and Parallel computing and 12 plus years of experience doing Software Development & Architecture in various industry domains like Product Development, Consulting, Financial, HealthCare, Retail, Real Estate, Pharmaceutical, Insurance, Retirement Systems, Search Engine Optimization.
- Strong combination of senior level architecting, developing, and team management skills.
- Confident professional able to direct top performing teams through an entire project lifecycle, from start to launch.
- Skilled in translating business needs into high quality software products.
- 12 plus years of experience in full life cycle development involving architecting, analysis, design (System/OO), development, deployment, testing, documentation, implementation & maintenance of application software facilitating corporate growth and cost saving.
- 4 plus years of experience in Architecting/Developing/Deploying scalable Big Data solutions using Predictive analytics, Statistics, Machine learning.
- 5 plus years of team management experience in leading product development onsite and offshore teams ranging from 3 to 15 members.
- Expertise in architecting & implementing scalable Big Data/NoSQL solutions ranging anywhere from 100 terabytes to 1000 terabytes of unstructured data using Apache Hadoop, Hortonworks/Cloudera Hadoop, Apache Hive, Apache Pig, Hbase, Cassandra.
- Expertise in implementing data science solutions using Machine Learning Packages like R, Python-Scikit learn, Mahout for Data Science, Predictive Analytics and Recommendations.
- Experience in architecting & implementing ETL process using technologies like Apache Flume, Sqoop, as well as doing real time analytics using Storm,Spark.
- Experienced in implementing analytic solutions using Machine learning algorithms like Naïve Bayes, Decision trees, k-Nearest Neighbors(KNN), k-Means clustering, Support Vector Machines(SVM),Latent Dirichlet Allocation(LDA),Collaborative Filtering.
- Experienced in implementing text mining using NLP (Natural Language Processing) packages like Apache OpenNLP & NLTK.
- Experienced in implementing full text search functionality using search engines like Apache Lucene, Apache Solr.
- Experienced in implementing solutions using Functional Programming technologies like Scala, Akka.
- Experienced in architecting, managing and deploying Big Data clusters ranging from 10-100 nodes in Cloud like Amazon EC2 as well as in production environments using Cloudera Manager, Apache Ambari, Whirr.
- Experienced in developing cutting edge solutions using Data structures & Algorithms .
- Experienced in implementing Object oriented methodologies(OOAD) and Core Java & J2EE design patterns like Singleton, Factory Pattern, Abstract Factory, Service Locator, Business Delegate, Data Access Layer(DAO), Session Façade, and Value object pattern(DTO) etc.
- Expertise in developing server side applications using Java/J2EE technologies such as Java 5.0,Java Reflection, RMI, Java Threads, Java Collections, Generics, JDBC, Java Servlets.
- Experienced in developing applications using dynamic languages like Groovy, Grails & Python.
- Experienced in developing Web Services using SOAP, RESTFUL, WSDL, JAX-RPC, JAX-RS,JAX-WS, Apache Axis 1&2.
- Experience in various industry domains like Financial, HealthCare, Retail, Real Estate, Search Engine Optimization, Web Crawler, Insurance, Retirement Systems and Pharmaceutical.
- Experienced in developing aspect oriented programming using AspectJ.
- Experienced in developing applications based on frameworks like Apache Struts, Spring Framework(IOC,JDBC/DAO, Spring MVC Web framework) and MVC framework.
- Experienced in implementing Data Access Layer using Object/Relational Mapping tool like Hibernate, GORM.
- Experienced with IDE like Eclipse(3.X/2.X), Spring Source Tool Suite, IBM WebSphere Application Developer(WSAD 5.0 & 6.0), IBM Rational Application Developer(RAD 6.0 & 7.0), JBuilder, NetBeans and Editplus.
- Experienced in deploying web or distributed application on Web-Application Servers like, Apache Tomcat, IBM WebSphere Application Server, JBOSS,BEA WebLogic, GlassFish,
- Experienced with XML related technologies such as XML, XML Schema, Namespace, XSLT, DOM, SAX, JAXP,JAXB.
- Experienced in using Middleware technologies like IBM WebSphere MQ Series along with Apache Camel for SOA integration with JMS as the messaging mechanism and IBM Data Power for mapping EDI to XML and XML to EDI formats.
- Experienced in developing applications for B2B integration using various EDI technologies.
- Experienced in implementing HIPAA based EDI transactions like 834,835,837 & 997.
- Expertise in developing enterprise web applications using HTML, DHTML, Javascript, Ajax, JSP, JSTL, Taglibs etc, Custom Tags, JSON, JQuery, Google Web Toolkit(GWT),DOJO.
- Expertise in RDBMS like Oracle(8i/9i/10g), SQL Server(8.0), DB2,Sybase, PostgreSQL, MySQL and also extensive experience in database development using SQL, PL/SQL and T-SQL stored procedures, triggers and functions.
- Experienced experience working on Unix, Linux and Windows operating systems.
- Experienced in developing build scripts using Apache Ant, Maven and also hands on experience with Log4j and JUnit.
- Experienced in designing UML diagrams like Use Case, Class, Sequence & Component diagrams using Rational Rose, Microsoft Visio, Free mind & Dia.
- Experienced in version manager tools like GIT, Rational ClearCase, CVS, PVCS & Subversion .
- Excellent analytical /problem solving skills on diverged technical environments.
TECHNICAL SKILLS
Application/Web Server: Apache Web server (1.3.19/1.3.12 ), JBoss Application Server, WebLogic 6.x/7.x/8.x, Web Sphere Application Server 3.x/4.x/5.x/6.x, Apache Tomcat Server 3.x/4.x/5.x.
Programming Languages: C, C++, Java.
JAVA/J2EE Technologies: Java(5.0/4.0/3.0) J2EE(1.4/1.3) RMI, JDBC, Servlets, JSP, JSTL EJB, JSF, TagLibs, Swing, Java Beans, Spring, Hibernate.
Web Related: HTML,XHTML,DHTML,XSLT,JAVASCRIPT,AJAX,JQUERY,GWT,JSON,GSP,JSTL,JSP.
Operating System: Windows 95/98/NT/2000/XP/Vista, MS-DOS, Linux, Unix (Putty, SSH ).
Protocols: TCP/IP, HTTP,FTP,SOAP.
IDEs: Rational Application Developer6.1.1 & 7.0,WSAD 6.0/5.0.X/5.1.X,Eclipse(3.x/2.x),Edit Plus,JBuilder, NetBeans, Emacs, Win Vi, STS 3.0.
Database: SQL server 8.x/7.x/6.x, Oracle 10i/9i/8i/8.x/7.x, DB2 9.x/8.x, Sybase 12.x, My SQL, PostgreSQL 7.x.
Patterns & Frameworks: Java & J2EE Design Patterns, Struts MVC Frameworks, Spring MVC & MVC - 2 Architecture.
O/R Framework: Hibernate(3.x/2.x)
Web Services: REST,SOAP,WSDL,JAX-WS,JAX-RPC,UDDI, Apache Axis.
Parsers & XML related: DOM,SAX,XSLT,JAXP,JAXB.
Build & Testing Tools: Ant 1.6, Maven, JUnit, Rational ClearQuest.
Packages: JDK 1.6/1.5/1.4/1.3/1.2.
Repositories: CVS, PVCS, Subversion, Visual Source Safe, Rational Clearcase
Directories: LDAP.
PROFESSIONAL EXPERIENCE
Confidential, Indianapolis, IN
Data Scientist/Big Data Architect/Sr Software Engineer
Responsibilities:
- Lead development of small projects spanning around 3-5 weeks and used Lean Agile methodologies like Kanban to do Weekly Sprints, Daily Standup & Pair Programming.
- Implemented Test Driven Development (TDD) for creating Java & Groovy test cases at component level & integration level using JUnit & JMock.
- Architect & Implemented various Confidential components using Java6, Groovy, Python, Spring Dependency Injection, Spring MVC, Spring RESTful Controller, Spring Web Flow, Grails and used Hibernate & GORM on the persistence layer.
- Lead the efforts for creating technical design documents involving high level architecture use case analysis, state & data model diagrams and publishing it on Wiki.
- Architect & Implemented SOAP & RESTful web services modules implementing SOA using JAX-WS,JAX-RS,JAX-RPC,JAXB,Apache-Axis2.
- Developed the Auto Classification & Clustering model on Confidential utilizing Naïve Bayes, Decision Trees, Random Forest & K-means clustering Machine Learning algorithm.
- Architect & Implemented the Commerce Engine platform responsible for placing the best ad attached to an answer being delivered using Hadoop/Cassandra.
- Created and configured Hadoop Distributed File Systems(HDFS) clusters running on Amazon EC2(Elastic Cloud) instances storing pegabytes amount of Confidential Answer engine data using map-reduce algorithms and also used NoSQL database like HBase, HIVE, Pig, Apache Flume, Zookeeper, Apache Whirr for creating Column oriented data storage to look up data for business analytics applications.
- Architect & Implemented Confidential Recommendation Engine using Apache Mahout Collaborative filtering technique running on Hadoop clusters.
- Architect & Implemented Enterprise Integration Framework leveraging asynchronous messaging queues using Apache Camel and ActiveMQ.
- Created Archival Storage & Indexing previous questions & answers for Full Text Search utilizing features like Tokenization, Stemming, Filters, Analyzers, Fuzzy Matching using Apache Solr & Lucene. Hibernate Search was used for index update, sharding & replication strategies.
- Used GoF design patters like Factory, Abstract Factory, Decorator across the system along with Enterprise Integration patterns on middle tier and Service Design Patterns on the Web Service Layer
- Developed the web layer using JQuery, JSON, GSP, JSP, AJAX, JSTL, GSTL.
- Used tools like Hudson, Maven for automated & continuous integrated build and GIT/GitHub for version control.
Environment: JAVA 1.6, J2EE1.5, Maven 3.0, JDBC, AspectJ 1.6, Spring 3.0,Hibernate 3.0, JSP,GSP, JSTL1.1, Groovy 1.7,Grails 1.3,jQuery 1.7, Ajax, JSON, JUnit, JMock, Spring WebFlow, Subversion 1.6,Apache Tomcat 7.0, STS 3.0,Hadoop 1.3,HBase 0.96,Apache Cassandra 1.2.6,Apache Hive, R 3.3,Apache Pig 0.9, Apache Flume 1.1, Zookeeper, Active MQ 5.0,Apache Camel 2.7,Apache Solr, Apache Lucene, Hibernate Search, MySQL 5.5, ApacheAxis2, SOAP, WSDL, JAX-WS,JAX-RS, JAXB, JAXP, Spring-WS, Storm, Spark, Python-Scikit Learn, Python, Scala, Akka, NLP, OpenNLP, NLTK, Cloudera Manager, Hortonworks Ambari, Whirr, Sqoop, FLUME, Oozie, RStudio.
Confidential, San Antonio, TX
Solution Archiect
Responsibilities:
- Used Agile methodologies like Scrum which involved participating in daily stand-up meeting, participating in Spring Planning Meeting to figure out stories that needs to be accomplished in the current sprint, Iterative Sprint Development spanning around 2 weeks, Pair Programming & Test Driven Development (TDD).
- Managed a team of 8 offshore resources based out of Shanghai, China.
- Designed UML diagrams like Use Case, Class, Sequential, State & Flow diagrams using MS Visio after the release planning meeting with all the key players from the business.
- Interacted with product owner, business analyst to discuss requirements and implemented task for those story points listed on the story board during the current spring in a iterative manner.
- Created test cases & mock objects to do unit testing & integration level testing in TDD manner using JUnit, JMock.
- Used Core Java6 features like Annotations, Collections, Generics, Enums, Concurrency & Threading and dynamic language like Groovy to develop the business layer, service layer, web service layer and the domain layer.
- Used GoF design patters like Factory, Abstract Factory, Command, Strategy, Decorator to implement sprint tasks.
- Used Spring Dependency Injection along with Groovy to implement the Dependency Injection principle across the entire application.
- Developed SOAP/HTTP based web service for asset based invoices and network invoices using JAX-WS, JAXB, JAXP, WSDL, Spring-WS, to receive invoices in xml format from the DST TA2000 and PowerSelect component.
- Developed the Web Layer using Spring MVC, Grails, Spring Web flow, Grails Web flow.
- Used Hibernate ORM on the persistence layer and used features like entity mapping strategies like one-to-many, many-to-one, many-to-many using hibernate annotations to map the domain layer efficiently with the database tables, different fetching and catching strategies to optimize loading of entity graph and avoid unnecessary calls to the database and Criteria & Querying modules at the repository layer for efficient database queries.
- Developed drool rules using Drools Rule Engine to do effectively validate invoices against conditions like duplicate payout for the same period/same FI, invalid service agreement, etc.
- Used AspectJ for aspect-oriented programming to handle cross-cutting concerns like logging, transactions.
- Used Quartz Scheduler to schedule validation jobs, export payout jobs, network payout transform jobs.
- Developed Ajax oriented web pages using jQuery, JSON,JSP,JSTL, GSTL Tags & Custom Tags.
- Used SpringSource Tool Suite (STS)/Eclipse IDE/Tomcat for local workspace and deployed the application on Dev, QA environment using WebLogic Application Server 9.0.
- Developed and SQL Schemas, Functions, Views & PL/SQL Stored Procedures using Oracle10g database.
- Used IBM Rational tools like CruiseControl for automated & continuous integrated build, ClearCase for version control and ClearQuest for bug tracking.
- Used Clover for getting coverage metrics to assess the impact of unit tests.
Environment: JAVA 1.6, J2EE 1.5, Ant 1.7, JDBC, AspectJ 1.6, Spring 2.5,Hibernate 3.0, JSP,GSP, JSTL1.1, Groovy 1.7,Grails 1.3,jQuery 1.5, Ajax, JSON, JUnit, JMock, Spring WebFlow, IBM Rational ClearCase 7.0, IBM Rational ClearQuest 7.0, STS 3.0, BEA Weblogic Application Server 9.0,Tomcat 6.0, Oracle 10g,Drools 4.0,JAX-WS,JAXB,JAXP,Spring-WS.