Big Data - Principal Software Engineer / Data Scientist (sr. Manager) Resume
SUMMARY:
Provide leadership and architecture support to transform complex engineering/business logic into distributed, scalable and mission critical applications. Lead teams to deliver solutions that follow software engineering best practices, continuous integration and continuous delivery in all phases of the software development life cycles.
TECHNOLOGY EXPOSURE:
Java EE: JPA (Hibernate), EJB 3.0, JMS, JSF 2.0, CDI, JTA, JSP, JNDI, JavaMail, Websocket
Big Data: Hadoop (Oozie, Flume, Hive, Sqoop), Spark, Cassandra, Solr, Kafka, SpringXD, ElasticSearch, Akka
Integration: ActiveMQ, RabbitMQ, Camel, Spring Integration, Tibco
Application Servers: Tomcat, Jetty, JBoss EAP 6.3, Drools
Caching: Ehcache, Infinispan, Redis
Web Services: RESTful (RESTEasy, Spring), SOAP (ApacheCXF), IBM Filenet 5.2
RDBMS: MySQL, MS SQL, DB2, Oracle
Client frameworks: RCP, Vaadin, GWT, Groovy and Grails
JavaScript Libraries: AngularJS, Bootstrap, D3, Node
Servers and configurations: Windows Server 2012 R2, RHEL, Ubuntu, Vagrant, Docker
Testing: Junit, Mockito, Spock
Build Tools: Maven, Gradle, Jenkins, CMake
Software development: SVN, Git, Penetration testing, Teamforge, Scrum, Agile, Software Lifecycle management
MATHEMATICAL EXPERTISE:
Data analysis: Statistical data analysis, Time series analysis, Data mining, Design of experiments
Modeling: Stochastic process, Simulation, Monte Carlo
Numerical calculation: Computational Fluid Dynamics
Optimization: Linear programming, Network optimization
TECHNICAL SKILLS:
Programming Languages: LabVIEW, Java, C#, Scala and basic exposure to C, C++, Fortran, PHP, Python, Javascript
Simulation and Signal Processing: Matlab, Mathematica, LabVIEW
Statistics Software: SAS, R, Lingo, MiniTab, Design Expert, Origin, Excel (add - in), Crystal Ball
Graphics and mobile: Unity, Blender, Xamarin
PROFESSIONAL EXPERIENCE:
Big Data - Principal Software Engineer / Data Scientist (Sr. Manager)
Confidential
Responsibilities:
- Enable Confidential to improve the member-experience and increase HEDIS gap closures at a member level by sending the right message, at the right time, through the right channel for individual members
- Developed a recommendation engine that recommends most optimal pharmacy based upon demographic data, Rx claim data, geo locations and pricing information
- System design of Universe Generation for CMS Audit
- Data governance for the inputs of the auditing process
- Maintain policies and procedures
- Convert business requirements into technical requirements
- Provide implementation based on big data technology
- Provide user interface for reporting, alerts and scheduling
- Reusable Big Data and Machine Learning Modules
- Implemented machine learning and data manipulation spark jobs
- Implemented spark based geocoding blending job
- Continuous Integration setup in big data environment
- Use Jenkins to build, release and deploy all of the spark jobs in the big data environment
- Setup Apache Solr for Natural Language Processing (NLP)
- Use Spark to do data ingestion into Solr instances
- Configure Solr to analyze and clustering text data
Consulting System Developer (Technical Lead)
Confidential
Responsibilities:
- Integrate RabbitMQ, Kafka, Cassandra and ELK stack (Elastic Search, Logstash, Kibana) through Spring Boot based micro services.
- Use Camel and Spring Integration to connect different technologies.
- Use AMQP as the primary protocol to communicate with RabbitMQ, the front end of the ESB.
- Use Kafka to archive and playback the messages.
- Use Cassandra to persist all of the train messages.
- Use ELK to aggregate all of the infrastructure logging and application level logging in one place.
Confidential
Senior data analyst/software engineerResponsibilities:
- Build control logic around energy storage system from micro controllers to energy dispatching system
- A clustered ActiveMQ environment can let micro controllers to send and receive messages from backend java applications with AMQP protocol
- An energy dispatching system provides automation and optimization functionality based electricity price and system availability
