We provide IT Staff Augmentation Services!

Principal Software Engineer Resume

0/5 (Submit Your Rating)

CA

SUMMARY:

  • Principal Software Engineer with masters + 10(+) years of experience in big data platforms (Spark - scala, AWS, Hadoop, Kafka, ELK, Hive), web platforms (Spring, tomcat, Apache, JBoss), API platforms (REST, SOAP, CXF, Akka) and machine learning platforms like MLlib, TensorFlow, R and scikit-learn leading developments in high volume trading engine, dynamic ad pricing, Massive big data ad platform (> 1 billion records/day), social media data-lake platform and dashboards, email campaign insights platform and sales reports and trends platform.

TECHNICAL SKILLS:

Big data: Hadoop, Spark, Hive, Kafka, Oozie, zookeeper, MongoDB, Azkaban, Pig, Yarn

Platforms: Spring, Struts, Hibernate, Akka, Play

Databases: MySQL, Postgres, Oracle, SQL Server

Spring: Spring MVC, Spring Boot, Spring Security

Servers: Raisin, Tomcat, JBoss, Apache

Tools: Eclipse, Intellij Idea

Languages: Java (certified,10 years),Python (3 years), Scala (3 years)

Dev Ops: Docker, Chef, puppet, Jenkins, Maven, AWS, Linux, Rancher

Finance: 2 courses: Principles of Finance, Corporate Finance and cofounder at fintech startup

WORK EXPERIENCE:

Principal Software Engineer

Confidential, SF, CA and LA, CA

Responsibilities:

  • Led team to develop high volume Akka/scala/docker micro-service architecture to integrate banks/custodial APIs
  • Led team of 6 to develop Spark/Scala/Cassandra/spark-stream lambda architecture application to process iTunes/App store data at Apple Inc. in near real time. Provided features like checkpointing, high volume state management and late data handling
  • Led team of 5 to develop Java/Scala/Python/Hive/Spark-SQL/spark-stream app to integrate POS sales data - Confidential
  • Architected Spark/Spring/Hive/EMR/S3 framework to parallel ingest high volume data from social media APIs and analyze using NLP techniques like classification and clustering, summarization and sentiment analysis at Confidential
  • Developed new email programs based on data derived from company wide “For Sale”/ Rental Property Solr search indices
  • Helped develop Java/Redis/JaxRS API platform that consumers from all over Confidential use to publish to Kafka.
  • Led development of ELK stack to monitor and deliver insights about high volume email / push campaigns.Solved complex high volume indexing problems using bulk API for indexing, custom analyzers, re-indexing large volumes of data or using aliases for time sensitive data to reduce needs for re-indexing, monitoring index merges so as to not affect search performance and so on. Developed logs/ email performance metrics dashboard in Kibana for continuous monitoring
  • Developed SQL /Hive based framework to provide Data warehouse team with periodic user Unsubscribe data
  • Helped devOps setup, configure and automate AWS big data services like EMR, EC2, S3 and Redshift/RDS.
  • Led Flume/HBase system to import tweets for selected artists and analyse using tableau
  • Enable faster execution of interactive queries by helping to introduce Presto and Spark SQL platform at Confidential
  • Led development ELK/Cloudera stack to monitor/deliver insights - high volume email campaigns and configured puppet
  • Led migration of Confidential movie sales data from MSSQL/Teradata to redshift
  • Helped develop Kafka/Java/Redis/Mongodb platform to supplement spark with near real time data ingests
  • Helped Introduce Movie Recommender service as a micro service architecture by enabling AWS auto scaling, Spring cloud config, Spring MVC, Sring Boot, Cloud Foundry and using design patterns like circuit breaker (Hysterix) for robustness and Eureka for service discovery
  • Helped develop complex tableau visualization using 6M data points (90000 artists / 90 days) to track artists trends across various social media channels like Soundcloud and Spotify

Tech stack: Spark/Hortonworks/MLLib/scikit-learn/Scala/Eclipse/AWS-EC2/S3/EMR/ELK/Kafka

Architect/Data Scientist

Confidential,San Mateo, CA

Responsibilities:

  • Led Hadoop/Spark/R/Mllib system to profile 700M users & recommend ads based on ALS, SVD & Naive Bayes
  • Led Hadoop/Hive/SQL pipeline to aggregate ads performance data and make it available for reporting
  • Helped CEO develop large scale LAMP web system to process > 1B requests/day with <90ms response times across 200 web servers
  • Introduced in memory DB Aerospike to replace legacy Map/reduce & reduced data pipeline from 1 hour to 10 minutes.
  • Helped introduce Big data analytics platform Platfora in the company

Data Integration Engineer - (J2EE/Scala)

Confidential, SF, CA

Responsibilities:

  • Developed POC to analyse high volume events data (>100 m views) using Scala/Lift/MongoDB/Mahout/Hadoop
  • Enable security for high volume web site using Spring Security
  • Automate and streamline data pipelines through Linux scripting and Jenkins
  • Develop J2EE/Spring/Axis API platform to import diverse data from partners with minimal config. change
  • Used Design patterns like Controller, separation of API and database logic that led to modular, easy to change code and allowed independent evolution of API and DB micro-services.

Tech stack: Scala/Raisin/Spring/Postgres/MongoDB/Talend

Research Scholar

Confidential, Pittsburgh PA

Responsibilities:

  • Setup Java/Python ML pipeline to execute sequence alignment models in parallel using Bootstrap Aggregating
  • Developed algorithm to infer information from multiple child trees and display in single parent tree in intuitive manner
  • Provided MVC architecture that separated data access (files/objects) and data view (Swing) code to simplify/speed dev

Tech stack: Swing/Java/Python/MySQL/Weka

J2EE System Engineer

Confidential

Responsibilities:

  • Led development of 6 people team for corporate wide quality control portal using Spring, Hibernate, Linux and Oracle
  • Led development of ESB framework to integrate isolated systems of TCS client. Nominated for "Best project"
  • Implemented Spring/JMS with MQ to integrate applications with Framework above
  • Utilized JavaScript/HTML/CSS to develop GUI that can be configured dynamically at run time
  • Train and mentor teams about statistical process control techniques using Excel and Minitab

Tech stack: Spring/JMS/Python/MySQL/Oracle/Websphere MQ/ESB

We'd love your feedback!