We provide IT Staff Augmentation Services!

Big Data/software Engineer Resume

3.00/5 (Submit Your Rating)

Philadelphia, PA

PROFESSIONAL SUMMARY:

  • 3+ years experiences in building big data application infrastructure and data modeling.
  • 2 - 3 years hands-on experiences in programming: Java, Scala, Python, PL/SQL, No-SQL.

TECHNICAL SKILLS:

Proficiency in framework tools: Spark 2.x, Spark Streaming, MLlib, Hadoop, Map-Reduce, ETL, RabbitMQ, Kafka, Zookeeper, AWS (EC2, Route 53), Cassandra, Mongo DB.

PROFESSIONAL EXPERIENCE:

Big Data/Software Engineer

Confidential, Philadelphia, PA

Responsibilities:

  • Create Data Lake by extracting data from real time customer Set-top box action into HDFS. Implement data ETL with Map-Reduce in Spark SQL. Build load balancer with HAproxy.
  • Build Data pipeline upon AWS EC2. Manage distributed cluster as AWS admin role.
  • Compose distribution system based on Apache Spark framework. Implement Spark Streaming application with Scala for customer request streaming management and modeling.
  • Use YARN as resource manager. Utilize Kafka on distributed streaming system and Zookeeper for configuration synchronization. Monitor production exceptions with Splunk.
  • Implement back-end server using core JAVA as producer of Spark platform.
  • Design Cassandra DB schemas to store customer information. Implement DB driver template.

Big Data Engineer

Confidential, Jersey City, NJ

Responsibilities:

  • Developed Enterprise Data Anomaly Detection application.
  • Designed distribution system for TB-scale enterprise data processing.
  • Utilized RabbitMQ on distributed streaming system.
  • Built Machine Learning Pipeline and ETL processing based on Spark distributed platform.
  • Applied Spark MLlib and Random Forest algorithm with Scala to detect anomaly data.
  • Managed and scheduled Spark Jobs on Hadoop cluster.
  • Developed back-end algorithm simulation service using SK-learn framework and Python.
  • Designed Mongo DB 3.x schemas for anomaly data storage. Implemented DB driver template using JAVA. Applied Agile development for entire project.

Data/Software Engineer

Confidential, Philadelphia, PA

Responsibilities:

  • Developed a user TV watching statistical application based on one million customers.
  • Created Data Warehouse by extracting data from customer watching history. Implemented ETL processing with Kettle.
  • Implemented statistical service with JAVA. Applied Spring framework for Web service.
  • Optimized Asynchronous application in Spring framework to process high concurrency request.
  • Managed system performance & capacity according to user watching habit.
  • Designed Oracle DB schemas. Implemented DB driver using Hibernate framework.

We'd love your feedback!