Big Data/software Engineer Resume
3.00/5 (Submit Your Rating)
Philadelphia, PA
PROFESSIONAL SUMMARY:
- 3+ years experiences in building big data application infrastructure and data modeling.
- 2 - 3 years hands-on experiences in programming: Java, Scala, Python, PL/SQL, No-SQL.
TECHNICAL SKILLS:
Proficiency in framework tools: Spark 2.x, Spark Streaming, MLlib, Hadoop, Map-Reduce, ETL, RabbitMQ, Kafka, Zookeeper, AWS (EC2, Route 53), Cassandra, Mongo DB.
PROFESSIONAL EXPERIENCE:
Big Data/Software Engineer
Confidential, Philadelphia, PA
Responsibilities:
- Create Data Lake by extracting data from real time customer Set-top box action into HDFS. Implement data ETL with Map-Reduce in Spark SQL. Build load balancer with HAproxy.
- Build Data pipeline upon AWS EC2. Manage distributed cluster as AWS admin role.
- Compose distribution system based on Apache Spark framework. Implement Spark Streaming application with Scala for customer request streaming management and modeling.
- Use YARN as resource manager. Utilize Kafka on distributed streaming system and Zookeeper for configuration synchronization. Monitor production exceptions with Splunk.
- Implement back-end server using core JAVA as producer of Spark platform.
- Design Cassandra DB schemas to store customer information. Implement DB driver template.
Big Data Engineer
Confidential, Jersey City, NJ
Responsibilities:
- Developed Enterprise Data Anomaly Detection application.
- Designed distribution system for TB-scale enterprise data processing.
- Utilized RabbitMQ on distributed streaming system.
- Built Machine Learning Pipeline and ETL processing based on Spark distributed platform.
- Applied Spark MLlib and Random Forest algorithm with Scala to detect anomaly data.
- Managed and scheduled Spark Jobs on Hadoop cluster.
- Developed back-end algorithm simulation service using SK-learn framework and Python.
- Designed Mongo DB 3.x schemas for anomaly data storage. Implemented DB driver template using JAVA. Applied Agile development for entire project.
Data/Software Engineer
Confidential, Philadelphia, PA
Responsibilities:
- Developed a user TV watching statistical application based on one million customers.
- Created Data Warehouse by extracting data from customer watching history. Implemented ETL processing with Kettle.
- Implemented statistical service with JAVA. Applied Spring framework for Web service.
- Optimized Asynchronous application in Spring framework to process high concurrency request.
- Managed system performance & capacity according to user watching habit.
- Designed Oracle DB schemas. Implemented DB driver using Hibernate framework.
