We provide IT Staff Augmentation Services!

Big Data Engineer Resume

2.00/5 (Submit Your Rating)

Bethlehem, PA

SUMMARY:

  • Technically skilled professional with 6+ years of experience in fields of Computer Science with Big Data/Python/Hadoop/AWS and Java technologies.

TECHNICAL SKILLS:

Languages: Python, Scala, Java, JavaScript, Hadoop, Spark, Restful Web Services

Database: MySQL, PostgreSQL, Oracle SQL, SQLite3, MongoDB, Hive, HBase

Tools: PyCharm, Sqoop, Eclipse, IntelliJ IDEA, Atom, DataGrip, Toad, GIT, JIRA

WORK EXPERIENCE:

Confidential, Bethlehem, PA

Big Data Engineer

Responsibilities:

  • Worked on Log parsing of multiple files, segregating the data based on certain attributes and uploading the data into DynamoDB table on Amazon Web Services. Wrote regular expression to parse files of different formats.
  • Managed Hive tables in an Big Data environment with facilitating transfer of data between HDFS and Local File System.
  • Converted the files to Avro, Parquet and JSON as per the requirement for further processing.
  • Created Hive Staging tables as a temporary directory for transfer of data between Hive Database and Local File System.
  • Wrote Python Spark application to join two disparate datasets and used Spark API s to process and parse the data for getting insights into the data. Used Test Driven Development and Unit Testing on Python code.
  • Spark RDD s, DataFrames and SQLContext were used for processing the data from Local File System and HDFS. Data was transformed to and from python collections and converted to (k,v) pair for Spark API processing.
  • Provide service design and development in support of new application development and maintenance
  • Experience on Java Multi - Threading, Collection Framework, Interfaces, Synchronization, and Exception
  • Utilized Apache Solr for tabular and text search on Files and Data stored in Hadoop Distributed File System.
  • Used Flume and Kafka for streaming analytics processing of web server logs.
  • Wrote code for adding, querying, updating and deleting data from AWS Dynamo DB.

Confidential, Plano, TX

Application Programmer Analyst

Responsibilities:

  • Worked as a Python/Django Developer. Built a back - end API with Django Rest Framework to handle user accounts and registrations.
  • Collaborated with Senior Developer to handle complicated issues related with deployment of Django based applications. Resolved ongoing problems and documented progress of Python project.
  • Developed pages for cross browser and cross platform compatibility. Utilized Browser Stack to test for compatibility. Have experience with Node.js and its framework Express.js
  • Wrote Python program to parse and upload csv files into PostgreSQL Database. HTTP Request Library was used for Web API call.
  • Maintained both Dev and Production Databases in PostgreSQL environment. PostgreSQL DB was setup in Amazon Web Services. Data was Stored in HIPPA complaint servers.
  • Deployed the application in Amazon Web Services. Have experience in Amazon EC2, RDS, SNS, SQS, S3, Lambda, Elastic
  • Beanstalk and Route 53. AWS Cloudfront was used as CDN to transfer data with low latency and high transfer speeds.
Confidential

Developer

Responsibilities:

  • Designed Database using different forms of relationships.
  • Documented and maintained database system specifications, diagram and connectivity charts.
  • Work with application developers to identify business needs and discuss solution options.
  • Designed and configured database and back-end applications and programs.
  • Build and maintain SQL scripts, indexes and complex queries for data analysis and extraction. Ran SQL Queries to back up the data from Hand-held devices and perform maintenance. working experience in distributed object-oriented component analysis and design according to Industry J2EE frameworks.
  • Experience in developing applications using Java writing core functions for team.
  • Worked closely and effectively with vendors to replace/repair defective hardware and software.

We'd love your feedback!