System Engineer Resume
4.00/5 (Submit Your Rating)
TECHNICAL SKILLS:
Skills: Hadoop, MapReduce, Confidential, YARN, Pig, Hive, Impala, Sqoop, Flume, Kafka, Basic proficiency in Spark, Scala, PySpark, NoSQL Databases (HBase, MongoDB), Java, Python, SQL, HiveQL
Tools: Tableau, QlikView, Eclipse, Intellij IDEA, MySQL, Oracle, Erwin, MS Excel, VM Ware
Others: Agile, Waterfall, Windows OS, Linux/UNIX Variants OS
WORK EXPERIENCE:
Confidential
System Engineer
Responsibilities:
- Created a Data lake in Confidential by loading the data from UNIX systems using shell scripting.
- Developed multiple MapReduce jobs for parsing, data cleansing and reorganizing the raw data
- Written Pig scripts using predefined functions, Confidential to specifically process and filter the MR output data
- Designed Hive table schemas using partitioning and bucketing.
- Developed HiveQL scripts to analyze the data
- Exported the analyzed data to the RDBMS using Sqoop for visualization and to generate reports for the BI team.
- Experience in using Sequence, RCFile, JSON, AVRO file formats and Hive SerDes
- Developed an application in JAVA, FTP, and SQL to facilitate the data transfers between a world’s leading Aircraft manufacturer and various Airline Companies
- Redesigned, developed an application and achieved increment in data transfer speeds up to 5X times
Confidential
Google Maps Analyst
Responsibilities:
- Designed high - quality map data and developed optimization features that are used by Googles map’s routing algorithms by using Google’s in-house tools and software
Confidential
System EngineerResponsibilities:
- Analyzed Twitter live stream data to discover most trending topics per day for a period of 1 week using Apache Flume, Confidential, Spark SQL/Shark and PySpark
Confidential
System EngineerResponsibilities:
- Identified customers with a high propensity for a subscription to a bank’s service by analyzing customer data set having 100,000+ records using various transformations and actions on a multi-node Spark cluster.
Confidential
System EngineerResponsibilities:
- Analyzed a YouTube’s public data set and gathered insights (top 5 categories of uploaded videos, top 10 rated videos, top 10 most viewed videos etc.) using Hadoop, Confidential, Pig, Sqoop and Hive
Confidential
System EngineerResponsibilities:
- Set up a distributed, 8 node Apache Hadoop cluster along with an Edge node running on CentOS machines.
- Also, installed and configured components like Pig, Hive, and Sqoop on the cluster