We provide IT Staff Augmentation Services!

Bigdata/hadoop Developer Resume

5.00/5 (Submit Your Rating)

Cleveland, OH

PROFESSIONAL SUMMARY:

  • 10+ years of total IT experience in analyzing, designing, administer, tuning, and developing Client/Server Applications and 3+ Years in Big Data - HadoopDevelopment and Ecosystem Analytics, Development and Design of Java based enterprise applications.
  • Experience on BIG DATA usingHADOOPframework and related technologies such as HDFS, HBase, MapReduce, Hive, Pig, Impala, Flume, Oozie, Sqoop, Spark and Zookeeper
  • Experience in working Cloudera CDH3, CDH4 and CDH5 distributions.Experience in working with Flume to load the log data from multiple sources directly into HDFS.
  • Experience in importing and exporting data using Sqoop from HDFS to Relational Database Systems (RDBMS) and from RDBMS to HDFS.
  • Experience in data analysis using Hive, Pig Latin, Impala, HBase and Custom Map Reduce programs in Java.
  • Experience in writing custom UDFs in Java for Hive and Pig to extend the functionality.
  • Experience on Spark, Scala, Data Frames.
  • Developed analytical components using Kafka, Scala, Spark SQL and Spark Stream.
  • Worked on the Spark SQL and Spark Streaming modules of Spark extensively and used Scala to write code for all Spark use cases.
  • Experience in load balancing and stress testing.
  • Experience with Sequence files and AVRO file formats and compression.
  • Experience in working with Amazon Web Services EC2 instances and S3 buckets.
  • Experience in designing both time driven and data driven automated workflows using Oozie.
  • ImplementedHadoopbased data warehouses, IntegratedHadoopwith Enterprise Data Warehouse systems.
  • Extensive experience in Data Ingestion, In-Stream data processing, Batch Analytics and Data Persistence Strategy.
  • Worked in Windows, Unix/Linux platforms.
  • Experience working in with ORACLE and My SQL databases.
  • Experience in creating Spark Contexts, Spark SQL Contexts, Spark Streaming Context to process huge sets of data
  • Promote full cycle approach including request analysis, creating/pulling dataset, report creation and implementation and providing final analysis to the requestor
  • Very Good understanding of SQL, ETL and Data Warehousing Technologies
  • Business Intelligence (BI) database applications and various segments of SDLC, using MS SQL Server 2014/2012/ 2008/2005/2000 , DTS/SSIS, and Reporting & Analysis Services.
  • Extensive experience with different phases of project (project initiation, project requirement and specification gathering, designing system, administer, coding, testing, and debugging new and existing client-server based applications).
  • Expertise in database optimization using tools like Database engine Tuning Advisor, SQL profiler, DBCC utilities and Windows Performance Monitor for monitoring and tuning MS SQL Server performance.
  • A well-organized, goal-oriented, highly motivated and effective team leader/member with excellent analytical, troubleshooting, and problem solving Skill.
  • Excellent Verbal & Written Communication skills and strong in Documentation.
  • Flexible, enthusiastic and project oriented team player with solid communication and leadership skills to develop creative solution for challenging client needs.

SKILLS:

BigData/HadoopFramework: HDFS, MapReduce, Pig, Hive, Sqoop, Oozie, Zookeeper, Flume and HBase, Amazon Web Services, Spark(Spark-Scala, Pyspark), Kafka, Hue Web, ImpalaClouderaHadoopCDH5, Cloudera Manager CM5

Databases: MS-SQL Server 2014/2012/ 2008/2005/2000 /7.0,VS2013/2010, Oracle 8i/9i/10g, Toad, MySQL

Languages: C, C++, Java, Scala, Python, SQL, Pig Latin, HiveQL T-SQL, PL/SQL, C, C++, C#, HTML, XML, Java ASP .NET, VB .NET

Operating System: CentOS, Linux, Windows 98/2000/2003/ XP/NT/Vista, Windows 2000Advanced Server, Windows 2003 Enterprise Server

Development tools: Microsoft SQL Studio, Eclipse, NetBeansDevelopment methodologies Agile/Scrum, Waterfall

Other tools: GIT,MSSQL Server Reporting Services 2008/2005/2000 (SSRS), MSSQL Server Analysis Services 2008/2005(SSAS), MSSQL Server Integration Services 2014/2012/ 2008/2005/2000 (SSIS), Data Transformation Services (DTS), ODBC, SQL Server Management studio (SSMS), Erwin 7.2/7.1/4.1 MS Visio 2007/2003, BCP, Active Directory, RS Utility, MS Office

PROFESSIONAL EXPERIENCE:

Confidential, Cleveland, OH

BigData/Hadoop Developer

Environment: HDFS, Map Reduce, Hive, HBase, Pig, Java, Oozie Scala, Kafka, Spark, Git, CentOS 6.4, SBT, Eclipse, RDBMS

Confidential, Cleveland, OH

Hadoop Developer

Environment: HDFS, MapReduce, Hive, HBase,Sqoop, Pig, Java, Scala,Pyspark, Kafka, Spark, Git, Eclipse, CentOS 6.4

Confidential, Cleveland, OH

Hadoop Developer

Environment: HadoopEcosystem, HDFS, Map Reduce, Pig, Hive, Sqoop, Eclipse, Shell Scripting, RDBMS

Confidential,Cleveland, OH

Sr. SQL DBA/ Sr. Database Consultant

Hardware/Software: MS SQL Server 2014/2012/2008 , 2005 T-SQL, SQL Server 2014/2 Integration Services (SSIS), TFS, SQL Server 2008 Reporting Services(SSRS),VS 2013/2010, MS Excel, Visio 2007, Erwin 7.2, SQL Server 2008 Analysis Services, Windows server 2008, BCP, Active Directory

Confidential,Honolulu, HI

Sr.Database Administrator (DBA) / Sr.SQL Database Consultant

Hardware/Software: MS SQL Server 2008, T-SQL, SQL Server 2008 Integration Services(SSIS), TFS, SQL Server 2008 Reporting Services(SSRS), MS Excel, Visio 2007, Erwin 7.2, SQL Server 2008 Analysis Services, Windows server 2008

Confidential,Newark, DE

Database Administrator (DBA) /Sr. MS SQL Server/SSIS/SSRS Developer

Hardware/Software: MS SQL Server 2008, T-SQL, SQL Server 2008 Integration Services(SSIS), TFS, SQL Server 2008 Reporting Services(SSRS), MS Excel, Visio 2007, Erwin 7.2, SQL Server 2008 Analysis Services, Windows server 2008

Responsibilities:

  • Experience on BIG DATA usingHADOOPframework and related technologies such as HDFS, HBase, MapReduce, Hive, Pig, Impala, Flume, Oozie, Sqoop, Spark and Zookeeper
  • Experience in working Cloudera CDH3, CDH4 and CDH5 distributions.Experience in working with Flume to load the log data from multiple sources directly into HDFS.
  • Experience in importing and exporting data using Sqoop from HDFS to Relational Database Systems (RDBMS) and from RDBMS to HDFS.
  • Experience in data analysis using Hive, Pig Latin, Impala, HBase and Custom Map Reduce programs in Java.
  • Experience in writing custom UDFs in Java for Hive and Pig to extend the functionality.
  • Experience on Spark, Scala, Data Frames.
  • Developed analytical components using Kafka, Scala, Spark SQL and Spark Stream.
  • Worked on the Spark SQL and Spark Streaming modules of Spark extensively and used Scala to write code for all Spark use cases.
  • Experience in load balancing and stress testing.
  • Experience with Sequence files and AVRO file formats and compression.
  • Experience in working with Amazon Web Services EC2 instances and S3 buckets.
  • Experience in designing both time driven and data driven automated workflows using Oozie.
  • ImplementedHadoopbased data warehouses, IntegratedHadoopwith Enterprise Data Warehouse systems.
  • Extensive experience in Data Ingestion, In-Stream data processing, Batch Analytics and Data Persistence Strategy.
  • Worked in Windows, Unix/Linux platforms.
  • Experience working in with ORACLE and My SQL databases.

We'd love your feedback!