Bigdata/hadoop Developer Resume
Cleveland, OH
PROFESSIONAL SUMMARY:
- 10+ years of total IT experience in analyzing, designing, administer, tuning, and developing Client/Server Applications and 3+ Years in Big Data - HadoopDevelopment and Ecosystem Analytics, Development and Design of Java based enterprise applications.
- Experience on BIG DATA usingHADOOPframework and related technologies such as HDFS, HBase, MapReduce, Hive, Pig, Impala, Flume, Oozie, Sqoop, Spark and Zookeeper
- Experience in working Cloudera CDH3, CDH4 and CDH5 distributions.Experience in working with Flume to load the log data from multiple sources directly into HDFS.
- Experience in importing and exporting data using Sqoop from HDFS to Relational Database Systems (RDBMS) and from RDBMS to HDFS.
- Experience in data analysis using Hive, Pig Latin, Impala, HBase and Custom Map Reduce programs in Java.
- Experience in writing custom UDFs in Java for Hive and Pig to extend the functionality.
- Experience on Spark, Scala, Data Frames.
- Developed analytical components using Kafka, Scala, Spark SQL and Spark Stream.
- Worked on the Spark SQL and Spark Streaming modules of Spark extensively and used Scala to write code for all Spark use cases.
- Experience in load balancing and stress testing.
- Experience with Sequence files and AVRO file formats and compression.
- Experience in working with Amazon Web Services EC2 instances and S3 buckets.
- Experience in designing both time driven and data driven automated workflows using Oozie.
- ImplementedHadoopbased data warehouses, IntegratedHadoopwith Enterprise Data Warehouse systems.
- Extensive experience in Data Ingestion, In-Stream data processing, Batch Analytics and Data Persistence Strategy.
- Worked in Windows, Unix/Linux platforms.
- Experience working in with ORACLE and My SQL databases.
- Experience in creating Spark Contexts, Spark SQL Contexts, Spark Streaming Context to process huge sets of data
- Promote full cycle approach including request analysis, creating/pulling dataset, report creation and implementation and providing final analysis to the requestor
- Very Good understanding of SQL, ETL and Data Warehousing Technologies
- Business Intelligence (BI) database applications and various segments of SDLC, using MS SQL Server 2014/2012/ 2008/2005/2000 , DTS/SSIS, and Reporting & Analysis Services.
- Extensive experience with different phases of project (project initiation, project requirement and specification gathering, designing system, administer, coding, testing, and debugging new and existing client-server based applications).
- Expertise in database optimization using tools like Database engine Tuning Advisor, SQL profiler, DBCC utilities and Windows Performance Monitor for monitoring and tuning MS SQL Server performance.
- A well-organized, goal-oriented, highly motivated and effective team leader/member with excellent analytical, troubleshooting, and problem solving Skill.
- Excellent Verbal & Written Communication skills and strong in Documentation.
- Flexible, enthusiastic and project oriented team player with solid communication and leadership skills to develop creative solution for challenging client needs.
SKILLS:
BigData/HadoopFramework: HDFS, MapReduce, Pig, Hive, Sqoop, Oozie, Zookeeper, Flume and HBase, Amazon Web Services, Spark(Spark-Scala, Pyspark), Kafka, Hue Web, ImpalaClouderaHadoopCDH5, Cloudera Manager CM5
Databases: MS-SQL Server 2014/2012/ 2008/2005/2000 /7.0,VS2013/2010, Oracle 8i/9i/10g, Toad, MySQL
Languages: C, C++, Java, Scala, Python, SQL, Pig Latin, HiveQL T-SQL, PL/SQL, C, C++, C#, HTML, XML, Java ASP .NET, VB .NET
Operating System: CentOS, Linux, Windows 98/2000/2003/ XP/NT/Vista, Windows 2000Advanced Server, Windows 2003 Enterprise Server
Development tools: Microsoft SQL Studio, Eclipse, NetBeansDevelopment methodologies Agile/Scrum, Waterfall
Other tools: GIT,MSSQL Server Reporting Services 2008/2005/2000 (SSRS), MSSQL Server Analysis Services 2008/2005(SSAS), MSSQL Server Integration Services 2014/2012/ 2008/2005/2000 (SSIS), Data Transformation Services (DTS), ODBC, SQL Server Management studio (SSMS), Erwin 7.2/7.1/4.1 MS Visio 2007/2003, BCP, Active Directory, RS Utility, MS Office
PROFESSIONAL EXPERIENCE:
Confidential, Cleveland, OH
BigData/Hadoop Developer
Environment: HDFS, Map Reduce, Hive, HBase, Pig, Java, Oozie Scala, Kafka, Spark, Git, CentOS 6.4, SBT, Eclipse, RDBMS
Confidential, Cleveland, OH
Hadoop Developer
Environment: HDFS, MapReduce, Hive, HBase,Sqoop, Pig, Java, Scala,Pyspark, Kafka, Spark, Git, Eclipse, CentOS 6.4
Confidential, Cleveland, OH
Hadoop Developer
Environment: HadoopEcosystem, HDFS, Map Reduce, Pig, Hive, Sqoop, Eclipse, Shell Scripting, RDBMS
Confidential,Cleveland, OH
Sr. SQL DBA/ Sr. Database Consultant
Hardware/Software: MS SQL Server 2014/2012/2008 , 2005 T-SQL, SQL Server 2014/2 Integration Services (SSIS), TFS, SQL Server 2008 Reporting Services(SSRS),VS 2013/2010, MS Excel, Visio 2007, Erwin 7.2, SQL Server 2008 Analysis Services, Windows server 2008, BCP, Active Directory
Confidential,Honolulu, HI
Sr.Database Administrator (DBA) / Sr.SQL Database Consultant
Hardware/Software: MS SQL Server 2008, T-SQL, SQL Server 2008 Integration Services(SSIS), TFS, SQL Server 2008 Reporting Services(SSRS), MS Excel, Visio 2007, Erwin 7.2, SQL Server 2008 Analysis Services, Windows server 2008
Confidential,Newark, DE
Database Administrator (DBA) /Sr. MS SQL Server/SSIS/SSRS Developer
Hardware/Software: MS SQL Server 2008, T-SQL, SQL Server 2008 Integration Services(SSIS), TFS, SQL Server 2008 Reporting Services(SSRS), MS Excel, Visio 2007, Erwin 7.2, SQL Server 2008 Analysis Services, Windows server 2008
Responsibilities:
- Experience on BIG DATA usingHADOOPframework and related technologies such as HDFS, HBase, MapReduce, Hive, Pig, Impala, Flume, Oozie, Sqoop, Spark and Zookeeper
- Experience in working Cloudera CDH3, CDH4 and CDH5 distributions.Experience in working with Flume to load the log data from multiple sources directly into HDFS.
- Experience in importing and exporting data using Sqoop from HDFS to Relational Database Systems (RDBMS) and from RDBMS to HDFS.
- Experience in data analysis using Hive, Pig Latin, Impala, HBase and Custom Map Reduce programs in Java.
- Experience in writing custom UDFs in Java for Hive and Pig to extend the functionality.
- Experience on Spark, Scala, Data Frames.
- Developed analytical components using Kafka, Scala, Spark SQL and Spark Stream.
- Worked on the Spark SQL and Spark Streaming modules of Spark extensively and used Scala to write code for all Spark use cases.
- Experience in load balancing and stress testing.
- Experience with Sequence files and AVRO file formats and compression.
- Experience in working with Amazon Web Services EC2 instances and S3 buckets.
- Experience in designing both time driven and data driven automated workflows using Oozie.
- ImplementedHadoopbased data warehouses, IntegratedHadoopwith Enterprise Data Warehouse systems.
- Extensive experience in Data Ingestion, In-Stream data processing, Batch Analytics and Data Persistence Strategy.
- Worked in Windows, Unix/Linux platforms.
- Experience working in with ORACLE and My SQL databases.
