We provide IT Staff Augmentation Services!

Hadoop Developer Resume

2.00/5 (Submit Your Rating)

Ashburn, VA

SUMMARY:

  • Having 8+ years of professional IT experience including 3 years in Big data ecosystems, experience in ingestion, storage, querying, processing and analysis of Big Data.
  • Hands on experience in using Hadoop ecosystem components like Hadoop Map Reduce, HDFS, Oozie, HiveQL, Sqoop, HBase, Zookeeper, Pig, and Flume with CDH3&4 clusters
  • Excellent understanding / knowledge of Hadoop architecture and various components such as HDFS, Job Tracker, Task Tracker, Name Node, Data Node and Map Reduce programming paradigms.
  • Hands - on experience with major components in Hadoop Ecosystem and knowledge of Mapper/Reducer/HDFS Frame work for scalability, distributed computing and high performance computing.
  • Experience in analyzing data using PIG Latin, HIVEQL and custom MapReduce programs in JAVA using Development tools like Eclipse and Visual Studio Extending HIVE and PIG core functionality by using custom UDFs also have very good working knowledge on Pentaho ETL, IBM Big Insight and Cassandra.
  • Experience in Designing, Developing and implementing connectivity products that allow efficient exchange of data between core database engine and Hadoop ecosystem.
  • Worked on NoSQL databases including HBase, and managing and reviewing Hadoop log files worked with HCatalog, to open up access to Hive’s Metastore.
  • Experience in importing /exporting the data using Sqoop from HDFS to Relational Database systems/mainframe and vice-versa. Used Hadoop Streaming utility well to run Map/Reduce Jobs.
  • Good Working knowledge on Hadoop Administration activities like installing cluster, commissioning & decommissioning of datanode, namenode recovery, capacity planning, and slots configuration.
  • Installing and configuring Hbase, HDFS, PIG, HIVE, and Hadoop MapReduce.
  • Experience in BI Design Development, Database Development and Administration using MS SQL Server 2000/2005/2008.
  • Good Experience in creating Business Intelligence solutions using SQL Server Database, SQL Server integration services (SSIS), SQL Server Analysis Services (SSAS) and SSRS
  • Experience in design, development and deployment of SSAS cubes with partitions, aggregations, actions, perspectives and KPIs. Good working knowledge in OLAP, MOLAP, and ROLAP.
  • Experience in design and development of reports like Chart reports, Drill down and Drill through Reports by passing parameters using SQL Server Reporting Services (SSRS). Used SQL and MDX for creating Report datasets.
  • Possess good data modeling skills like Relational and Dimensional (Star and Snowflake schema) modeling in understanding the data requirements and subsequently building the data model (both logical & physical)
  • Performed Database administration activities like backup, recovery, integrity check and index reorganization. Involved in working with disaster recovery solutions such asreplicationand log shipping.
  • Hands on experience in application development using Java, RDBMS, and Linux shell scripting.
  • Extensively worked on database applications using DB2 UDB, Oracle, SQL Server 2008/2005/2000 , PL/SQL and My SQL.
  • Highly motivated with strong problem solving skills with excellent communication and interpersonal skills.

SKILLS:

Big Data Technologies: Hadoop, HDFS, Hive, Map Reduce, Pig, Sqoop, Flume, Zookeeper, HBase, HCatalog, NoSQL, IBM Big Insight, Cassandra,MRUnit, YARN

BI Technologies: SSIS, SSAS, Pentaho, SSRS 2008, Agile BI, SQL Server 2008/2005/2000 , SSMS, SSIS, SSRS, SSAS, Visual Studio,Backup, Recovery, Replication, LogShipping

RDBMS: DB2, SQL Server, Oracle 9i/11g, My SQL

Operating Systems: Linux, Windows 98/00/NT/XP, Mac OS

Languages: VB, JAVA, COBOL, Unix Shell Scripting, SQL, Java Script, PL/SQL, C++

Web Technologies, IDE’s& Others: HTML, XML, REST, Eclipse

PROFESSIONAL EXPERIENCE:

Confidential, Ashburn, VA

Hadoop Developer

Responsibilities:

  • Involved in design and development phases ofSoftware Development Life Cycle (SDLC) usingScrummethodology
  • Developed data pipeline using Flume, Sqoop, Pig and Java MapReduce to ingest customer behavioral data and purchase histories into HDFS for analysis.
  • Developed job flows in Oozie to automate the workflow for extraction of data from warehouses and weblogs.
  • Used Pig as ETL tool to do transformations, event joins, and some pre-aggregations before storing the data onto HDFS
  • Used Hive to analyze the partitioned and bucketed data and compute various metrics for reporting on the dashboard.
  • Loaded the aggregated data onto DB2 for reporting on the dashboard.

Environment: JDK1.6,RedHat Linux, HDFS, Map-Reduce, Hive, UNIX, Java, Pig, Sqoop, Flume, Zookeeper, Oozie, DB2, HBase.

Confidential, IL

Hadoop Developer

Responsibilities:

  • Loaded the customer profiles data, customer claims information, billing information etc onto HDFS using Sqoop and Flume.
  • Built data pipeline using Pig and Java Mapreduce to store data onto HDFS.
  • Used Oozie to orchestrate the MapReduce jobs and worked with HCatalog, to open up access to Hive’s Metastore
  • Importing and exporting the data into HDFS and Hive using Sqoop.
  • Load and transform large sets of structured, semi structured and unstructured data.
  • Worked with running Hadoop streaming jobs to process terabytes of data also used Hadoop Streaming utility well to run Map/Reduce Jobs.
  • Used Pattern matching algorithms to recognize the fraudulent customer across different sources and built risk profiles for each customer using Hive and stored the results in HBase.
  • Performed unit testing using MRUnit

Environment: CDH4, Linux, Flume, Hive, Sqoop, Pig, Oozie, UNIX, Java, Pentaho, JDK1.6, Map reduce, HDFS, Hbase, MRUnit

Confidential, Edina, MN

Sr. SQL BI Developer

Responsibilities:

  • Actively participated in interaction with users, team lead, DBA’s and technical manager to fully understand the requirements of the system.
  • Tested and createdViews,User Defined FunctionsandStored Proceduresfor new and modified business requirements.
  • Worked effectively onSQL Profiler,Index TuningWizard,Estimated Query Planto optimize the performance tuning of SQL Queries and Stored Procedures.
  • Successfullymigrateddata between different heterogeneous sources such as flat file, Excel and SQL Server 2012/2008 usingSSIS,BCPandBulk Insert.
  • Created, tested, modified and scheduled SSIS packages to update the tables on a day to day basis.
  • DevelopedSSIS Packagesusing varioustasksin control flow and transformationsin dataflow.
  • Created master ETL packages which include various packages in them and scheduled them to execute based on the given requirement time.
  • Experience in troubleshooting andtuning the performanceof long running SSIS packages.
  • Developed the packages with monitoring features and logging so that audit information of the packages and their execution results are loaded in to the audit table.
  • Deployed Packages on both File system and SQL Server MSDB across Development, Test, Production using batch files and Execution Utility, and used Configuration files and Environment variables for production deployment.
  • Maintenance and Debugging of ETL packages andETL optimizationusing SQL Server best practices.
  • Createdindexeson selective columns tospeed up queriesin SQL Server Management Studio.
  • Created views andrestrict accessto data in a table forsecurity.
  • Involved in generatingMatrix reports,Sub reportsandcomplex reports with multi value parameters for the analysis ofPerformance.
  • Created Drill reports which gives all the details of various transactions like closed transactions,pending approvals,and summary of transactionsand scheduledthis report to run on quarterly basis usingSSRS(Reporting services).
  • Created various Daily, Weekly and Monthly reports showing detailed information usingSSRS.
  • Deployed the solutions on Web Server, Implemented security to restrict the access to users and to allow them to use only certain reports.
  • Created standard report Subscriptions and Data Driven report Subscriptions.
  • Documented the reports and the packages created.

Environment: SQL Server 2012/2008, Microsoft Business Intelligence Development Studio, T-SQL, SSIS, SSRS, SQL Server Management Studio (SSMS).

Confidential, Frederick, MD

SQL BI Developer

Responsibilities:

  • Involved in all stages of SDLC of Ledger and Covad systems & Designed and developed end to end BI solution for Ledger and Covad systems.
  • Created SSIS packages to Extract, Transform and load data using different transformations such as Lookup, Derived Columns, Condition Split, Aggregate, and Slowly Changing Dimension, Merge Join and Union all.
  • Used For-Each Loop Container, Sequence Container, Script task, Expressions, Execute SQL task, Variables, Send Mail Task, Package Execution task to achieve business needs.
  • Implemented SSIS Package (ETL) to Extract and Transform data from DB2 RDBMS, Flat File or CSV, SQL Server 2005 instances and to Load into Staging Database.
  • Implemented tables partitioning and created procedures to add and remove partitions.
  • Created SSIS packages to get data from different sources and populated Dimension Model. Implemented Error Handling, Package logging and Dynamic connections.
  • Implemented mechanism to collect package execution statistics at component level.
  • Developed SSIS packages for cube processing. Handled incremental process and process full options for cube partitions, process add and process update for dimensions.
  • Scheduled and maintain packages by daily, weekly and monthly using SQL Server Agent in SSMS.
  • Implemented Restart mechanism in SSIS packages. Designed and developed SSAS cubes for Ledger and Covad Systems.
  • Developed complex Stored Procedures to generate various Drill-through reports, Parameterized reports, Tabular reports, Matrix reports and linked reports using SSRS.
  • Wrote complex Expressions and Calculations for SSRS reports with Conditional Formatting.
  • Used SQL Server 2005 tools such as SQL Server Management Studio, SQL Server Profiler, SQL Server Agent, and Database Engine Tuning Advisor for day to day tasks including backup and restore.
  • Performance tuned SSIS packages making them efficient and tremendously increase in performance
  • Designed and Developed SSRS reports for Dashboard used Sub reports and Linked reports.
  • As a senior developer worked on designing schema, sub-schema for multiple platforms, into a single platform and delivered data integration capabilities and operational data structures, initiated optimizations and reconfigurations, as needed.
  • Developed MDX queries on OLAP cubes to present data in SSRS Reports.
  • Deployed all the database objects and BI components into production and other environments.

Environment: Microsoft SQL Server 2005 Enterprise Edition, Windows 2003 Server, SSMS, SQL Server Integration Services (SSIS), SQL Server Reporting Services (SSRS), SSAS, Business Intelligence Development Studio(BIDS), Microsoft Visual Studio 2005, Visual Source Safe 2005, SQL Profiler, C#.NET, VB.NET

Confidential, IL

SQL BI Developer

Responsibilities:

  • Worked with end-users, Business Analysts for requirement gatherings and specifications and database administrators, developers, report writers, and project teams to refine reporting requirements based on customer’s business needs.
  • Provided estimates of effort required to design and develop solutions and participated in design sessions with architects and developers
  • Involved inDesigning the architecture of the warehouse, Logical & PhysicalData modeling,data dictionary mappings
  • Writtenstored procedures, Triggers, User-defined Functions, Views and Cursorsfor report use
  • Did lot ofSQL performance monitoringandtuningof reporting data by optimizing indexes and stored procedures
  • Designed ETL Packages to bring data from existing OLTP databases over to the new data warehouse by performing different kinds of transformations using SSIS
  • Expertise in using different Transformationslike Lookups, Derived Column, Merge Join, Fuzzy Lookup, ForLoop, ForEachLoop, Conditional Split, Union all, Script componentand etc
  • Developed the packages with monitoring features andloggingso that audit information of the packages and their execution results are loaded in to the audit table.
  • Debugging and maintenance of ETL packages andETL optimizationusing SQL Server best practices (usingunblocking Transformationsandrow transformationsfor better Performance)
  • Createdmaster ETL packageswhich include various packages in them and scheduled them to execute based on the given requirement time.
  • UsedEventHandlers, check points, Stored Procedures forCustom Loggingand for various events (On Warning, On Pre and Post Execution, On Task Failedetc)
  • Worked on Dimensional Data Modeling usingStar and snowflake schemasforFact and Dimensiontables
  • Designed, developed, and deployed new reports and enhancements to existing reports, for various applications as assigned.
  • Created interactive reports for sorting, differentParameterized Reports which consist of report criteria in various reports to make minimize the report execution time and to limit the no of records required
  • Report parameters includedsingle valued parameters,multi-value parameterswhich also consisted of different parameter types like hidden, internal, default (queried and non queried parameters).
  • Worked on all types of report types liketables, matrix, charts, sub reports etc.
  • Created complex stored procedures to use as thedatasetsfor the Report Design, to generate Ad hoc reports usingSSRS
  • Involved in designing sub reports and linked reports depending on the user requirement and in order to limit the number ofData Sources.
  • Generated variousreports with drilldowns,drill-through,hyperlink, calculated members,and dropdownsfrom the cubes by connecting to Analysis server and SQL Server from SSRS.
  • Deployed the solutions on Web Server, Implemented security to restrict the access to users and to allow them to use only certain reports.
  • Provideddocumentationabout database/data warehouse structures and updated functional specification and technical design documents.

Environment: SQL Server 2008/2005, Oracle, Microsoft Business Intelligence Development Studio, T-SQL, SSIS, SSRS, Erwin, TFS, SQL Server Management Studio (SSMS)

We'd love your feedback!