Big Data Eng./architect Resume
5.00/5 (Submit Your Rating)
Emryville, CA
SUMMARY:
- Proven history of building large - scale data processing systems and serving as an expert in data warehousing solutions while working with a variety of database technologies.
- Experience architecting highly scalable, distributed systems using different open source tools as well as designing and optimizing large, multi-terabyte data warehouses.
- Able to integrate state-of-the-art Big Data technologies into the overall architecture and lead a team of developers through the construction, testing and implementation phase.
- Consulted with business partners and made recommendations to improve the effectiveness of Big Data systems, descriptive analytics systems, and prescriptive analytics systems.
- Integrated new tools and developed technology frameworks/prototypes to accelerate the data integration process and empower the deployment of predictive analytics.
- Working knowledge of machine learning and/or predictive modeling.
- Experience designing, reviewing, implementing and optimizing data transformation processes in the Hadoop and Informatica ecosystems.
- Able to consolidate, validate and cleanse data from a vast range of sources - from applications and databases to files and Web services.
- Capable of extracting data from an existing database, Web sources or APIs.
- Experience designing and implementing fast and efficient data acquisition using Big Data processing techniques and tools.
TECHNICAL SKILLS:
Tools: APIs and SDKs Interface
Databases and Tools: MySQL, MS SQL Server, Oracle, DB2, NoSQL HBase, HDFS, MongoDB,Netazza, Vertica, and Teradata
PROFESSIONAL EXPERIENCE:
Confidential, Emryville, CA
Big Data Eng./Architect
Responsibilities:
- Designed a large data warehouse using star schema, flow-flake.
- Designed and developed Big Data analytics platform for processing customer viewing preferences using Hadoop, Spark and Hive.
- Gather the data for analytics for client different sector like on retail, employee reporting
- Integrated Hadoop into Teradata accelerating the extraction, transformation, and loading of massive structured and unstructured data.
- Created Hive queries that helped market analysts spot emerging trends by comparing fresh data with EDW reference tables and historical metrics.
- Loaded the aggregate data into a relational database for reporting, dashboarding and ad-hoc analyses, which revealed ways to lower operating costs and offset the rising cost of programming.
- Developed Scala scripts, UDFs using both Data frames/SQL and RDD/MapReduce in Spark for Data Aggregation, queries and writing data back into OLTP
- Implemented test scripts to support test driven development and continuous integration.
- Experienced in performance tuning of Spark Applications for setting right Batch Interval time, correct level of Parallelism and memory tuning
- Created reports and dashboards using structured and unstructured data n different Tools like Tableau.
- Experience in designing and developing in Spark using Scala to compare the performance of Spark with Hive and SQL.
- Generated Dashboards with Quick filters, Parameters and sets to handle views more efficiently.
- Generated context filters and data source filters while handling huge volume of data.
- Built dashboards for measures with forecast, trend line and reference lines
- Experience in creating different Visualizations using Bars, Lines and Pies, Maps, Scatter plots, Gantts, Bubbles, Histograms, Bullets, Heat maps and Highlight tables.
Confidential, San Francisco, CA
Big Data Eng./Architect
Responsibilities:
- Responsible for the design, development, testing and documentation of the Informatica mappings, Reports and Workflows based on standards.
- Extensive experience in writing complex SQL queries and have worked with various data source types like Oracle, MS-SQL server, Teradata, Salesforce, Excel, Google analytics
- Perform Data analysis and understand Business requirements. Lead the effort on Data Validations. Implemented File management system including File Validation for source.
- Worked on maintaining the quality standards on ETL, Data and report level.
- Created views and aggregate tables to support reports, Wrote Database Scripts in order to support the project. Develop and reframe solutions based on performance Optimizations.
- Worked on Data Encryption, Validation and Standardization. Worked on Informatica Administration to manage repository and have experience on Admin Console administration experience.
- Worked on Informatica Server administration in the areas of backup repository, configure setting/performance tuning, and meta management.
- Performed Administrative talk as Informatica Administrator and participated on data-oriented tasks on Master Data projects especially members/Payment, like standardizing, cleansing, merging, de-duping rules along with UAT in each state.
- Generated Dashboards with Quick filters, Parameters and sets to handle views more efficiently.
- Create, update and maintain project documents including business requirements, functional and non-functional requirements, functional design, data mapping, etc.
- Identify opportunities for process optimization, process redesign and development of new process.
Confidential, Memphis, TN
SR BI Developer
Responsibilities:
- Gathered business requirements, definition and design of the data sourcing and data flows, data quality analysis, working in conjunction with the data warehouse architect on the development of logical data models.
- Used Erwin for data modeling.
- Created complex Stored Procedures, Triggers, Functions, Indexes, Tables, Views and other T-SQL code and SQL joins for applications.
- Implemented database standards and naming convention for the database objects. established data granularity standards, designed and built star and snowflake dimensional models
- Developed SSIS Packages to extract, transform and load (ETL) data into the data warehouse database from heterogeneous databases/data sources
- Designed Star Schema modeling creating Facts, Dimensions, Measures and Cubes in SSAS.
- Designed Aggregations and pre-calculating in SSAS.
- Analyzed the data for the billing and maintenance department.
- Analyzed and solved numerous data and software performance issues in the Utility reconciliation system. This included the development of complex SQL to both identify and fix data issues in the existing database
- Created SSRS reports using OLTP and OLAP data sources.
- Designing the Partition of the cube for the increasing the performance
- Implemented mechanisms to accelerate query performance, including aggregations, caching, and indexed data retrieval and optimizing the design of dimension attributes, cubes, and Multidimensional Expressions (MDX) queries.
Confidential, Newport Beach, CA
BI Developer
Responsibilities:
- Translated business needs into data analysis, business intelligence data sources and reporting solutions for the clients.
- Transformed data from different data sources like Oracle to SQL Server 2008/2005 using OLE DB connection providers by creating various SSIS packages. Data is transformed from data warehouse to hub.
- Performed performance tuning by analyzing indexes, queries, and table size etc Database Engine tuning advisor and query execution plan tools.Created complex functions and views based on the business requirements.
- Developed stored procedures for complex sub queries and joins using multiple tables based on the business requirements.
- Developed and maintained various cubes using MDX complex queries using SSAS.
- Coordinated with source system owners, day-to-day ETL progress monitoring, Data warehouse target schema design (Star Schema) and maintenance.
- Working with various business groups while developing their applications, assisting in Database model design.
- Designed and developed various Service Management Reports like Cascading Prompts, Sales Reports, Portfolio Report, sub-reports, charts, and parameterized reports, conditional and dynamic reports and deployed various reports on Share Point Portal.
- Created report models for ad hoc reports when the end user wants to see the reports on fly.
Confidential, MI
BI Developer
Responsibilities:
- Participated in the Requirement Gathering of the various work streams by meeting with the Business users, Base Lining the details and documenting for the development team.
- Responsible for Performance tuning and Optimization of SQL queries.
- Carrying out specific Benchmark SQL Queries, Query execution plans and locking issues.
- Writing SQL*Loader control scripts and stored procedures.
- Loaded data into database using stored procedures for retail data.
- Written Database Triggers, Stored Procedures using PL/SQL.
- Successfully created and debugged many PL/SQL procedures, Functions and Packages for the application.
- Providing meaningful analysis based on available data.Responsible for Analyzing and trouble-shooting technical issues related to data.
- Performed data Analysis to Assess Scope items. Pulling data from various sources in Oracle, Access.