Bi Architect/snr Data Engineer Resume
3.00/5 (Submit Your Rating)
SUMMARY
- Around 9 years of extensive work experience in Business Intelligence as an Snr Data Engineer/Snr ETL Developer and in - depth proficient experience in OLTP, Dimension Modeling, Data warehouse, T-SQL, OLAP and Database technologies.
- Around 3 years of experience in Machine Learning and Artificial Intelligence, Development of Chatbot, Face Recognition Algorithms, NLP.
- Experience in building chatbot application using Google Cloud (Hangout meet) and Azure Chatbot services.
- Knowledge in Google Cloud components like GSuite, DialogFlow etc
- Expertise in SQL Server2005/2008 R2/2012/2016, Business intelligence tools like Integration Services, Analysis Services and Reporting Services.
- Experience in creating Conceptual, logical and Physical data models .
- Experience in Informatica Big Data Management and Informatica Data Quality.
- Extensively used tools like SQL Profiler, Database Engine Tuning Advisor, Activity Monitor and Windows
- Performance Monitor for monitoring and tuning MS SQL Server Performance.
- Extensive experience in creating Tables, Views, Indexes, writing Stored Procedures, Triggers using T-SQL (DML and DDL) as well as analyzing and debugging existing complex stored procedures.
- Experience in writing T-SQL queries, Rank functions, CTE and derived table.
- An expert in creating, configuring and fine-tuning ETL workflows designed in SQL Server Integration Services.
- Extensively worked in SSIS Transformations like Lookup, Aggregate, Cache, character Map, Conditional Split, SCD, Data Conversion, Derived column, script component, pivot.
- Extensive experience in creating ETL solutions with different data sources like RDBMS, Flat file, XML, Excel.
- Experience in creating master and child packages, package configurations, logging and Error handling.
- Experience in creating Jobs, Alerts, SQL Mail Agent, and schedule SSIS Packagesusing SQL Server Job Agent and Windows Scheduled task.
- Experience in command line execution of SSIS Package.
- Expertise in SSIS Package deployment using Package configuration in Production.
- Expertise in developing SSAS Cube for business requirement.
- Created and Configured Data Source & Data Source Views, Dimensions, Cubes, Calculated Measures, Hierarchy, Partitions and KPI’s using SQL Server 2005/2008 R2 Analysis Services.
- Expertise in building MOLAP, ROLAP and HOLAP cubes in SSAS .
- Experience in analyzing data and creating Facts and Dimension tables.
- Expertise in Dimension Modeling .
- Experience in taking cube backup and restore.
- Experience in generating drill down reports, parameterized reports, matrix, charts in SSRS 2008.
- Experience in creating Transformations using the Pentaho data integration tool .
- Extensively Worked in Pentaho Transformation tasks like Mongo DB Input, Json Input, Table output, Modified Java Script etc.
- Experience in importing and exporting data using Sqoop from relational database to HDFS.
- Extensive work experience in MongoDB database.
- Proficient in Big Data, Hadoop Map Reduce Algorithms (2X Series)
- Experience in SQOOP, Hive, MySQL Database, HBase, MongoDB, Pig Latin.
- Proficient in creating customized Map-Reduce scripting, Partitioning, Combiner, Bucketing, and Input Splits.
- Proficient on Apache Spark, RDD and Scala.
- Knowledge in SHARK (Spark SQL), AKKA, KAFKA, MiLib (Machine Learning Algorithm)
- Experience in Cloud Services like AWS and Azure.
- Experience in AWS Rekognition, AWS Machine Learning, AWS Polly and other AWS Services.
- Experience in customizing “ ALEXA ” a voice assistant for business needs.
- Experience in working with NEO4J, a Graph Database.
- Experience in Azure Bot Service, Blob Tables/Files, Face API’s.
- Coordinated in requirements gathering effort to assure that client’s business needs are understood.
- Good team player, strong interpersonal combined with self-motivation, initiative and the ability to think outside the box.
- Ability and open to learn new technology in a short time and work in it.
TECHNICAL SKILLS
- RDBMS: MS - SQL Server 2005/2008 R2/2012/2016, Oracle 10g/11g, MS Access, MySQL
- NO SQL: Mongo DB, HBase, Hive, Informix
- Graph DB: Neo4j, AWS Neptune
- Languages: C#, T-SQL, Python, Java, Scala, Cypher Queries, Impala Queries, R, HTML
- Operating Systems: Windows 2008/XP/7/8/10, Ubuntu, Red hat, Amazon Linux, Cent OS 7
- BI/ETL Tools: SQL Server Integration Services ( SSIS ), Pentaho Data Integration (PDI), Talend Integration, Informatica 8.5, Informatica BDM 10.2, Informatica Data
- OLAP: SQL Server Analysis Services ( SSAS ), Cognos Dynamic Cube
- Reporting Tools: SQL Server Reporting Services ( SSRS ), Oracle Business Intelligence ( OBIEE ), Qlikview, Zoho Reports, Excel Reporting, Power BI, Tableau,
- Application Software: Visual Studio 2008/2012/2015, Toad, SQL Developer, Eclipse, Python IDE, MS Office Suite
PROFESSIONAL EXPERIENCE
Confidential
BI Architect/Snr Data Engineer
Responsibilities:
- Involved in Technical decisions for Business requirements, Interaction with Business Analysts to gather the requirements.
- Managed a team of size 5 in Offshore.
- Architect the design flow of the gathering the available information to one single repo.
- Designed the code baseline to download the files from Hadoop to Landing zone for SSIS to pick it up for ETL load using C#.
- Architected Complex SSIS ETL, Data Models and Created tables, Functions and Procedures on SQL Server.
- Responsible to maintain the data integrity and constraints.
- Designed the Data model for individual Data sources up to Data Warehouse.
- Responsible for Implementing the DSAR (GDPR) process to Delete, ROA, and DONOTSELL requests.
- Developed code base for Unification of Customers based on 10 different data parameters like First name, Last Name, Address etc and also built another process to distinguish the customer by their Emails.
- Developed an aggregated system to help business know the unified customer aggregated activities.
- Prepared Tech Design document, ETL Run books, project document and release notes.
- Created Deployment model and Scheduled the tasks to run in Dev and Prod using SSIS package Store
- Interacted with developers, Business & Management Teams and End Users.
Environment: C#, Azure Functions, Azure Bot Services, Python
Confidential
Senior Data Engineer
Responsibilities:
- Design and develop big data ingestion pipeline (mapping and workflows) in Informatica BDM (Big Data Management) to replace SQL Server Legacy system
- Design and develop Hive, Spark & Pig script, wherever required, and integrate with Informatica BDM's transformation for end-to-end integration through Orchestration
- Convert and merge data lake's delta tables from Avro to text for Greenplum distribution
- Aggregate and convert huge PIG Objects to BSON and push it Pig Mongo Connector and publish data in MongoDB
- Automate and orchestrate production build pipelines using DevOps tools like Jenkins and Git.
- Design and develop Oozie workflows for scheduling jobs in Production.
- Develop and Support REST Services to convert SQL Server Meta Data to be utilized for Big Data Pipeline.
- SSIS, Informatica Package and REST APIs in case of any issues
Environment: Apache Hadoop, Apache Hive, Spark,Scala, Python, Pig,Oozie, SQL Server, Informatica BDM
Confidential
Data Engineer
Responsibilities:
- Responsible for design, develop, test, maintenance, Production support and customize software and IT applications, Data/BI/ETL Architecture, Data Modeling, Dimension Modeling, and deploying Business Intelligence Analysis solutions using Informatica, Oracle, SQL Server SSIS, SSRS and SSAS.
- Architected Complex SSIS ETL, Data Models and Design develop, validation and deployment of Confidential ETL Inbound and Outbound interfaces using Informatica, Oracle, Unix scripts, Control M, UC4, Power exchange, Complex T-SQL queries and Shell Scripting, FileZilla, WinSCP, Putty, External tables and PL/SQL.
- Work on Gathering user requirements, Analyze, Database design, Architecture the Claims data warehouse using SQL developer tools.
- Create different Claims data model for legacy source data to Enterprise system, Validate and perform data cleansing and Convert EBCDIC format files using SSIS, C#, SQL Queries, Stored procedures and shell scripts.
- Duties also include Project Planning, Estimation, coordination, system integration, Defect/CR analysis and drafting the design specification documents for different ETL Projects.
Environment: C#, SQL Server (2014), T-SQL, Informatica, Windows Server 2012.
Confidential
Snr. BI/Data Engineer
Responsibilities:
- Involved in Technical decisions for Business requirements, Interaction with Business Analysts to gather the requirements.
- Architected Complex SSIS ETL, Data Models and design flow of the gathering the available information to one single repo.
- Designed the code baseline to build a chatbot using Azure cloud environment using C#.
- Created tables and Procedures on SQL Server and Blob Tables.
- Written python scripts to parse the text files to extract the game data and make it available for the chatbot to use.
- Prepared project documents and release notes.
- Created command-line execution of the scripts in a bat file and scheduled it in a Windows Scheduled task.
- Created Azure Functions to bring the data from SFTP to Blob Storages.
- Interacted with developers, Business & Management Teams and End Users.
Environment: C#, Azure Functions, Azure Bot Services, Python
Confidential, NY
Snr. BI/Data Engineer
Responsibilities:
- Involved in Technical decisions for Business requirements, Interaction with Business Analysts to gather the requirements.
- Architect the design flow of the gathering the available photos to one single repo.
- Designed the code baseline to train the photos of known people and celebrities using AWS Rekognition.
- Created Lambda functions to train the photos of known people in Confidential and did classification of the input photos
- Written python scripts to extract the faces among the group photos and did a comparison with trained model.
- Prepared project document and release notes.
- Created AWS Lambda to bring the data from SFTP to S3 Buckets and scheduled the jobs using Cloudwatch.
- Interacted with developers, Business & Management Teams and End Users.
Environment: Python, AWS Rekognition, AWS Lambda, S3, AWS Cloudwatch
