Lead Data Developer/software Associate Resume
SUMMARY
- Managed, implemented and transformed many data projects in Bigdata (EDL), Datawarehouse (EDW), Business Intelligence (BI),database developer (SQL) through SAFe/Agile methods.
- Managed team (Team size 10 to 20) and Datawarehouse or Bigdata projects for more than 7+ years.
- Conduct ETL (Teradata, Informatica, DataStage, SSIS), SQL performance tuning, troubleshooting, support, and capacity estimation to ensure highest data quality standards
- Industry experience with Bigdata technology (Hadoop, Hive, Spark, SQL, Kafka, Hadoop, Python, Scala etc.)
- Experience with real time data pipeline and architecture
- Conducted dimensional modelling, master data management, metadata management, data cleaning and warehouse querying
- Sound knowledge of Agile development (Scrum and Kanban), Waterfall and Safe methodology (JIRA, Confluence and HP quality Centre) and best practices (code reviews, testing, etc.) to develop and deliver data products
- Experience in handling unstructured data and building data pipelines
- Experience with Data modelling/Data Vault, Enterprise warehousing experience (Erwin tool), EDL (Data Lake)
- Strong proficiency with relational databases (Oracle, DB2, SQL, Teradata, HIVE etc.) and reading and writing SQL and implementing data pipelines to deal with incremental data size up to 100 TB with database tuning (Indexing strategies, partitioning)
- Strong understanding of Data Governance and Master Data Management and principles
- Experience in handling data visualization tools like Business Objects, Tableau
- Proficient in working with operating systems like Unix, Windows, Mainframe
TECHNICAL SKILLS
- Mapreduce, Spark (Scala and Python), Hive, Hadoop, Unix, VB scripting
- Teradata 14.0, Oracle, Hive
- Informatica, Datastage, ETL/ELT
- SVN, Clearcase, ESP workstation, Control - M
- Business objects, Tableau, SQL,T-SQL
- Data modelling, Erwin,Data Governance, MDM
- Finance, Transportation, Hospitality, Banking
- JIRA,Confluence, Agile, SAFe
PROFESSIONAL EXPERIENCE
Lead Data Developer/Software Associate
Confidential
Tools: and Technologies used: Teradata, Informatica/DataStage, UNIX, Hadoop, Business Objects (UDT and IDT), Tableau, Hadoop, Hive, Spark (with Scala and Python), Erwin for dimensional modelling. Extensively using T-SQL and Teradata utilities like MLOAD, FASTLOAD, FASTEXPORT, TPT and cloud technologies like IPAAS and AWS. JIRA, Confluence, Erwin, Star Schema, Snowflake. SVN, ClearCase, ESP workstation, Control-M
Responsibilities:
- Developed spark-Scala scripts to transform data extracts, manipulate data using spark-sql and loading back into hive datalake tables.
- Demonstrated capability by implementing real time streaming through Spark streaming (coded in scala) on twitter by finding out trending Hashtags
- Prepared Sales Repository for big data ecosystem and coded logic to implement surrogate key generation, SCD-2.
- Played an important role in doing Datawarehouse offloading to Big data ecosystem
- Mentoring juniors and given training to the teams on big data technologies
- Managed and guided team of 10 members
- As an Individual contributor, analyzed business requirements, created user stories, finalizing sprint requirements with project owner. Created tech specifications, data model, ETL jobs, control-M jobs, unit test plan documents and supporting pre-deployment and post deployment validations
- Developed Hive-QL queries to create comprehensive extracts containing quarterly finance data from 3 data marts- revenue, customer orders and customer invoice
- Designed reports in tableau and business objects.
- Interacting with customers & team for requirement gathering, risk assessment, and finalization of Architectural/Functional design
- Participated in technical design and coding of Software Applications, mapping requirements, and in the finalization of product specifications and selection of appropriate techniques
- Responsible for overall execution of projects, ensuring quality of deliverables & productivity improvements
- Developing plans & schedules, resource allocations, manpower deployment, and team meetings for individual projects, also worked with 3rd party vendors
- Involved in hiring and inducting the right talent & forming right teams for development
- Responsible for creating and maintaining design and support documentations
- GDPR implementation: Re-engineering of the systems to adapt to the GDPR for the European market.
- Product and Customer Data Management (PMDM and CMDM): Implemented across different data models to produce single version of truth in different denormalized tables.
- Digital Insight Integration: This project aimed to integrate DIGITAL INSIGHT data into Datawarehouse of NCR. New values related to GL, Inventory etc. would start flowing in EDW. Complete development was done under my leadership.
- Score Annuity Renewal Metrics: This project entailed redesigning geographical regions and theatres into new segments for the purposes of capitalizing on growth opportunities in emerging markets.
- T &T Business Management: The objective of this project was to gain visibility into complexities of NCR’s Spend pattern and to provide requisite data by merging more than 10 complex source systems in EDW.
- Hercules Funnel: This project was designed to figure out what are the potential opportunities which are in NCR funnel and to give 360 degree reporting based on sizing, estimation and to figure out which are more close to a win for NCR.
- Hercules HR workday: The primary purpose of this project was to provide NCR HR and HR customers an expanded dataset in the EDW environment.
- SIOP MEA WAVE 2: This Data Warehousing (DW) project was developed in an effort to streamline the entire process of three countries Saudi Arabia, Egypt, and Nigeria specific data and to keep real time track of various financial transactions.
Senior Data Developer
Confidential
Tools: and Technologies used: Teradata, Informatica/Datastage, UNIX, Hadoop, Business Objects (UDT and IDT), Tableau, Hadoop, Hive, Spark (with Scala and Python), Erwin for dimensional modelling. Extensively using T-SQL and Teradata utilities like MLOAD, FASTLOAD, FASTEXPORT, TPT and cloud technologies like IPAAS and AWS. JIRA, Confluence, Erwin, Star Schema, Snowflake. SVN, Clearcase, ESP workstation, Control-M
Responsibilities:
- Gather and define business requirements while managing the risks to improve business processes, thereby contributing to enterprise architecture development from a business needs point of view through business analysis and map processes
- Define the business mission and performance standards across all functional areas and periodically review performance with the deft application of concurrent management audit procedures
- Organize various training sessions for the team to enhance their performance and train them on hadoop
- Ensure technical solutions are designed for performance, reliability, scalability, maintainability, supportability, business continuity and business agility while leveraging industry’s best practices
- Deftly serve as ‘Single Point of Contact/Interface’ for supporting clients
- Conduct ‘SWOT’ analysis and utilize findings for designing customized strategies to enhance customer services
- Royal Mail group project: This project involved building a data mart which will be used by all mail applications.
Data Developer/Consultant
Confidential
Tools and Technologies used: Teradata, Informatica/DataStage, UNIX, Hadoop, Business Objects (UDT and IDT), Tableau, Hadoop, Hive, Spark (with Scala and Python), Erwin for dimensional modelling. Extensively using T-SQL and Teradata utilities like MLOAD, FASTLOAD, FASTEXPORT, TPT and cloud technologies like IPAAS and AWS. JIRA, Confluence, Erwin, Star Schema, Snowflake. SVN, ClearCase, ESP workstation, Control-MResponsibilities:
- Structured project proposals complete with details of activities, time frame, and required mix of resources. Made business presentations before the clients to generate value proposition and secure financial commitments. Worked as administrator for carrying out all kind of migration and for role, space and security implementation
- Excelled as tech leader while managing multiple projects ensuring successful completion of the projects and smooth execution and implementation of the projects
- Dealt with various technical aspects of the projects including analysis of project requirements, technical guidance, estimation, scheduling, and final delivery of the solutions while focusing on competence enhancement activities
- Collaborated with the team members and senior management to maintain a continuous stream of information regarding the project status and progress
- Catalyzed business growth with constant impetus of strategic initiatives across diverse functional domains
- Efficiently furnished guidance on the projects and its requirements to the clients over the technology, processes and applications while updating them on the regular project related developments
- Actively involved in preparing estimates for product testing activities, developing plans for testing and UAT while maintaining the resource matrix for task allocations
- The responsibilities range from developing and supporting the project artifacts of various domains, helping the enterprise in making the crucial decisions. Collaborate with developers, project managers, business analysts and business users in conceptualizing and developing data marts and enhancements. Deployment of code in testing and pre prod regions with maintaining different versions of code in clearcase.
- Implementation of new logic which will be delivering new files considering liquidity premium index. Requirement and feasibility analysis, design, documentation, development, performance testing, production implementation. Design and development of database structures and logic. Deployment of functional packages.
- Worked on a module which would be generating create table statement and where clauses dynamically. Its result set is used in making joins between fact table and dimension tables.
Data Developer/Assistant System Engineer
Confidential
Tools: and Technologies used: Teradata, Informatica/Datastage, UNIX, Hadoop, Business Objects (UDT and IDT), Tableau, Hadoop, Hive, Spark (with Scala and Python), Erwin for dimensional modelling. Extensively using T-SQL and Teradata utilities like MLOAD, FASTLOAD, FASTEXPORT, TPT and cloud technologies like IPAAS and AWS. JIRA, Confluence, Erwin, Star Schema, Snowflake. SVN, Clearcase, ESP workstation, Control-M
Responsibilities:
- Dexterously managed the development, execution, updating and reporting of project plans and schedules
- Coordinated significantly in various technical aspects of the projects, i.e. requirement analysis, proposal, design and development, quality, and defects monitoring
- Prepared and reviewed the test plans, test cases and test reports while performing mutual testing the processing for any defects
- Meticulously documented all major activities for effective reference and use
- Keenly participated in preparing approach document for new projects
- Assured both quality and customer service while managing advanced/complex development tasks and projects to successful completion
- Synchronized successfully with clients for test environment setup and data capture, coordination with user groups for input, feedback, acceptance of renovation
