Data Modeler/data Scientist/big Data Analyst Resume
Arlington, VA
PROFESSIONAL SUMMARY:
- Data Analytics and Software Development background with experience in Python, R, AWS, SQL & NoSQL Databases and Machine Learning.
- Technically proficient in Tableau platform, Power BI platform, SQL, HQL, Python, R, SQL Server, Oracle, and HDFS.
- Experience coding and modifying SQL/ETL/Alteryx based on dashboard requirements in Tableau Designing and developing data source, dashboards in Tableau.
- Design and Develop Hadoop ETL solutions to move data to the data lake using big data tools like Sqoop, Hive, Spark, HDFS, Talend etc.
- Developed HQL scripts in Hive & Spark SQL to perform transformation on relational data and Sqoop export data back to DB’s.
- Experienced in Application Development using Big Data/Hadoop - MapReduce, YARN & its Ecosystems like Hive, Sqoop, Spark, Scala, Oozie and flume.
- Experienced in Apache & Cloudera Hadoop Distributions.
- Capable of processing large sets of structured, semi-structured and unstructured data and supporting system application architecture.
- Experience in writing HIVE queries, very good understanding of partitions and concepts in hive and designed both Managed and External tables in hive for optimized performance.
- Good Experience in performing data quality checks on ingested data.
- Domain knowledge of Experience in both Banking and financial services& Health Insurance Domain.
- Good knowledge in CC&B modules like billing, payments, meter management, credit and collections.
- Good Experience in Configuration tools such as BO, MO, UI Map, Business script, BPA Script, Zones and Portals etc. & also good knowledge of OUAF (Oracle Utility Application Framework) 2.x.
- Involved in most of the SDLC phases (Design, Implementation, Maintenance and Testing)
- Good knowledge of various technologies/frameworks like Java SE, Java EE & XML
- Focused, quick learner and self-starter, possesses skills to work under pressure and utilize the learned concepts quickly in a productive manner
- Experienced in creating, designing, processing of cubes using SSAS.
- Responsible for creating and maintaining Analysis services objects such as cubes, dimensions, measures. Created report model on SSAS cubes as well as changing default configuration on existing cubes.
- Advanced in Tableau features including calculated fields, parameters, table calculations, row-level security, R integration, joins, data blending, and dashboard actions. Hands on experience with python programming.
- Developed Tableau dashboards using complex relational and multi-dimensional data sources like Teradata, Hadoop, Hive, and Hyperion Essbase.
- Analytics experience with forecasting methods, reporting packages and looking for trends, R, python.
- Analysis experience such as pulling data together from various sources, looking for new ways to analyze, looking for trends in data and writing use cases.
- Requirements analysis, Key Performance Indicators (KPI), metrics development, sourcing and gap analysis, OLAP concepts and methods, aggregates / materialized views and performance. Excellent experience in writing SQL queries to validate data movement between different layers in data warehouse environment.
- Practical understanding of the Data modeling (Dimensional & Relational) concepts like Star-Schema Modeling, Snowflake Schema Modeling, Fact and Dimension tables.
- Knowledge of Service Now system used for raising Data Quality tickets.
- Knowledge of Share point used for storing and extracting user’s documents.
- Efficient in converting user’s requirements into technical requirements and vice-versa.
- Very good exposure to the entire Software Development Life Cycle (SDLC) and created technical documentations.
TECHNICAL SKILLS:
Programming: C, C++, Java
Scripting: MATLAB, Net Logo 6.0.2, R, Python, Shell, Perl, Unix
DB and Modelling: Azure SQL, DB2, Mongo DB, MS Access, NoSQL, Oracle, SQL
Technologies: HTML, XML, NetBeans, ASP.Net
Data Analytical Tools: Alteryx, Apache Ni fi, Hadoop, Hive, Kafka Neo4j, Power BI, SAS, SAP, IBM SPSS, Spark, Tableau, Tensor Flow, Teradata, Wireshark, Weka
Cloud Technologies: AWS, MS Azure, Amazon Redshift, Azure Data Factory
Data Science Domain: Machine Learning, Deep Learning, Big Data, Artificial Intelligence
PROFESSIONAL EXPERIENCE:
Confidential, Arlington, VA
Data Modeler/Data Scientist/Big Data Analyst
Responsibilities:
- Experience in designing Data visualization using Tableau and publishing and presenting dashboards, Storyline on web and desktop platforms.
- Experience in designing stunning visualizations using Tableau software and publishing and presenting dashboards, Storyline on web and desktop platforms.
- Used Tableau and Power BI to flexibly create and edit the dashboards for analytical purposes.
- Used Python (NumPy, SciPy, pandas, sci-kit-learn, Seaborn, NLTK) and Spark (Py Spark, ML lib) to develop a variety of models and algorithms for analytic purposes.
- Worked closely with business, data governance, SMEs and vendors to define data requirements.
- Obtained Unstructured, semi structured data to learn about user behavior and merge data from multiple data sources.
- Used machine learning algorithms to identify and analyze the trends in the obtained data.
- Worked on python to clean the Dataset and understand the trends in the obtained data.
- Developed and maintained data dictionary to create metadata reports for technical and business purpose.
- Involved in reviewing business requirements and analyzing data sources from Excel/Oracle SQL server for design, Development, testing and production rollover of reporting and analysis.
- Involved on Prediction model building, Machine Learning, Business process improvements, Visualization & Process implementation with R Programming.
- Implementing Spark MLib utilities such as including classification, regression, clustering, collaborative filtering and dimensionality reduction.
- Developed Statistical Analysis and Response Modeling for Analytical Database contributors (logistic regression).
Confidential
Data Scientist/BI Developer
Responsibilities:
- Developed an Agent Based Modelling Prototype for the Confidential .
- Adopted the restaurant hub net model initially in the development and later switched to different versions.
- Model development was done in Net Logo, a free software available online and different versions had been developed
- Designed dashboards by joining multiple complex tables.
- Administration and development of Tableau dashboards with interactive views, trends, drill downs, user level security, scheduling, and web deployment.
- Expertise in employing various Tableau functionalities like Tableau Extracts, Parameters, Filters, Contexts, Data Source Filters, Actions, Functions, Trends, Hierarchies, Sets, Groups, Calculations, Data Blending and Maps
- Worked closely with the ETL Team to automate the Data load process, create required aggregate tables, indexing and table portioning for performance optimization of Reports.
- Generated comprehensive analytical reports by running SQL queries against current databases to conduct data analysis
- Coordinate with the business users in providing appropriate, effective and efficient way to design the new reporting needs based on the user with the existing functionality.
- Manipulating, cleansing & processing data using Excel and SQL.
- The data generated from the prototype was recorded into a background excel sheets using tweaking to the variables
- Involved in creating and visualizing dashboards using Tableau Desktop.
- The visualizations of data had been carried out in Python and Tableau depending on the requirements.
- Extensive capturing and analysis of the data had been done drawing meaningful conclusions of the prototype.
