Big Data Architect/hadoop Developer Resume
St Louis, MO
SUMMARY:
- Expert in Big Data/NoSQL Architecture, Administration, Development and Data Scientist
- Architecture for Enterprise Application Platform for Confidential
- Analysis using Hive, Datameer, Platfora, Revolution R
- Expert in Hawq/Hbase Administration and Modeling
- Expert in Setting Big Data Infrastructure from Scratch
- Expert in Architecture for new Big Data Projects
- Warehousing experience including Big data, Informatica, Netezza, Exadata, Teradata
- Hive/Impala/Hbase Data Modeling
- Experience with Pivotal/Cloudera Distribution
- NoSQL/Big data/Hadoop/Columnar
TECHNICAL SKILLS:
Administration/Development: Hadoop, HDFS, Map Reduce, Hbase, Pig, Hive, Sqoop
Analysis: Big Data Scientist, Enterprise Architecture for Java Platforms, Architecture for Java Technology
Database Administration: Oracle(11g,10g,9i,8,7), Oracle 10/11g Grid & Infrastructure, Oracle Exadata 11g, DB2, Sql Server, Sybase, SQL Server
Replication: Oracle Golden Gate 10
Warehousing: Oracle Exadata, Netezza, Informatica
Data Modeling: Training Dimension Modeling in Depth by Ralph Kimball
Enterprise Architecture: TOGAF9 (Level 1 & 2),Cloud Computing Architecture(IaaS, SaaS, PaaS)
IT Service Management: ITIL
Service Oriented Architecture: SOA
Project Management: PMP
Technology: Unix, Windows
Development: PL/SQL Programming, Unix Scripting
JOB EXPERIENCE:
Big Data Architect/Hadoop Developer
Confidential, St. Louis, MO
Responsibilities:
- Enterprise Big Data Architecture for Confidential setting Enterprise Application Platform (EAP) for Analytics
- Setting L0, L1, L2, L3, L4, L5 Data Models (Hive Databases)
- L1 Databases using Incremental approach; data ingestion from Oracle, Teradata, Netezza. L0 Avro data ingestion; L1 Rcfile
- Setting L5 impala databases using Parquet format for Accessing Online applications having 500 users
- Use of Talend Framework for data ingestion of Databases from Oracle, Teradata, Netezza for L0, L1 Hive databases. Data profiling with Talend.
- Transformation to L2, L3, L4, L5 from L1 Hive databases using Oracle Financial Services Analytical Application(OFSAA)
- Security Setup for all Hive Databases, and applications including Datameer, Platfora
- Analytics using Datameer, Platfora, Revolution R
- Worked on Big Data Projects/Regulatory Compliance:
- Data Ingestion from Oracle using incremental approach; Generation of Alerts for Anti Money Laundering (AML)
- G2 Athena (Insight), G2 Athena (Viz) for monitoring Cases generated for Fraudulent activity
- Optima Retail
- Optima Wholesale
- AML Cards
- AML Institution
- Financial Retrieval System (FRS)
- Human Resources Card Automation (HRCA)
- Coordination/Supervision between teams: 18 Member Team in India, 2 in Singapore, 5 in US
- Hiring Interviews for Confidential Projects in Big Data
- Oozie, AutoSys Workflows
- Hive/Impala/Hbase Data Modeling
- Conversion of SAS Programs to Hive
- Citi Wide Analysis of all Oracle, Teradata, Netezza Databases for Incremental/Complete load
Technology Environment: 50 Node Cloudera 4.5 Cluster, Redhat 6.4, Data Size 600 TB, Revolution R, Datameer, Platfora, Hive 0.11, Impala, OFSAA IPE
Senior Hadoop Architect/Admin
Confidential, NY
Responsibilities:
- Enterprise Big Data Architecture for Confidential including all businesses covering Data Management, Data Sciences
- RFP for Confidential businesses, Capacity Planning, application and Big Data architecture at Enterprise Level
- Vendor Evaluation for emerging technologies including Pivotal, Cloudera, Horton works, Mapr
- Proof of concept for use Business cases
- Setup 40 Node Pivotal Hadoop Cluster
- Design/Setup/Model of Hawq/Greenplum Database
- Data Sciences using Chorus/Alpine/Rhadoop/Hive/Micro strategy/Mat lab. Help Confidential in achieving ad - sales objectives
- Established Big Data Department from Scratch
- Hadoop DR Setup
- Hadoop High availability setup
- VRP Monitoring
- Complete application architecture for ad-sales
- Warehouse using Type1, Type2, Type3
- Analytics using R, Rhive, Hive, Pig
- OLTP using Impala
- Scala/Spark POC
Technology Environment: 40 Node Pivotal Hadoop 4.1 Cluster, Redhat 6.4, Command Centre, Mahout, Solr, Pig, Hive, Tomcat, R/Hadoop, Hbase, VRP, Signiant, Micro strategy, PgadminIII, Postgres 8.x, Hadoop Cluster Size 216 TB, Data Size 140TB.
Hadoop Engineer
Confidential
Responsibilities:
- Setup of 100 node Cloudera Hadoop Cluster
- Automated installation of Cloudera Hadoop, Hive, Pig, Hbase, Oozie, Sqoop, Hue using Chef/Opscode/Knife, Maven, GIT, Github
- Designed Hbase Schema. Setup with Kepler, Solr
- Hadoop Java Programs to Load Data in HDFS/Hbase, Code Reviews
- Set Chef Server
- Oozie Server Setup with Failover, wrote Oozie Workflows
- Hive Queries Setup
- Analysis of Data using Map reduce Programs
- Analysis of Survey Data using Mahout (Machine Learning), Pig, Hive
- Health Population Distribution analysis using R
- Database Load with Sqoop
- Hue Server Setup and Development
- Designed Population Health Services
- Cassandra Database POC as compared to Hbase
- Agile/scrum Methodology Environment
- Website Analysis for every click/function used
- Kestrel Data Integration and Queuing Service
- Modeled Hbase using Json Model, Mongo dB Model
- Database Search using Elastic Search
- Frontend in Node.js
- Analysis in Pig
- Website integration with Hbase using Kestrel
- Hadoop Infrastructure Automation using Salt
Technology Environment: 15 Node Cloudera Big data/Hadoop 4.2 Cluster, Linux 6.2, Salt, Oozie, Elastic Search, GitHub, JSON, Node.js, Kestrel, Python
Hadoop/Database Architect
Confidential, NJ
Responsibilities:
- Designed Hbase Schema
- NoSQL(Hadoop, HDFS, Map Reduce, Pig, Hive)
- Data Conversion/Analysis Executive Portfolio System Database to Hadoop/Hbase. Architected Hadoop Hardware, Data files layout for Executive Portfolio System
- Data Conversion with Sqoop
- Analysis of Server Audit Log files
- 40 Node Cluster
- Setup of Fraud Detection System (FDS) application consisting of Oracle RAC 11.2, Data guard and Golden Gate
- FDS is a three node cluster, using RMAN backups
- Providing support to Confidential Warehouse Database
- Providing support to Reporting Databases
- Patches/upgrade of databases
- Review of Data Models
- Options to provide better alternate options for database performance
- Database production Support 24x7
- Coordination with offshore production teams
- Coordination with offshore Development teams
- PL/SQL Development and Developers Support for SQL Tuning
- Security/Audit setup for databases
- Tuning using fog light, Oracle Grid, ASH
- Warehouse in Netezza, Exadata. Supported day to day production issues in Netezza like tuning/performance issues. Coordinated Netezza patches.
Technology Environment: Linux, Sun Unix, Veritas Cluster, Veritas Volume Manger, Oracle 11g RAC, Oracle ASM, Oracle 10g Release 2 RAC, PL/SQL, Oracle Grid, Scripting, Perl, Windows Servers, Vault, Apache/Tomcat, ColdFusion, iPlanet, Visio, WebLogic Server, Grid Control Setup, Real Application Cluster, Golden Gate 8TB, Patrol, EMC SRDF, NoSQL(Hadoop, HDFS, Map Reduce, Pig, Hive), Netezza, Python, Guardian 9.0
Database Architect
Confidential, Washington, DC
Responsibilities:
- Expert in selecting enterprise application technology ensuring compliance with corporate business strategy
- Expert in Architecture/Implementing Organization Wise integration of Applications/Databases. Implemented Maximo, Lawson Financials, Open Text, GIS applications/databases integration.
- Expert architecture for designing/implementing applications. Architecture/Implemented n-tier Maximo, Lawson Financials, Open text document management, GIS with transparent application/database failover
- SDLC and solution processes for in-house developed applications using agile
- Setting RAC under 11g Environment using ASM storage
- Setting Data guard 11g Environment
- Setting RAC under 10g Environment
- MySQL for Web Applications
- Architect/n-tier-type2 Setup for Maximo 7.1 Server in Real Cluster 9i/10g/11g Release 2 environment, upgraded Maximo several versions. This covers transparent failover for Maximo Application/Database in production environment. The manual failover for Maximo application to Redundant Data Centre.
- Setup of Actuate Report Server
- Administrator for WebLogic, Apache/Tomcat
- Architecture/Implemented Enterprise Backup Strategy for databases & applications
- Model GIS Database with requirements covering Asset Management
- Installation/Setup of Oracle Real Clusters 9i Release 2
- Transparent Failover setup for Real Application Clusters
- Development PL/SQL Packages/Procedures/Functions
- Tuned Oracle Databases
- Reduced Lawson reports running time from 12 Hours to 2 Hours
- Unix Shell(Korn) Scripting
- Backup & Recovery Setup using Recovery Manager (RMAN)
- Oracle Replication
- Quality Control of 11 Developers (Maxima Development Team & Lawson Team)
- Setting Standards/Operation Manual for WASA
- Maximo Administration/Setup/Installation/Upgrades several versions
- Financial Warehouse Logical/Physical Modeling
- Upgrade Oracle several versions
- Migrated database storage to NAS
- Converting SQL Server Databases/tables to Oracle
- Agile Development Process
- WASA Customers Water Usage Warehouse Logical/Physical Modeling
- Migration/upgrade/Patching Oracle 9i/10g/11g
- UML Modeling
- Oracle Enterprise Pack
- Build Report Server using Golden Gate
- Virtual Private Security for tables
- SDLC Development for applications
Technology Environment: Sun Fire V880 Machines Clustered, Sun Unix, Veritas Cluster, Veritas Volume Manger, Maximo 7.1, MEA, Oracle 11g RAC, MySQL, Oracle 10g Release 2 RAC, PL/SQL, Oracle Enterprise Manger, Cold Fusion Server, SQL*Net, SQL*Loader, Unix Scripting, Lawson Financials, Perl, Windows Servers, Vault, Redhat Linux, Apache/Tomcat, ColdFusion, iPlanet, Perl, ColdFusion, Visio, Actuate Report Writer, WebLogic Server, Grid Control Setup, Real Application Cluster, Red Hat Linux 2TB, Nagios
Oracle DBA
Confidential, NJ
Responsibilities:
- Support for Development Department developing applications
- Production Support for running Databases World Wide
- Setup for Oracle 9iAS (Discoverer, Forms, Reports, Wireless, Portal, Apache)
- Installation, Tuning of Oracle 8i Databases
- Up gradation to Oracle 9i from 8i
- Tuning of Database Servers
- Supporting Web Logic
- Installation/Maintenance of SQL Server-2000.
- Installation, Maintenance/tuning Informatica 5.1 operations
- Clinical Warehouse in Informatica Power Centre 5.1 (Designer, Repository Manager, Server Manager, Business Objects)
- Studied requirements, model for building Clinical Warehouse Model
- Developed Informatica Mapping, Transformations, Business objects, sessions using Informatica tools
- Installation & Maintenance of Oracle Clinical 4.0 (SAS 6.0)
- Unix Shell(Korn) Scripting
- Global Search using Intermedia Text
- Oracle Internet Directory(OID) for Centralized LDAP Server
Technology Environment: Sun E10000 to Sun 450 machines, Sun Unix 2.6/5.6/8.0, NT4.0/Windows 2000, Oracle 8I with Partitioning, Oracle PL/SQL 8.1.7, Oracle 9i, Oracle 9i Application Server, Oracle Discoverer 6I, Oracle Enterprise Manger(DBA Studio), Java Objects, IBM WebSphere 4.1, Erwin 3.0, SQL Server 2000, Oracle 11.5.5 Applications, Veritas, Informatica 5.1, Oracle Clinical 4.0, SAS 6.0, SQL*Net, SQL*Loader, Toad 6.1, SQL*Navigator, Oracle Discoverer, Performance monitoring and Unix Scripting, Oracle Reports Server, Quest-Toad 7.1, Linux 7.0,ESRI/Oracle Spatial Databases 10GB to 60GB
Lead Oracle DBA/Modeler
Confidential, New York
Responsibilities:
- Architect for three running projects: Designed Object Oriented Database Model based on Class Model using Erwin 3.0
- Designed Generalized Interface between Application and Database using PL/SQL
- Project work included conversion from legacy system
- Installation, Maintenance of Oracle 8i, Parallel Server/Replication
- Backup & Recovery Setup using Recovery Manager(RMAN)
- Tuning of Database Servers
- Migration from Sybase 11.5 to Oracle 8i
- Hiring of DBAs
- Solving Technical Problems for 9 DBA’s in the team
- Installation/Maintenance of SQL Server 7.0
- Advance Queuing Setup
Technology Environment: Sun 450 machines clustered, Sun Unix 2.6, NT4.0/Windows 2000, Oracle 8I with Partitioning, Oracle PL/SQL 8.1.7, Oracle Parallel Server, RMAN, Replication 8.1.7, Oracle Enterprise Manger(DBA Studio), Java Objects, IBM WebSphere 4.1, Select 6.1, Forte 3.0, Cordiant 2.0, Sybase 11.5, Erwin 3.0, SQL Server 7.0, SQL*Net including Advanced Networking Options, SQL*Loader, Toad 6.1, ftp, Project Manager, Performance monitoring and Unix Scripting, Embarcadero Performance Center 1.7 Web Based Application 100GB, 52 Tables
