We provide IT Staff Augmentation Services!

Data Architect, Big Data Architect Resume

0/5 (Submit Your Rating)

NY

SUMMARY:

  • 14+ years of experience in Architecture, Consulting, Pre - Sales, Development, Support, Solution Design of Data Warehousing, Business Intelligence and Big Data systems.
  • Designed and implemented very large data stores to the scale of hundreds of TeraBytes for analytics on data of various domains including geo-spatial and tele-communications.
  • Architect of the BI solutions for business transformation of State Electricity Boards and Major Tele-communication giants, India.
  • Designed and showcased winning solutions and proof of concepts for Confidential .(a deal worth $70 million)
  • Created solutions for data Integration, data warehousing, business intelligence and data mining, as part of RFP & RFI Responses.
  • Worked in Business Intelligence & Big Data technologies at Confidential, New York for 6 years.

TECHNICAL SKILLS:

Big Data Technologies: Hadoop, Hive, Pig, HBaseBig Data Platforms: Apache, HortonWorks

Data Technologies: Data Warehousing, Data Mining, Database Partitioning

Databases: IBM DB2 v9.x, PostgreSQL, MS SQL Server, mySQL

Hardware Appliances: IBM Balanced Warehouse, IBM Smart Analytics Server

O. S.: AIX, Linux(Ubuntu, Redhat, CentOS), Windows

Languages: C, C++, Java, SQL, DB2 SQL-PL, Perl, Pro C

Tools: IBM Infosphere Warehouse, Cognos, Datastage, SPSS, Erwin

Virtualization: Oracle VirtualBox

PROFESSIONAL EXPERIENCE:

Data Architect, Big Data Architect

Confidential

Technologies used: Apache Hadoop, Hbase, PostGIS, Linux, Shell Scripting, VirtualBox

Responsibilities:

  • Integrated current and historical geo-spatial data from publicly available agricultural(soil) and weather data sources and designed a hadoop data repository for better crop yield prediction and conservation of water/land/energy.
  • Designed, developed, deployed and managed a scalable 16 node Apache Hadoop+Hbase data store of 350TB capacity on the Ubuntu Linux platform.
  • Created Cognos reporting framework for solar power forecasting module of the project.
  • Developed pig map-reduce programs to process and upload huge data files to the Hbase cluster.
  • Processed Spatial data(ssurgo) using PostGIS spatial capabilities.
  • Used open-source technologies in the technology stack of the solution to minimize the software licensing cost and still build the most resilient and fault tolerant hadoop data store.
  • Created complex shell scripts to create data processing solutions along with NoSQL to optimize the resource usage on the available hardware and reduce dependency on pricey vendor products.
  • Created virtualized solutions on virtualbox for quick testing of big data proof of concepts.

Data Architect

Confidential

Responsibilities:

  • Layout the ETL, data modeling, reporting and data mining architectural framework for Confidential (IBM's Loan Servicing Subsidiary), using the MISMO standards .
  • Analyze the core business capabilities of Confidential, such as Borrower/Portfolio analysis and Loan Modification Fulfillment and design an optimal database to mitigate the constantly changing reporting needs.
  • Using innovative solution methods using the core database technology and reduced dependency on COTS products and thus enabled in achieving a high customer satisfaction.

Pre-Sales Consultant & BI Architect

Confidential

Responsibilities:

  • Created data architecture track of the $70 million deal winning response to RFP for Confidential .
  • Designed and presented POCs on data quality and Data Mining for the client.
  • Architected the technical architecture design for the OLTP and BI databases for the Income tax processing.
  • Implemented the previously designed data model for the processing database and business intelligence database.

Data Warehousing and Business Intelligence Architect

Confidential

Responsibilities:

  • Worked as a DWBI architect and was responsible for laying out the ETL, data modeling, reporting and data mining architectural framework for the client.
  • Created ultra-flexible XML based solutions to address the constantly changing client requirements.
  • Used innovative solution methods using the core database technology and reduced dependency on COTS products and thus enabled in achieving a high customer satisfaction.
  • Iron out showstoppers in the entire DWBI lifecycle to enable smooth go-live of the project.
  • Conducted BI and data mining sessions for non-BI architects as part of project/proposal needs and for ongoing competency building in BI
  • Designed data mining solutions for customer segmentation and energy loss forecasting.

Pre-sales Consultant, Database Subject Matter Expert

Confidential

Responsibilities:

  • Serve as a Solutions Migration Consultant assisting clients in a proof of concept utilizing IBM technology.
  • Provide Data Integration, Business Intelligence Solutions as part of the RFP & RFI Responses or consulting assignments.
  • Work with client teams on product positioning, understanding the competitive landscape and negotiating with clients on IBM products and proof of concepts to replace their current Oracle technology.
  • Accomplish Benchmarks for client applications on IBM stack.
  • Communicate most complex technical decisions to business clients and account teams.

Data Warehousing Architect, Data Modeler

Confidential

Responsibilities:

  • Created the Solution Architecture and Data Model for IDEA BI suite using IBM best practices.
  • Responsible for good health of a 40TB production data warehouse on partitioned environment, handling 200gb of raw data per day.
  • Tuned the SQL and data loading process. Resulting scripts run in 1/10 of the previous time.
  • Implemented Workload Management to monitor database queries.
  • Implemented the security policy of BI Usage for the business users.
  • Provided technical leadership to the multiple project teams by solving problems and issues where the development team is stuck.
  • Provided guidance to the DBA team for data correction to avoid show stopper data storage issues.

Data Warehouse Architect, Data Architect

Confidential, New York

Responsibilities:

  • Consolidated customer(IBM SWG Market Intelligence Group) requirement and implemented backend code based on business logic to in corporate new requirements on the front end(reporting) side..
  • Validated the incoming marketing data and execute scripts to create reports and scorecards.
  • Performance tuned data loading scripts to make sure new changes can be implemented and tested quickly.
  • Managing the Database and Ensuring planned timely backup & restore of entire system and database.
  • Create Jobs and manually run database maintenance jobs to Reorg tables, backup and recovery, Quiesce, and Modify/Recover table spaces. Delete old Arch logs, grant authorities to users, load data to tables.

Data Warehouse Designer and ETL Developer

Confidential, New York

Responsibilities:

  • Consolidated the data coming from disparate data sources for an automotive industry giant(Daimler-Chrysler). Performance optimization for scalability, speed and robustness
  • Designed the Data Warehouse using Rational Data Architect to support the needs for advanced analysis.
  • Extract, cleanse and Load the data into the data warehouse using IBM Ascential Datastage.
  • Rapid development of prototypes and mockups to communicate intended functionality.

DB2 Product Code Developer

Confidential, New York

Responsibilities:

  • Responsible for code implementation of the new bufferpool reuse and IO reduction scheme.
  • Performed code reviews for scan sharing code developed by Toronto and Watson teams.
  • Performed benchmarking for the quantitative proof of performance increase from Scan Sharing.
  • Created testcases using DB2 scripts and perl to test the new enhancements introduced for Scan Sharing
  • Enhancement to the Scan Sharing code keeping performance in view.
  • Help research teams with creating the DB2 environment on AIX for performance tests.

DB2 Product Code Developer

Confidential, New York

Responsibilities:

  • Responsible for code implementation in DB2 engine for OLAP data storage model using MDC.
  • Code reviews for MDC related developed by Toronto and Watson teams.
  • Create and maintain testcases to test the new enhancements introduced for MDC.
  • Fix defects in DB2 v8.x and v9.x caused by the changes done by the optimizer and rewrite teams.
  • Benchmarking for the quantitative proof of the Rollout and other concepts in MDC.
  • Fix the code problems after getting the regression notifications.
  • Develop test cases using Perl to verify the functionality of the mdc in detail and its working in association with other high level DB2 concepts as concurrency, backup, recovery, catalogues etc.
  • Automate the test cases to run in AIX, Windows and Linux.

DB2 Enabler, DBA, Trainer & Technical Staff

Confidential

Responsibilities:

  • Technical staff for the IBM onsite team to help i2 with DB2/i2 benchmarks and handle post sales critical situations.
  • Interaction with Business Users, Application Users, IT Professionals, Management Team and analyzing the Systems at the Business partner site. Deliver pre-release presentations about new versions of DB2
  • Assist in performance tuning, Query Optimization, Physical Database Design for I2 databases.
  • Support for porting applications from Oracle to DB2.

Database Programmer & Technical Staff

Confidential

Responsibilities:

  • Providing Level 3 Technical Support to VisualAge Java / VisualAge C++ Customers world wide.
  • Providing “Basic Enhancements” for Data Access Builder for follow-on releases of VAJ and VAC++ code fixes to the bugs identified during the Regression Testing.
  • Automating the System verification testing process.
  • Interface with other Visual Age tools development and release management group

We'd love your feedback!