We provide IT Staff Augmentation Services!

Big Data Lead Engineer/architect Resume

4.00/5 (Submit Your Rating)

Reston, VA

SUMMARY:

  • Over 25+ years of hands on Experience and author of database books.
  • Over 5+ year of experience of Big Data Hadoop ecosystem architect (Kafka, Hive, Sqoop, pig, datastage, ELK stack etc). Hadoop component integration, project initiation, and project execution.
  • Well rounded experience in Big Data, Hadoop, HBase, Impala, HDP, Ambari and eco - system on HDP
  • Designed highly scalable architectures using spark/Scala, mongoDB, Cassandra and extensible API.
  • Multiple NoSQL databases. Worked hands-on migrating databases from commercial databases to NoSQL databases like Cassandra/MongoDB. Written multiple migration scripts to handle one off cases. Used cloud and open source tools to handle migration tasks.
  • Over 17 years of Enterprise Infrastructure, design, Development Operations implementations and support. Database Administration and Technology lead experience.
  • Chief Information and Chief Data Lead
  • A technology leader in the open source database technology adoption with particular to DaaS, IaaS on cloud.
  • A mentor and in house trainer, excellent communicator, and team player.
  • Database migrations from private cloud to public cloud like AWS. Creation of AWS EC2 instances suitable for Oracle/MySQL and migrate databases from private cloud to AWS. Setting up Oracle data guard, optimizing the total grid is part of this function.
  • Enterprise architecture TOGAF.
  • Wide experience in application development, QA and SDLC cycle including scrum development methodology.
  • Hands on experience in multiple components of enterprise IT architecture including cloud computing, virtualization, network management, application design and development and inter-disciplinary teamwork.
  • Involved in Java and Python programming, and scripting languages like Perl, PHP and Pig Latin.
  • Worked on wide variety of Financial, Government and Commercial applications.
  • Presented many technical papers in international conferences.
  • Public Trust Full BI security cleared. TS screened.
  • Co-Authored two Database Administration books, still being sold on Amazon.

TECHNICAL SKILLS:

  • Oracle 10G
  • MySQL 5.1
  • UDB 8.0 and 8.
  • Sybase ASE 12.5
  • Sybase ASE 15.X
  • Sybase Replication 15.0
  • SQL Server 2008, Erwin 4.1, ERStudio, DBArtisan., Informatica.

PROFESSIONAL EXPERIENCE:

Big Data Lead Engineer/Architect

Confidential, Reston VA

Responsibilities:

  • Hands on Lead Big Data Engineering and Big Data architecture. Application design, solution definition, and implementation using Scala on Spark. Other components used include but not limited to Kafka, Flume, Sqoop and data streaming.
  • Applications involve high volume low latency response sensitive, high customer experience demanding. Providing leadership and working in team spirit. Studying new technologies, innovation and presenting recommendations are part of the job.
  • More than 25 years of Data Governance, Data Management, MDM, Data Security help me design applications and direct application development with cost optimization and customer experience as high prioritizes.
  • Next decade of applications will demand customer experience according to Gartner research. My focus is also customer experience centered.

Independent/ C2C Consulting- Senior Oracle DBA, database architect

Confidential

Responsibilities:

  • I am working on IBM contract at Fannie Mae, in Reston VA. Regular duties include every aspect of managing 1000s of Oracle databases, including but not limited to upgrading, patching, writing PL/SQL triggers, stored procedures, optimizing the procedures and providing day to day support.
  • Automating the operations with cost optimization in goal with shell scripts, perl and PHP scripts.
  • Database design, database systems design, architect the full SDLC complaint database infrastructure.
  • Define AWS EC2 and EMR infrastructure; Network definitions VPN, VPC, Route 53; Migrate On-Premise databases/applications to public cloud; define public cloud security policies
  • As Big Data Architect, some of the big data implementations that I defined, designed and architected include:
  • Log aggregator and log analytical system where various heterogeneous log data are accumulated, in a data lake concept. Log data is structured and synced with cloud repository using both Kinesis and Kafka depending on the app nature. Data finally stored at rest to NoSQL databases like DynamoDB or HBase or Cassandra.
  • Scala or Python is used to perform application logic before saving data at rest.
  • A POC performed to evaluate Microsoft Azure, with ETL operations, data science operations to figure out data relations, coefficient analysis, and classifications.
  • Additional Big Data projects included traditional RDBMS migration to Hadoop using many hadoop ecosystem components in particular the flume, sqoop and HIVE.
  • Working on setting up, configuring Ambari and Hadoop processing system based on Ambari. Capacity planning, system monitoring, cost optimization, multi-threading analytic engines on HDP.

Contractor- overlapping assignment

Confidential, FL

Responsibilities:

  • As spark/Cassandra/MongoDB architect, I played an important role to enhance sales enablement. The use case is harnessing multiple data sources to find hidden sales gaps, score the opportunities, develop machine learning tools to continuously enhance the tool, provide unconditional security, high availability, scalability at the least TCO and highest ROI.
  • A scrum based development environment with Python, and scala. Additional Hadoop eco-system components like flume, hive, sqoop, ligui are used for data ingest purposes. Kafka is used for data cleansing, queuing and programmatic control. Yarn is used for resource management.Complete product being deployed in Microsoft Azure, with some in MS HDInsight, and some custom managed platform.
  • Wide experience in PaaS and IaaS. This particular application is also an AaaS. MS Azure Analytical platforms and MS Azure Data Science/ML Library packages for Machine Learning are extensively used. Extensive use of Python, R, JSON in Machine Learning context are part of this project.

Principal DBA Oracle/Sybase/MySQL

Confidential, Dulles, VA

Responsibilities:

  • Operations team leader.
  • Administered very large databases: MySQL, Oracle, Sybase, SQL Server.
  • Database Replication and Disaster recovery.
  • Team coaching and development (All databases).
  • Company designated subject matter specialist (MySQL specialist).
  • Cross company organization team member for integrating technologies and systems.
  • Developing monitoring tools and standards.
  • Extensive Data Management, Data Governance and Master Data Management, Data Lead in:
  • Oracle
  • Sybase
  • SQL Server
  • MySQL

We'd love your feedback!