We provide IT Staff Augmentation Services!

Big Data Lead Engineer/architect Resume

2.00/5 (Submit Your Rating)

Reston, VA

SUMMARY:

  • Over 25+ years of hands on Experience and author of database reference books.
  • Over 5+ year of experience of Big Data Hadoop ecosystem architect (Kafka, Hive, Sqoop, pig, datastage, ELK stack etc). Hadoop component integration, project initiation, and project execution.
  • Well rounded experience in Big Data, Hadoop, HBase, Impala, HDP, Ambari and eco - system on HDP
  • Designed highly scalable architectures using spark/Scala, mongoDB, Cassandra and extensible API.
  • Multiple NoSQL databases. Worked hands-on migrating databases from commercial databases to NoSQL databases like Cassandra/MongoDB. Written multiple migration scripts to handle one off cases. Used cloud and open source tools to handle migration tasks.
  • Over 17 years of Enterprise Infrastructure, design, Development Operations implementations and support. Database Administration and Technology lead experience.
  • Chief Information and Chief Data Lead
  • A technology leader in the open source database technology adoption with particular reference to DaaS, IaaS on cloud.
  • A mentor and in house trainer, excellent communicator, and team player.
  • Database migrations from private cloud to public cloud like AWS. Creation of AWS EC2 instances suitable for Oracle/MySQL and migrate databases from private cloud to AWS. Setting up Oracle data guard, optimizing the total grid is part of this function.
  • Enterprise architecture TOGAF.
  • Wide experience in application development, QA and SDLC cycle including scrum development methodology.
  • Hands on experience in multiple components of enterprise IT architecture including cloud computing, virtualization, network management, application design and development and inter-disciplinary teamwork.
  • Involved in Java and Python programming, and scripting languages like Perl, PHP and Pig Latin.
  • Worked on wide variety of Financial, Government and Commercial applications.
  • Presented many technical papers in international conferences.
  • Public Trust Full BI security cleared. TS screened.
  • Co-Authored two Database Administration Reference books, still being sold on Amazon.

PROFESSIONAL EXPERIENCE:

Big Data Lead Engineer/Architect

Confidential, Reston,VA

Responsibilities:
  • Hands on Lead Big Data Engineering and Big Data architecture. Application design, solution definition, and implementation using Scala on Spark. Other components used include but not limited to Kafka, Flume, Sqoop and data streaming. Applications involve high volume low latency response sensitive, high customer experience demanding. Providing leadership and working in team spirit. Studying new technologies, innovation and presenting recommendations are part of the job.
  • More than 25 years of Data Governance, Data Management, MDM, Data Security help me design applications and direct application development with cost optimization and customer experience as high prioritizes. Next decade of applications will demand customer experience according to Gartner research. My focus is also customer experience centered.

Confidential,Reston,VA

Senior Oracle DBA

Responsibilities:
  • I am working on IBM contract at Confidential, in Reston VA. Regular duties include every aspect of managing 1000s of Oracle databases, including but not limited to upgrading, patching, writing PL/SQL triggers, stored procedures, optimizing the procedures and providing day to day support. Automating the operations with cost optimization in goal with shell scripts, perl and PHP scripts.
  • Database design, database systems design, architect the full SDLC complaint database infrastructure.
  • Define AWS EC2 and EMR infrastructure; Network definitions VPN, VPC, Route 53; Migrate On-Premise databases/applications to public cloud; define public cloud security policies
  • As Big Data Architect, some of the big data implementations that I defined, designed and architected include:
  • Log aggregator and log analytical system where various heterogeneous log data are accumulated, in a data lake concept. Log data is structured and synced with cloud repository using both Kinesis and Kafka depending on the app nature. Data finally stored at rest to NoSQL databases like DynamoDB or HBase or Cassandra.
  • Scala or Python is used to perform application logic before saving data at rest.
  • A POC performed to evaluate Microsoft Azure, with ETL operations, data science operations to figure out data relations, coefficient analysis, and classifications.
  • Additional Big Data projects included traditional RDBMS migration to Hadoop using many hadoop ecosystem components in particular the flume, sqoop and HIVE.
  • Working on setting up, configuring Ambari and Hadoop processing system based on Ambari. Capacity planning, system monitoring, cost optimization, multi-threading analytic engines on HDP.

Confidential, FL

Spark/Cassandra/MongoDB architect Responsibilities:
  • As spark/Cassandra/MongoDB architect, I played an important role to enhance sales enablement. The use case is harnessing multiple data sources to find hidden sales gaps, score the opportunities, develop machine learning tools to continuously enhance the tool, provide unconditional security, high availability, scalability at the least TCO and highest ROI. A scrum based development environment with Python, and scala. Additional Hadoop eco-system components like flume, hive, sqoop, ligui are used for data ingest purposes. Kafka is used for data cleansing, queuing and programmatic control. Yarn is used for resource management.
  • Complete product being deployed in Microsoft Azure, with some in MS HDInsight, and some custom managed platform. Wide experience in PaaS and IaaS. This particular application is also an AaaS. MS Azure Analytical platforms and MS Azure Data Science/ML Library packages for Machine Learning are extensively used. Extensive use of Python, R, JSON in Machine Learning context are part of this project.

Confidential

Senior Oracle DBA

Responsibilities:
  • Provided full scale, round the clock support for level one and level Oracle Database Administration support. Database upgrades, patching and on-call support.
  • Hardcore scripting using shell scripts, PERL, and PHP. Automated major DBA activities for cost optimization and to optimize Total Cost of Ownership (TCO).
  • In an independent activity, presented case studies on HADOOP in an independent database user group that I manage.

We'd love your feedback!