We provide IT Staff Augmentation Services!

Principal Data Architect Resume

5.00/5 (Submit Your Rating)

Suwanee, GA

SUMMARY:

  • Strong hands - on skills in developing parallel distributed data processing and analytics applications in Java and Python, hosted on Amazon Web Services S3 EC2 and EMR clusters using PySpark,Hadoop, MapReduce, Hive, Cassandra, Hbase, Spark, MongoDB.
  • Developed Map Reduce jobs for deployment on large clusters of 300 nodes, configured and managed hadoop clusters in production.
  • Self-starter with strong written and oral communication skills. Demonstrated success as a team player possessing the commitment and discipline to work well independently.

TECHNICAL SKILLS

Operating System: Ubuntu, Red Hat Linux, Windows.

Cloud Infrastructure Environment: Amazon Web Services EC2 EMR & S3 instances, IAM.

NoSQL databases: Cassandra, MongoDB, ElasticSearch

Language/Software: Python,Java, Struts, Hibernate, ANT, Maven, JUnit, JDBC, Java Server Faces, Ruby on Rails,GWT, C++, C, Shell scripting, PHP, XML, XSLT, JavaScript, JQuery, DHTML, PL/SQL, Stored Procedures, XML Beans, MXML, Flex Framework, AJAX, Axis, LUCENE, SOLR,DROOLS, jBPM, ILOG JRules, FileNet, ERLANG,Oracle ADF

Web/Application Servers: Apache Tomcat, JBoss, Confidential Web Sphere 5 and Bea Web Logic 8.1, LDAP iPlanet server, MTS, IIS, WebSphere Portal Server, LifeRay Portal Server, DotNetNuke, Escenic CMS

Tools: PyCharm,Eclipse, Eclipse RCP, WebSphere Studio Application Developer, PL/SQL Developer

Distributed/Parallel Processing: Cloudera Hadoop,Mahout,HDFS,Cassandra,Hive

Statistical Modeling: XLStat, R, SAS

Cloud Virtualization Frameworks: OpenNebula,VMWare vCloud,OpenStack,Chef,Puppet,Docker

Source Control: Git, Confidential

PROFESSIONAL EXPERIENCE:

Confidential, Suwanee, GA

Principal data architect

Responsibilities:

  • Working as part of implementation team of Python based data scraping, archival storage and indexing of FAA data.
  • Developed automated scalable data ingestion framework using Python (iPOPO, NLTK, Scrapy, Beautiful soup, PySpark), Kafka, Docker, MongoDB, ElasticSearch.
  • Used Git, Jira and Amazon web services cloud environment to spin EC2 instances and scale docker swarm clusters.

Confidential, Suwanee, GA

Data Architect-Big Data (consultant)

Responsibilities:

  • Working as part of implementation team of Big data, Hadoop MapReduce ETL project market place transactions
  • Responsible for porting the ETL process from in-house cloudera hadoop environment to Amazon Cloud EC2 & EMR Cluster environment, instance setup and data load in Amazon NoSQL distribution ( DynamoDB ),Cassandra
  • Developed ETL pipeline for real time analytics using Apache Spark, Storm, Kafka, Zookeeper, and Cassandra with production deployment on AWS in Java & Scala.
  • Developed custom Cassandra ( DataStax Enterprise Edition ) applications to run Spark nodes for Machine learning and sentiment analysis modeling using Mahout and R libraries
  • Created Archival Storage & Indexing previous questions & answers for Full Text Search utilizing features like Tokenization, Stemming, Filters, Analyzers, Fuzzy Matching.

Environment: Java, Scala, Python (NLTK, Scrapy, Beautiful soup, PySpark), Ruby on Rails, Amazon web Services EC2, EMR & S3, AWS Beanstalk, Amazon NoSQL DynamoDB, MongoDB, ElasticSearch, JSON, Java, Hadoop Map Reduce, Hive, Cassandra, Zookeeper, Apache Spark, Apache Storm, Kafka, Docker

Confidential, Smyrna, GA

Architect-Big Data

Responsibilities:

  • Leading implementation team of Big data, Hadoop MapReduce project for revenue optimizer and pricing mark down engine, business analytics using R, Java, Apache Hadoop, Cassandra, Hive HQL and predictive analytical modeling.
  • Developed custom Cassandra applications for real time retail data segmentation and price optimization algorithms.
  • Responsible and accountable for the coordinated management of multiple clients related analytics projects directed toward strategic business and other organizational objectives.
  • Build credibility, establish rapport, and maintain communication with stakeholders at multiple levels.
  • Maintain continuous alignment of program scope with strategic business objectives, and make recommendations to modify the program to enhance effectiveness toward the business result or strategic intent.
  • Configured and managed hadoop clusters using scalable open source cloud frameworks for optimal ultilization of available nodes.
  • Coach, mentor and lead personnel within a technical team environment.

Environment: Java, Python, Erlang, Ruby on rails, Cloudera Hadoop, Map Reduce, Cassandra, Hive, SAS, R, MongoDB, OpenStack, Amazon web Services, Chef, Puppet, Nagios, Storm, Zookeeper

Confidential, Peachtree, GA

End-to-End Architect-Mobile Gaming API Platform

Responsibilities:

  • Drive Solution Architectures for high performance and very highly scalable platforms.
  • Collaborate with key Telecommunication Architecture on definition of delivery of solution.
  • Developed and deployed mapreduce jobs in 6 hadoop clusters across 3 data centers. Total data nodes in hadoop cluster: ~133. Total capacity in terabytes: ~750TB. Total distinct business jobs running on these clusters: ~ 125. Total hadoop map-reduce jobs running daily across all hadoop clusters: ~1750.Configured Clusters on top of scalable OpenStack cloud infrastructure.
  • Develop RESTful Java Web services using JAX-RS, EJB3, JPA Hibernate
  • iOS and Android utility apps and games using Java Android SDK, HTML5, PhoneGap and native gaming engine Unity to call Confidential ’s RESTful API’s such as speech, Tropo, Phono etc.
  • Participate in Joint Application Design/Requirements sessions
  • Documentation: Architectural solutions to all phases of delivery including:-Initial end-to-end solution (based on Marketing Business Requirements), High Level Architecture Design, Content and Diagrams: -Activity -Context -Use Cases -Sequence -Deployment -Logical Architecture -Physical Architecture -Architectural budgets and costs based on hardware, software, licenses, maintenance, and development resources.

Environment: Java,Oracle ADF,Hadoop.Map Reduce,Lucene,Solr,JAX-RS, EJB3, JPA, Hibernate, iOS, TCP/IP, HTTP, SOAP, REST, XCAP, XDMS, XML, XSD, Cloud computing, Eclipse, Enterprise Architect, HTML5, PhoneGap,OpenStack, Chef,Puppet,Nagios

Confidential, Suwanee, GA

Game Designer,Infrastrcure Lead Architect/Team Lead

Responsibilities:

  • Confidential provides adults with lighthearted, engaging social media games with socially responsible messages. Game player base is monetized by placing advertisements as part of the storylines as well as selling virtual goods.
  • Provisioned and managed Amazon Cloud servers for initial launch.
  • Setup private cloud infrastructure using OpenStack to provide game developers a scalable, elastic shared data center to host gaming applications with optimal and cost effective hosting environment.
  • Designed and developed Custom Rule Based Gaming Engine portal using Liferay portal framework, Java Server Faces, Spring and Hibernate.
  • Developed custom pluggable architecture for in-game advertisement display and payment portal to accept credit and debit card transactions as well as Confidential credits.
  • Designed predictive models using in-game player activity, Google analytics and Confidential insight data, Confidential fan pages and twitter followership to maximize key performance indicators for brands and achieve monetization in social media space.

Environment: Windows, AIX, PHP, core Java 5, JSF,Lucene,Solr,Mahout. LifeRay, Javascript, JQuery, Maven, Tomcat, Spring, Hibernate, XML, Confidential Graph API,Mahout,Amazon Web Services EC2,OpenStack,Chef,Puppet,Nagios,Apache Hadoop MapReduce

Confidential, NJ

Lead Architect/Team Lead

Responsibilities:

  • Led the technical architecture solution definition to automate ETL process of members (insured) and client (heath insurance companies) data provided to catalyst by client health insurance companies.
  • Designed Web application architecture using JSF2, Spring MVC, Spring batch and Hibernate.
  • Designed multi layered SOA enabled enterprise grade application using design patterns.
  • Delivered all the design documents including UML diagrams, ERD & High Level & Low Level Application architecture.
  • Developed Maven build script for automated build and deployment process.

Environment: Windows 2000,UNIX, core Java 5, Javascript, JQuery, Maven, WebSphere, Weblogic, Tomcat, JBoss, Spring, Hibernate, Terracota, XML, JAXB, Xerces

Confidential, CA

Lead Architect/Team Lead

Responsibilities:

  • Led the design and development of business rules management system for iPractice at Confidential and Confidential at Confidential .
  • Used the rules to frame business object model for Confidential ’s ILOG BRE system and implement custom rule application in Java, J2EE exposed as web service for iPractice to help decide the amount of medicine samples for a logged-in HCP user.
  • Developed rules application to develop web services for Credit Application Adjudication system

Environment: Windows 2000,UNIX, core Java 5, JavaScript, JQuery, Maven, WebSphere, Tomcat, JBoss, Spring, ILOG JRules, jBPM, DROOLS, XML, JAXB, Xerces,Mule ESB

We'd love your feedback!