Big Data Architect Resume
Dallas, TX
SUMMARY:
- Hands - on software architect with comprehensive experience in enterprise integration, high-performance parallel and distributed applications, large-scale data warehousing and processing, relational and NoSQL database modeling and design, modularization technologies
- Excel at enterprise application design and development, including ground-up planning and implementation of core functional and nonfunctional requirements. Possess strong analytical and problem-solving skills, strong communication skills.
- Innovator of technology solutions to drive architecture of the next-generation applications and bottom-line gains.
TECHNICAL SKILLS:
Architecture and Design methodologies and frameworks: The twelve-factor methodology, Cloud Architecture, Microservices, SOA, OO Design, DDD, BDD, TDD, Functional Design, UML, Design Patterns, REST, n-tier, component-based, event-driven, TOGAF.
Languages: Java, Python, R, JavaScript, SQL, PL/SQL, XML, JSON, YAML, Groovy, C++, Bash/Korn/C Shell
Big Data: Kafka, Nifi, Hadoop, Yarn, Spark, Storm, HDFS, Hive, HBASE, Oozie, Flume, Sqoop, Cloudera Manager, Ambari, Ranger, Knox, Elasticsearch, Logstash, Kibana, Grafana
Java/JEE: Microservices, JPA, EJB, JDBC, JMS, JTA, JNDI, JMX, Servlet, JSP, JSF, Web Services (SOAP/REST), RMI, JAXB, collections, NIO, reflection, multi threading, sockets, JVM monitoring and tuning
Frameworks/Libraries: Spring, Hibernate, Jakarta Commons, JUnit, Kafka Streams, Flusk, ZeroMQ, Camel
Application Servers/Middleware/Technology: Kubernetes, Service Mix, OSGI, Mule ESB, Tomcat, Jetty, AciveMQ, Memcached, Ehcache, Oracle Coherence, LDAP, PKE, XACML, OATH, Kerberos, Docker, Docker Compose, Docker Swarm, Docker Registry
Relational Databases: Oracle, MS SQL Server, MySQL, PostgreSQL, Derby, Redis
Agile and Development methodologies: Scrum, Kanban, Extreme Programming, Water Fall, Test Driven Development, Data Driven Development
PROFESSIONAL EXPERIENCE:
Big Data Architect
Confidential, Dallas, TX
Responsibilities:
- Led vetting effort of Kafka’s new versions, developed number of POCs and examples. Prepared guidelines for developers for Kafka, Kafka Connect, Kafka Streams. Produce internal AT&T DevOp documentation and packages for Kafka distributions (Apache Kafka, Confluent Kafka). Guided and advised architecture effort for Kafka implementation for about 20 projects.
- Facilitated implementation of Ranger for kerberized clusters. Developed Ranger’s POCs for HDFS, HBase, Hive, Kafka, Storm, and Knox. Together with Hortonworks fixed and provided workarounds for all problem discovered during working on POCs. Prepared documentation for Hadoop administrators for installation of Ranger inside AT&T. Prepare documentation for developers.
- Explored Knox implementation and integration with AT&T single sign-on. Conducted a number of POCs and wrote guidelines for the process of mapping of SSL REST client to Kerberos TEMPprincipal and Kerberos TEMPprincipal to Hadoop user. Researched processes of impersonation and token delegation for Knox in Hadoop cluster. Developed number of topologies for WebHDFS and WebHCat. Proposed common approach for development and management of Knox topologies inside AT&T.
- Examined Storm implementation as part of kerberized Hadoop clusters. Together with Hortonworks fixed and provided workarounds for the problem discovered during working on POCs. Proposed numbers of solutions and fixes to Hortonworks. Developed examples for Storm implementation.
- Conducted Capacity Planning and Performance Tuning for Kafka, Spark, Storm, Elasticsearch, and Logstash. Wrote a number of AT&T Wiki articles related to capacity planning and performance tuning of Kafka, Storm, and Spark. Provided advice and examples which were implemented and drastically improved performance.
Senior Computer System Analyst
Confidential, Plano, TX
Responsibilities:
- Created architecture and implemented proof of concept (Python) of ingestion pipeline for data collected by probes from core telecoms network using Apache Spark, Kafka, HDFS, Hive.
- Productized outliers detection (data mining and computer learning) algorithms for cluster environment with Apache Spark, Spark ML, Spark SQL, HDFS, Parquet.
- Optimized data model for effective partitioning and fast parallel processing on Apache Spark cluster.
- Architected and implemented complete deployment infrastructure (Zookeeper, Kafka, Elasticsearch, Jetty, set of Confidential applications) for exploration of data produced by probes from core telecoms network using Docker, Docker Compose, and Docker Swarm.
Software Architect SME
Confidential, Dallas, TX
Responsibilities:
- Created architecture and design of eBiz Data Access layer, API layer, Service Mix and OSGi’ part of Messaging framework, Logging framework, Configuration framework, Service Mix and OSGi module for Caching framework (base on Coherence).
- Participated in code reviews and problem resolution sessions. Took part in POCs preparation.
- Wrote skeletons for final product.
Software Architect
Confidential
Responsibilities:
- Created architecture and design of One Call Resolution (OCR) Customer Support Enterprise application (Service Mix, Fuse, Camel, CXF, ActiveMQ, REST, SOAP, Redis, Hadoop HDFS, Mule ESB)
- Designed and implemented integration of OCR with Single Sign-On infrastructure (ACL base access to activities and resources integrated with LDAP and Directory Services)
Senior Software Consultant
Confidential
Responsibilities:
- Architected and prepared POC of client site applications integration container, server side applications integration container, container message bus, container history manager, container navigation support, container-applications communication mechanisms and protocols (GWT, Service Mix, OSGI, Vaadin)
- Designed, prepared requirements and POC of GWT and Vaadin based Address Book, E-mail, Calendar and Video on Demand Web applications
