Senior Big Data Developer and Architect Resume Dallas, TX - Hire IT People

SUMMARY:

12 years of IT experience (software engineering) on Application design, development, migration, integration and maintenance on Hadoop, Spark and Java platform in financial domain
4 years of Big Data Engineering experience in Hadoop & Spark
Commendable knowledge on Spark architecture including Spark Core, Spark SQL, DataFrames, Spark Streaming, Spark MLlib, Spark GraphX APIs
In depth understanding of Hadoop Architecture including YARN and various components such as HDFS, Resource Manager, Node Manager, Name Node, Data Node and MR v1 & v2 concepts
Worked extensively with CDH (Cloudera Distribution Including Apache Hadoop)
Experience in installing, configuring Hadoop components like Map Reduce, HDFS, HBase, ZooKeeper, Oozie, Hive, Sqoop, Pig, Flume using Cloudera Distributed Platform
Worked in hadoop & spark system engineering teams to define various design & implementation standards
Experienced in writing Spark programs/application in Scala using Spark APIs for Data Extraction, Transformation and Aggregation
Expertise in processing large sets of structured, semi - structured data in Spark & Hadoop, and store them in HDFS
Experience in converting SQL queries into Spark Transformations using Spark RDDs, DataFrames and Scala, and performed map-side joins on RDD's
Experienced in Spark SQL and Spark DataFrames using Scala
Experience in creating Real-Time Data streaming solutions using Apache Spark Streaming
Experience in creating DStreams from sources like Flume, Kafka and performed different Spark transformations and actions on it
Experienced in Spark Framework on both batch and real-time data processing
Experience in developing Kafka Consumer API using Spark Scala applications
Experienced in using Sqoop to import and export data from different RDBMS Servers like MySQL, Oracle and Teradata into HDFS and Hive
Developed MapReduce programs in Java for data cleansing, data filtering, and data aggregation
Expertise in working with Hive - creating tables, data distribution by implementing Partitioning and Bucketing, developing, tuning & optimizing the HQL queries
Worked in loading data into Hive tables and writing Hive adhoc queries that will run internally in MapReduce and different execution engines like Spark (Hive on Spark)
Experienced in analyzing data using PIG Latin scripts
Experienced in designing tables and views for reporting using Impala
Experience in developing and designing POCs for Spark cluster, compared the performance of Spark, with Hive, PIG and MapReduce
Performance tuning of Spark jobs by changing the configuration properties and using broadcast variables
Experienced in Spark optimization improvement of the existing algorithms in Hadoop using Spark Context, Spark-SQL, DataFrame, Pair RDD's and YARN
Experienced in writing Unix/Shell Scripting for various functionalities
Experienced in automating Sqoop, Hive, Java, MapReduce,Shell scripting etc using Oozie workflow
Worked on Platform migration - Mainframe, Teradata system decommissioning, and brining entire data to HDFS
Experienced with Maven, Jenkins, continuous building environments
Hands on experience on Amazon Web Services (AWS) components like Amazon Ec2 instances, S3 buckets.
Experienced in working with different file formats - Avro, Parquet, fixed length, EBCDIC, text file, XML, JSON, CSV
Experience with different RDBMS databases using DB2,Oracle,Teradata,MySQL and Exadata
Good understanding of algorithms, data structures, performance optimization techniques and object-oriented programming
Experience with web service API’s (REST, SOAP)
Worked in different compression techniques like Gzip, LZO, Snappy and Bzip2
Strong knowledge in Wealth Management platform, Financial and Brokerage firms
Good working experience with various SDLC methodologies using both Agile and waterfall Model
Excellent Team building, Analytical, Interpersonal and communication skills
Ability to work on multiple software systems, ability to quickly learn new technologies, adapt to new environments, self-motivated, team player

TECHNICAL SKILLS:

Primary skills: Spark core, RDDs, DataFrames, Spark SQL, Spark Streaming, Hadoop, HDFS, Yarn, MapReduce, Pig, Hive, Impala, Sqoop, Oozie, NoSQL, HBase, Java, scala, unix/shell scripting

Additional skills: Teradata, Oracle, SQL server, MySQL, Netezza, DB2, MS Access, python, hue, avro, parquet, Docker, BlinkDB, Cassandra, MongoDB, Splunk, Kafka, SQL, Unix, Linux, Windows 2007/2000/XP, MS DOS, IBM OS/390 O/S MVS/ESA, z/OS, Autosys, IBM TWS (Tivoli Workflow Scheduler), OOPs (Object Oriented Programming), Kerberos, Jenkins, bash, COBOL, JCL, REXX, CICS, J2EE, AWS, EC2, S3, AMI, Web Services, REST, HTML, XML, SOLA (Service Oriented Legacy Architecture), Javascript, Stored Procedures, MQ, working experience with web-services, NDM, sFTP, SVN, Visual Source Safe, Git, Eclipse, Teradata SQL Assistance, TOAD, Tectia, CA workload automation iXP, File-aid, DFSORT, Endevor, CA Panvalet, VISIO, SPUFI, INTERTEST, Via-Soft, IBM Debug tool, Xpeditor, PLATINUM, MS office, Lotus notes, REXX, Easytrieve, File Manager, Abend-aid, MS Visual Basic, Visual Source Safe (VSS), OPC & CA-7 Scheduler, IBM RDz (Rational Developer for System z), Mainframe Express (MFE), Sprint Tool Suite, Quality Centre

PROFESSIONAL EXPERIENCE:

Confidential, Dallas, TX

Senior Big Data Developer and Architect