Sr. Big Data Engineer Resume
San Jose, CA
SUMMARY:
- 10+ years of experience supporting EDW, RDBMS, ETL and Analytical platforms.
- 4+ years of experience in Hadoop Ecosystem providing solutions for Big Data Applications.
PROFESSIONAL EXPERIENCE
Sr. Big Data Engineer
Confidential, San Jose, CA
Responsibilities:
- Implement Big Data Solutions for different customers with Hadoop distributions like Cloudera, Hortonworks, Pivotal and MapR.
- Proficient with entire Hadoop Ecosystem. Hands on experience with Hadoop Hive.
- Experience with Test Driven Development.
- Familiar with NoSQL databases like Cassandra, HBase and MongoDB.
- Experience with different Security solutions, Access Control Tools, Data Encryption, Tokenization, Masking, Obfuscation, Redaction etc.
Big Data Engineer
Confidential, San Jose, CA
Responsibilities:
- Participated in building a large production - level Hadoop Cluster that can store up to 2PB of data with High Availability.
- Installed Cloudera CDH 5.3 Enterprise edition with YARN, Spark and Impala.
- Solid experience in Linux Administration using RHEL, Centos, SLES and Ubuntu.
- Assisted in creation of ETL processes for transformation of data sources from existing data processing systems.
- Developed Hadoop test framework. Performed functional and regression testing.
- Managed and reviewed Hadoop log files.
- Core Java Development work for analyzing text files, xml files, Unit testing and JDBC connectivity to MySQL and other relational databases.
- Participated in assessing business rules, collaborate with stakeholders and perform source-to-target data mapping, design and review.
- Performed industry standard Performance, Benchmarking and Reliability tests.
- Bash shell scripting, awk and sed.
- Evaluating new technologies in NoSQL databases, Scala, Cloud computing, R, BI tools and open source tools.
Senior Hadoop Engineer
Confidential, San Jose, CA
Responsibilities:
- Participated in coding of application programs with Java for mapping business rules.
- These included direct moves, some simple and also complex transformations.
- Knowledge of collecting logs and storing them in HDFS in Hadoop Cluster.
- Integrated Hadoop with existing Enterprise Data Warehouse system.
- Assisted in designing and development of ETL procedures as per business requirements.
- Created reports for the BI team using Sqoop to export data into HDFS and Hive. Basic Knowledge of Tableau software.
Hadoop Developer/Administrator
Confidential, San Jose, CA
Responsibilities:
- Architected, maintained and monitored large multimode Hadoop Clusters for performance and scalability with best practices (several TB of data).
- Gathered Big Data requirements (both Infrastructure and Development).
- Participated in developing Java programs for processing structured and semi-structured data.
- Installed Cloudera tools, Ganglia and Nagios tools for production environment.
- Knowledge of security tools like Trend Micro, Nessus and Kerberos.
- Integrated Kerberos with Enterprise AD (Active Directory) for user authentication.
- Participated in development and execution of system and disaster recovery.
- Formulated procedures for installation of Hadoop patches, updates and version upgrades.
- Automated processes for troubleshooting, resolution and tuning of Hadoop clusters.
- Worked in 24x7 on call rotation production support environment. Experience with BMC remedy ticket creation for service/change requests resolving and closing.
Senior Data Analysis Engineer
Confidential, San Jose, CA
Responsibilities:
- Statistical Analysis with ‘R’ and Matlab for analyzing big chunks of data. Generated specifications based on 3-Sigma measurements. Performed several types of statistical distributions, Variance and Standard deviation calculations to facilitate yield improvement analysis from millions of datalog files.
- Knowledge of algorithms and machine learning.
- SQL and RDBMS experience.
Senior Systems Engineer
Confidential, San Jose, CA
Responsibilities:
- Evaluated different technologies for modernization of existing processes.
- Used Cloudera Enterprise edition CDH 4 using Cloudera Manager.
- Supported technical team members for automation, installation & configuration tasks.
- Suggested improvement processes for all process automation scripts and tasks.
- RedHat Enterprise Linux Satellite server installation for OS maintenance, upgrades, patches and administrative activities. Familiar with Centos, yum utilities etc.,
- Set-up RAID for critical systems, Logical Volume Management
- Exposure to Amazon S3, EC2 and EMR for big data analysis. Installed Hadoop cluster both manually and using Apache Whirr.
Senior Software Engineer
Confidential, San Jose, CA
Responsibilities:
- Maven for builds and deployment, Git for version control and Artifactory for repository.
- Working knowledge of Switches and Networking equipment for large production clusters. TCP/IP, HTTP, DNS and sub-netting.
- MySQL database development and administration at Enterprise level.
- Participated in High performance Server design, testing and system validation. Customers included DELL, HP and IBM. Also, implemented some low power solutions where power consumption was a criterion.
- Extensive engineering knowledge of various systems like desktops, laptops and portable computing systems.
- Software EDA tools development.
- Served as an Applications Engineering role providing full technical support to Field Engineers and technical sales teams.
TECHNICAL SKILLS
Databases: Oracle, MySQL, ACCESS, HBASE, Cassandra, MongoDB.
Eclipse: IDE, Python IDLE, Cloud Computing, Git, Hortonworks and MapR.
M.S Office tools: Word, Excel, Powerpoint, Visio, Sharepoint and Project.