We provide IT Staff Augmentation Services!

Senior Big Data Engineer Resume

2.00/5 (Submit Your Rating)

Palo, AltO

SUMMARY:

  • Confidential is a seasoned Big Data Engineer from ETL to analytics to visualization.
  • His focus is Tableau, Python, SQL, Scala, Spark and R.
  • His mantra is Work Hard, Work Smart and Be Honest. He believes in compassion, mutual respect, continuous daily improvements, laser focus and be persistent never ever give up He is a dog person and he has a rat terrier

TECHNICAL SKILLS:

BIG DATA . MACHINE LEARNING . ETL . ANALYTICS . VISUALIZATIONTableau. Python . R . Spark . Scala . BigQuery . Hive . Sql . Hadoop

PROFESSIONAL EXPERIENCE:

Confidential, Palo Alto

Senior Big Data Engineer

Responsibilities:

  • Implemented Data Matching (or Record Linkage) using an ensemble of deterministic, fuzzy and machine learning algorithms for millions/billions of rows of data.

Tools: I used: Python/Anaconda scikit - learn, Oracle Databae 11g, Oracle SQL Developer, MariaDB, performance profiling, code optimization, integration with Ruby on Rails.

Confidential, San Francisco

Senior Big Data Engineer

Responsibilities:

  • Data acquisition from REST API / json; data wrangling with iPython and unix tools; segment and organize data from disparate sources and data loading to Google BigQuery
  • Architect, build and launch efficient and reliable data pipelines. Automated ETL. Minimum human supervision.
  • Develop analytics in Google BigQuery to enable other teams to consume and understand data faster.

Tools: Google BigQuery/DataFlow/Analytics, Google Compute Engine, Anaconda, iPython, Python, Bash Scripting, Tableau, Swagger.io API-centric data collections and publishing, Gigya (Customer Identity Management), PostgreSQL, Open Dining, HockeyApp, Atlassian JIRA and Confluence, Unix tools (grep, awk, sed, cut, sort, join, wc, find, uniq, jq, curl).

Confidential

Lead Tableau Developer

Responsibilities:

  • Image extraction from video frames every second.
  • Use TensorFlow's Inception to associate the images with 1,000 classifications
  • Find the best matching video clip of these images based on cosine similiary matrix
  • Optionally mash-up/interleave the best matched video clip with the user-provided clip
  • Business Use Case: enhance user-provided clips with mash-up; more targeted advertisements within the video

Confidential

Lead Tableau Developer

Responsibilities:

  • Segment and organize data from disparate sources (Oracle BlueKai, Experian, Datalogix)
  • Exploratary Data Analysis and Data wrangling with R and Python.
  • Implemented business use case in Hadoop/Hive and visualized in Tableau
  • Aggregated campaign measurements from impressions to store purchases by combining Oracle BlueKai and Datalogix data.
  • Machine Learning algorithms implemented: Clustering on consumer segments using k-means; Built predictive models on Classification using demographic and behavioral data on new and existing customers; Product recommendations using Collaborative Filter.
  • Agile and iterative development; Work directly with key business users: statisticians and data scientists.
  • Exceed expectation by delivering first business use case in Big Data Marketing Analytics to Clorox.

Tools: I used: Hive, Impala, R, Python, Tableau, Tableau Javascript API, SparkSQL, Oracle.

Confidential, San Francisco

Lead Tableau Developer

Responsibilities:

  • Built predictive models on LTV with R using Regression
  • Visualize in Tableau on LTV for various customer dimensions: by country, device, OS, games etc.

Technology Stack: R, SQL, Tableau, TreasureData, SWRVE Mobile Marketing, Spark/R DataBricks.

Confidential

Senior Developer / Project Manager

Responsibilities:

  • Implemented classification and regressions (SVM, decision trees, random forest) and clustering on large cap and tech stock.
  • Implemented Portfolio Optimizations using Sharpe Ratio. Concept to product by working iteratively with trade analysts.
  • Interactive visualization in Tableau on portfolio management, fund management, pre- and post-trade analysis, and various macro-economics.
  • Implemented sentiment analysis and text analytics on Twitter social media feeds and market news using Scala and Python.

Technology Stack: Python (numpy, pandas, matplotlib, scikit-learn, cProfile), Quandl, Quantopian / Zipline, Scala, Spark, Hive, Pig, MySQL, Microsoft SQL Server, Amazon Redshift, MetaTrader MQL 4 and 5, AmiBroker.

Project management tools: KanbanFlow, Trello, EverNote, Asana, Todo-ist, Git.

Confidential

Senior Developer / BI Lead / Project Manager

Responsibilities:

  • Confidential develops cloud-based business intellgience solution for public utilities.
  • Gather and document functional requirements for upcoming releases. Prioritize roadmap features and product enhancement requests. Crafted internal and external communications around upcoming features, including release notes & user manuals.
  • Analyze business needs and produced detailed wireframes using Balsamiq tool. Iterative UX feedback and coontinuous improvement with pilot clients.
  • Entity-Relationship Modeling to dimensional data modeling and schema design in data warehouse for public utilities dataset.
  • Data wrangling and data integration with various utilities; ETL automation.
  • Design and co-develop interactive dashboards and reports for public utilities using Google Charts, Tableau and Jaspersoft.
  • Manage the agile team process. Lead 20+ off-shore analysts and developers in a matrix environment.
  • Developed and maintained project plans, issue log and project status reports. Conducted daily scrums and sprint planning. Organized daily stand-up meetings, brainstorming sessions, product demos, integration tests and user acceptance tests.

Technology Stack: LAMP Stack (Linux, Apache, MySQL, Python, Java), Amazon EC2, Tableau, Tibco Jaspersoft, R, Python (numpy, pandas, matplotlib, scikit-learn), MS-SQL Server SSIS/SSRS, Oracle, HDF5, AWS, Balsamiq.

Confidential

Software Engineer / IT Manager / Knowledge Architect

Responsibilities:

  • Achieved efficient and paperless environment by streamlining more than 50+ approval and requisition workflow for HR, Sales and Consulting processes.
  • Successfully implemented and executed knowledge retention strategy for all SAP functional modules and third-party tools deployed in projects. Facilitated the SAP ASAP methodology to include a template deployment.
  • Implemented an electronic resource management system and achieved a profitable Professional Services model with a team of 40+ SAP consultants.

Technology Stack: IBM Lotus Domino/Notes, IBM DB2, SAP, Tableau, Crystal Report, MySQL, Microsoft SQL Server, Balsamiq.

We'd love your feedback!