We provide IT Staff Augmentation Services!

Data Scientist Resume

0/5 (Submit Your Rating)

VirginiA

SUMMARY

  • Above 8 years professional experience building and operating scalable distributed systems across the full software lifecycle including design, implementation, testing, operations, and maintenance.
  • Fluency in one or more modern programming languages such as Java, C# or C++.
  • Experience across front - end user interfaces, business logic, and data tiers.
  • High proficiency in Python / Java or similar programming language
  • Expertise related to applying deep learning, convolutional neural networks (CNNs) and related techniques for computer vision, object detection, object classifications and related problems.
  • Experience with TensorFlow, Caffe or other Deep Learning frameworks.
  • Experience in building scalable systems using distributed processing tools.
  • Experience related to algorithm design, software development, architecture of large, complex software systems
  • A Master’s degree in computer science or a related field. Bachelor’s candidates with strong experience.
  • Technology savvy and adaptable, so you can develop new solutions that match the evolving nature AI / BigData solutions.
  • Personal drive and intellectual curiosity to do what hasn’t been done before, coupled with an appreciation for overcoming challenges
  • Good communication skills
  • Some experience designing internet-scale public APIs.
  • Experience building solutions for enterprises, context-awareness, pervasive computing, and/or application of machine learning
  • Experience working with modern tools for big data storage and analysis (e.g., AWS, Apache Spark, Hadoop, SQL, NoSQL)
  • Experience or strong interest in foundational machine learning models and concepts: regression, random forest, boosting, GBM, NNs, HMMs, CRFs, MRFs, deep learning.
  • Experience defining and championing best practices across a software team.
  • Comfortable presenting to senior management, business stakeholders, and external partners.
  • BS in Computer Science, or equivalent background in data structures, algorithms, object-oriented design and systems architecture.
  • 3+ years professional experience building and operating scalable distributed systems across the full software lifecycle including design, implementation, testing, operations, and maintenance.
  • Fluency in one or more modern programming languages such as Java, C# or C++.
  • Experience across front-end user interfaces, business logic, and data tiers.
  • Experience serving as technical lead, including mentorship of more junior software developers.
  • Bachelor's degree in Computer Science, Computer Engineering or related technical discipline
  • 12+ years of relevant software engineering experience
  • 4+ years of technical project management experience
  • 4+ years of experience managing people
  • Excellent understanding of machine learning techniques and algorithms, such as k-NN, Naive Bayes, SVM, Decision Forests, etc.
  • Experience with common data science toolkits, such as R, Weka, NumPy, MatLab, etc
  • Great communication skills
  • Experience with data visualisation tools, such as D3.js, GGplot, etc.
  • Proficiency in using query languages such as SQL, Hive, Pig
  • Good applied statistics skills, such as distributions, statistical testing, regression, etc
  • 4+ years’ experience (hands-on) working with Data Science projects involving problems related with supervised and unsupervised learning.
  • Good with analyzing large, complex, multi-dimensional datasets with a variety of tools such as R, Python etc.
  • Hands-on experience with one or more of statistical analysis tools such as R, Python, Matlab, Spark MLLib, Apache Mahout
  • Mentored sophisticated organizations on large scale data and analytics using advanced statistical and machine learning models.
  • Architected and implemented analytics and visualization components for device data analysis platform to predict hardware
  • Optimized factors for sales conversions and designed an algorithm for deal recommendations for a large daily deals website.
  • Developed audience extension models relying on decision trees, random forest, logistic regression, and other categorical data.

TECHNICAL SKILLS

Client side Technologies: Java or Python, Bash, HTML5, PERL, Processing, Python and R, R, Hadoop, Python, Hive, C/C++, C#

Frameworks: Microsoft .Net 4.5/ 4.0/ 3.5/3.0 , Entity Framework, Bootstrap, Microsoft Azure, Swagger.

Databases: MS-Access, Oracle 11g/10g/9i and Teradata, big data, hadoop, SQL Server 2014/2012/2008/2005/2000

BI Tools: C x 4,HBase x 4,Bash x 3,Spark x 3,ElasticSearch x 2

Version Controller: TFS, Microsoft Visual SourceSafe, GIT, NUNIT, MSUNIT

Database Tools: SQL Server Query Analyzer.

Software Packages: MS-Office 2003/ 07/10/13 , MS Access, Messaging Architectures.

Operating Systems: Windows Win8/XP/NT/ 95/98/2000/2008/2012 , Android SDK.

Microsoft Technologies: JavaScript,JMP,Mahout,objectiveC,QlickView,Redis,Redshifed, PHP,Scala2,Shark2,Awk,Cascading,Cassandra,Clojure,Fortran

Web Technologies: Windows API, Web Services, Web API (RESTFUL) HTML5, XHTML, CSS3, AJAX, XML, XAML,MSMQ, Silverlight, Kendo UI.

Web Servers: IIS 5.0, IIS 6.0, IIS 7.5, IIS ADMIN.

Programming Languages: C#, VB.NET (VB6), VBScript, OOPS, Data structures, Algorithms

Development Tools: R x 30, SQL x 27, Python x 22, Hadoop x 19, SAS x 18, Java x 15, Hive x 13, Matlab x 12.

PROFESSIONAL EXPERIENCE

Confidential, Virginia

Data Scientist

Responsibilities:

  • Strong mathematics, statistics, and data analytics
  • Solid coding and engineering skills preferably in Machine Learning .
  • Proficient in Java, Python, and Scala
  • Industry experience building and productionizing end-to-end systems
  • Knowledge of Information Extraction, NLP algorithms coupled with Deep Learning
  • Experience with data processing and storage frameworks like Hadoop, Spark, Kafka etc.
  • Perform analysis of large sets of financial data using machine learning and deep learning methods.
  • Clean and process data,perform data modeling, evaluations and simulations.
  • Help with data analysis, testing of models and creating profitable trading strategies. providing offline training on Machine Learning.
  • Expertise in particular field..
  • Excellent communication skills.
  • Ability to propose hypothesis and design experiments in the context of specific problems.
  • Should come from a strong engineering background
  • Good overlap with Indix Data tech stack such as Hadoop, MapReduce, HDFS, Spark, Scalding, Scala/Python/C++
  • Dedication and diligence in understanding the application domain, collecting/cleaning data and conducting experiments.
  • Creativity in model and algorithm development.
  • An obsession to develop algorithms/models that directly impact business.
  • Master’s/Phd. in Computer Science/Statistics is a plus
  • Experience working in text mining and python libraries like scikit-learn, numpy, etc
  • Collect relevant data from production systems/Use crawling and parsing infrastructure to put together data sets.
  • Survey academic literature and identify potential approaches for exploration.
  • Craft, conduct and analyze experiments to evaluate models/algorithms.
  • Communicate findings and take algorithms/models to production with end to end ownership.

Environment: Python, Hive, C/C++, C#, Java or Python, Bash, HTML5, PERL, Processing, Python and R., Logistic Regression SQL, Python Data Science Stack (strongly preferred), Machine Learning and Statistics, Data Visualization, A/B Testing, Bandit problems.

Confidential, California

Data Scientist

Responsibilities:

  • Exceptional customer relationship skills including the ability to discover the true requirements underlying feature requests, recommend alternative technical and business approaches, and lead engineering efforts to meet aggressive timelines with optimal solutions
  • Master's degree in Computer Science or related field.
  • Design and develop state-of-the-art deep-learning / machine-learning algorithms for analyzing image and video data among others.
  • Create and support a data management workflow from data collection, storage, analysis to training and validation.
  • Design and build a scalable software architecture to enable real-time / big-data processing.
  • Ensure high quality crowdsourcing byBuilding machine learning models for user profiling, quality assurance & incentive design.
  • Building spam detectors to catch common abuse patterns
  • Demonstrate thought leadership by publishing case studies, open sourcing datasets, speaking at conferences and writing blog post
  • Will apply computer vision and image processing techniques to solve new problems for Automation Anywhere. willcollaborate closely with onsite research scientist(s) and UX researchers and Software Development Engineers to help define the scope of a product. will take responsibility for technical problem solving, creatively meeting product objectives and developing best practices

Environment:, SOAPUI, WCF, WPF, VSO, TFS, GIT,XML, XSD, SQL Server 2008Python, Hive, C/C++, C#, Java or Python, Bash, HTML5, PERL, Processing, Python and J Query, Oracle 10/11g, ANGULAR JS.

Confidential, Birmingham

Data Scientist

Responsibilities:

  • Be able to work independently on a project-by-project basis and also work in a collaborative and fast-paced team environment
  • Be able to provide technical and analytical solutions to evaluate the merits and challenges of a product idea
  • Create applications on both the server-side and on the web interface
  • Perform high complexity integration testing and validate all services integrate according to specifications
  • Responsible for prevention and early detection of defects through verification and validation activities ensuring the integrity and quality of all work products
  • Deep hands-on technical expertise - machine learning and AI expertise would be preferred
  • Strong verbal and written communication skills and demonstrated technical leadership
  • Strong business and technical vision
  • Ability to handle multiple competing priorities in a fast-paced environment
  • A deep understanding of software development in a team, and a proven track record of shipping software on time
  • Exceptional customer relationship skills including the ability to discover the true requirements underlying feature requests, recommend alternative technical and business approaches, and lead engineering efforts to meet aggressive timelines with optimal solutions
  • Master's degree in Computer Science or related field.
  • Work on the bridging the gap between the latest in deep learning research and its application in real world products. help designing, innovating and building our next generationML architecture
  • Working with a group (Data Science & Machine Learning Group) of ML engineers, Data Scientists and Product Analysts
  • Define and drive API oriented solutions for data and machine learning services demonstrate cross-functional resource interaction to accomplish your goals. identify and initiate investigations of new technologies, prototype and test solutions for product features, and design and validate designs that deliver an exceptional user experience.
  • Ability to propose hypothesis and design experiments in the context of specific problems.
  • Should come from a strong engineering background
  • Good overlap with Indix Data tech stack such as Hadoop, MapReduce, HDFS, Spark, Scalding, Scala/Python/C++
  • Dedication and diligence in understanding the application domain, collecting/cleaning data and conducting experiments.
  • Creativity in model and algorithm development.
  • An obsession to develop algorithms/models that directly impact business.

Environment: Hadoop, MySQL, Big Table, MapReduce, SAS, Large-scale SSRS, IIS, SQL Server 2012, WCF, Web API, HTML5, CSS3, JQuery.

Confidential,California

Data Scientist

Responsibilities:

  • Highly motivated with excellent verbal and written communication skills
  • Ability to work successfully with multi-functional teams, principles and architects. Coordinates effectively across organizational boundaries and geographies
  • Develop the core product of our company with the team.
  • Collaborate with other teams to understand the requirements and implement them in the product.
  • Conduct design and code reviews.
  • Analyze and improve efficiency, scalability, and stability of various components.
  • Strong mathematics, statistics, and data analytics
  • Solid coding and engineering skills preferably in Machine Learning (not mandatory)
  • Proficient in Java, Python, and Scala.
  • Industry experience building and productionizing end-to-end systems
  • Knowledge of Information Extraction, NLP algorithms coupled with Deep Learning
  • Experience with data processing and storage frameworks like Hadoop, Spark, Kafka etc.
  • Ability to propose hypothesis and design experiments in the context of specific problems.
  • Should come from a strong engineering background
  • Good overlap with Indix Data tech stack such as Hadoop, MapReduce, HDFS, Spark.
  • Deep hands-on technical expertise - machine learning and AI expertise would be preferred
  • Strong verbal and written communication skills and demonstrated technical leadership
  • Strong business and technical vision
  • Ability to handle multiple competing priorities in a fast-paced environment
  • A deep understanding of software development in a team, and a proven track record of shipping software on time
  • Exceptional customer relationship skills including the ability to discover the true requirements underlying feature requests, recommend alternative technical and business approaches, and lead engineering efforts to meet aggressive timelines with optimal solutions
  • Master's degree in Computer Science or related field.
  • Some experience designing internet-scale public APIs.
  • Experience building solutions for enterprises, context-awareness, pervasive computing, and/or application of machine learning
  • Experience working with modern tools for big data storage and analysis (e.g., AWS, Apache Spark, Hadoop, SQL, NoSQL.

Environment:, HTML, XML, XSLT, SQL Server 2008R2, SSRS, CSS, MS-Office Scala2,Shark2,Awk,Cascading,Cassandra,Clojure,Fortran,JavaScript,JMP,Mahout,objectiveC,QlickView, Redis, Redshifed.

Confidential

Data Scientist

Responsibilities:

  • Responsible for prevention and early detection of defects through verification and validation activities ensuring the integrity and quality of all work products
  • Deep hands-on technical expertise - machine learning and AI expertise would be preferred
  • Strong verbal and written communication skills and demonstrated technical leadership
  • Strong business and technical vision
  • Ability to handle multiple competing priorities in a fast-paced environment
  • A deep understanding of software development in a team, and a proven track record of shipping software on time
  • Exceptional customer relationship skills including the ability to discover the true requirements underlying feature requests, recommend alternative technical and business approaches, and lead engineering efforts to meet aggressive timelines with optimal solutions
  • Master's degree in Computer Science or related field.
  • Define and drive API oriented solutions for data and machine learning services
  • 6years of full-time programming experience within an operations or technical department.
  • 3+ years of direct experience with multiple Agile teams.
  • Be able to distill business objectives into technical solutions through effective system design and architecture
  • Be able to work independently on a project-by-project basis and also work in a collaborative and fast-paced team environment
  • Be able to provide technical and analytical solutions to evaluate the merits and challenges of a product idea
  • Design and develop state-of-the-art deep-learning / machine-learning algorithms for analyzing image and video data among others.
  • You will demonstrate cross-functional resource interaction to accomplish your goals.
  • You will identify and initiate investigations of new technologies, prototype and test solution
  • Strong mathematics, statistics, and data analytics
  • Solid coding and engineering skills preferably in Machine Learning.
  • Proficient in Java, Python, and Scala.

Environment: Scala2,Shark2,Awk,Cascading,Cassandra,Clojure,Fortran,JavaScript,JMP,Mahout,objectiveC,QlickView,Redis,Redshifed

Confidential

Data Scientist

Responsibilities:

  • Creativity in model and algorithm development.
  • An obsession to develop algorithms/models that directly impact business.
  • Master’s/Phd. in Computer Science/Statistics is a plus
  • Job Expectations
  • Experience working in text mining and python libraries like scikit-learn, numpy, etc
  • Collect relevant data from production systems/Use crawling and parsing infrastructure to put together data sets.
  • Survey academic literature and identify potential approaches for exploration.
  • Craft, conduct and analyze experiments to evaluate models/algorithms.
  • Communicate findings and take algorithms/models to production with end to end ownership.
  • Responsible for prevention and early detection of defects through verification and validation activities ensuring the integrity and quality of all work products
  • Deep hands-on technical expertise - machine learning and AI expertise would be preferred
  • Strong verbal and written communication skills and demonstrated technical leadership
  • Strong business and technical vision
  • Ability to handle multiple competing priorities in a fast-paced environment
  • A deep understanding of software development in a team, and a proven track record of shipping software on time
  • Exceptional customer relationship skills including the ability to discover the true requirements underlying feature requests, recommend alternative technical and business approaches, and lead engineering efforts to meet aggressive timelines with optimal solutions
  • Master's degree in Computer Science or related field
  • Fluency in one or more modern programming languages such as Java, C# or C++.
  • Personal drive and intellectual curiosity to do what hasn’t been done before, coupled with an appreciation for overcoming challenges
  • Good communication skills.

Environment: Python x 22,Hadoop x 19,SAS x 18,Java x 15,Hive x 13,Matlab x 12,Pig x 11,C++ x 9,Ruby x 9,SPSS x 9,Perl x 8,Tableau x 8,Excel x 6,NoSQL x 5,AWS x 4

We'd love your feedback!