Lead Data Scientist

Vancouver, Canada

Ranked as #12 on Forbes’ List of 25 Fastest Growing Public Tech Companies for 2017, EPAM is committed to providing our global team of 25,900+ EPAMers with inspiring careers from day one. EPAMers lead with passion and honesty and think creatively. Our people are the source of our success and we value collaboration, try to always understand our customers’ business, and strive for the highest standards of excellence. No matter where you are located, you’ll join a dedicated, diverse community that will help you discover your fullest potential.


You are curious, persistent, logical and clever a true techie at heart. You enjoy living by the code of your craft and developing elegant solutions for complex problems. If this sounds like you, this could be the perfect opportunity to join EPAM as a Lead Data Scientist. Scroll down to learn more about the position’s responsibilities and requirements.

What You’ll Do

  • Lead the strategic planning, development and implementation of medium-to-large data science solutions or a component of a larger solution, including predictive modeling, unsupervised and supervised learning, and machine learning techniques;
  • Lead on all stages of presales activities for such projects, owning the whole presale process from the Competency Center perspective when required. Manage the delivery of architectural POCs, where required;
  • Interact with clients, advise and drive the translation of business requirements and models into appropriate architectural designs to ensure that business needs are met;
  • Work directly and collaboratively with clients, external data providers, and other key stakeholders to ensure that the solution’s concept/vision is understood and agreed upon;
  • Actively participate in project review and planning sessions. As needed, lead the solution development, drive and supervise end-to-end development cycle (SDLC) or participate in the projects start-up;
  • Manage the knowledge transfer from the project development team to the post-go-live support team. Oversea or effect the creation of architectural case study for EPAM’s repository of reusable assets;
  • Drive the strategic visioning activities for the practice and competency center. Develop reusable assets, development methods, processes, best practices to accelerate delivery;
  • Drive the program of evaluating the hardware and software platforms, benchmarking of alternative solution architectures, supervise a defined process for provision of structured, reusable results;
  • Keep pace with the innovative technologies and consider possibilities of creating relevant solution offerings;
  • Write broad topic and strategic white papers in the course of industry and technology research. Maintain high competency visibility by regular posting in internal newsletters, blogs as well as speaking at internal and external conferences and other events;
  • Analyze large data sets to discover trends, identify performance metrics, and uncover optimization opportunities;
  • Apply machine learning algorithms and statistical methods to large sets of raw data;
  • Continuously improve algorithms and develop best practice for instrumentation.

What You Have

  • Capable of leadership (influence management) and pragmatism;
  • 5+ years experience prototyping classification and regression models using machine software such as scikit-learn, R, Matlab, Octwave, Weka, Mahout, etc;
  • Experience with large-scale log processing or big data including but not limited to Elastic MapReduce, Hadoop Streaming, Pig, Hive, Spark, etc;
  • Fluency in a range of scripting languages, operating systems, and software platforms including but not limited to Linux, Redis, Java, MySQL, Python, Pandas, AWS;
  • Experience in Data Architecture, Data modeling, and Database manipulation;
  • Experience working with relational and non-relational databases to retrieve structured, unstructured, and semi-structured data sets;
  • Knowledge of business domains (Travel and Hospitality, Finance, Retail, etc.);
  • Track record (4+ projects) of working directly with the customer (i.e. regularly facing customer directly) both remotely and on-site;
  • Presale activities with exposure to customers on billable accounts;
  • Demonstrated experience with full life cycle of software development processes like RUP or "Waterfall", Agile (SCRUM, XP, etc.). Experience advocating and establishing an appropriate implementation methodology to successfully develop and deploy the solution;
  • Demonstrated experience in solution cost estimation (including tools, tasks, complexity, labor and time) at coarse grain and fine grain levels, with supporting material evidence;
  • Demonstrated experience in validating the overall solution from the perspective of performance, scalability, security and capacity.

Nice to have

  • Visualization: Matplotlib, Seaborn, Bokeh;
  • NLP: NLTK, SpaCy, Gensim;
  • Deep Learnring/Neural Networks: Tensorflow, Keras, Apache MXNet, Pytorch.

What We Offer

  • Extended Healthcare with Prescription Drugs, Dental and Vision Insurance (Company Paid);
  • Life and AD&D Insurance (Company Paid);
  • Employee Assistance Program (Company Paid);
  • Long-Term Disability;
  • Registered Retirement Savings Plan (RRSP) with company match;
  • Paid Time Off;
  • Critical Illness Insurance;
  • Employee Discounts.