
Data Science Engineering
We enable easy access to insights to make the lives of our merchants, partners and all of Shopify better.


ShopifyQL Notebooks: Simplifying Querying with Commerce Data Models

The Complex Data Models Behind Shopify's Tax Insights Feature

Monte Carlo Simulations: Separating Signal from Noise in Sampled Success Metrics

3 (More) Tips for Optimizing Apache Flink Applications

Using Server Sent Events to Simplify Real-time Streaming at Scale

How to Export Datadog Metrics for Exploration in Jupyter Notebooks

Reducing BigQuery Costs: How We Fixed A $1 Million Query

How to Structure Your Data Team for Maximum Influence

What is a Full Stack Data Scientist?

Shopify Data’s Guide To Opportunity Sizing

Data-Centric Machine Learning: Building Shopify Inbox’s Message Classification Model

Introducing ShopifyQL: Our New Commerce Data Querying Language

8 Data Conferences Shopify Data Thinks You Should Attend

Lessons Learned From Running Apache Airflow at Scale

Double Entry Transition Tables: How We Track State Changes At Shopify

Data Is An Art, Not Just A Science—And Storytelling Is The Key

The Magic of Merlin: Shopify's New Machine Learning Platform

A Data Scientist’s Guide To Measuring Product Success

7 Tips For Optimizing Apache Flink Applications

Shopify's Playbook for Scaling Machine Learning

Shopify’s Unique Data Science Hierarchy Of Needs

Building a Real-time Buyer Signal Data Pipeline for Shopify Inbox

Scaling Shopify's BFCM Live Map: An Apache Flink Redesign

Using Propensity Score Matching to Uncover Shopify Capital’s Effect on Business Growth

Shopify's Path to a Faster Trino Query Execution: Custom Verification, Benchmarking, and Profiling Tooling

Winning AI4TSP: Solving the Travelling Salesperson Problem with Self-programming Machines

Using Rich Image and Text Data to Categorize Products at Scale

5 Steps for Building Machine Learning Models for Business

Shopify's Path to a Faster Trino Query Execution: Infrastructure

10 Lessons Learned From Online Experiments

How Shopify Built An In-Context Analytics Experience

A Five-Step Guide for Conducting Exploratory Data Analysis

Building Smarter Search Products: 3 Steps for Evaluating Search Algorithms

Capturing Every Change From Shopify’s Sharded Monolith

4 Tips for Shipping Data Products Fast

How to Make Dashboards Using a Product Thinking Approach

How to Reliably Scale Your Data Platform for High Volumes

How to Build a Production Grade Workflow with SQL Modelling

How to Build an Experiment Pipeline from Scratch

How to Use Quasi-experiments and Counterfactuals to Build Great Products

How to Track State with Type 2 Dimensional Models

How We’re Solving Data Discovery Challenges at Shopify

Shopify's Data Science & Engineering Foundations

7 Ways to Make Your SQL Workshop Beginner-friendly

Categorizing Products at Scale

The Evolution of Kit: Automating Marketing Using Machine Learning

Great Code Reviews—The Superpower Your Team Needs

How Shopify Uses Recommender Systems to Empower Entrepreneurs
