Data Scientist Interview Prep
An interview prep path for data science loops. Built on statistics and probability, machine-learning concepts, and experimentation (A/B testing), grounded in SQL and Python, with the system-design and behavioral rounds product data scientists face. Lighter on algorithmic coding than a SWE path, heavier on inference and experiment design.
Statistics and probability
The backbone of every data science loop: distributions, hypothesis testing, p-values, confidence intervals, and the Bayesian reasoning interviewers probe. Anchor these first.
Machine learning concepts
Bias-variance, regularization, the right evaluation metric for the problem, and the intuition behind the core algorithms - the conceptual ML screen most DS loops include.
Experimentation and A/B testing
Product data science runs on experiments. Master experiment design, sizing and power, and the pitfalls (peeking, novelty, SRM, network effects) interviewers love to test.
SQL and Python foundations
The daily tools. SQL for pulling and shaping data, Python for analysis. Pair these MCQs with the SQL Playground for hands-on query practice.
ML system design and behavioral
Senior DS loops add applied design (serving models, analytics pipelines) and behavioral rounds that screen for going deep and acting under ambiguity.
- 01DesignDesign an ML Model Serving Platform (TorchServe / Triton)System Design · Hard
- 02DesignDesign an Analytics Pipeline (Kafka / Spark / Warehouse)System Design · Hard
- 03BehavioralDive Deep (Amazon Leadership Principle)Behavioral · Amazon LP
- 04BehavioralDealing with AmbiguityBehavioral · General
- 05BehavioralOwnership (Amazon Leadership Principle)Behavioral · Amazon LP
Browse other learning paths
Three role-targeted paths are live: Backend, SRE / DevOps, and ML Engineer. More are on the way - if you have a role you want covered, let us know.
View all paths →