Data Scientist Interview Questions
Prepare for your data scientist interview with 10 expert-curated questions and sample answers covering ML, statistics, experimentation, and behavioral topics.
behavioral Questions
Tell me about a time your analysis contradicted what a senior stakeholder believed.
behavioralintermediate
Tell me about a time your analysis contradicted what a senior stakeholder believed.
Sample Answer
Our CMO believed a loyalty program drove retention; my cohort analysis showed members were simply higher-intent customers — selection, not causation. I presented it as a question rather than a verdict, proposed a holdout experiment, and let the data settle it. The program was restructured, saving $800K, and the CMO became my strongest advocate because I'd brought rigor without ego.
Tip: The interviewer is testing diplomacy as much as statistics. Show both.
Why do you want to work here, and what would you tackle first?
behavioralbeginner
Why do you want to work here, and what would you tackle first?
Sample Answer
I've used your product and noticed the onboarding asks questions a model could infer — that suggests a personalization opportunity. In the first 90 days I'd map your data assets, ship one small win like a propensity score for an existing campaign, and earn the credibility to tackle the bigger modeling roadmap. Quick value first, then the ambitious work.
Tip: Research one specific, plausible opportunity in their product before the interview. It always lands.
technical Questions
Walk me through a model you built that shipped to production. What was the business impact?
technicalintermediate
Walk me through a model you built that shipped to production. What was the business impact?
Sample Answer
I built a churn prediction model — XGBoost on 18 months of behavioral features — scoring 2M subscribers weekly with 0.89 AUC. The retention team targeted the top decile with tailored offers, recovering 12% of predicted churners, worth about $3.2M annually. The key wasn't the algorithm; it was working with the retention team to design interventions the scores could actually trigger.
Tip: Structure as problem → approach → metric → business outcome. The last part is what separates candidates.
How do you decide whether an A/B test result is trustworthy?
technicalintermediate
How do you decide whether an A/B test result is trustworthy?
Sample Answer
Before looking at the lift, I check the experiment's health: sample ratio mismatch, pre-experiment bias between groups, novelty effects, and whether we hit the pre-registered sample size rather than peeking. Then I look at the metric movement alongside guardrails. A 'significant' result with a sample ratio mismatch is noise wearing a p-value.
Tip: Mentioning sample ratio mismatch and peeking instantly signals real experimentation experience.
Explain the bias-variance tradeoff to a non-technical stakeholder.
technicalbeginner
Explain the bias-variance tradeoff to a non-technical stakeholder.
Sample Answer
Imagine training a new employee. If they only memorize past examples, they'll fail on anything new — that's overfitting, or high variance. If they oversimplify with one rule for everything, they'll miss important patterns — that's underfitting, or high bias. Good models, like good training, balance learning the real patterns without memorizing the noise.
Tip: Analogies win here. Practice explaining three core concepts in plain language before any interview.
When would you choose a simple model over a complex one?
technicalbeginner
When would you choose a simple model over a complex one?
Sample Answer
Almost always to start. Logistic regression or gradient boosting baselines establish whether the signal exists, ship faster, and are explainable to stakeholders and regulators. I escalate complexity only when the baseline demonstrably leaves business value on the table — and the gain exceeds the added serving, monitoring, and explainability costs.
Tip: Interviewers use this to filter résumé-driven-development. Default to simple, justify complexity.
How do you handle missing data?
technicalintermediate
How do you handle missing data?
Sample Answer
First I diagnose why it's missing — missing completely at random, at random, or not at random — because the mechanism dictates the remedy. Then options range from deletion when safe, simple or model-based imputation, to missingness indicator features, which often capture signal since the absence itself can be predictive. What I avoid is silent mean-imputation as a default.
Tip: Naming the missingness taxonomy (MCAR/MAR/MNAR) is the expected depth marker.
How would you measure the success of a recommendation system?
technicaladvanced
How would you measure the success of a recommendation system?
Sample Answer
Offline: ranking metrics like NDCG or recall@k against held-out interactions — useful for iteration but weakly correlated with real impact. Online: A/B tested engagement and revenue per session, with guardrails for diversity and long-term retention, since recommenders can win clicks while narrowing catalogs and hurting the long run. I'd insist the launch criterion be the online metric.
Tip: Distinguish offline from online evaluation and mention a feedback-loop risk — that's the senior answer.
situational Questions
Your model performs well offline but poorly in production. What do you investigate?
situationaladvanced
Your model performs well offline but poorly in production. What do you investigate?
Sample Answer
First, training-serving skew: are features computed identically in both environments? Second, data drift: has the live population shifted from training data? Third, feedback delay: am I measuring the same label at the same horizon? In my experience skew is the culprit most often — a feature computed from a 'current' value in serving that was a historical snapshot in training.
Tip: Enumerate hypotheses in priority order — this question tests debugging discipline, not luck.
A product manager asks for 'a quick model' by Friday. The data is messy. What do you do?
situationalbeginner
A product manager asks for 'a quick model' by Friday. The data is messy. What do you do?
Sample Answer
I'd negotiate scope, not quality: what decision does the model inform, and what accuracy makes it useful? Often a rules-based heuristic or a simple regression on the clean subset answers Friday's question, with a roadmap for the real model. I'd deliver something honest with documented caveats over something impressive with hidden ones.
Tip: Show you scope to the decision, communicate uncertainty, and never silently ship garbage.
Preparation Tips
Refresh statistics fundamentals — p-values, confidence intervals, and power come up in nearly every interview.
Prepare three model stories: one success with business impact, one failure with lessons, one explainability challenge.
Practice SQL — many data science interviews start with a SQL screen before any ML discussion.
Be ready to design an A/B test live: metric choice, sample size, duration, and pitfalls.
Practice explaining one technical concept (regularization, AUC) to a non-technical listener out loud.
Practice Data Scientist Interview Questions
Get AI-powered feedback on your answers and ace your next interview.
Start Interview PrepRelated Interview Questions
Data Engineer
Prepare for your data engineer interview with 10 expert-curated questions and sample answers covering pipelines, SQL, data modeling, and behavioral topics.
Data Analyst
Master your data analyst interview with 10 real-world questions and answers on SQL, data visualization, statistical analysis, and business insights.
Machine Learning Engineer
Prepare for your ML engineer interview with 10 expert-curated questions and sample answers covering MLOps, model serving, LLMs, and system design.