10 Netflix SQL Interview Questions (Updated 2025)

(Ex-Facebook & Best-Selling Data Science Author)

Updated on

March 20, 2025

At Netflix, SQL crucial for analyzing streaming statistics to identify viewing trends and to analyze data from A/B testing. Unsurprisingly this is why Netflix frequently asks advanced SQL problems during interviews for Data Science and Data Engineering positions.

To help you prepare, this blog covers 10 Netflix SQL interview questions to practice, which are similar to recently asked questions at Netflix – able to answer them all?

Netflix SQL Interview Questions

10 Netflix SQL Interview Questions

SQL Question 1: Identify VIP Users for Netflix

To better cater to its most dedicated users, Netflix would like to identify its "VIP users" - those who are most active in terms of the number of hours of content they watch. Write a SQL query that will retrieve the top 10 users with the most watched hours in the last month.

Example Input Table

user_id	sign_up_date	subscription_type
435	08/20/2020	Standard
278	01/01/2021	Premium
529	09/15/2021	Basic
692	12/28/2021	Standard
729	01/06/2022	Premium

Example Input Table

activity_id	user_id	date_time	show_id	hours_watched
10355	435	02/09/2022 12:30:00	12001	2.5
14872	278	02/10/2022 14:15:00	17285	1.2
12293	529	02/18/2022 21:10:00	12001	4.3
16352	692	02/20/2022 19:00:00	17285	3.7
17485	729	02/25/2022 16:45:00	17285	1.9

Answer:

The given query joins the 'users' and 'watching_activity' tables and calculates the total hours watched by each user within the last month. It then sorts these users in order of most hours watched and takes the top 10 results.

To practice a similar power-user data analysis problem question on DataLemur's free online SQL coding environment, try this recently asked Microsoft SQL interview question:

SQL Question 2: Analyzing Ratings For Netflix Shows

Given a table of user ratings for Netflix shows, calculate the average rating for each show within a given month. Assume that there is a column for user id, show id, rating (out of 5 stars), and date of review. Order the results by month and then by average rating (descending order).

This is will provide an interesting insights into how show ratings fluctuate over time and which shows have garnered the most positive feedback.

Sample Tables:

Example Input:

review_id	user_id	review_date	show_id	stars
6171	123	06/08/2022 00:00:00	50001	4
7802	265	06/10/2022 00:00:00	69852	4
5293	362	06/18/2022 00:00:00	50001	3
6352	192	07/26/2022 00:00:00	69852	3
4517	981	07/05/2022 00:00:00	69852	2

Example Output:

mth	show_id	avg_stars
6	50001	3.50
6	69852	4.00
7	69852	2.50

Answer:

Your SQL query could look like this:

This query pulls out the month from the review_date, groups the data by month and show_id, and for each group, calculates the average stars. The results are then ordered first by month, then by average stars in a descending order (highest rated shows first).

p.s. Window functions show up super frequently during SQL interviews, so practice the 27+ window function questions on DataLemur

SQL Question 3: What does / SQL commands do?

The / operator is used to remove to return all rows from the first SELECT statement that are not returned by the second SELECT statement.

Note that is available in PostgreSQL and SQL Server, while MINUS is available in MySQL and Oracle (but don't stress about knowing which DBMS supports what exact commands since the interviewers at Netflix should be lenient!).

Here's a PostgreSQL example of using EXCEPT to find all of Netflix's Facebook video ads with more than 10k views that aren't also being run on YouTube:

If you want to retain duplicates, you can use the EXCEPT ALL operator instead of EXCEPT. The EXCEPT ALL operator will return all rows, including duplicates.

SQL Question 4: Filter Netflix Users Based on Viewing History and Subscription Status

You are given a database of Netflix's user viewing history and their current subscription status. Write a SQL query to find all active customers who watched more than 10 episodes of a show called "Stranger Things" in the last 30 days.

For this question, consider the following tables:

Example Input:

user_id	active
1001	true
1002	false
1003	true
1004	true
1005	false

Example Input:

user_id	show_id	episode_id	watch_date
1001	2001	3001	2022-10-01
1001	2001	3002	2022-10-02
1001	2001	3003	2022-10-03
1002	2001	3001	2022-10-01
1002	2001	3002	2022-10-02
1003	2001	3001	2022-10-01
1003	2001	3002	2022-11-01
1003	2001	3003	2022-11-02
1004	2002	3004	2022-11-03

Example Input:

show_id	show_name
2001	"Stranger Things"
2002	"Money Heist"

Answer:

In PostgreSQL, the SQL query to solve the above problem would be:

The SQL query starts by joining the table with on the column and then joins with the table on .

It then uses a WHERE clause to filter for active users and the show "Stranger Things", and limits the to the last 30 days.

The GROUP BY statement groups the result by , and the HAVING clause filters the groups that have more than 10 unique 's, or in other words, customers that have viewed more than 10 episodes. undefined

SQL Question 5: What does it mean to denormalize a database?

Denormalization is the process of modifying a database schema in a way that deviates from the typical rules of normalization (1NF, 2NF, 3NF, etc.).

Denormalization is often used to improve the performance of a database, particularly when it is being used for reporting and analytical purposes (rather than in an Online Transaction Processing (OLTP) manager).

By duplicating data, denormalization can reduce the number of expensive joins required to retrieve data, which can improve query performance. However, denormalization can also cause problems such as increased data redundancy and the need for more complex update and delete operations.

SQL Question 6: Filter and Match Customer's Viewing Records

As a data analyst at Netflix, you are asked to analyze the customer's viewing records. You confirmed that Netflix is especially interested in customers who have been continuously watching a particular genre - 'Documentary' over the last month.

The task is to find the name and email of those customers who have viewed more than five 'Documentary' movies within the last month.

'Documentary' could be a part of a broader genre category in the genre field (for example, 'Documentary, History'). Therefore, the matching pattern could occur anywhere within the string.

Use the and databases provided below:

Example Input

movie_id	title	genre	release_year
1265	'The Last Dance'	'Documentary, Sport'	2020
7821	'Tiger King'	'Documentary, Crime'	2020
3402	'Becoming'	'Documentary'	2020
5698	'Swift	'Documentary, Music'	2020
4169	'The Irishman'	'Biography, Crime, Drama'	2019
2698	'Extraction'	'Action, Thriller'	2020

Example Input:

user_id	name	email	last_movie_watched	date_watched
361	'John Doe'	'johndoe@gmail.com'	1265	'2021-08-03'
124	'Jane Smith'	'janesmith@yahoo.com'	1265	'2021-08-03'
815	'Emily Clark'	'emilyclark@hotmail.com'	7821	'2021-08-03'
634	'Robert Brown'	'robertbrown@gmail.com'	3402	'2021-08-03'
962	'Sarah Johnson'	'sarahjohnson@gmail.com'	5698	'2021-08-03'

Answer:

In the SQL query above, we are joining and tables on the and fields. We retrieve the and from the table who have watched more than 5 movies in the genre 'Documentary' in the last month. We use the PostgreSQL operator combined with the wildcard to match customers who have the genre 'Documentary' listed anywhere in the field. We use GROUP BY and HAVING to consider only those customers who have viewed more than five documentaries within the last month. This query will return the name and email of those customers. undefined

SQL Question 7: Can you explain the concept of database normalization?

Database normalization is the process of breaking down a table into smaller and more specific tables and defining relationships between them via foreign keys. This minimizes redundancy, and creates a database that's more flexible, scalable, and easier to maintain. It also helps to ensure the integrity of the data by minimizing the risk of data inconsistencies and anomalies.

SQL Question 8: Analyzing Netflix User Behavior and Content Ratings

You're a Data Analyst at Netflix, you have been asked to analyze customer behavior and their content ratings. You have the following two tables:

A table that has information about users, their ID and their subscription_starts.
A table that has information about what content each user has reviewed and the score they gave.

Write a SQL query to analyze the data and find the average rating for each content, sorted by the average rating in descending order.

Example tables:

Example Input:

user_id	subscription_starts
1	2018-01-01
2	2019-02-20
3	2017-07-14
4	2020-11-28
5	2018-04-24

Example Input:

review_id	user_id	date	content_id	rating
101	1	2022-06-08	1	4
202	2	2022-06-10	2	4
303	3	2022-06-18	1	3
404	1	2022-07-26	2	3
505	5	2022-07-05	1	2

Answer:

This query first joins users table with reviews table on user_id. Then, it groups by in reviews table and calculates the average rating for each content. Finally, it sorts the resulting data by in descending order, so that the content with the highest average rating will be at the top.

Since joins come up so often during SQL interviews, try this interactive Snapchat SQL Interview question using JOINS:

SQL Question 9: Calculate the Standard Deviation of Movie Ratings

Netflix wants to have a better understanding of how their ratings vary for each movie department. They want to calculate the standard deviation of ratings given to the movies of various departments.

The standard deviation is a measure of how spread out the ratings are. If the ratings are all close to the mean, the standard deviation is close to zero. If the ratings are spread out over a wider range, the standard deviation is larger.

The standard deviation is calculated by taking the square root of the variance. Variance is calculated as follows: the difference between each rating and the mean rating is squared, these squared differences are averaged, this produces the variance.

Utilize the ratings provided by users and apply SQL functions like AVG(), POWER(), SQRT() to compute the standard deviation.

Assuming you have a table named like below:

Example Input:

movie_id	department_id	rating
1001	200	4
1002	200	5
1003	201	3
1004	202	2
1005	200	2
1006	202	5
1007	201	1
1008	202	4

Answer:

In the query above, the inner query computes the average rating per each department. This average rating (the "mean") is needed to compute the variance.

The outer query calculates the variance by subtracting the mean from each rating, squaring the result and taking the average of these squares. The square root of this variance gives us the standard deviation. This query provides the standard deviation of movie ratings for each department.

To practice a very similar question try this interactive Amazon Average Review Ratings Question which is similar for calculating statistical measures or this Alibaba Compressed Mean Question which is similar for understanding spread of data.

SQL Question 10: What is a foreign key?

A foreign key is a column or group of columns in a table that refers to the primary key in another table. The foreign key constraint helps maintain referential integrity between the two tables.

For example, let's look at the Netflix sales database:

netflix_sales:

+------------+------------+------------+------------+ | order_id | product_id | customer_id| quantity | +------------+------------+------------+------------+ | 1 | 222 | 1 | 2 | | 2 | 333 | 1 | 1 | | 3 | 444 | 2 | 3 | | 4 | 555 | 3 | 1 | +------------+------------+------------+------------+

In this table, and could both be foreign keys. They reference the primary keys of other tables, such as a Products table and a Customers table, respectively. This establishes a relationship between the table and the other tables, such that each row in the sales database corresponds to a specific product and a specific customer.

Netflix SQL Interview Tips

The key to acing a Netflix SQL interview is to practice, practice, and then practice some more! Besides solving the above Netflix SQL interview questions, you should also solve the 200+ SQL coding questions which come from companies like Amazon, Microsoft, Meta, and smaller tech companies.

Each problem on DataLemur has hints to guide you, fully explained answers along with a discussion board to see how others solved it and crucially, there is an interactive coding environment so you can easily right in the browser your SQL query answer and have it checked.

To prep for the Netflix SQL interview you can also be useful to practice SQL problems from other tech companies like:

However, if your SQL query skills are weak, forget about jumping right into solving questions – improve your SQL foundations with this SQL tutorial for Data Analytics.

This tutorial covers SQL concepts such as window functions and finding NULLs – both of these show up often in Netflix SQL interviews.

Netflix Data Science Interview Tips

What Do Netflix Data Science Interviews Cover?

For the Netflix Data Science Interview, besides SQL questions, the other types of questions to prepare for are:

Probability & Stats Questions
Python or R Coding Questions
Business Sense and Product-Analytics Questions
ML Modelling Questions
Behavioral & Resume-Based Questions

Netflix Data Scientist

How To Prepare for Netflix Data Science Interviews?

To prepare for Netflix Data Science interviews read the book Ace the Data Science Interview because it's got:

201 interview questions sourced from Microsoft, Amazon & startups
a crash course on SQL, Product-Sense & ML
over 900+ reviews on Amazon & 4.5-star rating

Also refer to this SQL Interview Cheat sheet as a resource! It covers everything you need to know from Window functions to CTEs.

10 Netflix SQL Interview Questions (Updated 2025)

10 Netflix SQL Interview Questions

SQL Question 1: Identify VIP Users for Netflix

Example Input Table

Example Input Table

Answer:

SQL Question 2: Analyzing Ratings For Netflix Shows

Example Input:

Example Output:

Answer:

SQL Question 3: What does / SQL commands do?

SQL Question 4: Filter Netflix Users Based on Viewing History and Subscription Status

Example Input:

Example Input:

Example Input:

Answer:

SQL Question 5: What does it mean to denormalize a database?

SQL Question 6: Filter and Match Customer's Viewing Records

Example Input

Answer:

SQL Question 7: Can you explain the concept of database normalization?

SQL Question 8: Analyzing Netflix User Behavior and Content Ratings

Example tables:

Example Input:

Example Input:

Answer:

SQL Question 9: Calculate the Standard Deviation of Movie Ratings

Example Input:

Answer:

SQL Question 10: What is a foreign key?

Netflix SQL Interview Tips

Netflix Data Science Interview Tips

What Do Netflix Data Science Interviews Cover?

How To Prepare for Netflix Data Science Interviews?

Career Resources

Support

Interview Questions