9 Spotify SQL Interview Questions (Updated 2025)

(Ex-Facebook & Best-Selling Data Science Author)

Updated on

January 25, 2025

At Spotify, SQL is used across the company for analyzing user listening behaviors for custom playlist creation and managing databases to optimize server performance during peak streaming hours. Unsurprisingly this is why Spotify asks SQL query questions during interviews for Data Analytics, Data Science, and Data Engineering jobs.

In case you're preparing for a SQL Assessment, here’s 9 Spotify SQL interview questions to practice, which are similar to recently asked questions at Spotify – can you solve them?

Spotify SQL Interview

9 Spotify SQL Interview Questions

Sure, here is a SQL interview question potentially suitable for Spotify:

SQL Question 1: Top 5 Artists

Assume there are three Spotify tables: , , and , which contain information about the artists, songs, and music charts, respectively.

Write a query to find the top 5 artists whose songs appear most frequently in the Top 10 of the table. Display the top 5 artist names in ascending order, along with their song appearance ranking.

If two or more artists have the same number of song appearances, they should be assigned the same ranking, and the rank numbers should be continuous (i.e. 1, 2, 2, 3, 4, 5). If you've never seen a rank order like this before, do the rank window function tutorial.

Table:

Column Name	Type
artist_id	integer
artist_name	varchar
label_owner	varchar

Example Input:

artist_id	artist_name	label_owner
101	Ed Sheeran	Warner Music Group
120	Drake	Warner Music Group
125	Bad Bunny	Rimas Entertainment

Table:

Column Name	Type
song_id	integer
artist_id	integer
name	varchar

Example Input:

song_id	artist_id	name
55511	101	Perfect
45202	101	Shape of You
22222	120	One Dance
19960	120	Hotline Bling

Table:

Column Name	Type
day	integer (1-52)
song_id	integer
rank	integer (1-1,000,000)

Example Input:

day	song_id	rank
1	45202	5
3	45202	2
1	19960	3
9	19960	15

Example Output:

artist_name	artist_rank
Ed Sheeran	1
Drake	2

Answer:

To try this question out for yourself for FREE go here, Spotify SQL Interview Question.

SQL Question 2: Analyze Artist Popularity Over Time

Question Description: Let's assume you are a Data Analyst at Spotify. You are given a data table named containing daily listening counts for different artists. The table has three columns: , , and .

You are required to write a SQL query to calculate the 7-day rolling average of daily listens for each artist. The rolling average should be calculated for each day for each artist based on the previous 7 days (including the current day).

Example Input:

artist_id	listen_date	daily_listens
1	2022-06-01	15000
1	2022-06-02	21000
1	2022-06-03	17000
2	2022-06-01	25000
2	2022-06-02	27000
2	2022-06-03	29000

Notice that the column is date formatted.

Example Output:

artist_id	listen_date	rolling_avg_listens
1	2022-06-01	15000.00
1	2022-06-02	18000.00
1	2022-06-03	17666.67
2	2022-06-01	25000.00
2	2022-06-02	26000.00
2	2022-06-03	27000.00

Please pay attention to the rounding in the result.

Answer:

Explanation:

This SQL query computes the rolling average by using a window function AVG with a window frame defined as "the previous 6 days plus the current row".

The clause ensures the rolling average is calculated separately for each artist.

The clause is used to order the rows in each partition by the .

The clause defines the window frame for the window function. It states that for a given row, consider all rows from 6 days before to the current row.

Finally, the function calculates the average of daily_listens over the defined window frame.

The at the end just keeps the result set ordered by artist and date.

To solve another question about calculating rates, try this TikTok SQL question on DataLemur's interactive coding environment:

SQL Question 3: Spotify Streaming History

You're given two tables containing data on Spotify users' streaming activity: which has historical streaming data, and which has data from the current week.

Write a query that outputs the user ID, song ID, and cumulative count of song plays up to August 4th, 2022, sorted in descending order.

Assume that there may be new users or songs in the table that are not present in the table.

Table:

Column Name	Type
history_id	integer
user_id	integer
song_id	integer
song_plays	integer

Example Input:

history_id	user_id	song_id	song_plays
10011	777	1238	11
12452	695	4520	1

Table:

Column Name	Type
user_id	integer
song_id	integer
listen_time	datetime

Example Input:

user_id	song_id	listen_time
777	1238	08/01/2022 12:00:00
695	4520	08/04/2022 08:00:00
125	9630	08/04/2022 16:00:00
695	9852	08/07/2022 12:00:00

Example Output:

user_id	song_id	song_plays
777	1238	12
695	4520	2
125	9630	1

Answer:

To see an additional solution go to DataLemur and solve this Spotify SQL Interview Question.

SQL Question 4: Music Streaming Statistics

As a Data Analyst of Spotify, suppose your team is interested in understanding the listening habits of the users. You're provided with the following tables:

table contains information about users.
table contains information about songs.
table contains information about song artists.
table logs every song listened to by each user.

The following relationships hold:

Every song has one and only one artist, but an artist can have multiple songs.
Every song can be listened to by multiple users, and every user can listen to multiple songs.

Your goal is to write a SQL query that returns each user's favourite artist, based on the number of songs they've listened to by the artist.

Example Input:

user_id	username	country
1	user101	USA
2	user202	UK
3	user303	Brazil

Example Input:

song_id	song_name	artist_id
101	song101	1001
102	song102	1002
103	song103	1001

Example Input:

artist_id	artist_name
1001	artist1001
1002	artist1002

Example Input:

user_id	song_id	stream_time
1	101	5:00
1	102	5:30
1	103	6:00
2	101	8:00
2	103	9:00
3	102	10:00

Answer:

The above query will find the count of songs each user has listened to by each artist, grouping by both user_id and artist_id. This result is sorted in descending order of the count, and the top record for each user represents their favorite artist. The outer query then joins this result back to the artists and users tables to get the respective names.

To practice a related SQL problem on DataLemur's free online SQL coding environment, attempt this Facebook SQL Interview question:

SQL Question 5: What is denormalization?

Denormalization is a technique used to improve the read performance of a database, typically at the expense of some write performance.

By adding redundant copies of data or grouping data together in a way that does not follow normalization rules, denormalization improves the performance and scalability of a database by eliminating costly join operations, which is important for OLAP use cases that are read-heavy and have minimal updates/inserts.

SQL Question 6: Filter Spotify Users Based on Subscription and Activity

As a data analyst at Spotify, you are tasked with extracting a list of active Premium subscribers who have listened to at least 15 different artists in the current month. Active users are those who have logged in within the last 30 days.

Assuming you have two tables:

Example Input:

Answer:

This query first groups the table by and calculates the count of distinct artists each user interacted with in August. It then joins this table with the table on . The WHERE clause filters for Premium users who have logged in within the last 30 days and have interacted with at least 15 different artists.

To solve a similar problem about calculating rates, solve this SQL interview question from TikTok within DataLemur's interactive SQL code editor:

SQL Question 7: What distinguishes a left join from a right join?

"In SQL, a join generally retrieves rows from multiple tables and combines them into a single result set. For an example of the difference between a left vs. right join, suppose you had a table of Spotify orders and Spotify customers.

A retrieves all rows from the left table (in this case, the Orders table) and any matching rows from the right table (the Customers table). If there is no match in the right table, NULL values will be returned for the right table's columns.

A combines all rows from the right table (in this case, the Customers table) and any matching rows from the left table (the Orders table). If there is no match in the left table, NULL values will be displayed for the left table's columns.

SQL Question 8: Calculate the average listening duration for each music genre on Spotify

Suppose that Spotify would like to understand better the average listening duration for each genre of music on their platform. As a data scientist, your task is to write a SQL query that calculates the average listening duration per genre for every month.

Assume you have access to a user_activity table and a songs table with the following schema:

user_activity Example Input:

activity_id	user_id	song_id	timestamp	listening_duration_sec
1	101	5001	2022-03-01 09:00:00	210
2	102	6985	2022-03-01 11:30:00	120
3	103	5001	2022-03-01 15:45:00	300
4	101	6985	2022-04-01 08:45:00	180
5	102	5001	2022-04-01 10:00:00	240

songs Example Input:

song_id	genre
5001	Rock
6985	Pop

Your aim is to produce a table like:

Example Output:

mth	genre	avg_listening_duration_sec
3	Rock	255
3	Pop	120
4	Rock	240
4	Pop	180

Answer:

In order to get to the answer, we need to join the two tables on song_id and then use the GROUP BY clause. The AVG() function can be used with GROUP BY to find the average listening duration for each music genre.

This SQL query does the following:

Joins the user_activity table (as ua) and the songs table (as s) on the song_id field.
Uses the EXTRACT function to get the month from the timestamp in user_activity.
Groups by month and genre.
Uses AVG to calculate the average listening_duration_sec for each genre in each month. The result will be average listening duration per genre for every month.

To solve another question about calculating rates, solve this SQL interview question from TikTok within DataLemur's interactive coding environment:

SQL Question 9: Find Users Who've Listened to All Albums of a Specific Artist

As an analyst at Spotify, you're tasked to identify all users who have listened to all albums of the artist "Adele". Assume you have access to a table that keeps track of user information and an table that keeps track of all instances where a user listened to an album. Here are the table structures:

user_id	user_name
1	John
2	Jane
3	Alice

listen_id	user_id	artist_name	album_name
101	1	Adele	19
102	1	Adele	21
103	1	Adele	25
104	2	Adele	21
105	3	Adele	21
106	3	Adele	25

Using SQL, write a query to find all users who have listened to all Adele's albums (19, 21, 25).

Answer:

This query works by first identifying all distinct Adele's albums and then checking whether there are any of these albums that a certain user has not listened to. If a user has listened to all Adele's albums, they will not have any of Adele's albums that they have not listened to. Such users are selected by the query.

To solve a related SQL interview question on DataLemur's free interactive SQL code editor, solve this Meta SQL interview question:

How To Prepare for the Spotify SQL Interview

The key to acing a Spotify SQL interview is to practice, practice, and then practice some more! In addition to solving the above Spotify SQL interview questions, you should also solve the 200+ tricky sql questions on DataLemur which come from companies like Facebook, Google and unicorn tech startups.

Each exercise has multiple hints, detailed solutions and best of all, there is an interactive coding environment so you can instantly run your SQL query and have it executed.

To prep for the Spotify SQL interview you can also be useful to practice SQL problems from other tech companies like:

But if your SQL query skills are weak, forget about going right into solving questions – improve your SQL foundations with this DataLemur SQL Tutorial.

This tutorial covers SQL topics like joining a table to itself and window functions – both of which pop up routinely in Spotify interviews.

Spotify Data Science Interview Tips

What Do Spotify Data Science Interviews Cover?

For the Spotify Data Science Interview, besides SQL questions, the other types of questions to practice:

Probability & Stats Questions
Python or R Programming Questions
Business Sense and Product-Sense Questions
Machine Learning and Predictive Modeling Questions
Behavioral & Resume-Based Questions

Spotify Data Scientist

How To Prepare for Spotify Data Science Interviews?

The best way to prepare for Spotify Data Science interviews is by reading Ace the Data Science Interview. The book's got:

201 Interview Questions from companies like Microsoft, Google & Amazon
A Crash Course on SQL, Product-Sense & ML
Amazing Reviews (900+ reviews, 4.5-star rating)

9 Spotify SQL Interview Questions (Updated 2025)

9 Spotify SQL Interview Questions

SQL Question 1: Top 5 Artists

Table:

Example Input:

Table:

Example Input:

Table:

Example Input:

Example Output:

Answer:

SQL Question 2: Analyze Artist Popularity Over Time

Example Input:

Example Output:

SQL Question 3: Spotify Streaming History

Table:

Example Input:

Table:

Example Input:

Example Output:

Answer:

SQL Question 4: Music Streaming Statistics

Example Input:

Example Input:

Example Input:

Example Input:

Answer:

SQL Question 5: What is denormalization?

SQL Question 6: Filter Spotify Users Based on Subscription and Activity

Example Input:

Example Input:

Answer:

SQL Question 7: What distinguishes a left join from a right join?

SQL Question 8: Calculate the average listening duration for each music genre on Spotify

Example Output:

Answer:

SQL Question 9: Find Users Who've Listened to All Albums of a Specific Artist

Answer:

How To Prepare for the Spotify SQL Interview

Spotify Data Science Interview Tips

What Do Spotify Data Science Interviews Cover?

How To Prepare for Spotify Data Science Interviews?

Career Resources

Support

Interview Questions