logo

9 Facebook SQL Interview Questions (Updated 2024)

Updated on

January 22, 2024

Facebook/Meta LOVES to ask SQL coding questions during interviews for Data Analyst, Data Science, and Data Engineering jobs. I know this first-hand, because I used to work at FB, and I also co-authored Ace the Data Science Interview with Kevin Huo (an Ex-Facebook Data Scientist).

Because we know the Meta/Facebook SQL interview process intimately well, we curated 9 real Facebook/Meta SQL interview questions to practice which come from recent Meta interviews.

Facebook/Meta SQL Interview Questions

9 Facebook/Meta SQL Interview Questions

SQL Question 1: Average Post Hiatus

Given a table of Facebook posts, for each user who posted at least twice in 2024, write a SQL query to find the number of days between each user’s first post of the year and last post of the year in the year 2024. Output the user and number of the days between each user's first and last post.

You can solve this SQL problem interactively and run your solution directly on DataLemur:

Average Post Hiatus Facebook/Meta SQL interview question

Table:
Column NameType
user_idinteger
post_idinteger
post_datetimestamp
post_contenttext
Example Input:
user_idpost_idpost_datepost_content
15165259941507/10/2024 12:00:00Need a hug
66109362435607/29/2024 13:00:00Bed. Class 8-12. Work 12-3. Gym 3-5 or 6. Then class 6-10. Another day that's gonna fly by. I miss my girlfriend
00423978425407/04/2024 11:00:00Happy 4th of July!
66109344256007/08/2024 14:00:00Just going to cry myself to sleep after watching Marley and Me.
15165211176607/12/2024 19:00:00I'm so done with covid - need traveling ASAP!
Example Output:
user_iddays_between
1516522
66109321

Answer:

First, we can use the and aggregate functions on the column to retrieve the earliest and latest post dates, and substract one from another accordingly.

To calculate the difference for each user, we the results by , and then filter for posts made in the year 2024. To do so, we use the function to extract the year from the column.

In the final step, to exclude users who have posted only once during the year, we apply the clause with a condition greater than 1.

This yields the final solution:


p.s. DataLemur Premium users can attempt part 2 of the problem which is much harder.

SQL Question 2: Facebook Power Users

A Facebook power user is defined as someone who posts a ton, and gets a lot of reactions on their post. For the purpose of this question, consider a Facebook power user as someone who posts at least twice a day and receives an average of 150 comments and/or reactions per post.

Write a SQL query to return the IDs of all Facebook power users, along with the number of posts, and the average number of reactions per post.

Use the following tables "user_post" and "post_interactions":

Example Input:
user_idpost_idpost_date
110012024-09-01
110022024-09-01
210032024-09-02
210042024-09-03
110052024-09-02
Example Input:
post_idcommentsreactions
100175200
100285250
10046090
1005100150
10035070

Answer:


Grandma Addicted to Social Media & Facebook Power User

SQL QUESTION 3: Can you explain the difference between and ?

The clause works similarly to the clause, but it is used to filter the groups of rows created by the clause rather than the rows of the table themselves.

For example, say you were analyzing Facebook ads data:


This query retrieves the total sales from all ads in each region, and uses the clause to only sales made after January 1, 2024. The rows are then grouped by region and the clause filters the groups to include only those with total sales greater than $500k.

SQL Question 4: Active User Retention

Assume you're given a table containing information on Facebook user actions. Write a SQL query to obtain number of monthly active users (MAUs) in July 2022, including the month in numerical format "1, 2, 3".

Hint: An active user is defined as a user who has performed actions such as 'sign-in', 'like', or 'comment' in both the current month and the previous month.

You can type up + execute your SQL query interactively to this problem on DataLemur:

Active User Retention: Facebook SQL Interview Question

Table:
Column NameType
user_idinteger
event_idinteger
event_typestring ("sign-in, "like", "comment")
event_datedatetime
Example Input:
user_idevent_idevent_typeevent_date
4457765sign-in06/31/2022 12:00:00
7426458sign-in07/03/2022 12:00:00
4453634like07/05/2022 12:00:00
7421374comment07/05/2022 12:00:00
6483124like07/18/2022 12:00:00
Example Output for June 2022:
monthmonthly_active_users
61

In July 2022, there was only one monthly active user (MAU) with the 445.

Answer

For a full step-by-step explanation of the problem, click here.

SQL QUESTION 5: What's the difference between a left and right join?

In SQL, a join generally retrieves rows from multiple tables and combines them into a single result set. For an example of the difference between a left vs. right join, suppose you had a table of Facebook users and Facebook posts.

A retrieves all rows from the left table (in this case, the users table) and any matching rows from the right table (the posts table). If there is no match in the right table, NULL values will be returned for the right table's columns.

A combines all rows from the right table (in this case, the posts table) and any matching rows from the left table (the users table). If there is no match in the left table, NULL values will be displayed for the left table's columns.

Left vs. Right Join

SQL Question 6: Facebook Friend Recommendations

Facebook wants to recommend new friends to people who show interest in attending 2 or more of the same private Facebook events.

Notes:

  • A user interested in attending would have either 'going' or 'maybe' as their attendance status.
  • Friend recommendations are unidirectional, meaning if user x and user y should be recommended to each other, the result table should have both user x recommended to user y and user y recommended to user x.
  • The result should not contain duplicates (i.e., user y should not be recommended to user x multiple times).
Table:
Column NameType
user_a_idinteger
user_b_idinteger
statusenum ('friends', 'not_friends')

Each row of this table indicates the status of the friendship between user_a_id and user_b_id.

Example Input:
user_a_iduser_b_idstatus
111333not_friends
222333not_friends
333222not_friends
222111friends
111222friends
333111not_friends
Table:
Column NameType
user_idinteger
event_idinteger
event_typeenum ('public', 'private')
attendance_statusenum ('going', 'maybe', 'not_going')
event_datedate
Example Input:
user_idevent_idevent_typeattendance_statusevent_date
111567publicgoing07/12/2022
222789privategoing07/15/2022
333789privatemaybe07/15/2022
111234privatenot_going07/18/2022
222234privategoing07/18/2022
333234privategoing07/18/2022
Example Output:
user_a_iduser_b_id
222333
333222

Users 222 and 333 who are not friends have shown interest in attending 2 or more of the same private events.

Answer:

To find pairs of friends to be recommended to each other if they're interested in attending 2 or more of the same private events we'll:

  1. Find users who are interested in attending private events.
  2. Join tables to compare the correct data
  3. Find pairs of users who are not friends but are interested in 2 or more of the same private events.

This leads to the following query:


For a full step-by-step solution, and to run the code yourself, subscribe to DataLemur premium to unlock this problem:

Facebook Friends Recommendation SQL Interview Question

Also, if you're a nerd for Machine Learning, you can learn more about Facebook's People-You-May-Know (PYMK) algorithm here.

SQL QUESTION 7: Can you explain the concept of a constraint in SQL?

Constraints are just rules for your DBMS to follow when updating/inserting/deleting data.

Say you had a table of Facebook employees, and their salaries, job titles, and performance review data. Here's some examples of SQL constraints you could implement:

NOT NULL: This constraint could be used to ensure that certain columns in the employee table, such as the employee's first and last name, cannot contain NULL values.

UNIQUE: This constraint could be used to ensure that the employee ID is unique. This would prevent duplicate entries in the employee table.

PRIMARY KEY: This constraint could be used to combine the NOT NULL and UNIQUE constraints to create a primary key for the employee table. The employee ID could serve as the primary key.

FOREIGN KEY: This constraint could be used to establish relationships between the employee table and other tables in the database. For example, you could use a foreign key to link the employee ID to the department ID in a department table to track which department each employee belongs to.

CHECK: This constraint could be used to ensure that certain data meets specific conditions. For example, you could use a CHECK constraint to ensure that salary values are always positive numbers.

DEFAULT: This constraint could be used to specify default values for certain columns. For example, you could use a DEFAULT constraint to set the employee hire date to the current date if no value is provided when a new employee is added to the database.

SQL Question 8: Average Number of Shares per Post

As a data analyst at Facebook, you are asked to find the average number of shares per post for each user.

In the user_posts table, each row represents a post by a user. Each user may have zero or more posts.

In the post_shares table, each row represents a share of a post. Each post may have zero or more shares.

Please write a SQL query to find the average number of shares per post for each user.

Example Input:

post_iduser_idpost_textpost_date
11Hello world!06/08/2022 00:00:00
22What a beautiful day!06/10/2022 00:00:00
31Hope everyone is having a good day!06/18/2022 00:00:00
43Facebook is amazing!07/26/2022 00:00:00
52Enjoying a great meal!07/05/2022 00:00:00

Example Input:

share_idpost_idshare_date
1106/09/2022 00:00:00
2206/11/2022 00:00:00
3106/19/2022 00:00:00
4106/29/2022 00:00:00
5307/27/2022 00:00:00

Example Output:

user_idavg_shares_per_post
11.67
20.50
30.00

Answer:


This query works by first finding the total number of shares for each post by grouping on in the table. Then, it joins this with on to get the user_id of the user who made each post. It finally averages the total number of shares for each user using the function. Null values are replaced with zero so that users who have no shares do not return null for .

In case this marketing problem was interesting, checkout how this blog on how marketing analysts use SQL.

SQL Question 9: Calculate Facebook Ad Click-Through Rate

Assume you have an events table on Facebook app analytics. Write a SQL query to calculate the click-through rate (CTR) for the app in 2022 and round the results to 2 decimal places.

Definition and note:

  • Percentage of click-through rate (CTR) = 100.0 * Number of clicks / Number of impressions
  • To avoid integer division, multiply the CTR by 100.0, not 100.

Before reading our solution, practice this Meta SQL question interactively:

Meta SQL interview question

Table:
Column NameType
app_idinteger
event_typestring
timestampdatetime
Example Input:
app_idevent_typetimestamp
123impression07/18/2022 11:36:12
123impression07/18/2022 11:37:12
123click07/18/2022 11:37:42
234impression07/18/2022 14:15:12
234click07/18/2022 14:16:12
Example Output:
app_idctr
12350.00
234100.00

Answer:

We first find the number of clicks and impressions using the statement to assign a value of 1 for 'click' events and 0 for other events.

Then, to calculate the percentage of click-through rate (CTR) we divide the number of clicks by the number of impressions and multiplying by 100.0, rounded to 2 decimal places using the function.

This yields the following solution:


How To Prepare for the Facebook SQL Interview

The key to acing a Facebook SQL interview is to practice, practice, and then practice some more! In addition to solving the earlier Facebook SQL interview questions, you should also solve the 200+ DataLemur SQL Interview Questions which come from companies like FAANG (FB, Apple, Amazon, Netflix, Google). DataLemur SQL and Data Science Interview Questions

Each exercise has hints to guide you, full answers and crucially, there is an online SQL coding environment so you can easily right in the browser your SQL query and have it graded.

To prep for the Facebook SQL interview you can also be useful to practice SQL questions from other tech companies like:

In case your SQL coding skills are weak, forget about jumping right into solving questions – improve your SQL foundations with this DataLemur SQL tutorial.

DataLemur SQL Tutorial for Data Science

This tutorial covers SQL topics like aggregate window functions and math functions – both of which show up routinely during Facebook interviews.

Facebook Data Science Interview Tips

What Do Facebook Data Science Interviews Cover?

For the Facebook Data Science Interview, in addition to SQL query questions, the other types of questions to prepare for are:

How To Prepare for Facebook Data Science Interviews?

The best way to prepare for Facebook Data Science interviews is by reading Ace the Data Science Interview. The book's got:

  • 201 Interview Questions from FAANG (FB, Apple, Amazon, Netflix, Google)
  • A Refresher covering SQL, Product-Sense & ML
  • Great Reviews (900+ reviews, 4.5-star rating)

Acing Data Science Interview