11 New York Times SQL Interview Questions (Updated 2025)

(Ex-Facebook & Best-Selling Data Science Author)

Updated on

February 28, 2025

Data Analysts & Data Scientists at New York Times uses SQL to extract insights from readership data to understand consumer behavior, such as which articles are most popular, how users interact with the website, and what topics are trending. They also use it to manage the digital ads based on readers' preferences and engagement, which is why, New York Times asks prospective hires SQL interview questions.

To help you study for the New York Times SQL interview, here's 11 NYT SQL interview questions in this blog.

New Yorkt Times SQL Interview Questions

11 NYT SQL Interview Questions

SQL Question 1: Laptop vs. Mobile Viewership

This is the same question as problem #3 in the SQL Chapter of Ace the Data Science Interview!

Assume you're given the table on user viewership categorised by device type where the three types are laptop, tablet, and phone.

Write a query that calculates the total viewership for laptops and mobile devices where mobile is defined as the sum of tablet and phone viewership. Output the total viewership for laptops as and the total viewership for mobile devices as .

Table

Column Name	Type
user_id	integer
device_type	string ('laptop', 'tablet', 'phone')
view_time	timestamp

Example Input

user_id	device_type	view_time
123	tablet	01/02/2022 00:00:00
125	laptop	01/07/2022 00:00:00
128	laptop	02/09/2022 00:00:00
129	phone	02/09/2022 00:00:00
145	tablet	02/24/2022 00:00:00

Example Output

laptop_views	mobile_views
2	3

Answer:

Practice this NYT SQL Interview Question on our FREE interactive coding platform!

SQL Question 2: Top Department Salaries

Given a table of New York Times employee salary data, write a SQL query to find the top 3 highest earning employees in each department.

New York Times Example Input:

employee_id	name	salary	department_id
1	Emma Thompson	3800	1
2	Daniel Rodriguez	2230	1
3	Olivia Smith	2000	1
4	Noah Johnson	6800	2
5	Sophia Martinez	1750	1
8	William Davis	6800	2
10	James Anderson	4000	1

Example Input:

department_id	department_name
1	Data Analytics
2	Data Science

Example Output:

department_name	name	salary
Data Analytics	James Anderson	4000
Data Analytics	Emma Thompson	3800
Data Analytics	Daniel Rodriguez	2230
Data Science	Noah Johnson	6800
Data Science	William Davis	6800

Code your solution to this question directly within the browser on DataLemur:

Answer:

We use the DENSE_RANK() window function to generate unique ranks for each employee's salary within their department, with higher salaries receiving lower ranks. Then, we wrap this up in a CTE and filter the employees with a ranking of 3 or lower.

f the solution above is confusing, you can find a step-by-step solution here: Top 3 Department Salaries.

SQL Question 3: What are the three different normal forms?

Normalization is the process of dividing a larger table into smaller tables, to eliminate redundancy and dependency. Although there are 5 levels of normalization (normal forms), the 3 most important ones that you should know for the New York Times SQL interview are:

First Normal Form (1NF): Remove a table's duplicate columns, and make sure each value in the column is a singular value (no containers or lists of data). Each row of table should have a unique identifier as well.
Second Normal Form (2NF): A table is in 2NF if it meets all requirements of the 1NF the non-key columns are dependent only on the primary key. You do this by separating subsets of columns subsets, and associating the tables by using primary/foreign keys.
Third Normal Form (3NF): The table should be in 2NF and there shouldn't be any dependency on any non-key attributes (meaning a primary key should be the only thing needed to identify a row).

NYT SQL Interview Questions

SQL Question 4: Analyzing Article Reads by Month

The New York Times wants to analyze the monthly activity of their online articles. They would like to understand how many unique readers they have for each article every month. You are given a table which tracks each time a user opens an article. Every row in the dataset means that a user has read an article. Duplicate entries are possible and indicate that a user read the article multiple times. Write a SQL query to return the month, the article_id and the number of unique readers for that article for that month.

Consider below is the dataset:

Example Input:

read_id	user_id	read_date	article_id
1001	123	01/15/2022	101
1002	234	01/19/2022	101
1003	123	01/28/2022	101
1004	123	01/28/2022	102
1005	345	02/01/2022	103
1006	456	02/02/2022	104
1007	345	02/05/2022	105
1008	345	02/07/2022	103

Expected Output:

month	article	unique_users
1	101	2
1	102	1
2	103	1
2	104	1
2	105	1

Answer:

In PostgreSQL, you can utilize the clause to get the month from a date. Hence, your SQL query might look like:

This query counts the unique users (denoted by unique ) separated by and . The clause allows for unique counts within each grouping.

To solve another window function question on DataLemur's free online SQL code editor, solve this Google SQL Interview Question:

SQL Question 5: What is a database index, and what are the different types of indexes?

A database index is a way to optimize the performance of a database by reducing the amount of data that needs to be searched to retrieve a record.

There are several types of indexes:

unique & non-inuqie indexes
primary & composite indexes
clustered & non-clustered indexes

SQL Question 6: Design and Query a Database for New York Times Subscriptions and Articles

You've been hired as a database designer for the New York Times. Your task is to model a database to keep track of magazine subscriptions, articles, and authors. Each author can write many articles, and multiple authors can collaborate on a single article. An article can belong to multiple magazine issues. A subscription can access multiple magazine issues.

Design a database schema and construct a PostgreSQL query to find all the authors who wrote more than three articles in any magazine issue a subscriber has access to. Assume all articles are in the English language.

Example Input:

subscription_id	subscriber_name	magazine_id
4	John Doe	2
7	Jane Doe	1
8	Jack Doe	3

Example Input:

issue_id	magazine_id	publication_date
100	1	2021-07-10
101	2	2021-08-10
102	3	2021-09-10

Example Input:

article_id	issue_id	author_id
500	100	45
501	101	45
502	101	46
503	100	46
504	102	45
505	102	47

Example Input:

author_id	author_name
45	Author1
46	Author2
47	Author3

Answer:

This SQL query begins by joining the subscriptions, magazine issues, articles, and authors tables. The join is based on the common columns between these tables, constructing a mega table which includes subscription information, articles, magazine issues, and authors all at once. Afterward, it groups the data based on subscription and author, and calculates the count of distinct articles for each author. Finally, the HAVING clause filters out authors who have written more than 3 articles for any magazine issues that a subscriber has access to.

SQL Question 7: What are the various types of joins used in SQL?

Using a join in SQL, you can retrieve data from multiple tables and merge the results into a single table.

In SQL, there are four distinct types of JOINs. To demonstrate each kind, Imagine you had two database tables: an table that contains data on Google Ads keywords and their bid amounts, and a table with information on product sales and the Google Ads keywords that drove those sales.

: An INNER JOIN retrieves rows from both tables where there is a match in the shared key or keys. For example, an INNER JOIN between the table and the table could be performed using the keyword column as the shared key. This would retrieve only the rows where the keyword in the table matches the keyword in the table.
: A LEFT JOIN retrieves all rows from the left table (in this case, the table) and any matching rows from the right table (the Sales table). If there is no match in the right table, values will be returned for the right table's columns.
: A RIGHT JOIN retrieves all rows from the right table (in this case, the Sales table) and any matching rows from the left table (the table). If there is no match in the left table, values will be returned for the left table's columns.
: A FULL OUTER JOIN retrieves all rows from both tables, regardless of whether there is a match in the shared key or keys. If there is no match, values will be returned for the columns of the non-matching table.

SQL Question 8: Average Number of Shares per Article

As a data analyst at the New York Times, you have been asked to determine the average number of shares per article for the last month. The company wants to understand the reach of their articles on social platforms. Assume we have a table with the columns , , and , and another table with the columns , , , and .

Here's a sample of the and tables:

:

article_id	publish_date	title
1	2023-01-01	New Year's Festivities Around the World
2	2023-01-01	Economic Outlook for 2023
3	2023-01-02	The Resurgence of Physical Books

:

share_id	article_id	share_date	social_platform
101	1	2023-01-01	Twitter
102	1	2023-01-01	Facebook
103	1	2023-01-02	Instagram
104	2	2023-01-01	LinkedIn
105	2	2023-01-03	Twitter
106	3	2023-01-02	Facebook

Answer:

You can determine the average number of shares per article for the last month with the following PostgreSQL query:

This query first selects the and from the table, and computes the average number of share IDs () from the table. The operation is used to merge the and tables based on matching s. The clause restricts the data to articles published in the last month. The clause allows calculating the average number of shares per article.

To practice a very similar question try this interactive New York Times Laptop vs. Mobile Viewership Question which is similar for dealing with article analytics or this Facebook Average Post Hiatus (Part 1) Question which is similar for querying date-based statistics.

SQL Question 9: Click-through-rate for NY Times Articles

Assuming that New York Times (NYT) wants to calculate the click-through-rate (CTR) for its articles. Each time an article is served on the homepage, it is counted as an impression. If a user clicks on the article to read it, it is counted as a click.

Calculate the CTR as the total number of clicks on an article divided by the total number of impressions, for articles served in the top slot on the homepage, on an hourly basis.

Example Input:

impression_id	article_id	impression_time
1	101	2022-07-01 08:00:00
2	101	2022-07-01 08:15:00
3	102	2022-07-01 08:30:00
4	102	2022-07-01 09:00:00
5	101	2022-07-01 09:15:00

Example Input:

click_id	article_id	click_time
1	101	2022-07-01 08:05:00
2	101	2022-07-01 08:20:00
3	102	2022-07-01 08:35:00
4	102	2022-07-01 09:05:00
5	101	2022-07-01 09:20:00

Answer:

Here, we use a LEFT JOIN to combine with , because we want to keep all impressions (the served articles) even if there are no corresponding clicks. We use date_trunc('hour') to round down the impression_time and click_time to an hour, so that clicks and impressions within the same hour are counted together. For each combination of hour and article, we calculate the total number of clicks and impressions, and then calculate the CTR by dividing total clicks by total impressions. Finally, we order the results by hour and ctr in descending order.

To solve a similar problem about calculating rates, try this SQL interview question from TikTok within DataLemur's online SQL coding environment:

SQL Question 10: How do you identify duplicated data in a table?

One way to find duplicate records in a table is by using , and then seeing which groups have more than one occurence:

Another way is by using the operator:

SQL Question 11: Joining and Analyzing User Subscription Data

You are given two tables, and . The table records different users' subscription status at the New York Times. The table details the various subscription plans the company offers.

Your task is to write a SQL query to find the total revenue generated by each subscription plan per year.

Here is a sample representation of the and tables:

Example Input:

user_id	subscription_id	start_date	end_date
101	301	2020-01-01	2021-01-01
102	301	2020-05-15	2021-05-15
103	302	2020-07-01	2021-07-01
104	302	2020-08-15	2021-08-15
105	303	2020-10-01	2021-10-01

Example Input:

subscription_id	subscription_plan	price_per_month
301	Basic	15
302	Premium	25
303	Deluxe	35

Answer:

Here's a SQL query to solve this question:

The above SQL query is joining the and tables using the field. We're grouping by the subscription plan and year, and calculating the total revenue by multiplying the number of subscriptions with the monthly price of each plan and the number of months in a year (assuming each subscription lasts exactly a year). The query is filtered to only include records where the is greater than the , to avoid including cancelled subscriptions.

Since joins come up frequently during SQL interviews, try an interactive Spotify JOIN SQL question:

Preparing For The New York Times SQL Interview

The best way to prepare for a SQL interview, besides making sure you have strong SQL fundamentals, is to practice a ton of real SQL questions that were asked in recent job interviews. In addition to solving the earlier New York Times SQL interview questions, you should also solve the 200+ DataLemur SQL Interview Questions which come from companies like Netflix, Airbnb, and Amazon.

Each interview question has multiple hints, step-by-step solutions and best of all, there's an online SQL code editor so you can easily right in the browser your SQL query answer and have it checked.

To prep for the New York Times SQL interview you can also be useful to solve SQL problems from other media companies like:

Stay ahead of the curve with The New York Times' in-depth coverage of Artificial Intelligence trends and breakthroughs!

In case your SQL skills are weak, forget about diving straight into solving questions – refresh your SQL knowledge with this free SQL tutorial.

This tutorial covers things like filtering with LIKE and sorting data with ORDER BY – both of which come up frequently in SQL interviews at New York Times.

NYT Data Science Interview Tips

What Do New York Times Data Science Interviews Cover?

In addition to SQL query questions, the other question categories covered in the New York Times Data Science Interview are:

Stats Interview Questions
Python or R Coding Questions
Product Data Science Interview Questions
ML Modelling Questions
Behavioral Interview Questions centered on New York Times company values

New York Times Data Scientist

How To Prepare for New York Times Data Science Interviews?

To prepare for New York Times Data Science interviews read the book Ace the Data Science Interview because it's got:

201 interview questions sourced from companies like Google, Tesla, & Goldman Sachs
a refresher covering Python, SQL & ML
over 1000+ reviews on Amazon & 4.5-star rating

Don't ignore the behavioral interview – prep for that using this list of behavioral interview questions for Data Scientists.

11 New York Times SQL Interview Questions (Updated 2025)

11 NYT SQL Interview Questions

SQL Question 1: Laptop vs. Mobile Viewership

Table

Example Input

Example Output

Answer:

SQL Question 2: Top Department Salaries

New York Times Example Input:

Example Input:

Example Output:

Answer:

SQL Question 3: What are the three different normal forms?

SQL Question 4: Analyzing Article Reads by Month

Example Input:

Expected Output:

Answer:

SQL Question 5: What is a database index, and what are the different types of indexes?

SQL Question 6: Design and Query a Database for New York Times Subscriptions and Articles

Example Input:

Example Input:

Example Input:

Example Input:

Answer:

SQL Question 7: What are the various types of joins used in SQL?

SQL Question 8: Average Number of Shares per Article

:

:

Answer:

SQL Question 9: Click-through-rate for NY Times Articles

Example Input:

Example Input:

Answer:

SQL Question 10: How do you identify duplicated data in a table?

SQL Question 11: Joining and Analyzing User Subscription Data

Example Input:

Example Input:

Answer:

Preparing For The New York Times SQL Interview

NYT Data Science Interview Tips

What Do New York Times Data Science Interviews Cover?

How To Prepare for New York Times Data Science Interviews?

Career Resources

Support

Interview Questions