8 Expedia SQL Interview Questions (Updated 2024)

Updated on

February 29, 2024

At Expedia Group, PostgreSQL is crucial for querying travel data for customer insights and to study the user acquisition funnel for inefficiences. That's why Expedia often tests SQL coding questions in interviews for Data Analytics, Data Science, and Data Engineering jobs.

To help you study for the Expedia SQL interview, we've collected 8 Expedia Group SQL interview questions – how many can you solve?

Expedia SQL Interview

8 Expedia Group SQL Interview Questions

SQL Question 1: Identify Top Customers Booking Hotels On Expedia

As a Data Analyst at Expedia, assume you need to find out who their most loyal and high spending users are to target them with personalized marketing campaigns. Specifically, Expedia is interested in users who have made a high amount of total booking cost throughout 2022.

Sample table:

Example Input:
booking_iduser_idbooking_datehotel_idtotal_cost
10125401/02/202210014000
10236502/10/202210032100
10325403/15/202210025300
10440004/01/202210031900
10525405/21/202210013400
10650006/12/202210022500
10740007/03/202210014100
10825408/17/202210035100
10936509/05/202210032000
11025410/06/202210015200

Answer:


This query sums up the column for each for bookings made in the year 2022. It returns the and the summed cost (alias ) for users whose total cost is greater than 10,000. The results are ordered by the total cost in descending order, so the highest spending users appear first.

To solve a similar VIP customer analysis question on DataLemur's free interactive coding environment, try this Microsoft SQL Interview problem: Microsoft SQL Interview Question: Super Cloud Customer

SQL Question 2: Calculate Monthly Average Review Rate for Each Product

As an analytics professional at Expedia, one of your responsibilities will be to analyze customer reviews and ratings which are essential for the company's product improvement and customer satisfaction.

Our table consists of customer's reviews data with the following columns:

  • : Unique identifier for each review
  • : ID of the user who submitted the review
  • : The date the review was submitted
  • : ID of the hotel/flight/package for which the review was submitted
  • : Star rating given by the user (0-5, with 5 being the best)

Your task is to write a SQL query to calculate the monthly average ratings(stars) for each product.

Example Input:
review_iduser_idsubmit_dateproduct_idstars
61711232022-08-06500014
78022652022-10-06698524
52933622022-18-06500013
63521922022-26-07698523
45179812022-05-07698522

Answer:


This query first groups the data by month (using the function on ) and . It then calculates the average star rating() per month per product() using the function. The result is ordered by and then .

Example Output:
mthproductavg_stars
6500013.5
6698524.0
7698522.5

For more window function practice, solve this Uber SQL problem within DataLemur's online SQL code editor:

Uber Data Science SQL Interview Question

SQL Question 3: What's an index, and what are the different types?

A database index is a data structure that improves the speed of data retrieval operations on a database table.

There are few different types of indexes that can be used in a database:

  • Primary index: a unique identifier is used to access the row directly.
  • Unique index: used to enforce the uniqueness of the indexed columns in a table.
  • Composite index: created on multiple columns of a table, is used to speed up the search process for multiple columns
  • Clustered index: determines the physical order of the data in a table

For a concrete example, say you had a table of Expedia customer payments with the following columns:

Here's what a clustered index on the column would look like:


A clustered index on the column would determine the physical order of the records in the table based on the . This means that the records with the earliest values would be stored together physically in the table, followed by records with later payment_date values. This speeds up queries that filter or sort the data based on the , as the records are already physically sorted in the table. For example, if you want to retrieve all the payments made in the month of June, the database can use the clustered index to quickly locate and retrieve the desired records without having to search through the entire table.

Expedia Group SQL Interview Questions

SQL Question 4: Find the Average Hotel Rating

Expedia is a travel agency company, which provides hotel bookings among other services. For each booking, customers are asked to give a rating, which ranges from 1 to 5 stars. The higher the rating, the more satisfied the customer was.

As a data analyst at Expedia, you are tasked to find the average rating for each hotel. You have a 'ratings' table with the hotel_id, user_id, rating, and a 'hotels' table with the hotel_id, hotel_name.

Here is some example data:

Example Input:

Example Input:

Answer:


Expected Output:


In the query, the ratings table is joined with the hotels table on the hotel_id. Then the average rating for each hotel is calculated using the AVG function in combination with the GROUP BY clause.

To practice a very similar question try this interactive Amazon Average Review Ratings Question which is similar for requiring average rating computation or this Facebook User's Third Transaction Question which is similar for working with user and transaction related data.

SQL Question 5: What is the difference between a correlated subquery and non-correlated subquery?

hile a correlated subquery relies on columns in the main query's FROM clause and cannot function independently, a non-correlated subquery operates as a standalone query and its results are integrated into the main query.

An example correlated sub-query:


This correlated subquery retrieves the names and salaries of Expedia employees who make more than the average salary for their department. The subquery references the department column in the main query's FROM clause (e1.department) and uses it to filter the rows of the subquery's FROM clause (e2.department).

An example non-correlated sub-query:


This non-correlated subquery retrieves the names and salaries of Expedia employees who make more than the average salary for the Data Science department (which honestly should be very few people since Data Scientists are awesome and deserve to be paid well).The subquery is considered independent of the main query can stand alone. Its output (the average salary for the Data Science department) is then used in the main query to filter the rows of the Expedia employees table.

SQL Question 6: Average Rating Per Hotel Per Month

Given a table which contains customer reviews for various hotels booked via Expedia, calculate the average rating of each hotel per month. Each review is associated with a specific hotel through and has a particular . For this question, you can assume that all dates are in the format 'YYYY-MM-DD'.

Example Input:
review_iduser_idsubmit_datehotel_idrating
112022-05-0114.5
222022-05-0113.5
332022-05-0224.0
442022-06-0115.0
552022-06-0224.5
Example Output:
monthhotelavg_rating
514
524
615
624.5

Answer:


This query extracts the month from the submit date of the review, then groups the results by the extracted month and . The aggregation function is applied to calculate the average rating per hotel per month.

SQL Question 7: Can you provide an example of two entities that have a one-to-one relationship, and another example of a one-to-many relationship?

When designing a database schema, a one-to-one relationship between two entities is characterized by each entity being related to a single instance of the other. An example of this is the relationship between a car and a license plate - each car has one license plate, and each license plate belongs to one car.

On the other hand, a one-to-many relationship is when one entity can be associated with multiple instances of the other entity. For example, a person can have multiple email addresses, but each email address only relates back to one person.

SQL Question 8: Joining and Analyzing Customer and Bookings Data

As a data analyst at Expedia's headquarters, you've been tasked with the job of analyzing the customer database and their bookings. Specifically, your boss wants you to get a list of all customers, their total number of trips booked, and the average rating for each trip. Join the customer and booking tables to get this information.

Table Example:

Table Example:

Answer:

In PostgreSQL, you could use something like the following:


This query entails joining the customer table with the bookings table on the field. We then use the function to count the total number of bookings and the function to calculate the average rating for all trips booked by each customer. Finally, we group by , , and and order by . If any customer did not book a trip, that customer would be excluded from the result.

Since joins come up so often during SQL interviews, try this interactive Snapchat SQL Interview question using JOINS: Snapchat JOIN SQL interview question

Preparing For The Expedia SQL Interview

The key to acing a Expedia SQL interview is to practice, practice, and then practice some more! Beyond just solving the earlier Expedia SQL interview questions, you should also solve the 200+ FAANG SQL Questions on DataLemur which come from companies like Microsoft, Google, Amazon, and tech startups. DataLemur SQL and Data Science Interview Questions

Each DataLemur SQL question has multiple hints, full answers and best of all, there's an interactive coding environment so you can easily right in the browser your SQL query and have it graded.

To prep for the Expedia SQL interview it is also helpful to solve SQL questions from other tech companies like:

But if your SQL skills are weak, forget about jumping right into solving questions – refresh your SQL knowledge with this SQL interview tutorial.

DataLemur SQL Course

This tutorial covers things like math functions and filtering data with WHERE – both of these come up routinely in Expedia SQL assessments.

Expedia Group Data Science Interview Tips

What Do Expedia Data Science Interviews Cover?

In addition to SQL interview questions, the other types of questions to practice for the Expedia Data Science Interview are:

  • Statistics and Probability Questions
  • Python or R Programming Questions
  • Business Sense and Product-Sense Questions
  • Machine Learning and Predictive Modeling Questions
  • Behavioral & Resume-Based Questions

Expedia Data Scientist

How To Prepare for Expedia Data Science Interviews?

To prepare for Expedia Data Science interviews read the book Ace the Data Science Interview because it's got:

  • 201 interview questions sourced from Facebook, Google, & Amazon
  • a refresher covering SQL, Product-Sense & ML
  • over 900+ 5-star reviews on Amazon

Ace the DS Interview

© 2024 DataLemur, Inc

Career Resources

Free 9-Day Data Interview Crash CourseFree SQL Tutorial for Data AnalyticsSQL Interview Cheat Sheet PDFUltimate SQL Interview GuideAce the Data Job Hunt Video CourseAce the Data Science InterviewBest Books for Data Analysts