At Nexon, SQL is used all the damn time for analyzing player behavior patterns in games and optimizing database performance for seamless gaming experiences. Unsurprisingly this is why Nexon almost always evaluates jobseekers on SQL questions in interviews for Data Science, Analytics, and & Data Engineering jobs.
To help you prepare for the Nexon SQL interview, we've collected 11 Nexon SQL interview questions – can you solve them?
Nexon is a global leader in online games, with users across the world playing their games for countless hours each day. As a data analyst at Nexon, you are tasked with finding the 'power users' among their players. 'Power users' are defined as users who have spent the most hours playing Nexon's games in the past month. You have access to a 'user_activity' table that logs each gaming session a user embarks on.
Provide a SQL query that will reveal the top 5 power users (user_ids) in the past month (based on the 'session_end' timestamp). Your result should showcase the user_id and total hours spent in descending order.
session_id | user_id | game_id | session_start | session_end |
---|---|---|---|---|
201 | 111 | 1 | 2023-01-01 09:00:00 | 2023-01-01 12:30:00 |
202 | 111 | 1 | 2023-01-02 10:00:00 | 2023-01-02 11:30:00 |
203 | 112 | 2 | 2023-01-01 08:30:00 | 2023-01-01 10:30:00 |
204 | 113 | 1 | 2023-01-03 15:00:00 | 2023-01-03 16:00:00 |
205 | 112 | 2 | 2023-01-02 12:00:00 | 2023-01-02 13:30:00 |
This query first creates a CTE (Common Table Expression) to calculate total gameplay hours for each user in the last month. Then, it selects the top 5 users with the most gameplay hours. The EXTRACT function is used to get the difference in hours between the session_start and session_end. The CURRENT_DATE - interval '1 month' condition ensures that we only take into account the sessions from the past month.
To practice a related super-user data analysis question on DataLemur's free interactive coding environment, try this recently asked Microsoft SQL interview question:
Using a dataset containing game reviews at Nexon, your task is to write a SQL query in PostgreSQL to calculate the monthly average rating for each product. Given the table which has the following columns: , , , , and (represented as an integer from 1 to 5), write a SQL query to provide an output table with columns for month (), , and average star rating .
review_id | user_id | submit_date | product_id | stars |
---|---|---|---|---|
6171 | 123 | 2022-06-08 | 50001 | 4 |
7802 | 265 | 2022-06-10 | 69852 | 4 |
5293 | 362 | 2022-06-18 | 50001 | 3 |
6352 | 192 | 2022-07-26 | 69852 | 3 |
4517 | 981 | 2022-07-05 | 69852 | 2 |
mth | product | avg_stars |
---|---|---|
6 | 50001 | 3.50 |
6 | 69852 | 4.00 |
7 | 69852 | 2.50 |
You would use a SQL query like this:
In the above PostgreSQL query, the function is used to get the month from the . The function is used as an aggregate function to calculate the average of the for each product. The clause is used to group the result-set by month and product. The results are then ordered by month in ascending order and average stars in descending order.
To solve another window function question on DataLemur's free interactive coding environment, try this Google SQL Interview Question:
Cross join and natural join are like two sides of a coin in the world of SQL.
Cross joins is like the wild and reckless cousin who creates a giant new table by combining every row from table A with every row from table B, no questions asked, no common key needed.
Natural joins are like the more refined and selective cousin who only combines rows from multiple tables if they have something in common (i.e., common columns/keys).
While cross join doesn't discriminate and will create a massive table if given the chance, natural join is more selective and only returns a table with the number of rows equal to the number of matching rows in the input tables. So, choose your JOIN wisely!
Sure, here's an interview question tailored towards a gaming company like Nexon, which could be facing challenges such as understanding player gaming habits so as to improve retention.
As a data analyst at Nexon, you are tasked to analyze the gaming habits of players. You want to find the average duration of daily gameplay for each game Nexon offers during the past month.
You have two tables -
session_id | player_id | game_id | start_time | end_time |
---|---|---|---|---|
1235 | 98123 | 250 | 2022-07-01 10:00:00 | 2022-07-01 13:00:00 |
1236 | 18295 | 251 | 2022-07-01 15:00:00 | 2022-07-01 16:00:00 |
1237 | 98123 | 250 | 2022-07-02 10:30:00 | 2022-07-02 12:30:00 |
1238 | 18295 | 250 | 2022-07-02 20:00:00 | 2022-07-02 22:00:00 |
1239 | 57412 | 252 | 2022-07-03 09:00:00 | 2022-07-03 13:00:00 |
game_id | game_title |
---|---|
250 | Maple Story |
251 | Dungeon Fighter |
252 | Mabinogi |
game_title | average_playtime |
---|---|
Maple Story | 2.50 |
Dungeon Fighter | 1.00 |
Mabinogi | 4.00 |
Please note that the output's average playtime is in 'hours' for ease of understanding.
This query first calculates the duration of each game session in hours by subtracting the start_time from the end_time and converting the interval to hours. It then averages this duration for each game over the past month. The results are grouped by game_name using a join on the game_id between the two tables.
A join in SQL allows you to retrieve data from multiple tables and combine it into a single set of results.
To demonstrate the difference between left vs. right join, imagine you had two database tables, an advertising campaigns table which had information on Google Ads keywords and how much was bid for each keyword, and a sales table, which has data on how many products were sold and which Google Ads keyword drove that sale.
: A LEFT JOIN retrieves all rows from the left table (in this case, the table) and any matching rows from the right table (the Sales table). If there is no match in the right table, values will be returned for the right table's columns.
: A RIGHT JOIN retrieves all rows from the right table (in this case, the Sales table) and any matching rows from the left table (the table). If there is no match in the left table, values will be returned for the left table's columns.
Nexon Corporation is analyzing game purchases and revenues from different customers. The campaign is targeted towards customers who made at least three purchases or have spent over 100,000 in total revenue, regardless of the number of purchases. Given the customer records, write a query to filter and list the customers.
customer_id | name |
---|---|
1 | John Doe |
2 | Jane Smith |
3 | Robert Johnson |
4 | Jennifer Williams |
5 | Linda Brown |
purchase_id | customer_id | purchase_date | product_id | amount_spent |
---|---|---|---|---|
101 | 1 | 06/08/2022 | 201 | 20000 |
102 | 1 | 06/09/2022 | 202 | 05000 |
103 | 1 | 06/09/2022 | 203 | 15000 |
104 | 2 | 06/10/2022 | 204 | 78000 |
105 | 2 | 06/11/2022 | 205 | 25000 |
106 | 3 | 06/10/2022 | 206 | 50000 |
107 | 4 | 06/12/2022 | 207 | 30000 |
108 | 5 | 06/13/2022 | 208 | 120000 |
This query first makes a sub-query on table to find the number of purchases() and total revenue() for each customer. It then performs operation on and the sub-query on , and applies clause to filter out customers who made at least three purchases or have total revenue over 100,000. The result of the query is the list of these customers' names.
In database schema design, a one-to-one relationship is when each entity is associated with only one instance of the other. For instance, a US citizen's relationship with their social-security number (SSN) is one-to-one because each citizen can only have one SSN, and each SSN belongs to one person.
A one-to-many relationship, on the other hand, is when one entity can be associated with multiple instances of the other entity. An example of this is the relationship between a person and their email addresses - one person can have multiple email addresses, but each email address only belongs to one person.
Nexon maintains a customer records database. They are specifically interested in identifying customers who have their email addresses registered with certain internet service providers (ISPs). Write a SQL query to find all Nexon customers whose email addresses end with "@isp.com" or "@hosting.com".
Sample tables:
CustomerID | FirstName | LastName | |
---|---|---|---|
1001 | John | Doe | john.doe@net.com |
1002 | Jane | Smith | jane.smith@isp.com |
1003 | Mike | Johnson | mike.johnson@hosting.com |
1004 | Emma | Brown | emma.brown@misc.com |
1005 | Tom | Anderson | tom.anderson@isp.com |
CustomerID | FirstName | LastName | |
---|---|---|---|
1002 | Jane | Smith | jane.smith@isp.com |
1003 | Mike | Johnson | mike.johnson@hosting.com |
1005 | Tom | Anderson | tom.anderson@isp.com |
This SQL query uses the LIKE keyword to filter the Customers table based on the Email field. The '%' character in the LIKE patterns is a wildcard that matches any sequence of characters. Therefore, '%@isp.com' matches any email ending with "@isp.com" and '%@hosting.com' matches any email ending with "@hosting.com". The query returns all columns for the matching customer records.
Nexon is a global video game company that specializes in building online communities. The company would like to understand the behavior of their customer base by analyzing game purchases and usage. The team is specifically interested in the count of unique customers who have bought any game and logged into the game within the last 30 days.
Two tables are available for this analysis: and .
purchase_id | customer_id | game_id | purchase_date |
---|---|---|---|
1 | 123 | 001 | 08/10/2022 |
2 | 265 | 002 | 08/12/2022 |
3 | 362 | 003 | 08/20/2022 |
4 | 192 | 002 | 09/10/2022 |
5 | 981 | 003 | 09/18/2022 |
login_id | customer_id | game_id | login_date |
---|---|---|---|
1 | 123 | 001 | 09/10/2022 |
2 | 265 | 002 | 09/12/2022 |
3 | 362 | 003 | 09/20/2022 |
4 | 192 | 002 | 10/10/2022 |
5 | 981 | 001 | 10/18/2022 |
The columns of the tables are:
Write a SQL Query to find out the count of unique customers who have purchased any game and logged into that game in the last 30 days.
This query performs an inner join on the and tables with the condition that both the and match across the tables. This ensures we're looking at logins for games that were actually purchased by the customer. It also uses a WHERE clause to filter down to purchases and logins that are in the past 30 days. Finally, it uses to find the count of unique customers who have both purchased a game and logged into it within the last 30 days.
Because joins come up so often during SQL interviews, try an interactive Spotify JOIN SQL question:
No, in 99% of normal cases a and do NOT produce the same result.
You can think of more as set addition when it combines the results of two tables.
You can think of a more as set multiplication, producing all combos of table 1 with combos of table 2.
Nexon wants to evaluate player behavior in one of their games to develop new features and tweaks. Specifically, they need to understand the average, minimum, and maximum time spent by users playing the game each day, and the total time spent across all days. They also want to know the total number of users who logged into the game each day, and the square root of total time spent by the users on any day. For simplicity, assume there is no data error and daily time spent by users is logged properly.
activity_id | user_id | game_date | time_spent |
---|---|---|---|
1 | 100 | 2021-07-01 | 120 |
2 | 101 | 2021-07-01 | 130 |
3 | 100 | 2021-07-02 | 140 |
4 | 102 | 2021-07-02 | 150 |
5 | 103 | 2021-07-03 | 160 |
6 | 104 | 2021-07-03 | 110 |
game_date | num_players | min_time_spent | avg_time_spent | max_time_spent | total_time_spent | sqrt_total_time |
---|---|---|---|---|---|---|
2021-07-01 | 2 | 120 | 125 | 130 | 250 | 15.81 |
2021-07-02 | 2 | 140 | 145 | 150 | 290 | 17.03 |
2021-07-03 | 2 | 110 | 135 | 160 | 270 | 16.43 |
The provided PostgreSQL query first groups the data by . It then calculates the number of distinct players (), minimum time spent (), average (), maximum (), and total time () that the players spent in the game. It also computes the square root of the total time spent (), which is rounded to two decimal places for better readability. The resulting data is an overview of player activity for each game date.
To practice a very similar question try this interactive Snapchat Sending vs. Opening Snaps Question which is similar for calculating and analyzing time spent users or this Amazon Server Utilization Time Question which is similar for calculating total time around a specific action.
Assuming that you've already got basic SQL skills, the next best tip we have to prepare for the Nexon SQL interview is to solve as many practice SQL interview questions as you can! In addition to solving the earlier Nexon SQL interview questions, you should also solve the 200+ SQL coding questions which come from companies like Microsoft, Google, Amazon, and tech startups.
Each exercise has hints to guide you, fully explained answers along with a discussion board to see how others solved it and crucially, there is an online SQL code editor so you can right online code up your SQL query and have it executed.
To prep for the Nexon SQL interview it is also a great idea to solve SQL problems from other tech companies like:
But if your SQL coding skills are weak, forget about going right into solving questions – go learn SQL with this SQL tutorial for Data Scientists & Analysts.
This tutorial covers SQL concepts such as CASE/WHEN/ELSE statements and joins – both of which pop up frequently in SQL job interviews at Nexon.
In addition to SQL interview questions, the other question categories to practice for the Nexon Data Science Interview are:
To prepare for Nexon Data Science interviews read the book Ace the Data Science Interview because it's got: