9 Akamai SQL Interview Questions (Updated 2024)

At Akamai, SQL is used across the company for analyzing traffic patterns across Akamai's distributed network, for querying of their vast web logs dataset for anomaly detection, and in supporting their Managed Databases product. That's why Akamai frequently asks SQL problems during interviews for Data Science and Data Engineering positions.

So, to help you study for the Akamai SQL interview, here’s 9 Akamai SQL interview questions – able to answer them all?

9 Akamai SQL Interview Questions

SQL Question 1: Identify the Most Engaged Customers at Akamai

Akamai is a leading content delivery and cloud services provider. In this exercise, we have access to two tables from Akamai's database. The first table records basic information about customers, and the second table records every single interaction each customer has with Akamai's systems.

Your task is to write a SQL query to identify the top 5 customers who generated the most web traffic in the last month. For this exercise, we'll determine a customer's "activity" level by the number of entries their ID has in the table for the last month.

Positions at Akamai can involve analyzing customer engagement and identifying key players or "power users". Being able to create a list of these important customers can aid in targeted marketing or special customer care programs.

Sample Input:
customer_idnamesign_up_date
123ABC Corp2022-01-01 00:00:00
456DEF Inc2022-03-05 00:00:00
789GHI Ltd2022-05-10 00:00:00
321JKL Corp2022-07-15 00:00:00
654MNO Inc2022-08-20 00:00:00
Sample Input:
traffic_idcustomer_iddate
1111232022-09-01 00:00:00
2221232022-09-05 00:00:00
3334562022-09-10 00:00:00
4441232022-09-15 00:00:00
5557892022-09-20 00:00:00

Answer:


This query first creates a Common Table Expression ( clause) to count the number of interactions each customer has made in the last month. Then it joins this CTE with the table to get the names of the customers, and sorts by to get the top 5 most active customers.

Remember, gives the first day of the current month at 00:00:00, so we subtract an interval of '1 month' to get the same date-time but for the previous month.

To practice a related super-user data analysis question on DataLemur's free online SQL coding environment, try this Microsoft Teams Power User SQL Interview Question: Microsoft SQL Interview Question: Teams Super User

SQL Question 2: Analyze Traffic Volume per Service for Akamai

Akamai Technologies is a Content Delivery Network (CDN) and cloud service provider. They handle huge amounts of web and video traffic for their clients. As a data analyst at Akamai, let's assume that you have been given the task to analyze the traffic volume handled by each service over different months. You need to write a SQL query to find out the total traffic (in GB) handled by each service per month.

Please note, traffic is calculated per request and we need to aggregate it per service per month.

Example Input:
request_idservice_idrequest_datetraffic_volume_gb
101106/08/20221.2
102206/10/20222.5
103106/18/20220.6
104207/26/20221.8
105107/05/20221.4
Example Output:
monthservice_idtotal_traffic_gb
611.80
622.50
711.40
721.80

To keep it simple we are only considering data for 2 months.

Answer:


This SQL query first extracts the month from the request_date and then uses the SUM window function to calculate the total traffic per service per month. The PARTITION BY clause is used to split the data into partitions based on the service_id and month. The results are then ordered by month and service_id.

To practice a related window function SQL problem on DataLemur's free online SQL coding environment, try this Google SQL Interview Question: Google SQL Interview Question

SQL Question 3: Does a typically give the same results as a ?

For all practical purposes, and do NOT produce the same results.

While both are similar in that they combine two tables, you can think of joins as increasing the width of the resulting table (you'll have more columns in the result set for a left/inner/right join), whereas a union is used to combine rows which increases the height of the result set but keeps the column count the same.

If your familiar with set operations, you can think of a as set addition, whereas a is set multiplication.

Akamai SQL Interview Questions

SQL Question 4: Traffic Monitoring and Response Time Management

Akamai is one of the leading Content Delivery Network (CDN) and cloud service providers. Besides providing services to deliver and accelerate content, another critical aspect of their operation is to monitor network traffic and service response times. For this question, we'll consider a simple scenario where Akamai wants to analyse the traffic of various client requests and the response time they are experiencing.

Imagine Akamai has two tables:

  1. 'clients' table that has information about all clients the Akamai network serves.
  2. 'website_data' table that records each request serving details in terms of date, time, file size served, and response time.

Design a SQL query that computes the volume of data served for each client each month, along with the average response time.

Example Input:
client_idclient_name
1ClientA
2ClientB
3ClientC
Example Input:
data_idclient_iddate_timefile_size_MBresponse_time_ms
1102/01/2022 00:00:0035200
2102/01/2022 01:00:0050250
3202/01/2022 00:00:0040150
4302/01/2022 00:00:0045180
5103/01/2022 00:00:0030210
Example Output:
monthclient_nametotal_data_MBavg_response_time_ms
2ClientA85225
2ClientB40150
2ClientC45180
3ClientA30210

Answer:

To achieve this, we first need to join the 'clients' and the 'website_data' tables on the 'client_id'.
Next, group by month and client name, then sum the 'file_size_MB' and take the mean of 'response_time_ms' for each group.


This query provides a monthly summary of the total data served per client, along with the average response time, which can be valuable for performance monitoring and identifying areas for improvement.

SQL Question 5: How is a foreign key different from a primary key in a database?

To explain the difference between a primary key and foreign key, let's start with some data from Akamai's marketing analytics database which stores data from Google Ads campaigns:

:

+------------+------------+------------+------------+ | ad_id | campaign_id| keyword | click_count| +------------+------------+------------+------------+ | 1 | 100 | Akamai pricing | 10 | | 2 | 100 | Akamai reviews | 15 | | 3 | 101 | Akamai alternatives | 7 | | 4 | 101 | buy Akamai | 12 | +------------+------------+------------+------------+

In this table, could be a primary key. It is unique for each row in the table and cannot contain null values.

could be a foreign key. It references the of the Google Ads campaign that each ad belongs to, establishing a relationship between the ads and their campaigns. This foreign key allows you to easily query the table to find out which ads belong to a specific campaign, or to find out which campaigns a specific ad belongs to.

It is also possible for a table to have multiple foreign keys that reference different primary keys in other tables. For example, the table could have additional foreign keys for the of the ad group that each ad belongs to, and the of the Google Ads account that the campaigns belong to.

SQL Question 6: Average Bandwidth Consumption per Customer

"What is the average bandwidth consumed each week by each customer using Akamai's CDN solution?"

Akamai is well known for its globally-distributed content delivery network (CDN), so this type of question could be relevant for a position related to their core business.

Here are some possible sample tables relating to the aforementioned problem:

Example Input:
usage_idcustomer_idweek_end_datetotal_bandwidth_gb
10112306/05/2022 00:00:0050
10212306/12/2022 00:00:0075
10312306/19/2022 00:00:0060
10445606/05/2022 00:00:0025
10545606/12/2022 00:00:0040
10678906/19/2022 00:00:0080
10778906/26/2022 00:00:0065
Example Output:
customer_idaverage_bandwidth_gb
12361.67
45632.50
78972.50

Answer:


This query computes the average bandwidth used per customer over all weeks recorded. To specify a particular week or range of weeks, you could modify the and clauses accordingly. For example, to compute the average over only the week ending June 19th, you could use:


To practice a very similar question try this interactive Amazon Server Utilization Time Question which is similar for calculating total usage or this Microsoft Supercloud Customer Question which is similar for tracking customer usage.

SQL Question 7: Can you explain what / SQL commands do?

The / operator is used to remove to return all rows from the first SELECT statement that are not returned by the second SELECT statement.

Note that is available in PostgreSQL and SQL Server, while MINUS is available in MySQL and Oracle (but don't stress about knowing which DBMS supports what exact commands since the interviewers at Akamai should be lenient!).

Here's a PostgreSQL example of using EXCEPT to find all of Akamai's Facebook video ads with more than 10k views that aren't also being run on YouTube:


If you want to retain duplicates, you can use the EXCEPT ALL operator instead of EXCEPT. The EXCEPT ALL operator will return all rows, including duplicates.

SQL Question 8: Compute average usage of network bandwidth

As a network services provider, Akamai deals with large volumes of data involving network usage patterns. Assume you have a table that captures the usage of network bandwidth every 10 minutes for various servers across the globe. You need to compute the average usage of network bandwidth for each server for each day, rounded to two decimal places. The daily usage is computed as the absolute difference between the maximum and minimum usage for that day.

Also, you need to compute the square root of the maximum daily usage for each server and provide it along with the daily averages.

Example Input:
entry_idserver_identry_timebandwidth_usage_mb
2001S12022-07-06 10:00:00580.26
2002S22022-07-06 10:00:00720.15
2003S12022-07-06 10:10:00635.35
2004S12022-07-06 10:20:00720.58
2005S22022-07-06 10:20:00880.26
2006S12022-07-07 10:00:00700.34
2007S22022-07-07 10:10:00890.57
Expected Output:
dayserver_idavg_usagesqrt_max_usage
2022-07-06S170.1626.85
2022-07-06S280.0529.67
2022-07-07S10.0026.46
2022-07-07S20.0029.84

Answer:


The clause is used here to categorize data by day and server_id. The bandwidth_usage statistics for each server each day are calculated using aggregate functions like MAX, MIN, AVG, ABS and SQRT to perform mathematical operations.

To practice a very similar question try this interactive Amazon Server Utilization Time Question which is similar for calculating total usage of a resource or this Twitter Tweets' Rolling Averages Question which is similar for calculating rolling averages over time.

SQL Question 9: Traffic Data Analysis

As a content delivery network and cloud service provider, Akamai handles a large amount of web traffic data. Imagine you are a data engineer at Akamai and you're working with timestamp data from the table. Your task? Write a SQL query to calculate the average number of daily requests per service for each month.

Example Input:
log_idtimestampservice_idrequests
12022-01-01 00:00:011004000
22022-01-02 12:01:001002500
32022-01-15 10:04:002002000
42022-01-20 08:30:011006000
52022-02-12 14:20:102005000
62022-02-30 03:00:001004500
72022-02-15 06:15:102003000

Expected Output:

monthservice_idavg_daily_requests
11004166.67
12005000
21004500
22002666.67

Answer:

Here's a PostgreSQL query that could be used to answer that question:


This query does the following:

  1. The function is used to pull out the month from the .
  2. The clause allows us to separate the data by month and by .
  3. Finally, calculates the average number of requests in each group.

Since there's no WHERE clause, the query will return data for all months and services in the table.

Preparing For The Akamai SQL Interview

Assuming that you've already got basic SQL skills, the next best tip we have to prepare for the Akamai SQL interview is to solve as many practice SQL interview questions as you can! In addition to solving the earlier Akamai SQL interview questions, you should also solve the 200+ SQL questions from real Data Science & Analytics interviews which come from companies like Google, Uber, and Microsoft. DataLemur Question Bank

Each SQL question has hints to guide you, detailed solutions and crucially, there is an online SQL coding environment so you can right online code up your query and have it checked.

To prep for the Akamai SQL interview you can also be useful to solve SQL problems from other tech companies like:

However, if your SQL foundations are weak, forget about diving straight into solving questions – improve your SQL foundations with this free SQL for Data Analytics course.

SQL interview tutorial

This tutorial covers SQL topics like SQL joins with practice exercises and joining a table to itself – both of which come up frequently during SQL job interviews at Akamai.

Akamai Data Science Interview Tips

What Do Akamai Data Science Interviews Cover?

Besides SQL interview questions, the other types of problems to practice for the Akamai Data Science Interview are:

Akamai Data Scientist

How To Prepare for Akamai Data Science Interviews?

The best way to prepare for Akamai Data Science interviews is by reading Ace the Data Science Interview. The book's got:

  • 201 Interview Questions from tech companies like Google & Microsoft
  • A Refresher on Python, SQL & ML
  • Great Reviews (900+ reviews, 4.5-star rating)

Acing Data Science Interview

© 2024 DataLemur, Inc

Career Resources

Free 9-Day Data Interview Crash CourseFree SQL Tutorial for Data AnalyticsSQL Interview Cheat Sheet PDFUltimate SQL Interview GuideAce the Data Job Hunt Video CourseAce the Data Science InterviewBest Books for Data Analysts