In this Lab session you will write 15 SQL queries for the TPCH database. You will execute the queries for the data loaded in Lab 2 and check the results.

Write SQL statements for the following queries on the TPCH database (1 pt. per query):

  1. How many customers are not from EUROPE or AFRICA?

  1. How many suppliers in each region have more than the average account balance of their own region?

  1. For the lineitems ordered in May 1995 (o orderdate), nd the largest discount that is less than the average discount among all the orders.

  1. How many customers and suppliers are in each country from EUROPE?

  1. For parts of type STEEL, return the name of the supplier from AMERICA that can supply them at minimum cost (ps supplycost) among the American suppliers. Print the supplier name for each part size together with the minimum cost.

  1. Based on the available quantity of items, what is the manufacturer of the most popular item (the more popular, the less available items in ps availqty) from Supplier#000000053?

  1. For each order priority, count the number of parts ordered in 1996 and received earlier (l receiptdate) than the commit date.

  1. Count the number of distinct suppliers that supply items with type MEDIUM POLISHED and size equal to any of 3, 23, 26, and 49.

  1. Count the number of supplied parts that have total value (ps supplycost*ps availqty) in the top 3% value across all the supplied parts and are supplied by suppliers from CANADA.

  1. How many customers from each region have never placed an order and have more than the average account balance?

  1. Return the highest value (l extendedprice*(1-l discount)) items not shipped as of October 2, 1994.

  1. What is the total supply cost for parts less expensive than $1000 (p retailprice) shipped in 1996 (l shipdate) by suppliers who did not ship any part with an extended price less than 1000 in 1995?

  1. Count the number of orders made in the fourth quarter of 1996 in which at least one line item was received by the customer later than its committed date. List the count of such orders for each order priority sorted in ascending priority order.

  1. For any two regions, nd the gross discounted revenue (l extendedprice*(1-l discount)) derived from line items in which parts were shipped from a supplier in either region to a customer in the other region in 1995 and 1996. List the supplier region, the customer region, the year (l shipdate), and the revenue from shipments that took place in that year. Order the answers by supplier region, customer region, and year (all ascending).

  1. The market share for a given nation within a given region is de ned as the fraction of the revenue, sum of l extendedprice*(1-l discount), from the line items in that region that were supplied by suppliers from the given nation. Determine the market share of UNITED STATES in EUROPE in 1996 (l shipdate).

Instructions. You are required to execute the queries on the TPCH database you populated in Lab 2. We provide you the correct answers for this database. We will check the correctness of your queries on a di erent database. Folder Lab5 under Lab contains an archive with the correct answers. There is a le for each query, e.g., 1.out for query 1, 2.out for query 2, etc. You are required to turn in an archive containing a le with the SQL statement for every query. The names of the les have to be 1.sql for query 1, 2.sql for query 2, and so on. There should be exactly 15 les in the archive le. The name of the archive le has to be where firstname and lastname are your rst and last names, respectively. The type of the archive le has to be zip.

Lab 5 2

error: Content is protected !!