An Idea can change your life.....

Friday, December 07, 2007

SQL Server - Crosstab queries using PIVOT

Problem:
In SQL Server 2000 there was not a simple way to create cross-tab queries, but a new option in SQL Server 2005 has made this a bit easier.

Solution:
With SQL Server 2005 a lot of new features have been introduced. One of these new features is
PIVOT. What this allows you to do is to turn query results on their side, so instead of having results listed down like the listing below, you have results listed across.

SalesPerson Product SalesAmount
Bob Pickles $100.00
Sue Oranges $50.00
Bob Pickles $25.00
Bob Oranges $300.00
Sue Oranges $500.00

With a straight query the query results would be listed down, but the ideal solution would be to list the Products across the top for each SalesPerson, such as the following:

SalesPerson Oranges Pickles
Bob $300.00 $125.00
Sue $550.00

To use PIVOT you need to understand the data and how you want the data displayed. First you have the data rows, such as SalesPerson and the columns, such as the Products and then the values to display for each cross section. Here is a simple query that allows us to pull the cross-tab results.


SELECT SalesPerson,

[Oranges] AS Oranges,

[Pickles] AS Pickles

FROM (SELECT SalesPerson,

Product,

SalesAmount

FROM ProductSales) ps

PIVOT

(SUM(SalesAmount)

FOR Product IN ( [Oranges],[Pickles] ) ) AS pvt


So how does this work?

There are three pieces that need to be understood in order to construct the query.
(1) The SELECT statement
SELECT SalesPerson, [Oranges] AS Oranges, [Pickles] AS Pickles
This portion of the query selects the three columns for the final result set (SalesPerson, Oranges, Pickles)

(2) The query that pulls the raw data to be prepared
(SELECT SalesPerson, Product, SalesAmount FROM ProductSales) ps
This query pulls all the rows of data that we need to create the cross-tab results. The (ps) after the query is creating a temporary table of the results that can then be used to satisfy the query for step 1.

(3) The PIVOT expression
PIVOT (SUM (SalesAmount) FOR Product IN ( [Oranges], [Pickles]) ) AS pvt
This query does the actual summarization and puts the results into a temporary table called pvt

Another key thing to notice in here is the use of the square brackets [ ] around the column names in both the SELECT in part (1) and the IN in part (3). These are key, because the pivot operation is treating the values in these columns as column names and this is how the breaking and grouping is done to display the data.

No comments: