Polymorphic Table Functions Example – (NOT) Transposing Rows to Columns

It was not possible for me to write a follow-up to my last post about Transposing Columns To Rows with PTF showing an opposite task of transposing rows to columns right next weekend as I thought. Partly because of our awesome Trivadis TechEvent which took place back then and partly because this kind of the exercise turned out to be much more difficult one as supposed. Actually it is a nice example to see the limitations of the new feature.

I also realized that it was probably not a good idea to start to write about PTF with these two examples going too deep in the detail. That’s why I plan to write some step-by-step posts on PTF.

But for now, let’s stay with the example. What we achieved in the previous post? We transposed the columns of the SCOTT.EMP table into rows, so that the result set looked like this:


Figure 1

The problem

Now we want to query this structure and get the result set in a form like the original SCOTT.EMP had. What is the problem here? Assume that we don’t know the structure of SCOTT.EMP, and we need to reproduce it based on the pictured data set. Yes, we could do this, but only by looking at the data itself. For example, we need to query all rows for the EMPNO=7839 and then we know, there are seven columns in addition to EMPNO and what their data types are (based on NOT NULL values in corresponding columns). If these can vary from EMPNO to EMPNO, then we need to query the whole data set to determine the final output structure.

On the other hand, what degree of flexibility the polymorphic functions are providing? At design time we don’t need to know the shape of the input table and don’t have to determine the shape of the return type. The return type is determined by the PTF invocation arguments list. Now we have two phases: SQL cursor compilation and execution. During the compilation the function DESCRIBE is called and that is the point where the return type must be determined. We cannot change the shape of the result set during the execution.

So we are not flexible enough to do this. Can we work around this? What if we could query the source data set inside the DESCRIBE function and thus would be able to determine the structure? Of course, that means we would dramatically increase the parse time, possibly reading the whole data before the execution even starts. But let’s try.

If I want to know what keys with their respective data types are stored in the table T, I would possibly use some query like this:

SELECT DISTINCT  
       key_name 
,      first_value(datatype) OVER (PARTITION BY key_name 
                                   ORDER BY rnk DESC) datatype 
FROM   (SELECT key_name  
        ,      CASE  
                   WHEN key_val_char IS NOT NULL THEN 'C' 
                   WHEN key_val_date IS NOT NULL THEN 'D' 
                   WHEN key_val_number IS NOT NULL THEN 'N' 
                   ELSE 'C' 
               END datatype 
        ,      CASE  
                   WHEN key_val_char IS NULL 
                        AND key_val_date IS NULL 
                        AND key_val_number IS NULL 
                   THEN 0 
                   ELSE 1 
               END rnk 
        FROM t);

KEY_NAME                       D
------------------------------ -
ENAME                          C
HIREDATE                       D
DEPTNO                         N
JOB                            C
SAL                            N
COMM                           N
MGR                            N

7 rows selected. 

Listing 1

If there are rows where the value for the key is null like the key COMM for EMPNO=7839, you cannot definitely say what datatype the key has. I introduced the column RNK, to address this issue, so that the rows with values for particular key will be considered first. Of course this implementation is by no means complete and doesn’t handle all cases, e.g. what happens if values for the same key are stored differently, sometimes as number, sometimes as string? But I think, it is sufficient for our example.

Try to get the structure with a PTF

Let us first try to develop a PTF which returns exactly this information and postpone the task of pivoting the rows for a while.
We’ll implement the function GET_COLS which can be called as follows:

  

SELECT * FROM get_cols(t);

Listing 2

First create an implementation package for the function. What we need is the function DESCRIBE and the procedure FETCH_ROWS. We don’t need the optional OPEN and CLOSE procedures here. For this simple example we also don’t need the parameter of the type COLUMNS, since we are not going to define some logic based on the specified columns.

  
CREATE OR REPLACE PACKAGE get_cols_pkg AS 

  FUNCTION describe(tab  IN OUT DBMS_TF.TABLE_T) 
                    RETURN DBMS_TF.DESCRIBE_T; 
 
  PROCEDURE fetch_rows; 
 
END get_cols_pkg; 
/

Listing 3

And now the implementation along with some explanations.
Function DESCRIBE:

  • Lines 14-17: We want two columns (COLUMN_NAME and DATATYPE) to be in the result set, so we need to define them using COLUMN_METADATA_T record type. The collection NEW_COLS will be returned out of the DESCRIBE function as a part of DESCRIBE_T record type (see the lines 54 and 57)
  • Lines 20-23: All columns from the original table are not of interest: we just mark both properties PASS_THROUGH and FOR_READ as FALSE
  • Lines 25-46: Define and execute a dynamic SQL statement to assess the existing key names and their data types
  • Lines 48-52: Actually this information is already the answer to be returned from our PTF. We should save it until the execution phase using the “compilation store”

Procedure FETCH_ROWS:

  • Line 71: Get input row set
  • Lines 73-85: Until there are columns we have saved in the compilation store in the DESCRIBE function, fill the collections NAMECOL and TYPECOL if not yet done (check execution store at line 78). Suppress all other rows (line 74). Notice the processed column in the execution store (line 82).
  • Line 87: Set the row replication with prepared collection REPFAC
  • Lines 89-90: Put the collections with processed values in the output result set
 

CREATE OR REPLACE PACKAGE BODY get_cols_pkg AS 
FUNCTION describe(tab  IN OUT DBMS_TF.TABLE_T) 
RETURN DBMS_TF.DESCRIBE_T  
AS 
   new_cols DBMS_TF.COLUMNS_NEW_T; 
   col_id   PLS_INTEGER := 1; 
   v_sql VARCHAR2(2000); 
   type t_colrec is record (column_name dbms_id, datatype varchar2(1)); 
   type t_colrectab is table of t_colrec index by pls_integer; 
   v_colrectab t_colrectab; 
   descr DBMS_TF.describe_t; 
BEGIN 
-- We only want these two columns
  new_cols(1) := dbms_tf.column_metadata_t(name => 'COLUMN_NAME',  
                                         TYPE => dbms_tf.type_varchar2); 
  new_cols(2) := dbms_tf.column_metadata_t(name => 'DATATYPE',  
                                         TYPE => dbms_tf.type_varchar2); 
-- We don't need any other columns, neither for passing them through 
--    nor for reading
  FOR I IN 1 .. tab.COLUMN.COUNT LOOP 
      tab.column(i).pass_through := FALSE; 
      TAB.COLUMN(i).for_read := FALSE; 
  END LOOP; 
   
  v_sql := 
    q'! SELECT DISTINCT  
           key_name 
    ,      first_value(datatype) OVER (PARTITION BY key_name 
                                       ORDER BY rnk DESC) datatype 
    FROM   (SELECT key_name  
            ,      CASE  
                      WHEN key_val_char IS NOT NULL THEN 'C' 
                      WHEN key_val_date IS NOT NULL THEN 'D' 
                      WHEN key_val_number IS NOT NULL THEN 'N' 
                      ELSE 'C' 
                   END datatype 
            ,      CASE  
                       WHEN key_val_char IS NULL 
                            AND key_val_date IS NULL 
                            AND key_val_number IS NULL THEN 0 
                       ELSE 1 
                    END rnk 
            FROM t)
     !'; --'
  
  EXECUTE IMMEDIATE v_sql BULK COLLECT INTO v_colrectab; 

-- Save the results into the "compilation strore"   
  FOR i IN 1..v_colrectab.count LOOP 
    descr.cstore_chr('c'||i) := v_colrectab(i).column_name; 
    descr.cstore_chr('d'||i) := v_colrectab(i).datatype; 
  END LOOP; 
   
  descr.new_columns := new_cols; 
  descr.row_replication:=true; 
   
  RETURN descr; 

END; 
 
PROCEDURE fetch_rows IS 
    inp_rs DBMS_TF.row_set_t; 
    colcnt PLS_INTEGER; 
    rowcnt PLS_INTEGER; 
    repfac dbms_tf.tab_naturaln_t; 
    namecol dbms_tf.tab_varchar2_t; 
    typcol dbms_tf.tab_varchar2_t; 
    colname dbms_id; 
    coltype varchar2(1); 
BEGIN  
    dbms_tf.get_row_set(inp_rs, rowcnt, colcnt); 
  
    FOR i IN 1..rowcnt LOOP
       repfac(i) := 0; 
       IF dbms_tf.cstore_exists ('c'||i) THEN 
	  dbms_tf.cstore_get('c'||i, colname); 
          dbms_tf.cstore_get('d'||i, coltype); 
     	  IF NOT dbms_tf.xstore_exists(colname) THEN   
             namecol(nvl(namecol.last+1,1)) := colname; 
             typcol(nvl(typcol.last+1,1)) := coltype; 
             repfac(i) := 1; 
             dbms_tf.xstore_set(colname,1); 
          END IF; 
       END IF; 
    END LOOP; 
	 
    dbms_tf.row_replication(replication_factor => repfac);  
 
    dbms_tf.put_col(1, namecol); 
    dbms_tf.put_col(2, typcol); 

END; 
 
END get_cols_pkg; 
/

Listing 4: Package Body get_cols_pkg

Now we define and test the function itself.

 

SQL> CREATE OR REPLACE FUNCTION get_cols(tab TABLE) RETURN TABLE   
  2  PIPELINED row POLYMORPHIC USING get_cols_pkg; 
  3  /

Function GET_COLS compiled

SQL> 
SQL> SELECT  *  
  2  FROM    get_cols(t);

COLUMN_NAME                    DATAT
------------------------------ -----
ENAME                          C    
HIREDATE                       D    
DEPTNO                         N    
JOB                            C    
SAL                            N    
COMM                           N    
MGR                            N    

7 rows selected. 

Listing 5

It works! But wait a minute! We have hard coded the name of the table into the dynamic SQL query! It is very easy to fix, isn’t it? NO! The function DESCRIBE has indeed the parameter TAB of the record type DBMS_TF.TABLE_T, but there is no such thing as table name there. In fact, this is reasonable since what we can pass in the PTF doesn’t need to be a table, but can be a CTE (common table expression), e.g. a WITH-subquery. As of now I only see the quick and dirty fix of providing an extra string parameter holding whatever to select from: a table name or a subquery text. Listing 6 shows a solution with an extra parameter to_select_from.

 

CREATE OR REPLACE PACKAGE get_cols_pkg AS 
FUNCTION describe(tab  IN OUT DBMS_TF.TABLE_T
                , to_select_from IN VARCHAR2 ) 
RETURN DBMS_TF.DESCRIBE_T; 
 
PROCEDURE fetch_rows (to_select_from varchar2 ) ; 
 
END get_cols_pkg; 
/

CREATE OR REPLACE PACKAGE BODY get_cols_pkg AS 
FUNCTION describe(tab  IN OUT DBMS_TF.TABLE_T
                , to_select_from IN VARCHAR2  ) 
RETURN DBMS_TF.DESCRIBE_T  
AS 
   new_cols DBMS_TF.COLUMNS_NEW_T; 
   col_id   PLS_INTEGER := 1; 
   v_sql VARCHAR2(2000); 
   type t_colrec is record (column_name dbms_id, datatype varchar2(1)); 
   type t_colrectab is table of t_colrec index by pls_integer; 
   v_colrectab t_colrectab; 
   descr DBMS_TF.describe_t; 
BEGIN 
-- We only want these two columns
  new_cols(1) := dbms_tf.column_metadata_t(name => 'COLUMN_NAME',  
                                         TYPE => dbms_tf.type_varchar2); 
  new_cols(2) := dbms_tf.column_metadata_t(name => 'DATATYPE',  
                                         TYPE => dbms_tf.type_varchar2); 
-- We don't need any other columns, neither for passing them through 
--    nor for reading
  FOR I IN 1 .. tab.COLUMN.COUNT LOOP 
      tab.column(i).pass_through := FALSE; 
      TAB.COLUMN(i).for_read := FALSE; 
  END LOOP; 
   
  v_sql := '
    SELECT DISTINCT  
           key_name 
    ,      first_value(datatype) OVER (PARTITION BY key_name 
                                       ORDER BY rnk DESC) datatype 
    FROM   (SELECT key_name  
            ,      CASE  
                      WHEN key_val_char IS NOT NULL THEN ''C'' 
                      WHEN key_val_date IS NOT NULL THEN ''D'' 
                      WHEN key_val_number IS NOT NULL THEN ''N'' 
                      ELSE ''C'' 
                   END datatype 
            ,      CASE  
                       WHEN key_val_char IS NULL 
                            AND key_val_date IS NULL 
                            AND key_val_number IS NULL THEN 0 
                       ELSE 1 
                    END rnk 
            FROM ('||to_select_from||') t) '; --'
  
  EXECUTE IMMEDIATE v_sql BULK COLLECT INTO v_colrectab; 

-- Save the results into the "compilation strore"   
  FOR i IN 1..v_colrectab.count LOOP 
    descr.cstore_chr('c'||i) := v_colrectab(i).column_name; 
	descr.cstore_chr('d'||i) := v_colrectab(i).datatype; 
  END LOOP; 
   
  descr.new_columns := new_cols; 
  descr.row_replication:=true; 
   
  RETURN descr; 

END; 
 
PROCEDURE fetch_rows ( to_select_from VARCHAR2 ) IS 
    inp_rs DBMS_TF.row_set_t; 
    colcnt PLS_INTEGER; 
    rowcnt PLS_INTEGER; 
    repfac dbms_tf.tab_naturaln_t; 
    namecol dbms_tf.tab_varchar2_t; 
    typcol dbms_tf.tab_varchar2_t; 
	colname dbms_id; 
	coltype varchar2(1); 
BEGIN  
    dbms_tf.get_row_set(inp_rs, rowcnt, colcnt); 
  
	FOR i IN 1..rowcnt LOOP
      repfac(i) := 0; 
	  IF dbms_tf.cstore_exists ('c'||i) THEN 
	    dbms_tf.cstore_get('c'||i, colname); 
		dbms_tf.cstore_get('d'||i, coltype); 
     	IF NOT dbms_tf.xstore_exists(colname) THEN   
          namecol(nvl(namecol.last+1,1)) := colname; 
          typcol(nvl(typcol.last+1,1)) := coltype; 
		  repfac(i) := 1; 
		  dbms_tf.xstore_set(colname,1); 
		END IF; 
	  END IF; 
	END LOOP; 
	 
    dbms_tf.row_replication(replication_factor => repfac);  
 
    dbms_tf.put_col(1, namecol); 
    dbms_tf.put_col(2, typcol); 

END; 
 
END get_cols_pkg; 
/
SQL> CREATE OR REPLACE FUNCTION get_cols(tab TABLE
  2                                     , to_select_from varchar2 ) 
  3  RETURN TABLE   
  4  PIPELINED row POLYMORPHIC USING get_cols_pkg; 
  5  /

Function GET_COLS compiled

SQL> 
SQL> WITH tsub as (SELECT * FROM T WHERE empno != 7839) 
  2  SELECT  *  
  3  FROM    get_cols(tsub, 'SELECT * FROM T WHERE empno != 7839');

COLUMN_NAME                    DATAT
------------------------------ -----
ENAME                          C    
HIREDATE                       D    
DEPTNO                         N    
JOB                            C    
SAL                            N    
COMM                           N    
MGR                            N    

7 rows selected. 

Listing 6

Well, this works, even with a subquery. But from my point of view it is far from an elegant solution… And caution, simply concatenating a parameter into dynamic query is a clear no-go in terms of SQL injection!

PTF for transposing rows to columns

Now it is a diligent but routine piece of work to construct an implementation package for the transposing function based on this approach.
What should the function, let’s name it KEYVAL2TAB, do? Well, it should act as a kind of grouping function: we need to process all rows for particular EMPNO returning only one row with all column values.
It is where table semantic PTF and the ability to partition the input data (like with analytic functions) come into play. The invocation of the function should look like this:

 
SELECT * FROM keyval2tab(t PARTITION BY empno,'T');

Listing 7

I think the package code (Listing 8) needs some explanations too.

Function DESCRIBE:

  • Lines 24-35: All other columns except KEY_NAME and the three “value”-columns are “identifying”. We just want to take them over as is and are therefore adding them to NEW_COLS. Actually I supposed, this should work by specifying the property pass_through=true for them, but I didn’t manage this to work. Maybe it was a bug in my code or it is not supposed to work in this manner with row replication – I don’t know at this moment. It needs further investigation. I’ll write on it later.
    For now we just define such “identifying” columns as new, count them (line 29) and save the count in compilation store for later use within FETCH_ROWS
  • Lines 37-70: It’s time to define “transposed” columns using the approach we just tried with the package GET_COLS_PKG. The difference is that we don’t need to store the column names and data types in compilation store but can directly define then as new columns (filling the collection NEW_COLS)

Procedure FETCH_ROWS:

  • Line 93: Get input row set
  • Lines 95-99: We want to collapse all rows from one partition to just one: set replication factor to 0 for all rows except the first
  • Line 100: Get the saved number of identifying columns from compilation store
  • Lines 102-108: Reassign the values of this columns from input to the output result set. I think it’s time to explain how the result sets (input and output) are constructed. Please refer to Figure 2 for better understanding. The portion of the fetched data flows into the table type ROW_SET_T which is a table of records of the type COLUMN_DATA_T. Which in turn has a DESCRIPTION (record type COLUMN_METADATA_T) and fifteen so called variant fields for all supported types. These variant fields are actually collections (associative arrays or index-by tables in terms of PL/SQL). Only one variant field of the proper data type is active for one column.


    Figure 2

    ENV_T is record type containing the information about current execution state. We use it to get a list of put columns

  • Lines 110-122: Looping through rows and columns we gather the actual values for every output column. Once again, we are filling only one row in the output row set: out_rs(p).tab_varchar2(1) means we reference the first row and P-th column which is of type VARCHAR2

CREATE OR REPLACE PACKAGE kv2tab_pkg AS 
FUNCTION describe(tab  IN OUT DBMS_TF.TABLE_T, to_select_from IN VARCHAR2 ) 
RETURN DBMS_TF.DESCRIBE_T; 
 
PROCEDURE fetch_rows (to_select_from varchar2 ) ; 
 
END kv2tab_pkg; 
/

CREATE OR REPLACE PACKAGE BODY kv2tab_pkg AS 
FUNCTION describe(tab  IN OUT DBMS_TF.TABLE_T, to_select_from IN VARCHAR2  ) 
RETURN DBMS_TF.DESCRIBE_T  
AS 
   new_cols DBMS_TF.COLUMNS_NEW_T; 
   col_id   PLS_INTEGER := 1; 
   idcol_cnt   PLS_INTEGER := 0; 
   v_sql VARCHAR2(2000); 
   type t_colrec is record (column_name dbms_id, datatype varchar2(1)); 
   type t_colrectab is table of t_colrec index by pls_integer; 
   v_colrectab t_colrectab; 
   descr DBMS_TF.describe_t; 
BEGIN 
   -- 
  FOR I IN 1 .. tab.COLUMN.COUNT LOOP  
    IF  tab.COLUMN(i).description.name NOT IN ('"KEY_NAME"','"KEY_VAL_NUMBER"','"KEY_VAL_CHAR"','"KEY_VAL_DATE"') THEN  
       new_cols(col_id) := dbms_tf.column_metadata_t(name => tab.COLUMN(i).description.name  
	                                                      , TYPE => tab.COLUMN(i).description.type);  
        col_id := col_id +1;  
        idcol_cnt := idcol_cnt +1; 
    END IF;  
    tab.column(i).pass_through := FALSE;  
    TAB.COLUMN(i).for_read := TRUE;  
  END LOOP; 
  
  descr.cstore_num('idcols') := idcol_cnt; 
  
  v_sql := '
    SELECT DISTINCT  
           key_name 
    ,      first_value(datatype) OVER (PARTITION BY key_name 
                                       ORDER BY rnk DESC) datatype 
    FROM   (SELECT key_name  
            ,      CASE  
                      WHEN key_val_char IS NOT NULL THEN ''C'' 
                      WHEN key_val_date IS NOT NULL THEN ''D'' 
                      WHEN key_val_number IS NOT NULL THEN ''N'' 
                      ELSE ''C'' 
                   END datatype 
            ,      CASE  
                       WHEN key_val_char IS NULL 
                            AND key_val_date IS NULL 
                            AND key_val_number IS NULL THEN 0 
                       ELSE 1 
                    END rnk 
            FROM ('||to_select_from||') t) '; 
  
  EXECUTE IMMEDIATE v_sql BULK COLLECT INTO v_colrectab; 

  FOR i IN 1..v_colrectab.count LOOP 

    case  v_colrectab(i).datatype  
	  when 'C' then NEW_COLS(col_id) := dbms_tf.column_metadata_t(name => v_colrectab(i).column_name  
	                                                      , TYPE => dbms_tf.type_varchar2);  
      when 'D' then NEW_COLS(col_id) := dbms_tf.column_metadata_t(name => v_colrectab(i).column_name 
	                                                      , TYPE => dbms_tf.type_date);														    
      when 'N' then NEW_COLS(col_id) := dbms_tf.column_metadata_t(name => v_colrectab(i).column_name 
	                                                       , TYPE => dbms_tf.type_number);														    
    end case;	  
    col_id := col_id +1;  
  END LOOP; 
   
  descr.new_columns := new_cols; 
  descr.row_replication:=true; 
   
  RETURN descr; 

END; 
 
PROCEDURE fetch_rows ( to_select_from VARCHAR2 ) IS 
    inp_rs DBMS_TF.row_set_t; 
    out_rs DBMS_TF.row_set_t;  
    env    dbms_tf.env_t := dbms_tf.get_env();  
    colcnt PLS_INTEGER; 
    rowcnt PLS_INTEGER; 
    repfac dbms_tf.tab_naturaln_t; 
    namecol dbms_tf.tab_varchar2_t; 
    typcol dbms_tf.tab_varchar2_t; 
	colname dbms_id; 
	coltype varchar2(1); 
    idcol_cnt pls_integer;    
BEGIN  

    DBMS_TF.get_row_set(inp_rs, rowcnt, colcnt);  
  
    FOR i IN 1 .. rowcnt LOOP    
      repfac(i) := 0;    
    END LOOP;   
      
    repfac(1) := 1;  
    dbms_tf.cstore_get('idcols', idcol_cnt);   

    FOR i IN 1..idcol_cnt LOOP 
      IF env.put_columns(i).type = dbms_tf.type_number THEN 
         out_rs(i).tab_number(1):= inp_rs(i).tab_number(1);   
      ELSIF env.put_columns(i).type = dbms_tf.type_varchar2 THEN 
         out_rs(i).tab_varchar2(1):= inp_rs(i).tab_varchar2(1);   
      END IF;
    END LOOP;
  
	FOR r IN 1 .. rowcnt LOOP   
       FOR p IN 1 .. env.put_columns.count LOOP   
		  IF  inp_rs(idcol_cnt+1).tab_varchar2(r)=replace(env.put_columns(p).name,'"') THEN   
		    IF env.put_columns(p).type = dbms_tf.type_varchar2 then   
		       out_rs(p).tab_varchar2(1):= inp_rs(idcol_cnt+2).tab_varchar2(r);   
   	        ELSIF env.put_columns(p).type = dbms_tf.type_number then   
               out_rs(p).tab_number(1):= inp_rs(idcol_cnt+3).tab_number(r);   
            ELSIF env.put_columns(p).type = dbms_tf.type_date then   
			   out_rs(p).tab_date(1):= inp_rs(idcol_cnt+4).tab_date(r);   
   		    END IF;  
		  END IF;  
	    END LOOP;  
	END LOOP;  

	dbms_tf.put_row_set (out_rs, repfac);  
END; 
 
END kv2tab_pkg; 
/

Listing 8

Now we can create and test our function. And it works. If we were to add some more yet unknown keys with values to the table T, would you expect the PTF will return these as new columns? Well, actually it should, but it would not. This is the price for doing nasty things like dynamic SQL in the DESCRIBE function. We have made the shape of the result set to be dependent on the data. But the database has no clue about it. It has no reason to invalidate the cursor and to hard parse the statement again and thus we cannot see the new column. We have to flush the shared pool or slightly change the query to make hard parse happen again.


SQL> CREATE OR REPLACE FUNCTION keyval2tab(tab TABLE, to_select_from VARCHAR2 ) RETURN TABLE    
  2  PIPELINED table POLYMORPHIC USING kv2tab_pkg; 
  3  /

Function KEYVAL2TAB compiled

SQL> 
SQL> 
SQL> SELECT * FROM keyval2tab(t PARTITION BY empno,'T') FETCH FIRST 10 ROWS ONLY;

     EMPNO ENAME                HIREDATE      DEPTNO JOB               SAL       COMM        MGR
---------- -------------------- --------- ---------- ---------- ---------- ---------- ----------
      7369 SMITH                17-DEC-80         20 CLERK             800                  7902
      7499 ALLEN                20-FEB-81         30 SALESMAN         1600        300       7698
      7521 WARD                 22-FEB-81         30 SALESMAN         1250        500       7698
      7566 JONES                02-APR-81         20 MANAGER          2975                  7839
      7654 MARTIN               28-SEP-81         30 SALESMAN         1250       1400       7698
      7698 BLAKE                01-MAY-81         30 MANAGER          2850                  7839
      7782 CLARK                09-JUN-81         10 MANAGER          2450                  7839
      7788 SCOTT                19-APR-87         20 ANALYST          3000                  7566
      7839 KING                 17-NOV-81         10 PRESIDENT        5000                      
      7844 TURNER               08-SEP-81         30 SALESMAN         1500          0       7698

10 rows selected. 

SQL> INSERT INTO t (empno, key_name, key_val_char) VALUES(7839, 'NEW_COLUMN', 'NEW_VALUE');

1 row inserted.

SQL> 
SQL> SELECT * FROM keyval2tab(t PARTITION BY empno,'T') FETCH FIRST 10 ROWS ONLY;

     EMPNO ENAME                HIREDATE      DEPTNO JOB               SAL       COMM        MGR
---------- -------------------- --------- ---------- ---------- ---------- ---------- ----------
      7369 SMITH                17-DEC-80         20 CLERK             800                  7902
      7499 ALLEN                20-FEB-81         30 SALESMAN         1600        300       7698
      7521 WARD                 22-FEB-81         30 SALESMAN         1250        500       7698
      7566 JONES                02-APR-81         20 MANAGER          2975                  7839
      7654 MARTIN               28-SEP-81         30 SALESMAN         1250       1400       7698
      7698 BLAKE                01-MAY-81         30 MANAGER          2850                  7839
      7782 CLARK                09-JUN-81         10 MANAGER          2450                  7839
      7788 SCOTT                19-APR-87         20 ANALYST          3000                  7566
      7839 KING                 17-NOV-81         10 PRESIDENT        5000                      
      7844 TURNER               08-SEP-81         30 SALESMAN         1500          0       7698

10 rows selected. 

SQL> 
SQL> SELECT * FROM keyval2tab(t PARTITION BY empno,'T') ORDER BY EMPNO FETCH FIRST 10 ROWS ONLY;

     EMPNO ENAME                HIREDATE      DEPTNO JOB               SAL       COMM NEW_COLUMN        MGR
---------- -------------------- --------- ---------- ---------- ---------- ---------- ---------- ----------
      7369 SMITH                17-DEC-80         20 CLERK             800                                 
      7499 ALLEN                20-FEB-81         30 SALESMAN         1600        300                      
      7521 WARD                 22-FEB-81         30 SALESMAN         1250        500                      
      7566 JONES                02-APR-81         20 MANAGER          2975                                 
      7654 MARTIN               28-SEP-81         30 SALESMAN         1250       1400                      
      7698 BLAKE                01-MAY-81         30 MANAGER          2850                                 
      7782 CLARK                09-JUN-81         10 MANAGER          2450                                 
      7788 SCOTT                19-APR-87         20 ANALYST          3000                                 
      7839 KING                 17-NOV-81         10 PRESIDENT        5000            NEW_VALUE            
      7844 TURNER               08-SEP-81         30 SALESMAN         1500          0                      

10 rows selected. 

Listing 8

Conclusion

Well, we created a function which was indeed able to transpose rows to columns dynamically, e.g. we didn’t have to know what columns were actually there. But – let me emphasize that – I don’t believe it is a feasable solution. Maybe for some kind of explorative analysis tasks, if you are looking into unknown data set for the first time. We had to work around one problem after another and ended up with quite strange invocation and our PTF is scanning all the data twice. The bottom line: it seems to me that the cases where we need the result set based on the input data, rather then structure or parameters, are in general not a good area of application for polymorphic table functions.

Do you have another ideas? Your comments are welcome!

Related Posts

4 thoughts on “Polymorphic Table Functions Example – (NOT) Transposing Rows to Columns

  1. Iudith Mentzel

    Hello Andrej,

    I just found your new post about the PTF functions, and I just want to thank you a lot for your tremendous work :):)

    I think that the Oracle documentation of the PTF functions overall functionality, both in the PL/SQL User Guide and in the Packages reference for the DBMS_TF package is lacking a lot of information.

    The technical details of the many record and collection types can be somehow understood,
    but what is missing is some more detailed explanation of how exactly the processing occurs,
    or, more specifically, at what point is each implementation procedure invoked and when are
    the various data structures populated.
    For example, the presentation of GET_ENV and ENV_T do not explain at all where in the processing flow
    do they intervene and how.

    I have the feeling that the only way of learning currently available is to look at the examples and try to understand the functionality by reverse-engineering those examples, instead of gaining first a full understanding of the various processing steps involved.

    Your examples and explanations are extremely welcome for a deeper understanding and
    I will be extremely grateful to you if you will find the time to elaborate the step-by-step posts that
    you mentioned at the start of this post.
    It would be great if you could make these step-by-step posts into a logical series, introducing all the features gradually, in their logical order, like a set of lessons, building each post on the understanding
    and knowledge gained from the previous posts.

    Thanks a lot once again for your great work 🙂

    Best Regards,
    Iudith Mentzel

    Reply
  2. Rafal

    Hello,

    I have a question about the strange behavior of the package after adding a new parameter (new row to the T table). Before adding the parameter, the query result is as follows.

    SQL> SELECT * FROM keyval2tab(t PARTITION BY empno,'T');

    EMPNO DEPTNO SAL HIREDATE COMM JOB MGR ENAME
    ________ _________ _______ ___________ _______ ____________ _______ _________
    7369 20 800 80/12/17 CLERK 7902 SMITH
    7499 30 1600 81/02/20 300 SALESMAN 7698 ALLEN
    7521 30 1250 81/02/22 500 SALESMAN 7698 WARD
    7566 20 2975 81/04/02 MANAGER 7839 JONES
    7654 30 1250 81/09/28 1400 SALESMAN 7698 MARTIN
    7698 30 2850 81/05/01 MANAGER 7839 BLAKE
    7782 10 2450 81/06/09 MANAGER 7839 CLARK
    7788 20 3000 87/04/19 ANALYST 7566 SCOTT
    7839 10 5000 81/11/17 PRESIDENT KING
    7844 30 1500 81/09/08 0 SALESMAN 7698 TURNER
    7876 20 1100 87/05/23 CLERK 7788 ADAMS
    7900 30 950 81/12/03 CLERK 7698 JAMES
    7902 20 3000 81/12/03 ANALYST 7566 FORD
    7934 10 1300 82/01/23 CLERK 7782 MILLER

    14 rows selected.

    SQL> INSERT INTO t (empno, key_name, key_val_char) VALUES(7839, 'NEW_COLUMN', 'NEW_VALUE');

    1 row inserted.

    SQL> commit;

    Commit complete.

    After adding the parameter and flush shared_pool, the query result is incorrect.

    SQL> SELECT * FROM keyval2tab(t PARTITION BY empno,'T');

    EMPNO DEPTNO SAL HIREDATE NEW_COLUMN COMM JOB MGR ENAME
    ________ _________ _______ ___________ _____________ _______ ____________ ______ ________
    7369 20 800 80/12/17
    7499 30 1600 81/02/20
    7521 30 1250 81/02/22
    7566 20 2975 81/04/02
    7654 30 1250 81/09/28
    7698 30 2850 81/05/01
    7782 10 2450 81/06/09
    7788 20 3000 87/04/19
    7839 10 5000 81/11/17 NEW_VALUE PRESIDENT KING
    7844 30 1500 81/09/08
    7876 20 1100 87/05/23
    7900 30 950 81/12/03
    7902 20 3000 81/12/03
    7934 10 1300 82/01/23

    14 rows selected.

    Could you explain why this happened?
    How can this be fixed?

    Best regards,
    Rafal

    Reply
    1. admin Post author

      Hello Rafal,
      thank you for asking. I could reproduce the issue and I know the reason why it is happening. In fact, I missed the bug in my post: the last query returns all NULL’s for the column MGR. I have overlooked this!
      In short, this has to do with the way I’m filling the output result set and with the handling of NO_DATA_FOUND. However, the more detailed explanation may be more complex and longer, so I decided to write a follow-up post on this.
      I’ll try my best to finish it this week.
      Best Regards
      Andrej

      Reply

Leave a Reply

Your email address will not be published. Required fields are marked *