Close

Reguła automatyzacji Jira podczas scalania pull requestu

Posted by: AJ Welch

Amazon Redshift retains a great deal of metadata about the various databases within a cluster and finding a list of tables is no exception to this rule.

The most useful object for this task is the PG_TABLE_DEF table, which as the name implies, contains table definition information. Note: The PG_ prefix is just a holdover from PostgreSQL, the database technology from which Amazon Redshift was developed.

To begin finding information about the tables in the system, you can simply return columns from PG_TABLE_DEF:

SELECT
  *
FROM
  PG_TABLE_DEF;

For better or worse, PG_TABLE_DEF contains information about everything in the system, so the results of such an open query will be massive, but should give you an idea of what PG_TABLE_DEF is capable of:

schemaname  tablename                   column          type            encoding distkey sortkey notnull
--------------------------------------------------------------------------------------------------------
pg_catalog  padb_config_harvest         name            character(136)  none     false   0       true
pg_catalog  padb_config_harvest         harvest         integer         none     false   0       true
pg_catalog  padb_config_harvest         archive         integer         none     false   0       true
pg_catalog  padb_config_harvest         directory       character(500)  none     false   0       true
pg_catalog  pg_aggregate                aggfnoid        regproc         none     false   0       true
pg_catalog  pg_aggregate                aggtransfn      regproc         none     false   0       true
pg_catalog  pg_aggregate                aggfinalfn      regproc         none     false   0       true
pg_catalog  pg_aggregate                aggtranstype    oid             none     false   0       true
pg_catalog  pg_aggregate                agginitval      text            none     false   0       false
pg_catalog  pg_aggregate_fnoid_index    aggfnoid        regproc         none     false   0       false
pg_catalog  pg_am                       amname          name            none     false   0       true
pg_catalog  pg_am                       amowner         integer         none     false   0       true
pg_catalog  pg_am                       amstrategies    smallint        none     false   0       true

To limit the results to user-defined tables, it’s important to specify the schemaname column to return only results which are public:

SELECT
  *
FROM
  PG_TABLE_DEF
WHERE
  schemaname = 'public';
schemaname  tablename   column   type                   encoding    distkey sortkey notnull
-------------------------------------------------------------------------------------------
public      category    catid    smallint               none        true    1       true
public      category    catgroup character varying(10)  none        false   0       false
public      category    catname  character varying(10)  none        false   0       false
public      category    catdesc  character varying(50)  none        false   0       false
public      date        dateid   smallint               none        true    1       true
public      date        caldate  date                   none        false   0       true
public      event       eventid  integer                none        true    0       true
public      event       venueid  smallint               none        false   0       true
public      event       catid    smallint               none        false   0       true

This shows us all the columns (and their associated tables) that exist and that are public (and therefore user-created).

Lastly, if we are solely interested only the names of tables which are user-defined, we’ll need to filter the above results by retrieving DISTINCT items from within the tablename column:

SELECT
  DISTINCT tablename
FROM
  PG_TABLE_DEF
WHERE
  schemaname = 'public';

This returns only the unique public tables within the system:

tablename
---------
category
date
event
listing
sales
users
venue

There we have it! A simple yet effective method for retrieving table information from within your Amazon Redshift cluster.


Następny temat