How to determine which tables are missing indexes

Friday, August 01. 2008

How to determine which tables are missing indexes

Every once in a while - particularly if you are using inherited tables, you forget to put an important index on one of your tables which bogs down critical queries. Its sometimes convenient to inspect the index catalog to see what tables are missing indexes or what tables are missing a critical index. Normally we try to stick with querying the information_schema because queries against that schema work pretty much the same in PostgreSQL as they do in SQL Server and MySQL. For most of the examples below we had to delve into pg_catalog schema territory since there was no view we could find in information_schema that would give us enough detail about indexes.

Problem: Return all non-system tables that are missing primary keys

Solution:

This will actually work equally well on SQL Server, MySQL and any other database that supports the Information_Schema standard. It won't check for unique indexes though.


SELECT c.table_schema, c.table_name, c.table_type
FROM information_schema.tables c
WHERE c.table_type = 'BASE TABLE' AND c.table_schema NOT IN('information_schema', 'pg_catalog') 
AND
NOT EXISTS (SELECT cu.table_name 
				FROM information_schema.key_column_usage cu
				WHERE cu.table_schema = c.table_schema AND
					cu.table_name = c.table_name)
ORDER BY c.table_schema, c.table_name;

Problem: Return all non-system tables that are missing primary keys and have no unique indexes

Solution - this one is not quite as portable. We had to delve into the pg_catalog since we couldn't find a table in information schema that would tell us anything about any indexes but primary keys and foreign keys. Even though in theory primary keys and unique indexes are the same, they are not from a meta data standpoint.


SELECT c.table_schema, c.table_name, c.table_type
FROM information_schema.tables c
WHERE  c.table_schema NOT IN('information_schema', 'pg_catalog') AND c.table_type = 'BASE TABLE' 
AND NOT EXISTS(SELECT i.tablename  
				FROM pg_catalog.pg_indexes i 
			WHERE i.schemaname = c.table_schema 
				AND i.tablename = c.table_name AND indexdef LIKE '%UNIQUE%')
AND
NOT EXISTS (SELECT cu.table_name 
				FROM information_schema.key_column_usage cu
				WHERE cu.table_schema = c.table_schema AND
					cu.table_name = c.table_name)
ORDER BY c.table_schema, c.table_name;

Problem - List all tables with geometry fields that have no index on the geometry field.

Solution -


SELECT c.table_schema, c.table_name, c.column_name
FROM (SELECT * FROM 
	information_schema.tables WHERE table_type = 'BASE TABLE') As t  INNER JOIN
	(SELECT * FROM information_schema.columns WHERE udt_name = 'geometry') c  
		ON (t.table_name = c.table_name AND t.table_schema = c.table_schema)
		LEFT JOIN pg_catalog.pg_indexes i ON 
			(i.tablename = c.table_name AND i.schemaname = c.table_schema 
				AND  indexdef LIKE '%' || c.column_name || '%') 
WHERE i.tablename IS NULL
ORDER BY c.table_schema, c.table_name;

Posted by Leo Hsu and Regina Obe in intermediate, mysql, postgis, q&a, sql server at 20:49 | Comments (6) | Trackbacks (0)

Trackbacks

Trackback specific URI for this entry

No Trackbacks

Comments

Display comments as (Linear | Threaded)

More valuable may be the ability to see which tables are doing full table scans vs index scans. After all, lack of index scans not only tell you there is either no index but rather there may be a bad one. try this.

CREATE OR REPLACE VIEW pg_table_nonindex_x AS
SELECT x1.table_in_trouble, pg_relation_size(x1.table_in_trouble) AS sz_n_byts, x1.seq_scan, x1.idx_scan,
CASE
WHEN pg_relation_size(x1.table_in_trouble) > 500000000 THEN 'Exceeds 500 megs, too large to count in a view. For a count, count individually'::text
ELSE x_count(x1.table_in_trouble)::text
END AS tbl_rec_count, x1.priority
FROM ( SELECT (pg_stat_all_tables.schemaname::text || '.'::text) || pg_stat_all_tables.relname::text AS table_in_trouble, pg_stat_all_tables.seq_scan, pg_stat_all_tables.idx_scan,
CASE
WHEN (pg_stat_all_tables.seq_scan - pg_stat_all_tables.idx_scan) < 500 THEN 'Minor Problem'::text
WHEN (pg_stat_all_tables.seq_scan - pg_stat_all_tables.idx_scan) >= 500 AND (pg_stat_all_tables.seq_scan - pg_stat_all_tables.idx_scan) < 2500 THEN 'Major Problem'::text
WHEN (pg_stat_all_tables.seq_scan - pg_stat_all_tables.idx_scan) >= 2500 THEN 'Extreme Problem'::text
ELSE NULL::text
END AS priority
FROM pg_stat_all_tables
WHERE pg_stat_all_tables.seq_scan > pg_stat_all_tables.idx_scan AND pg_stat_all_tables.schemaname 'pg_catalog'::name AND pg_stat_all_tables.seq_scan > 100) x1
ORDER BY x1.priority DESC, x1.seq_scan;

#1 Kevin Kinchen (Homepage) on 2009-03-04 21:24 (Reply)

Thanks!

#1.1 digicon on 2009-08-25 17:25 (Reply)

I had a few issues with the view, so I changed it below. Hope this helps as well!

CREATE OR REPLACE VIEW pg_table_nonindex_x AS
SELECT
x1.table_in_trouble,
pg_relation_size(x1.table_in_trouble) AS sz_n_byts,
x1.seq_scan, x1.idx_scan,
CASE
WHEN pg_relation_size(x1.table_in_trouble) > 500000000
THEN 'Exceeds 500 megs, too large to count in a view. For a count, count individually'::text
ELSE count(x1.table_in_trouble)::text
END AS tbl_rec_count,
x1.priority
FROM
(
SELECT
(schemaname::text || '.'::text) || relname::text AS table_in_trouble,
seq_scan,
idx_scan,
CASE
WHEN (seq_scan - idx_scan) < 500 THEN 'Minor Problem'::text
WHEN (seq_scan - idx_scan) >= 500 AND (seq_scan - idx_scan) < 2500 THEN 'Major Problem'::text
WHEN (seq_scan - idx_scan) >= 2500 THEN 'Extreme Problem'::text
ELSE NULL::text
END AS priority
FROM
pg_stat_all_tables
WHERE
seq_scan > idx_scan
AND schemaname != 'pg_catalog'::name
AND seq_scan > 100) x1
GROUP BY
x1.table_in_trouble,
x1.seq_scan,
x1.idx_scan,
x1.priority
ORDER BY
x1.priority DESC,
x1.seq_scan
;
SELECT * FROM pg_table_nonindex_x;

#1.2 digicon (Homepage) on 2009-08-26 10:10 (Reply)

Is there anything in Postgres 8.3/8.4 to help identify only outdated/bloated indexex to be rebuilt?

I found the query below in Postgres Documentation posted by Tom Lane as a reply to:“pg_stat_user_indexes view clarification” and is supposed to list all indexes candidates for REINDEX:
select schemaname,relname,indexrelname,idx_tup_read,idx_tup_fetch from pg_stat_user_indexes where idx_tup_read != idx_tup_fetch;
Is this still valid in Postgres 8.3/8.4?

I also found another statement somewhere on the net and can this be also something to rely on or not? Currently I’m running maintenance tasks (vacuum,analyze,reindex,analyze) on all databases all indexes but due to their number and size this takes many hours to complete. I’m sure that by not reindexing all tables.indexes I can save lots of maintenance time but not sure how to identify only outdated/bloated PG indexes easily.

--100 is general rule but I took 10 to be more aggressive on the reindex.
select schemaname,relname,indexrelname,idx_scan,idx_tup_read,idx_tup_fetch from pg_stat_all_user_indexes where idx_tup_fetch > (idx_scan * 10) and idx_scan 0 and schemaname = 'public'

Any hint greatly appreciated.

#2 Nenea Nelu on 2010-01-15 12:11 (Reply)

Regina,

Thank you very much!

Mat

#3 Mateusz Loskot (Homepage) on 2010-08-03 09:13 (Reply)

You are welcome Mat.

#3.1 Regina on 2010-08-03 18:27 (Reply)

Add Comment

Name
Email
Homepage
In reply to
Comment	E-Mail addresses will not be displayed and will only be used for E-Mail notifications. To prevent automated Bots from commentspamming, please enter the string you see in the image below in the appropriate input box. Your comment will only be submitted if the strings match. Please ensure that your browser supports and accepts cookies, or your comment cannot be verified correctly. Enter the string from the spam-prevention image above: Phone* What is seven minus five?
	Remember Information? Subscribe to this entry

How to determine which tables are missing indexes

Postgres OnLine Journal

PostGIS in Action About the Authors Consulting

Friday, August 01. 2008

How to determine which tables are missing indexes

Entry's Links

Quicksearch

Calendar

Categories

Archives

Subscribe

Blog Administration

How to determine which tables are missing indexes

Postgres OnLine Journal PostGIS in Action About the Authors Consulting

Friday, August 01. 2008

How to determine which tables are missing indexes

Entry's Links

Quicksearch

Calendar

Categories

Archives

Subscribe

Blog Administration

Postgres OnLine Journal

PostGIS in Action About the Authors Consulting