Postgres OnLine Journal : January / December 2025

PostgreSQL Q & A

FROM function or SELECT function

Basics

Unpivoting data using JSONB Beginner
Learning PostgreSQL from AI and JSON exploration Beginner
PG 17 new random functions Beginner
Converting JSON documents to relational tables Beginner
Learning PostgreSQL from AI and JSON exploration: Part 2 Beginner

Using PostgreSQL Extensions

http extension for windows updated to include PostgreSQL18 64-bit

A Product of Paragon Corporation
http://www.paragoncorporation.com/
http://www.postgresonline.com/

PostgreSQL Q & A

FROM function or SELECT function

When I have set returning functions and even non-set returning functions, I love putting them in the FROM clause instead of the SELECT part. I often take it for granted that the results are usually the same, but in some important situations, they are different. It's not guaranteed to be the same when your function is not immutable.

When it doesn't matter in terms of answers

If you expect the same inputs to yield the same outputs, such as with this example:


SELECT substring(f.string, i,1)
FROM (SELECT 'hello') AS f(string)
	, generate_series(1, length(f.string)) AS i;

Is equivalent to:


SELECT substring(f.string,  generate_series(1, length(f.string)),1)
FROM (SELECT 'hello') AS f(string);

I prefer using the FROM approach when all else being equal, because I can then use that FROM output in a WHERE clause. Like so:

SELECT c  
FROM (SELECT 'hello') AS f(string)
	, generate_series(1, length(f.string)) AS i, substring(f.string, i,1) AS c
WHERE mod(ascii(c) ,i ) = 1;

and at least in prior versions of PostgreSQL, calls to these functions especially ones that returned multiple columns, often caused function call duplication when it was in the SELECT part which made things slower. I don't think that is the case these days, though I haven't tested.

The convenience of being able reuse those outputs in your WHERE, is a killer feature when doing PostGIS spatial queries with all the richness of PostGIS set returning and composite returning functions. You'd have to write a lot of gyrations of ugly sub queries, if you settled with having your functions all in the SELECT clause.

Whenever you see a function in the FROM clause using arguments from prior tables preceded with a comma, it's a short-hand for writing CROSS JOIN LATERAL ..., so yes sometimes I do write the whole thing out when I'm feeling particularly patient and pedantic, like so:

SELECT c  
FROM (SELECT 'hello') AS f(string)
	CROSS JOIN LATERAL generate_series(1, length(f.string)) AS i
	CROSS JOIN LATERAL substring(f.string, i,1) AS c
WHERE mod(ascii(c) ,i ) = 1;

And some people laugh at me for wasting those extra key strokes to make a point. The point being it reminds me, I can change that to a LEFT JOIN LATERAL and achieve a slightly different purpose and is clearer to viewers of the magic that is happening.

When it matters

I love that FROM trick so much that, I often get blind-sided when the results come out different from what my mind thinks it should be. It matters when your function is VOLATILE.

Here is an example using the random() function. One might expect each call to it to give you a different number right? But here is a query that will make it not do that, cause it's only calling the function once evidentially. Which is great if that is what you wanted. Not so great when you were trying to create a manifest of 100 passengers for your futuristic voyage, only to discover all passengers names are all the same.

SELECT r
FROM generate_series(1,5), random() AS r;

Output gives you 5 repeating numbers

          r
---------------------
 0.19868264850983475
 0.19868264850983475
 0.19868264850983475
 0.19868264850983475
 0.19868264850983475
(5 rows)

Now if you write your query like so:

SELECT  random() AS r
FROM generate_series(1,5);

, you do get 5 different numbers:

         r
--------------------
 0.9913061113169994
 0.3108248186791389
 0.7519172938054108
 0.7725538387216184
 0.2287233312353214
(5 rows)

Back to Table Of Contents

Basics

Unpivoting data using JSONB Beginner

One of my favorite uses of JSONB functionality is to unpivot data.

Let's say you were handed a dataset with this structure where each type of bed column holds the count of units of that kind.


CREATE TABLE housing_data (
    neighborhood VARCHAR(50),
    studio INTEGER,
    br1 INTEGER,
    br2 INTEGER,
    br3 INTEGER,
    br4 INTEGER
);

INSERT INTO housing_data (neighborhood, studio, br1, br2, br3, br4)
VALUES ('Downtown', 500, 800, 1000, 1200, 1500),
       ('Midtown', 700, 950, 1150, 1400, 1700),
       ('Uptown', 600, 800, 1000, 1200, 1500),
       ('Suburbs', 300, 450, 600, 750, 900);

For your reporting needs, you would like each bed type count to be a separate row. Here is one of my favorite tricks for accomplishing this:

SELECT neighborhood, replace(je.key,'br','') AS bed_type, je.value::integer AS num_units
FROM housing_data AS d 
	CROSS JOIN jsonb_each_text( to_jsonb(d) - 'neighborhood') AS je;

The assumption here is that every column except for neighborhood is a bedroom unit count. The output is:

 neighborhood | bed_type | num_units
--------------+----------+-----------
 Downtown     | 1        |       800
 Downtown     | 2        |      1000
 Downtown     | 3        |      1200
 Downtown     | 4        |      1500
 Downtown     | studio   |       500
 Midtown      | 1        |       950
 Midtown      | 2        |      1150
 Midtown      | 3        |      1400
 Midtown      | 4        |      1700
 Midtown      | studio   |       700
 Uptown       | 1        |       800
 Uptown       | 2        |      1000
 Uptown       | 3        |      1200
 Uptown       | 4        |      1500
 Uptown       | studio   |       600
 Suburbs      | 1        |       450
 Suburbs      | 2        |       600
 Suburbs      | 3        |       750
 Suburbs      | 4        |       900
 Suburbs      | studio   |       300
(20 rows)
(20 rows)

If you had more than one column that wasn't a bedroom count you could revise your subtraction, to exclude that too using revised: to_jsonb(d) - 'neighborhood' - 'other_column', alternatively you could use array construct to_jsonb(d) - ARRAY['neighborhood','other_column']. This trick can also be accomplished using the hstore extension and converting a row to an hstore and using the each function of hstore.

Back to Table Of Contents

Basics

Learning PostgreSQL from AI and JSON exploration Beginner

After reading Paul's teasing article on Accessing Large Language Models with AI and testing out his code, I decided I should probably stop fighting this AI smothering. I still have a distrust of AI services but downloading an AI model and using on my own local desktop or server is nice and I can break bread with that. One use I tried is using AI to generate fake data and it did a pretty decent job. I also learned a couple of things.

For these exercises I'm using my favorite AI model at least for PostgreSQL work so far phi4 which you can get here Phi 4, the 14b 9.1GB seemed decent enough and answered questions in under 5-20 seconds for most of my questions on my 4 year old graphics card, 20 cpu, 64GB ram desktop . I learned a bunch of things from my excursion down this AI path.

Lessons Learned

AI can tell you what features your software is missing by the mistakes it makes.
AI is good for generating dummy data for your exercises.
AI sometimes throws in stuff you never asked for.
Ask the same question that has more than one answer, expect to get a different answer each time.
AI makes mistakes but in following it along, feeding it error messages, it can correct itself and you too can learn something following it's error paths.

I shall at least for the rest of this article call my side-kick Aesop cause calling it phi4 feels a bit alien.

Figure out the stretch of your AI's knowledge of PostgreSQL

If you are going to trust advice from anything especially on a topic as important as PostgreSQL, you should get a sense of its breadth of knowledge. Just like how I run:

SELECT version();

on a PostgreSQL server to know what neat tricks are at my disposal, I do the same with Aesop.

Me:

What is the latest version of PostgreSQL

To which I am greeted with

Aesop:

As of my last update in October 2023, the latest stable release of PostgreSQL was version 15. However, please verify this information from an official source such as the PostgreSQL website or other trusted resources, as new versions might have been released since then. PostgreSQL frequently updates its software with improvements and new features, so checking their official announcements is always a good practice to ensure you have the most current information.

Creating fake data

First question I asked Aesop:

Create a PostgreSQL table with a jsonb column that has heavily nested data that has 2 rows covering trips and make sure to use GENERATED BY instead of serial.

I threw in the requirement about GENERATED BY cause the first pass, Aesop threw me back a CREATE table that used the old SERIAL syntax. With this GENERATED BY qualification it gave me this:

Aesop:

To create a PostgreSQL table where the primary key is generated using `GENERATED ALWAYS AS IDENTITY` instead of `SERIAL`, you can follow these steps. This approach, introduced in PostgreSQL 10, allows for more flexibility and control over identity columns.


CREATE TABLE trips (
    trip_id BIGINT GENERATED ALWAYS AS IDENTITY PRIMARY KEY,
    user_id INTEGER NOT NULL,  -- Assuming a reference to a users table
    trip_name VARCHAR(255) NOT NULL,
    trip_data JSONB,           -- Column for storing nested JSON data
    created_at TIMESTAMP DEFAULT CURRENT_TIMESTAMP,
    updated_at TIMESTAMP DEFAULT CURRENT_TIMESTAMP ON UPDATE CURRENT_TIMESTAMP
);

It was nice that Aesop warned me this will only work for PostgreSQL 10 and above, cause if you are creating software that needs to work with PostgreSQL 9.6, this will not do. So I was excited, glancing at this ready to use code that Gasp DOES NOT WORK. What is wrong with this code exactly? Try running it in PostgreSQL and you'll see the error. I mean it's even nicely commented and formatted.

Poor Aesop was a little confused and threw in some MySQL goodness in there. Stuff that MySQL has that PostgreSQL doesn't but perhaps might be a nice feature for those folks too lazy to create an update trigger for updated_at.

Which begs the question: Oh why or why does my PostgreSQL not have an ON UPDATE clause for CREATE TABLE.

So commenting out that piece we have a working create table now


CREATE TABLE trips (
    trip_id BIGINT GENERATED ALWAYS AS IDENTITY PRIMARY KEY,
    user_id INTEGER NOT NULL,  -- Assuming a reference to a users table
    trip_name VARCHAR(255) NOT NULL,
    trip_data JSONB,           -- Column for storing nested JSON data
    created_at TIMESTAMP DEFAULT CURRENT_TIMESTAMP,
    updated_at TIMESTAMP DEFAULT CURRENT_TIMESTAMP -- ON UPDATE CURRENT_TIMESTAMP
);

Then Aesop provided a statement to stuff in some juicy trip data, exactly 2 rows as I had asked for and it worked without any changes.


INSERT INTO trips (user_id, trip_name, trip_data)
VALUES
(
    1,
    'Summer Vacation',
    jsonb_build_object(
        'destination', jsonb_build_object(
            'city', 'Paris',
            'country', 'France'
        ),
        'activities', jsonb_build_array(
            jsonb_build_object('name', 'Eiffel Tower Visit', 'date', '2023-07-15'),
            jsonb_build_object('name', 'Louvre Museum Tour', 'date', '2023-07-16')
        ),
        'start_date', '2023-07-14',
        'end_date', '2023-07-20'
    )
),
(
    2,
    'Winter Getaway',
    jsonb_build_object(
        'destination', jsonb_build_object(
            'city', 'Zurich',
            'country', 'Switzerland'
        ),
        'activities', jsonb_build_array(
            jsonb_build_object('name', 'Lake Zurich Cruise', 'date', '2023-12-10'),
            jsonb_build_object('name', 'Museum Visits', 'date', '2023-12-11')
        ),
        'start_date', '2023-12-09',
        'end_date', '2023-12-15'
    )
);

But Let me ask the same question again, because in all good software, when you ask the same question that has several answers, you expect to get a different but correct answer each time.

And voila what do you have here, a different way of inserting json data into a PostgreSQL table. Further more, maybe Aesop got a bit tired of being asked the same question so decided, I really didn't need those user_id and timestamp columns I NEVER ASKED FOR in the first place.

>>> Create a PostgreSQL table with a jsonb column that has heavily nested data that has 2 rows covering trips and make sure to use GENERATED BY instead of serial.

Aesop:

Certainly! Below is an example of how you can create a PostgreSQL table using the `GENERATED ALWAYS AS IDENTITY` feature for automatic sequence generation, along with a highly nested `jsonb` column to store detailed trip data.

See not even a nice commented column, very minimalistic. Aesop is clearly tired of my repeated questions.


CREATE TABLE trips (
    id BIGINT GENERATED ALWAYS AS IDENTITY PRIMARY KEY,
    trip_data JSONB NOT NULL
);


-- Inserting two rows into the table with heavily nested JSON data

INSERT INTO trips (trip_data) VALUES
('{
  "name": "Trip to Paris",
  "details": {
    "start_date": "2023-09-15",
    "end_date": "2023-09-20",
    "itinerary": [
      {
        "day": 1,
        "activities": ["Arrival at Charles de Gaulle Airport", "Check-in at Hotel"],
        "accommodation": {
          "hotel_name": "Le Meurice",
          "rating": 5
        }
      },
      {
        "day": 2,
        "activities": ["Visit Eiffel Tower", "Lunch at Café de Flore"],
        "notes": "Remember to book Eiffel Tower tickets in advance."
      },
      {
        "day": 3,
        "activities": ["Explore Louvre Museum"],
        "accommodation": {
          "hotel_name": "Le Meurice",
          "amenities": ["Spa", "Gym"]
        }
      }
    ]
  }
}');

INSERT INTO trips (trip_data) VALUES
('{
  "name": "Trip to Tokyo",
  "details": {
    "start_date": "2023-11-01",
    "end_date": "2023-11-10",
    "itinerary": [
      {
        "day": 1,
        "activities": ["Arrival at Narita Airport", "Check-in at Ryokan"],
        "accommodation": {
          "ryokan_name": "Tawaraya",
          "rating": 5
        }
      },
      {
        "day": 2,
        "activities": ["Visit Tokyo Tower", "Explore Asakusa Temple"],
        "notes": "Try some sushi for lunch."
      },
      {
        "day": 3,
        "activities": ["Shop at Shibuya Crossing", "Dinner at Ramen Street"],
        "accommodation": {
          "ryokan_name": "Tawaraya",
          "amenities": ["Hot Springs", "Private Baths"]
        }
      }
    ]
  }
}');

I got Aesop to create dense data, now to have it query some meaningful bits out of it.

Me:

Write a PostgreSQL SQL statement that returns the day and activity of each trip one row per activity.

And alas an answer but this is perhaps one of the uglier ways of doing this, more on that in a different article.


SELECT
    id AS trip_id,
    (jsonb_array_elements_text(trip_data->'details'->'itinerary')::jsonb ->> 'day')::integer AS day,
    jsonb_array_elements_text((jsonb_array_elements_text(trip_data->'details'->'itinerary')::jsonb -> 'activities')) AS activity
FROM
    trips;

I also asked Aesop more complicated JSON query questions which it failed at initially but with feeding it errors returned from PostgreSQL it reworked its answersand gve me code that at least would run.

Back to Table Of Contents

Basics

PG 17 new random functions Beginner

Have you ever wanted to get a random integer between 1 and 10 and been a little annoyed the slightly cryptic code you had to write in PostgreSQL? PostgreSQL 17 random functions make that simpler. Sometimes it's the small changes that bring the most joy.

Random over the years

The random() function has largely stayed stagnant for a long time. In the before times, if you needed a random number, all you had to work with was a single function random() that took no arguments and returned a double precision number that is >= 0 and < 1. When have you ever needed a random double precision number that is >= 0 and < 1? Probably never, but it did provide good ammunition to get to your real task. So how do you create a random integer between 1 and 10 in pre-PG17?

One way to do it in pre PG17 era is.

SELECT (1 + random()*10)::integer

In PostgreSQL 16 we were blessed with yet another random family function, the random_normal that you need a degree in statistics to fully appreciate written by a guy with a degree in statistics standing by with Rolling dice with postgres random function to explain this marvelous function complete with diagrams and pictures because you really need pictures and diagrams to understand this.

Random in PG 17

Finally in PG 17, we were blessed with a very intuitive and short way to answer the question that 80% of the time is what we wanted a random function for in the first place. Return an integer between 1 and 10:

SELECT random(1,10);

Not only is this code shorter, it is also clearer in intent.

Now if that were not sweet enough, suppose you wanted to create random money amounts. Money at least in US has only 2 decimals. How would you create a random amount between 1 and 10 dollars

In Pre PG 17 era, I'd do this

SELECT (1 + random()*10)::numeric(4,2);

In PG 17, I can do this:

SELECT random(1.00,10.00);

Which would return a number such as 5.79, that will always have exactly 2 decimals. If your two numbers had different number of decimals, it would return the greatest number of decimals of either number. So:

SELECT random(1.00, 10.000);
would return a number like 9.657

Back to Table Of Contents

Basics

Converting JSON documents to relational tables Beginner

JSON is one of the most popular ways of disseminating data between systems. It is probably the most common offered by webservices. PostgreSQL is a database perfectly suited for grabbing that data and transforming it into a more structured relational format. All this can be done directly in the database. We'll go over some ways to load and restructure json data.

Before starting, create a temporary table to store the json:

	CREATE TEMP TABLE data_json(data jsonb);

Loading Data using pg_read_file

My favorite way of reading text files I just want to stuff into a single field is pg_read_file. pg_read_file is a server side function that allows you to read all of the file or a portion of the file. There are a couple of caveats for it's use.

Requires super user or being member of pg_read_server_files group role
The file you are reading must be readable by the postgres server process
There is a limitation on read size and you are also restricted on how big of a file you can stuff in a column. For most files I've worked with, this has not been an issue.

There is a companion function called pg_read_binary_file for reading data in binary format or for reading text in a specific encoding.

To demonstrate, download: Boston Public Schools json format and put it in C:/temp folder. If you are on Linux would be a path such as /tmp and path references in this doc change C:/Temp to /tmp

Note that this is a GeoJSON file, which means if you had PostGIS installed you could do interesting things with this. But for this exercise, I'm going to treat it like any JSON file. I should also note that this approach doesn't work for big files that can't fit into a single column.

INSERT INTO data_json(data)
SELECT pg_read_file('C:/temp/public_schools.geojson.json')::jsonb;

Using jsonb_array_elements to expand rows

There are a couple of ways of expanding a JSON dataset into rows. The oldest way that works since PostgreSQL 9.3 is the jsonb_array_elements. You can combine this with the ->> and -> json operators to select properties. Here is how we do it with the sample dataset that is of geojson structure

CREATE TABLE boston_public_schools AS 
SELECT (je->'id')::bigint AS id, 
	(je->'geometry'->'coordinates'->>0)::float AS longitude,
	(je->'geometry'->'coordinates'->>1)::float AS latitude,
	je->'properties'->>'SCH_NAME' AS sch_name,
	je->'properties'->>'ADDRESS' AS address,
	je->'properties'->>'CITY' AS city,
	je->'properties'->>'ZIPCODE' AS zipcode
FROM data_json 
	CROSS JOIN jsonb_array_elements(data_json.data->'features') AS je;

There are 3 features being used in this example, first we are using the -> operator. This operator when applied to a jsonb or json returns back the property as a jsonb or json element. Note that you can burrow into a document by nesting these operator calls as we do with je->'geometry'->'coordinates'->>1

The companion to -> is the ->> operator which returns text instead of a json. You use this when you are done with your burrowing.

Both -> and ->> can take a text or an integer. The integer version is used only for json arrays and returns the nth element of the array. Counting of arrays in JavaScript and by extension JSON starts at 0.

So with these operators you pick out pieces of a json document, but before we do that, we'll want to expand a json document into it's separate rows. For geojson documents, there is always a features property which is an array with each element being a data row.

To break up these rows, you can use jsonb_array_elements, which is a set returning element that only works with jsonb formatted arrays and returns each element of the array as a jsonb object.

Your table should end up looking like this

SELECT * FROM boston_public_schools LIMIT 3;

id |     longitude      |     latitude      |       sch_name       |       address       |    city     | zipcode
----+--------------------+-------------------+----------------------+---------------------+-------------+---------
  1 | -71.00412000099993 | 42.38879000000003 | Guild Elementary     | 195 Leyden Street   | East Boston | 02128
  2 | -71.03047970999995 | 42.37853662100008 | Kennedy Patrick Elem | 343 Saratoga Street | East Boston | 02128
  3 | -71.03389000099997 | 42.37527000000006 | Otis Elementary      | 218 Marion Street   | East Boston | 02128
(3 rows)

Using JSON_TABLE to expand rows and columns

Introduced in PostgreSQL 17 is the ISO-SQL standard JSON_TABLE function, which if you are familar with XML follows more or less the same pattern as XMLTABLE. It utilizes json path syntax to parse out elements. Here is a repeat of the earlier exercise using JSON_TABLE.


DROP TABLE IF EXISTS boston_public_schools;
CREATE TABLE boston_public_schools AS
SELECT
    je.*
FROM
    data_json
CROSS JOIN
    JSON_TABLE (
        data_json.data,
        '$.features[*]' COLUMNS (
            id integer PATH '$.id',
            longitude float PATH '$.geometry.coordinates[0]',
            latitude float PATH '$.geometry.coordinates[1]',
            NESTED PATH '$.properties' COLUMNS (
                sch_name text PATH '$.SCH_NAME',
                address text PATH '$.ADDRESS',
                city text PATH '$.CITY',
                zipcode text PATH '$.ZIPCODE'
            )
        )
    ) AS je;

This is a pretty rich function, so you should read the docs to appreciate it's full breath. The fact it's an ISO/SQL standard function means you are more likely to find it in other relational databases. If you have a heavily nested document, the NESTED PATH subclause comes in handy for saving some typing. The main thing I don't like about it is I find it harder to comprehend and remember than the earlier syntax.

Back to Table Of Contents

Basics

Learning PostgreSQL from AI and JSON exploration: Part 2 Beginner

This is the second part of the series I started on Learning PostgreSQL from AI and JSON exploration: Part 1. For this 2nd part, I decided to try gpt-oss the 14GB model which was just released in the past week. My first impression, "When will this ai shut up about its thinking process?". Phi4 on the other hand gave a bit of a narrative about how it came up with answers but it did not write a novel about it and it also seemed more personable, using "My and you" instead of a distanced "User wants to". However gpt-os did become less chatty with a bit of coaxing. So anyway personality wise I prefer phi4. That is not to say that Gpt-oss doesn't have some wow factor. It created perfect SQL each time. I did not need to correct it. One of the other neat features of this model which I haven't had time to explore is ability to have it do browser searches and other agentic like stuff.

Again for this size model, speed of answering with my crappy graphics card and processor was not bad, but it was noticeably slower than Phi4. Given the impressive accuracy, I didn't mind too much and I suspect can be improved by tweaking some of its settings.

Personal growth with AI

Here are a couple of ways I feel an AI like gpt-oss can help with personal growth.

Even as an experienced PostgreSQL user, there are some functions you need to be reminded to use or that you are aghast you've never used. Gpt-oss showed me I was missing out on using jsonb_to_recordset.
The hard-part of programming is not learning a language, but learning how to express your questions unambiguously and understanding fully what you are trying to achieve. gpt-oss helped in this regard by pointing out where my questions didn't make sense or were ambiguous given the dataset I was asking it to work with.

How up to date is your AI

Lets start as we did before assessing gpt-oss recency of knowledge about PostgreSQL. I did not try to tweak any of its settings and out of the box it's painfully chatty. As I used it more, probably because my prior conversation was still in it's context, it became less chatty and event stopped with the impersonal "User requests" preface.

To prevent it from writing a novel about PostgreSQL's versioning before giving me an answer, I asked it this way

Me:

What is the latest version of PostgreSQL in one sentence. No narrative.

To which I am greeted with below which took quite some time cause it was probably thinking about PostgreSQL versionining patterns over the years.

gpt-oss:

Thinking... User requests: "What is the latest version of PostgreSQL in one sentence. No narrative". So we answer: "PostgreSQL 17.4".

Creating fake data

Me:

Create a PostgreSQL table with a jsonb column that has heavily nested data that has 2 rows covering trips where the primary key is generated using `GENERATED ALWAYS AS IDENTITY`. No thinking narrative.

I threw in the requirement about GENERATED BY because just like phi4, gpt-oss wanted to give me an old-school bigserial. Its generated data output freaked me out a bit because many of the locations were places that are a few blocks from me, though the names were slightly different or had the wrong address, which I assume was intentional. Is it because I live at the center of techie civilization or it picked up my GPS location and thought "I'll show her some stuff she can relate to". Maybe next time, I will allow it to go on about how it came up with these places.

gpt-oss:

Thinking... We need to provide just the SQL code. No narrative. Provide table creation and insertion of two rows with nested JSONB. Keep concise. ...done thinking.


CREATE TABLE trips (
    id   BIGINT GENERATED ALWAYS AS IDENTITY PRIMARY KEY,
    data jsonb NOT NULL
);

INSERT INTO trips (data) VALUES
(
    '{
        "trip_name": "Weekend Getaway",
        "trip_type": "leisure",
        "origin": {
            "city": "New York",
            "airport": "JFK",
            "coordinates": { "lat": 40.6413, "lon": -73.7781 }
        },
        "destination": {
            "city": "Boston",
            "airport": "BOS",
            "coordinates": { "lat": 42.3656, "lon": -71.0096 }
        },
        "stops": [
            {
                "stop_id": 1,
                "name": "Boston Airport",
                "arrival": "2025-09-01T10:00:00-04:00",
                "departure": "2025-09-01T11:30:00-04:00",
                "location": {
                    "coordinates": { "lat": 42.3656, "lon": -71.0096 },
                    "address": {
                        "street": "S. Station Fwy",
                        "city": "Boston",
                        "state": "MA",
                        "zip": "02128"
                    }
                },
                "weather": {
                    "forecast": [
                        { "date": "2025-09-01", "high": 78, "low": 62, "condition": "Partly Cloudy" },
                        { "date": "2025-09-02", "high": 80, "low": 63, "condition": "Sunny" }
                    ],
                    "precipitation_probability": 0.15
                }
            },
            {
                "stop_id": 2,
                "name": "Boston City Center",
                "arrival": "2025-09-01T12:00:00-04:00",
                "departure": "2025-09-01T15:00:00-04:00",
                "location": {
                    "coordinates": { "lat": 42.3601, "lon": -71.0589 },
                    "address": {
                        "street": "1 First St",
                        "city": "Boston",
                        "state": "MA",
                        "zip": "02108"
                    }
                },
                "weather": {
                    "forecast": [
                        { "date": "2025-09-01", "high": 78, "low": 62, "condition": "Partly Cloudy" },
                        { "date": "2025-09-02", "high": 80, "low": 63, "condition": "Sunny" }
                    ],
                    "precipitation_probability": 0.12
                }
            }
        ],
        "itinerary": {
            "activities": [
                {
                    "time": "2025-09-01T12:30:00-04:00",
                    "activity": "Lunch at Quincy Market",
                    "details": { "restaurant": "Trident", "cuisine": "Seafood", "reservation": true }
                },
                {
                    "time": "2025-09-01T14:00:00-04:00",
                    "activity": "Freedom Trail",
                    "details": { "tour_type": "Self‑guided", "duration_minutes": 120, "cost_usd": 0 }
                }
            ],
            "accommodation": {
                "hotel": {
                    "name": "Boston Marriott Copley Place",
                    "address": "101 State Street, Boston, MA 02108",
                    "room_type": "Deluxe King",
                    "check_in": "2025-09-01T15:30:00-04:00",
                    "check_out": "2025-09-02T11:00:00-04:00"
                }
            }
        }
    }'::jsonb
),
(
    '{
        "trip_name": "Business Trip",
        "trip_type": "work",
        "origin": {
            "city": "San Francisco",
            "airport": "SFO",
            "coordinates": { "lat": 37.6213, "lon": -122.3790 }
        },
        "destination": {
            "city": "Seattle",
            "airport": "SEA",
            "coordinates": { "lat": 47.6038, "lon": -122.3308 }
        },
        "stops": [
            {
                "stop_id": 1,
                "name": "Seattle Airport",
                "arrival": "2025-10-10T09:00:00-07:00",
                "departure": "2025-10-10T09:30:00-07:00",
                "location": {
                    "coordinates": { "lat": 47.6038, "lon": -122.3308 },
                    "address": {
                        "street": "3rd Ave & E Denny Way",
                        "city": "Seattle",
                        "state": "WA",
                        "zip": "98128"
                    }
                },
                "weather": {
                    "forecast": [
                        { "date": "2025-10-10", "high": 68, "low": 54, "condition": "Cloudy" }
                    ],
                    "precipitation_probability": 0.22
                }
            },
            {
                "stop_id": 2,
                "name": "Microsoft Seattle HQ",
                "arrival": "2025-10-10T10:30:00-07:00",
                "departure": "2025-10-10T11:30:00-07:00",
                "location": {
                    "coordinates": { "lat": 47.6694, "lon": -122.3458 },
                    "address": {
                        "street": "1501 2nd Ave W",
                        "city": "Seattle",
                        "state": "WA",
                        "zip": "98121"
                    }
                },
                "weather": {
                    "forecast": [
                        { "date": "2025-10-10", "high": 70, "low": 58, "condition": "Clear" }
                    ],
                    "precipitation_probability": 0.05
                }
            }
        ],
        "itinerary": {
            "meetings": [
                {
                    "time": "2025-10-10T10:30:00-07:00",
                    "location": "Microsoft Seattle HQ",
                    "participants": ["John Doe", "Jane Smith", "Alan Turing"],
                    "agenda": ["Kickoff", "Budget Review", "Road‑map"]
                },
                {
                    "time": "2025-10-10T11:30:00-07:00",
                    "location": "Amazon Seattle HQ",
                    "participants": ["Bob Johnson", "Elon Musk"],
                    "agenda": ["Partnership", "Tech Integration"]
                }
            ],
            "accommodation": {
                "hotel": {
                    "name": "The Edgewater Seattle",
                    "address": "121 2nd Ave, Seattle, WA 98101",
                    "room_type": "Executive Suite",
                    "check_in": "2025-10-10T14:00:00-07:00",
                    "check_out": "2025-10-11T11:00:00-07:00"
                }
            }
        }
    }'::jsonb
);

gpt-oss gave exceptionally beautiful and flawlessly runnable code and it gave me no more and no less than the column structure I asked for.

Now that we have some dense data, let's query it.

Me:

Write a PostgreSQL SQL statement that returns the date and itineraries of each trip one row per itinerary. No thinking narrative.

gpt-oss took about a minute or 2 to come up with an answer for this on my desktop, but it gave a pretty decent answer and a short description of its thinking process.

Thinking... We need to extract date and itineraries per trip. Likely itinerary activities: each row per activity. Provide date and itinerary details. Use jsonb_to_recordset. Let's craft: SELECT id, trip_name, activity_time, activity, details FROM trips, jsonb_to_recordset(data->'itinerary'->'activities') as x(activity_time timestamp, activity text, details jsonb). Or extract date part of time. Provide date (activity_time::date). So output: date, activity, details. Let's craft final answer. ...done thinking.


SELECT
    t.id,
    t.data->>'trip_name'            AS trip_name,
    activity_rec.time::date         AS activity_date,
    activity_rec.activity           AS activity,
    activity_rec.details            AS activity_details
FROM
    trips AS t
CROSS JOIN LATERAL
    jsonb_to_recordset(
        t.data -> 'itinerary' -> 'activities'
    ) AS activity_rec(
        time      TIMESTAMP,
        activity  TEXT,
        details   JSONB
    );

Let's revise a bit cause I don't want to read a blob of json in that activity details.

Me:

Write a PostgreSQL SQL statement that returns the date, time, type, name of each activity or meeting, one row per activity or meeting. For the type specify if it is a meeting or an activity. No thinking narrative.

As you can see it did end up providing a bit of narrative. But if you are learning how to query PostgreSQL, the explaining you probably welcome.

What I really love about gpt-oss explanation is that it made me aware of ambiguities in my question and things I didn't really think that much about relative to the structure of the data. More importantly I discovered that I've never used this wonderful function jsonb_to_recordset, though I had heard about it and yet this wonderful function has existed since PostgreSQL 13.

gpt-oss:

Thinking... We need a query to return date, time, type (meeting or activity), name. Activities have fields: time, activity, details. Meetings have time, location, participants, agenda. So type determined by presence of 'activity' key. Or we can union two selects. We need date, time, type, name. For activity name: activity field. For meeting name: maybe location or maybe some field like 'meeting'. But they didn't specify meeting name. We could use 'location' as name for meeting. The problem: "type, name of each activity or meeting". For meeting, name could be location. Use location. We'll union. Use CROSS JOIN LATERAL. Return: trip id maybe optional. They didn't ask for trip id. Just date, time, type, name. Implement: SELECT date, time, 'activity' AS type, activity AS name FROM activities UNION ALL SELECT date, time, 'meeting' AS type, location AS name FROM meetings But need to parse JSON. We'll use jsonb_to_recordset for activities and meetings. Return date and time from timestamp. Use date(time) and time(time). The 'time' field is TIMESTAMP. Also include type. Let's produce: SELECT (act.time)::date AS date, (act.time)::time AS time, 'activity' AS type, act.activity AS name FROM trips t CROSS JOIN LATERAL jsonb_to_recordset(t.data->'itinerary'->'activities') AS act(time TIMESTAMP, activity TEXT, details JSONB) UNION ALL SELECT (meet.time)::date AS date, (meet.time)::time AS time, 'meeting' AS type, meet.location AS name FROM trips t CROSS JOIN LATERAL jsonb_to_recordset(t.data->'itinerary'->'meetings') AS meet(time TIMESTAMP, location TEXT, participants JSONB, agenda JSONB); Order by date, time. Let's produce final answer. ...done thinking.


/*  Return one row per activity or meeting, showing
    the date, time, type (activity / meeting) and name  */

SELECT
    (act.time)::date  AS date,
    (act.time)::time  AS time,
    'activity'        AS type,
    act.activity      AS name
FROM
    trips t
CROSS JOIN LATERAL
    jsonb_to_recordset(
        t.data -> 'itinerary' -> 'activities'
    ) AS act(
        time      TIMESTAMP,
        activity  TEXT,
        details   JSONB
    )

UNION ALL

SELECT
    (meet.time)::date  AS date,
    (meet.time)::time  AS time,
    'meeting'          AS type,
    meet.location      AS name
FROM
    trips t
CROSS JOIN LATERAL
    jsonb_to_recordset(
        t.data -> 'itinerary' -> 'meetings'
    ) AS meet(
        time        TIMESTAMP,
        location    TEXT,
        participants JSONB,
        agenda      JSONB
    )

ORDER BY date, time;

Back to Table Of Contents

Using PostgreSQL Extensions

http extension for windows updated to include PostgreSQL18 64-bit

Updated October 16th, 2025 64-bit package for PostgreSQL 18 http extension v1.7.0 release.

For those folks on windows who want to do http gets and posts directly from your PostgreSQL server, we've made binaries for the http extension for PostgreSQL Windows.

These are designed to work with PostgreSQL EDB windows distributions.

If you have PostGIS already installed, many of these files you will also already have since things like the libcurl and PCRE are also packaged with PostGIS.

http extension binaries for PostgreSQL 18, 17, 16, 15, 14, 13, 12 11, 10, 9.6, 9.5, and 9.4 windows 64-bit downloads

PostgreSQL 10-14 64-bit are 1.5.0, other versions are older http versions.

The curl library for http is built with SSL support and utilizes the ssleasy.dll packaged with the EDB installs.

http extension binaries for PostgreSQL 10, 9.6, 9.5, and 9.4 windows 32-bit downloads

http quick primer

To enable in a database after having installed the binaries.

CREATE EXTENSION http;

Do a basic get

SELECT h.content, h.content_type, hkv.value As dt
FROM http_get('http://postgis.net/tips/') AS h 
    LEFT JOIN LATERAL (SELECT *  
    FROM unnest(h.headers) 
        WHERE field =  'Date') AS hkv ON true;

Check out more examples at: https://github.com/pramsey/pgsql-http and https://github.com/pramsey/pgsql-openai.

Back to Table Of Contents