Use Postgres' RETURNING capability to optimize `insertMany`. #407

MaxGabriel · 2015-06-02T15:07:58Z

Closes #365.

This PR uses Postgres' RETURNING capability to optimize bulk-inserts. Instead of inserting one-at-a-time, the rows can be inserted all at once and have their IDs returned. Based on my comment here, I don't think this is possible for SQLite/MySQL in a reliable way.

I believe test coverage is already pretty good on this, because several tests use insertMany, get back IDs, and then use those IDs to lookup the inserted rows.

Things I'm unsure about:

The different InsertSqlResult types. Is ISRManyKeys supposed to be used when a composite primary key is defined?
Version bumps. Is this considered a breaking change because it adds a new record field to public API?

Closes #365

gregwebs · 2015-06-02T16:43:32Z

This should re-use functionality from Sql/Util.hs. In particular, this will fail at least for composite keys and should use dbIdColumns for the returning clause. The test case for this should be added, perhaps in CompositeTest or PrimaryTest.

…ry keys

MaxGabriel · 2015-06-05T20:06:16Z

Ok I added a failing test case for the composite primary key case, and then fixed that by adding a RawSql instance for Keys and using dbIdColumnsEsc from Database.Persist.Sql.Util. Probably I should add more tests for the RawSql instance, but is this generally correct otherwise?

gregwebs · 2015-06-06T14:18:47Z

I actually haven't touched RawSql before, but it looks good now. Perhaps @snoyberg can look at the RawSql instance.

gregwebs · 2015-06-06T14:22:42Z

One more thing, the documentation for insertMany_ and insertMany should be updated now to reflect which SQL databases have efficient implementations.

…, insertMany_ and insertEntityMany

MaxGabriel · 2015-06-07T02:26:49Z

persistent/Database/Persist/Sql/Class.hs

+  rawSqlCols _ key         = (length $ keyToValues key, [])
+  rawSqlColCountReason key = "The primary key is composed of "
+                             ++ (show $ length $ keyToValues key)
+                             ++ " columns"


Is there a better way to write rawSqlColCountReason? Maybe a way to list out the components of the key?

* `RawSql` instance for `Key`s * Optimize `insertMany` for Postgres backend [ci skip]

gregwebs · 2015-06-07T17:00:13Z

@MaxGabriel The changes look good now, thanks for the great patch with the updated ChangeLog and documentation!

@snoyberg this bumps persistent to 2.2 because the SqlBackend gets a connInsertManySql field added. However, this is only really an API breakage for someone implementing a new SQL backend, so we could probably avoid the major version bump if we wanted to. What do you think?

snoyberg · 2015-06-08T13:07:50Z

I'm OK skipping the major version bump, we typically do so for internal-only changes like this.

Use Postgres' RETURNING capability to optimize `insertMany`.

gregwebs · 2015-06-08T17:44:07Z

hmm, I am having second thoughts on the minor version bump. If someone pegs their backend version, but not persistent, then re-installing dependencies can cause a breakage. If we do the major bump, then at least we can say persistent should have been constrained to a major version.

@snoyberg are you ok with making this a major bump?

snoyberg · 2015-06-08T18:00:06Z

Yes, I'm OK with it. Just to throw out one other possibility: we can add upper bounds via Hackage revisions to prevent the currently-released backends from using the newer persistent. I slightly prefer that approach, but only slightly.

gregwebs · 2015-06-08T22:28:47Z

I tagged and released them all as version 2.2. That is enough work for me without changing a bunch of revisions on hackage.

Use Postgres' RETURNING capability to optimize insertMany.

4ba264b

Closes #365

snoyberg added the in progress label Jun 2, 2015

MaxGabriel added the Postgres label Jun 2, 2015

MaxGabriel added 2 commits June 2, 2015 21:54

Add failing test for composite primary keys when using insertMany

832c204

Add RawSql instance for Key, and use that for parsing composite prima…

0b53bd9

…ry keys

MaxGabriel added 4 commits June 6, 2015 08:51

Document which backends have efficient implementations for insertMany…

018be6a

…, insertMany_ and insertEntityMany

Clean up RawSql instance for Keys

71c7efc

Add test for RawSql instance for Key (for composite keys)

06022dc

Add a test using the RawSql instance for a non-composite primary key

989098f

MaxGabriel reviewed Jun 7, 2015
View reviewed changes

Update changelogs for #407

96c7cc0

* `RawSql` instance for `Key`s * Optimize `insertMany` for Postgres backend [ci skip]

Change persistent version bump from (2.1.6 -> 2.2) to (2.1.6 -> 2.1.7)

d520346

gregwebs added a commit that referenced this pull request Jun 8, 2015

Merge pull request #407 from yesodweb/postgresInsertManyReturning3

e944857

Use Postgres' RETURNING capability to optimize `insertMany`.

gregwebs merged commit e944857 into master Jun 8, 2015

snoyberg removed the in progress label Jun 8, 2015

gregwebs deleted the postgresInsertManyReturning3 branch June 8, 2015 17:33

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Use Postgres' RETURNING capability to optimize `insertMany`. #407

Use Postgres' RETURNING capability to optimize `insertMany`. #407

Uh oh!

MaxGabriel commented Jun 2, 2015

Uh oh!

gregwebs commented Jun 2, 2015

Uh oh!

MaxGabriel commented Jun 5, 2015

Uh oh!

gregwebs commented Jun 6, 2015

Uh oh!

gregwebs commented Jun 6, 2015

Uh oh!

MaxGabriel Jun 7, 2015

Uh oh!

gregwebs commented Jun 7, 2015

Uh oh!

snoyberg commented Jun 8, 2015

Uh oh!

gregwebs commented Jun 8, 2015

Uh oh!

snoyberg commented Jun 8, 2015

Uh oh!

gregwebs commented Jun 8, 2015

Uh oh!

Uh oh!

Use Postgres' RETURNING capability to optimize insertMany. #407

Use Postgres' RETURNING capability to optimize insertMany. #407

Uh oh!

Conversation

MaxGabriel commented Jun 2, 2015

Uh oh!

gregwebs commented Jun 2, 2015

Uh oh!

MaxGabriel commented Jun 5, 2015

Uh oh!

gregwebs commented Jun 6, 2015

Uh oh!

gregwebs commented Jun 6, 2015

Uh oh!

MaxGabriel Jun 7, 2015

Choose a reason for hiding this comment

Uh oh!

gregwebs commented Jun 7, 2015

Uh oh!

snoyberg commented Jun 8, 2015

Uh oh!

gregwebs commented Jun 8, 2015

Uh oh!

snoyberg commented Jun 8, 2015

Uh oh!

gregwebs commented Jun 8, 2015

Uh oh!

Uh oh!

Use Postgres' RETURNING capability to optimize `insertMany`. #407

Use Postgres' RETURNING capability to optimize `insertMany`. #407