SQLite

River mainly targets Postgres, but is equipped with experimental support for SQLite. SQLite databases are files on disk rather than active daemons, making them suitable for embedded environments and other non-server contexts. We're hoping that bringing SQLite to River will widen its potential applications.

SQLite support was introduced in River 0.23.0.

The SQLite driver

SQLite is activated by initializing a client with the riversqlite driver instead of the more common riverpgxv5:

import (
    "database/sql"

    _ "modernc.org/sqlite"

    "github.com/riverqueue/river"
    "github.com/riverqueue/river/riverdriver/riversqlite"
)

dbPool, err := sql.Open("sqlite", "file:./river.sqlite3")
if err != nil {
    panic(err)
}
defer dbPool.Close()

dbPool.SetMaxOpenConns(1)

workers := river.NewWorkers()
river.AddWorker(workers, &SortWorker{})

riverClient, err := river.NewClient(riversqlite.New(dbPool), &river.Config{
    Queues: map[string]river.QueueConfig{
        river.QueueDefault: {MaxWorkers: 100},
    },
    Workers:  workers,
})
if err != nil {
    panic(err)
}

See the sqlite example for complete code.

River's SQLite driver uses primitives from Go's built-in database/sql so it should theoretically be able to support a varied selection of SQLite drivers. River tests with modernc.org/sqlite, a modern, pure Go option.

SQLite support is broadly tested internally, but should still be considered experimental, and it may still have a few tweaks made to its schema before being considering finalized. All changes will be noted in the changelog for future versions.

Due to limitations like SQLite's less expressive SQL (for example, mutations in CTEs aren't possible) and rocky SQLite support in sqlc, the SQLite driver isn't as fast as Postgres, benchmarking at about ¼ the speed, but still pushing 10k jobs/sec. We expect to address SQLite's deficiencies over time, and it's reasonably likely that SQLite and Postgres will converge in performance as optimizations for the former are implemented.

Migrating via CLI

Migrations use the normal River CLI. A sqlite:// DSN activates the SQLite driver:

export DATABASE_URL="sqlite://./river.sqlite3"
river migrate-up --line main --database-url "$DATABASE_URL"

DDL is different than Postgres', so a DSN is needed even for operations that don't enact on a database to tell River which database to produce code for:

river migrate-get --version 6 --up --database-url sqlite:// > river_6.up.sql

Concurrency best practices

An inherent limitation of SQLite is that it can only be opened by one writer at a time. Writers that try to open it while another process has a lock are met with this error:

database is locked (5) (SQLITE_BUSY)

Below are two techniques for avoiding this problem. We recommend the use of both.

WAL journaling

Like most databases, SQLite uses journaling to protect written data in the event of crashes. Its default journaling mode is DELETE, but WAL (write-ahead logging) can be enabled for superior concurrent capability. Connect to a database and set journal_mode:

PRAGMA journal_mode = WAL

Drivers like modernc.org/sqlite can also set this pragma via database URL query parameter:

dbPool, err := sql.Open("sqlite",
    "file:./river.sqlite3?_pragma=journal_mode(WAL)")

Specifically, use of WAL means that some concurrent access becomes possible. Readers do not block writers and a writer does not block readers.

We find in our testing that use of WAL doesn't immediately produce a noticeable effect at lower concurrency levels, but in highly concurrent code its use becomes paramount, with SQLITE_BUSY errors inevitable without it.

Single connection pool

A good way to manage highly concurrent access with a Go connection pool is to configure it to have a maximum of one active connection:

dbPool, err := sql.Open("sqlite", "file:./river.sqlite3")
if err != nil {
    panic(err)
}
defer dbPool.Close()

// Set maximum connections to 1 to avoid `SQLITE_BUSY` errors.
dbPool.SetMaxOpenConns(1)

This approach is unconventional by database standards because it means that goroutines will block waiting for database access, but in practice SQLite operations are fast, so it tends to be tolerable even at high throughput.

An alternative is to increase busy_timeout. We find that use of a single active connection is more effective at reducing the incidence of SQLITE_BUSY errors, but your mileage may vary.

However, when setting a maximum of one connection, take care with transactions. An open transaction will monopolize the single available connection as long as it stays open, potentially starving other callers. When using transactions, keep them as short-lived as possible. Ideally they should last only single digit milliseconds.

Keep transactions short

When setting a maximum of one active connection, an open transaction will monopolize it until finished, starving out other callers. Avoid this by keeping transactions short-lived.

Dual read/write connection pools

A variant of the single connection pool strategy is to have two connection pools, one that allows a single writer to write, and another that allows any number of readers to read.

Multi-connection read pool:

dbPoolRead, err := sql.Open("sqlite", "file:./river.sqlite3")
if err != nil {
    panic(err)
}
defer dbPoolRead.Close()

// Client with no connection limit. Should only perform reads.
riverClientRead, err := river.NewClient(riversqlite.New(dbPoolRead), ...)

Single connection write pool:

dbPoolWrite, err := sql.Open("sqlite", "file:./river.sqlite3")
if err != nil {
    panic(err)
}
defer dbPoolWrite.Close()

// Set maximum connections to 1 to avoid `SQLITE_BUSY` errors.
dbPoolWrite.SetMaxOpenConns(1)

// Client allowing only one connection. Should perform all writes.
riverClientWrite, err := river.NewClient(riversqlite.New(dbPoolWrite), ...)

WAL journaling must be enabled, but if it is, reads don't block writes and vice versa, so any number of reads can be ongoing simultaneously.

Because River is a job queue which is write heavy by nature, the number of readonly operations is limited. Examples of read operations are functions like JobGet, JobList, QueueGet, etc. As a matter of course, all clients which are started to run jobs need to be given full read/write access.

Immediate transactions

By default, SQLite opens transaction as BEGIN DEFERRED. Deferred transactions don't immediately take any sort of lock, but rather wait until the first statement in the transaction to decide what to do.

If the first statement is a SELECT, a read transactions begins. If the transaction later runs a mutation like UPDATE or DELETE, SQLite tries to upgrade the transaction to write, but a major gotcha is that if it finds the database locked when it tries to do so, it fails immediately with SQLITE_BUSY regardless of busy timeout or any other setting.

BEGIN IMMEDIATE transactions start a write transaction right away and aren't susceptible to the SQLITE_BUSY failure. Go's database/sql doesn't provide a way of making all transactions IMMEDIATE, but they can generally be activated via a database URL query parameter. For example, with modernc.org/sqlite:

dbPool, err := sql.Open("sqlite",
    "file:./river.sqlite3?_txlock=immediate")

Immediate transactions aren't necessary when using a maximum connection pool size of 1 because no other goroutine can lock the database while another is holding the only available connection. That said, it's probably not harmful to turn them on, and they might help in case another process (i.e. a distinct process with its own connection pool) is trying to access the same database.

libSQL

The SQLite driver also supports libSQL, a SQLite fork that adds additional features and accepts open source contributions. LibSQL should be used through the same riversqlite driver that SQLite uses, and the same considerations around concurrency apply:

import (
    "database/sql"

  _ "github.com/tursodatabase/libsql-client-go/libsql"

    "github.com/riverqueue/river"
    "github.com/riverqueue/river/riverdriver/riversqlite"
)

dbPool, err := sql.Open("libsql", "file:./river.libsql")
if err != nil {
    panic(err)
}
defer dbPool.Close()

dbPool.SetMaxOpenConns(1)

workers := river.NewWorkers()
river.AddWorker(workers, &SortWorker{})

riverClient, err := river.NewClient(riversqlite.New(dbPool), &river.Config{
    Queues: map[string]river.QueueConfig{
        river.QueueDefault: {MaxWorkers: 100},
    },
    Workers:  workers,
})
if err != nil {
    panic(err)
}

See the libsql example for complete code.

LibSQL support is currently made possible because it stays very closely API compatible with SQLite. If that were to change in the future, River's libSQL support might have to change as well. Either by dropping it (if it doesn't have enough users to justify continued support), or by forking a separate libSQL-specific driver.

Benchmark

Here's an informal benchmark of River on SQLite burning through a fixed backlog of one million jobs. On a commodity M4 MacBook Pro, it works about 10k jobs/sec, which is about ¼ the speed of a similar run on Postgres:

$ go run ./cmd/river bench --database-url "sqlite://:memory:" --num-total-jobs 1_000_000
bench: jobs worked [          0 ], inserted [    1000000 ], job/sec [        0.0 ] [0s]
bench: jobs worked [      16007 ], inserted [          0 ], job/sec [     8003.5 ] [2s]
bench: jobs worked [      22009 ], inserted [          0 ], job/sec [    11004.5 ] [2s]
bench: jobs worked [      20019 ], inserted [          0 ], job/sec [    10009.5 ] [2s]
bench: jobs worked [      20005 ], inserted [          0 ], job/sec [    10002.5 ] [2s]
bench: jobs worked [      19490 ], inserted [          0 ], job/sec [     9745.0 ] [2s]
bench: jobs worked [      20011 ], inserted [          0 ], job/sec [    10005.5 ] [2s]
bench: jobs worked [      18521 ], inserted [          0 ], job/sec [     9260.5 ] [2s]
bench: jobs worked [      20017 ], inserted [          0 ], job/sec [    10008.5 ] [2s]
bench: jobs worked [      20004 ], inserted [          0 ], job/sec [    10002.0 ] [2s]
bench: jobs worked [      19502 ], inserted [          0 ], job/sec [     9751.0 ] [2s]
bench: jobs worked [      18520 ], inserted [          0 ], job/sec [     9260.0 ] [2s]
bench: jobs worked [      19504 ], inserted [          0 ], job/sec [     9752.0 ] [2s]
bench: jobs worked [      18511 ], inserted [          0 ], job/sec [     9255.5 ] [2s]
bench: jobs worked [      19752 ], inserted [          0 ], job/sec [     9876.0 ] [2s]
bench: jobs worked [      20262 ], inserted [          0 ], job/sec [    10131.0 ] [2s]
bench: jobs worked [      19520 ], inserted [          0 ], job/sec [     9760.0 ] [2s]
bench: jobs worked [      18503 ], inserted [          0 ], job/sec [     9251.5 ] [2s]
bench: jobs worked [      20004 ], inserted [          0 ], job/sec [    10002.0 ] [2s]
bench: jobs worked [      16007 ], inserted [          0 ], job/sec [     8003.5 ] [2s]
...
bench: total jobs worked [    1000000 ], total jobs inserted [    1000000 ], overall job/sec [     9560.1 ], running 1m44.601231s

The difference in speed has little to do with database performance, and more to do with driver implementation. Due to limitations in sqlc, SQLite's batch operations only work one row at a time, and despite still being quite fast because of the local round trip, it still has a dramatic impact on performance.

Venturing beyond Postgres

SQLite is River's first foray beyond Postgres, and we're interested in hearing from you if you've been able to use it effectively. This will also help us gauge interest in non-Postgres systems, and whether we should implement support for other common RDBMSes like MySQL.