Is MySQL's auto_increment really monotonic?

Why you shouldn't rely on auto_increment feature in some cases.

Author: Maciej Papież

Mon Apr 09 2018

MySQL

AUTO_INCREMENT

Kafka

transaction

TL;DR: If a race condition between two MySQL transactions appears, the row with ID = N may appear in the database BEFORE another row with ID < N.

The task

Last week, I’ve been asked to build a microservice that will be responsible for polling a table for data and pushing it to a Kafka topic. As the table contains immutable events, it’d wise to use some kind of incremental loading.

The tool

Confluent JDBC Connector (see more) is a ready-made add-on exploiting Kafka Connect possiblities, it simplifies building such a service to few lines of properties files and the data just flows… ;)

However, one needs to set the properties carefully, adjusting each of them to the particular use case. One of them is the mode, which defines how the table should be queried for new rows. By default, it adopts the incrementing strategy, which should be fine for most. At least, the docs don’t mention any kind of risks involved here.

Beware, it’s a trap! (At least with MySQL that I had to use here.)

The trap

We need three terminals here, let’s call them A, B and C.

In the terminal A, we create an empty table foo (a very simple one - autoincrement primary key + createdat timestamp + some text column value):

create table foo (
  id int not null auto_increment,
  value char(30) not null,
  created_at timestamp default current_timestamp,
  primary key (id)
  );

Query OK, 0 rows affected (0,08 sec)

select * from foo;

Empty set (0,00 sec)

OK, done - created and empty. Let’s use the second connection, terminal B, to start a new transaction and insert a row alfa (I’ll use the Alphabet to maintain a readable order).

start transaction;
insert into foo (value) values ('alfa');

Since we’re in the middle on an uncommited transaction (with defualt isolation level), alfa in not yet visible for connection A.

select * from foo;

Empty set (0,00 sec)

Let’s use the last connection, C, to make an insert of bravo row (with autocommit).

insert into foo (value) values ('bravo');

Query OK, 1 row affected (0,01 sec)

This should be visible from now on for all other connections, that’s what terminal A says:

select * from foo;

+----+-------+---------------------+
| id | value | created_at          |
+----+-------+---------------------+
|  2 | bravo | 2018-04-09 13:13:51 |
+----+-------+---------------------+
1 row in set (0,00 sec)

It’s there! As you can notice, it got assigned the ID = 2, because ID = 1 had been previously allocated for the alfa row. But hey, where is it?!

Uncommited! Hanging in the air!

Now we commit the connection B transation:

commit;

Query OK, 0 rows affected (0,02 sec)

…and run a query using connection A:

select * from foo;

+----+-------+---------------------+
| id | value | created_at          |
+----+-------+---------------------+
|  1 | alfa  | 2018-04-09 13:13:21 |
|  2 | bravo | 2018-04-09 13:13:51 |
+----+-------+---------------------+
2 rows in set (0,01 sec)

Now, both alfa and bravo rows are visible.

The issue

During the between-commit phase, the row bravo with ID = 2 was visible for other connections while the row alfa (ID = 1) was still uncommited, thus not visible for others.

This is why I claim that in edge cases, when a race condition between transaction occurs, the auto_increment breaks its monotonic property. This is due to the fact that the value is allocated on insert, not on commit.

For the microservice I’m developing, this is a serious threat. Consider that we query the db during this short between-commit phase:

we retrieve a single row, ID = 2,
we process this batch of one row,
we store the ID = 2 state for the next batch, telling it “you should add where ID > 2 to your query”.

We’ve just ommited the alfa row!

Solution ideas

Here’s a list of proposals that might work (or might not):

Exploit the created_at column to improve the situation? I don’t think it’ll help, but that’s one of the modes that Kafka JDBC connector can operate in.
Wait X seconds until all transactions with ID < N are commited (e.g. innodblockwait_timeout + 5 seconds).
Dump whole table instead of incremental loading.
Mark the already pushed rows with a flag, query with where flag = false.

In case of any questions, comments, don’t hestitate to reach out.

The task

The tool

The trap

The issue

Solution ideas

How to use AWS cognito without any library

My target language doesn't support any aws' library for cognito, what can I do now?

Grzegorz M. (@grzesjam), Maciej Papież (@maciejpapiez)

How to quickly remove merged branches

I've got dozens of merged branches in my bitbucket/github - how to remove them?

Maciej Papież (@maciejpapiez)

Time out-of-sync in AWS EC2

How to let network time protocol do its job

Maciej Papież (@maciejpapiez)

Naming Sass color variables

My approach to name color variables

Krzysztof Grziwok

How to write a Phing target autocomplete bash script

Improved solution with multiple imported XML build files support

Przemek Pawlas

Dev & prod ready Docker setup for SPA app

Simple and light env setup to run SPA apps on various configurations with Docker multi-stage build

Marcin Łesek (@marcinlesek)

.NET in MY browser?

Blazor - a WASM powered front-end framework

Krzysztof Miczkowski

JPA and UUID

And why @PrePersist is bad for your entity

Anna Skawińska

Complex command handler in JavaScript

Comparison of using Promise and async/await

Mariusz Bąk (malef)

Unresolved status check from Travis on Github Pull Request

When your repository couldn't get information from Travis about build status check and you stuck on unresolved PR

Marcin Łesek (@marcinlesek)

Quick import of MySQL database dump

How to import 3 GB database dump in under 30 seconds

Mariusz Bąk (malef)

Elasticsearch error - [script] unknown field [source], parser not found

Caused by Elastica and how to fix Travis?

Przemek Pawlas

Pull Request Templates on GitHub

How to improve the code review process and reduce f**-up rate significantly

Krzysztof Miemiec

PHP with MySQL 8

How to avoid authentication method error

Przemek Pawlas

Minio as S3 replacement in development and beyond

How to configure self-hosted S3 file storage with Docker and setup Symfony Flysystem

Dawid Śpiechowicz

Make JAXB great again!

Setup JAXB to generate a fluent builder API and use java.time classes

Maciej Papież

How to implement Redux in React

What is Redux and how we can combine it with our application?

Michał Rożenek

Async/await in Express routing

How to do it?

Dawid Rożenek