LMDB vs. LevelDB

This document compares the Lightning Memory-mapped Database (LMDB) key-value storage engine to the LevelDB key-value storage engine.

Features

Key Types

LMDB

Keys are arbitrary byte arrays and are stored in lexicographical order.

LevelDB

Keys are arbitrary byte arrays and are stored in lexicographical order by default. Users can also specify a custom comparator that changes the order.

Value Types

LMDB

Values are arbitrary byte arrays.

LevelDB

Values are arbitrary byte arrays.

Iteration

LMDB

LMDB supports iterating key/value pairs in lexicographical order. It also supports seeking to a key (or the first key >= a given key). And it supports iteration in reverse order.

LevelDB

LevelDB supports iterating key/value pairs in lexicographical order. It also supports seeking to a key (or presumably the first key >= a given key). And it supports iteration in reverse order (although “reverse iteration may be somewhat slower” per LevelDB > Iteration).

ACID Properties

Atomicity

LMDB

LMDB is transactional and guarantees the atomicity of transactions.

LevelDB

LevelDB isn’t transactional but does provide a WriteBatch API that writes a set of changes to the store atomically.

Consistency

LMDB

The meaning of consistency for a key-value store is ambiguous, but LMDB: Getting Started notes: “as long as a transaction is open, a consistent view of the database is kept alive.” In this case, the word is used to describe a view that is unaffected by subsequent write transactions.

LevelDB

The meaning of consistency for a key-value store is ambiguous, but LevelDB provides a Snapshot API that it describes as providing “consistent read-only views over the entire state of the key-value store.” In this case, the word is used to describe a view that is unaffected by subsequent write transactions.

Isolation

LMDB

Per its Introduction, LMDB is “fully thread-aware and supports concurrent read/write access from multiple processes and threads.” LMDB implements concurrency via a copy-on-write strategy (MVCC) and permits concurrent access by one writer and unlimited readers.

LevelDB

Per its Concurrency model, LevelDB supports synchronization across threads for some operations, although others require external synchronization. It does not support concurrent access by multiple processes.

Durability

LMDB

LMDB transactions are both durable and performant. LMDB uses shadow paging to improve the performance of write transactions, and Howard Chu notes in this Hacker News comment: “LMDB write performance remains uniform under load.”

LevelDB

Per Synchronous Writes, LevelDB sacrifices durability for performance, persisting changes asynchronously by default, although it’s possible to configure a write to be performed synchronously.

Durability may not be the highest priority for LevelDB, as its primary Google consumer, IndexedDB, is not guaranteed to be durable in browsers.

Quality Attributes

Performance

In a run of this benchmark, LMDB was approximately an order of magnitude faster than LevelDB to open a database and read entries, while being roughly equivalent to 3x faster to write entries (depending on the type of write), in this benchmark run on a macOS.

LMDB

LMDB read performance is excellent, and its write performance is reasonable and consistent.

LMDB reuses the space freed by deleted entries instead of compacting the store, which avoids the complexity and performance impact of compaction (trading off disk space in some situations).

LevelDB

LevelDB read performance is good, with this 2011 benchmark showing improvements over engines like SQLite.

LevelDB “compacts on open by design,” which can slow down opening a database and seeking to a key significantly, per #210.

Software Footprint

LMDB

The LMDB product page claims that LMDB comprises “32KB of object code,” and its Wikipedia entry claims that it’s 64KB in size. The sample Rust program that uses the lmdb crate in the mykmelez/kvbench repo is about 73kB larger than a control program.²

LevelDB

The LevelDB Wikipedia entry claims its “binary size” is 350kB. The sample Rust program that uses the leveldb crate in the mykmelez/kvbench repo is about 207kB larger than a control program.³

Reliability

LMDB

According to its Wikipedia entry, “LMDB was designed from the start to resist data loss in the face of system and application crashes. Its copy-on-write approach never overwrites currently-in-use data. [Which] means the structure on disk/storage is always valid, so application or system crashes can never leave the database in a corrupted state. In its default mode, at worst a crash can lose data from the last not-yet-committed write transaction. Even with all asynchronous modes enabled, it is only an OS catastrophic failure or hardware power-loss event rather than merely an application crash that could potentially result in any data corruption.”

LevelDB

According to its Wikipedia entry, “LevelDB is widely noted for being unreliable and databases it manages are prone to corruption. Academic studies of past versions of LevelDB have found that, under some file systems, the data stored in those versions of LevelDB might become inconsistent after a system crash or power failure. LevelDB corruption is so commonplace that corruption detection has to be built into applications that use it.”

However, a review of the LevelDB Issue 197 Workaround discussion and some of the issues containing the string “corrupt” in LevelDB’s issue tracker suggests that corruption issues are taken seriously and actively investigated.

Storage Footprint

In a run of the disk space “benchmark” in the mykmelez/kvbench repo, LMDB used roughly 1.5–4x more disk space than LevelDB to store equivalent amounts of data. However, that benchmark employs a hack to measure the space taken by the LMDB and LevelDB datastores on disk, and it isn’t clear that the hack is an accurate way to measure disk usage.

LMDB

LMDB doesn’t compress data on disk, and it doesn’t compact datastores, but it does reuse the space freed by deleted entries.

LevelDB

LevelDB compresses data on disk using Google’s Snappy compression library, and it compacts datastores.

References

LMDB: The Leveldb Killer?

Is LMDB a LevelDB Killer?

Understanding LMDB Database File Sizes and Memory Utilization

LevelDB “impl” document

Notes

Footnote 1

¹See Wikipedia’s Reliability of Wikipedia article for much discussion about the reliability of Wikipedia articles generally. This document uses the Wikipedia articles on LMDB and LevelDB as a source—but not the sole source—of information about the two storage engines.

Footnote 2

²490,880 bytes for the LMDB program versus 417,804 bytes for the control program on a macOS system is an 73,076 byte difference. Both programs built using stable Rust v1.30.1 with the “release” profile and then stripped on December 18, 2018.

Footnote 3

³624,900 bytes for the LevelDB program versus 417,804 bytes for the control program on a macOS system is a 207,096 byte difference. Both programs built using stable Rust v1.30.1 with the “release” profile and then stripped on December 18, 2018.

LMDB vs. LevelDB

Meta

Project Structure

LMDB

LevelDB

License

LMDB

LevelDB

Language

LMDB

LevelDB

Rust Bindings

LMDB

LevelDB

Docs

LMDB

LevelDB

Features

Key Types

LMDB

LevelDB

Value Types

LMDB

LevelDB

Iteration

LMDB

LevelDB

ACID Properties

Atomicity

LMDB

LevelDB

Consistency

LMDB

LevelDB

Isolation

LMDB

LevelDB

Durability

LMDB

LevelDB

Quality Attributes

Performance

LMDB

LevelDB

Software Footprint

LMDB

LevelDB

Reliability

LMDB

LevelDB

Storage Footprint

LMDB

LevelDB

References

Notes

Footnote 1

Footnote 2

Footnote 3