the manifesto

sync, done
right.

Name: outl
Author: Avelino

The pitch in one sentence: outl is the only outliner whose sync is provably correct, doesn't need a server, and doesn't pollute your markdown to do it.

a long read · 7 acts want the algorithm with code? →

I

act one

where roam and logseq fail.

Both got the outliner UX right. Both fall apart on sync.

roam research

sync as a service

Roam keeps every workspace in a central database on their servers. Real-time sync is great when it works. The cost:

Your data lives on their machines. Export is JSON; the moment Roam decides to throttle, raise prices, or shut down, your notes are stranded.
No offline merge. Two devices edit the same block while disconnected? The one that connects last wins, the other one's changes silently vanish. No conflict surfaced, no merge prompt, no history of what was lost.
No interop. You can't open a Roam graph in another editor. There's no .md on disk to inspect.

Roam was an inspiration for what an outliner feels like. It is not an example of how to store your thinking.

logseq

files on disk, but the merge is hopeful

Logseq fixed the "where do my files live" problem: it writes markdown. Then it broke the markdown:

- ## My block
  id:: 6601a2c1-4f31-4a45-1c2c-3a5e6b7d8f90
  - child block
    id:: 6601a2c1-...

Every block gets a UUID written into the file. Open it in VS Code, Obsidian, or cat, and it's full of metadata. Worse:

Sync is a paid Pro tier. And it's a file-rsync flavor — there is no merge algorithm. When two devices write the same file, the newer one wins. Same loss as Roam, just with extra steps.
DB version split the community. Logseq's pivot to a database backend left the file-based users behind and shipped half-broken for over a year.
Mobile is a known-bad experience. Years of users asking for parity.

Logseq pointed at the right idea — files on disk — and stopped halfway.

plain git

the merge destroys structure

If files are markdown and you want sync, why not just git?

$ git pull --rebase
CONFLICT (content): Merge conflict in pages/Avelino.md

Git treats the file as a sequence of lines. When two people re-arrange the outline, the lines line up wrong, the merge marker splits a block in half, and you spend an hour resolving conflicts by hand. Every move operation in a tree of nested bullets becomes a textual war.

Try it once. You'll never do it twice.

II

act two

what outl does instead.

The core idea is two layers.

on disk

• pages/foo.md · clean markdown

• pages/foo.outl · block IDs (JSON sidecar)

• ops/ops-<device>.jsonl · op log (JSONL, one file per device)

in memory

● op log · the truth

↓ materialize

• tree CRDT · projection

→ 1.

The op log is the source of truth.

Every change — moving a block, editing its text, setting a property, deleting — is recorded as a LogOp with a Hybrid Logical Clock timestamp. The list of ops, sorted by HLC, deterministically produces the tree.

→ 2.

The materialized tree and the .md are projections.

Both can be thrown away. If your sidecar is lost, outl doctor regenerates it from the op log. If your .md is deleted, the op log still has every block.

→ 3.

Markdown on disk is clean.

No id::, no HTML comments, no YAML frontmatter delimiters. Block IDs live in .foo.outl (a JSON dotfile). When you edit pages/foo.md externally, outl's 3-level matching algorithm reconstructs which block had which ID.

the pieces that make this work

Tree CRDT

Kleppmann et al. 2022

Every device applies ops in HLC order, undoes/replays late arrivals, and provably converges to the same tree.

HLC timestamps

Total order across devices without coordination. Wall clock + logical counter + actor ID.

Yrs (Yjs in Rust)

Character-level CRDT for the text inside a block. Concurrent edits to the same sentence merge cleanly.

Fractional indexing

Sibling order as a sortable string. Inserting between two positions doesn't renumber anyone.

Slugified filenames

[[Avelino]] resolves to pages/avelino.md with title:: Avelino set automatically. Display name stays human; the filename is stable.

III

act three

the hard case
they all lose.

Two devices, offline, both move the same block.

initial · both devices

•ROOT

•X

•A ← we'll move this

•Y

•B ← and this

device 1 moves A under B

•ROOT

•X (empty)

•Y

•B

● A

device 2 moves B under A

•ROOT

•X

•A

● B

•Y (empty)

Both edits are sensible locally. Now they sync.

Roam

has no story — last write wins by wall-clock time.

Logseq sync

rsyncs the files; one device's edit replaces the other's. Information lost.

Git merge

sees two changed .md files, gives you a conflict with <<<<<<< markers across nested bullets. You spend the next hour repairing your outline.

outl

both devices receive both ops via P2P sync. Each sorts the two ops by HLC. The earlier one applies normally. The later would close a cycle — the algorithm detects this as a deterministic no-op on the tree, but the op stays in the log. Both devices end up with the same final tree. Zero data loss.

The op that became a no-op isn't discarded: if a future op breaks the loop (someone moves a third block out), the algorithm can replay history and find that the no-op move is now valid. The system never forgets what you intended.

this worked example is implemented as cycle.rs in outl-core. every change to the algorithm has to pass it.

IV

act four

what "do it right"
actually means.

It's worth being specific. The algorithm in outl provides these five formal guarantees — each backed by a property test in crates/outl-core/tests/.

01

Strong eventual consistency

Two devices that have observed the same set of ops produce exactly the same tree, regardless of delivery order or duplication.

✓ tested in convergence.rs

three replicas apply 100+ ops in three different permutations and the resulting trees are byte-identical.

02

Commutativity after reordering

The order in which a replica receives ops doesn't matter. The algorithm undoes newer ops, applies the late arrival in HLC position, then replays the undone ones.

✓ tested in convergence.rs

user-visible state is the same as if everything had arrived in HLC order from the start.

03

Idempotency

Applying the same op N times is the same as applying it once. You can re-sync a workspace that's already in sync and nothing changes.

✓ tested in idempotency.rs

04

Tree invariant preservation

The materialized tree is always a valid tree. No node ever has two parents. No cycle ever forms. Every node is reachable from ROOT or the soft-delete bucket TRASH_ROOT.

✓ tested in cycle.rs + cycle_chain.rs

05

No silent loss

Every op delivered to apply_op ends up in the log. Including the ones turned into no-ops by cycle detection. Nothing is ever silently dropped.

✓ tested in in every test

this is why outl can offer time-travel later — the entire premise of the ChronDB backend (issue #1).

The first four are properties Roam and Logseq can't even claim. The fifth is why outl can offer time-travel later.

V

act five

why not
automerge?

Automerge is a great general-purpose CRDT. Why didn't we use it?

Tree CRDT specifically.

Automerge has tree support but it's experimental, and we'd need to bolt on the move-with-cycle logic ourselves. Better to implement Kleppmann's algorithm directly — it fits in ~300 lines of Rust and we control the entire on-disk format.

Domain semantics.

Our Op enum talks about Move(node, new_parent, position) and SetProp(node, key, value). Automerge is generic — every operation goes through a JSON-patch-like API. Specialization makes error messages and tests dramatically clearer.

Storage control.

We own the JSONL op log (one append-only file per device), the shape of each op, and the bytes that go on the wire. With Automerge we'd be locked into their binary format forever.

The cost: we're on the hook for correctness. That's why the test battery is huge and the coverage target on the four critical functions (do_op, undo_op, apply_op, creates_cycle) is 100% — no exceptions.

VI

act six

what sync
looks like over the wire.

P2P sync ships today (in beta). The algorithm runs over a real network transport: iroh is the wire, and it's the default. Your devices talk straight to each other — no account, no cloud, no third-party server that holds or reads your notes. (iroh's relays only forward already-encrypted bytes for NAT traversal; they never store or decrypt anything.)

→ QUIC + automatic hole punching. No central server. No STUN/TURN unless your network is genuinely awful.
→ Discovery via shareable ticket. outl peer pair prints a string, the other device runs outl peer pair <ticket>, both are in the same swarm. Paired peers live in .outl/peers.json.
→ Incremental ops over a vector clock. Each replica keeps a last_ts_per_actor. The sync protocol sends only the ops the other peer hasn't seen.
● E2E encrypted by default. Your notes never leave the devices you own.

The algorithm was designed for this from day one — it handles ops arriving in any order, any number of times, with any delay. The network is just plumbing.

VII

act seven

honest
trade-offs.

Be skeptical of any sync story that claims zero compromises. Here are ours.

One move wins per concurrent pair.

If two people move the same block to different parents at the same time, exactly one move materializes. The other goes into the log but doesn't take effect. Pretending both succeed would lose information — that's Logseq's mistake.

Text-level undo through Yrs is partial.

Block text is a Yrs document. Yrs guarantees character-level convergence, but reversing a single Edit op via undo_op may not produce the exact pre-edit string if other edits interleaved. The string still converges; only local undo semantics weaken.

Conflict surfacing is silent.

Today outl resolves and moves on. A future feature could pop up "concurrent edits on this block" the way Notion does. Not yet.

No causal delivery enforcement.

HLC is total order, not causal. In practice this is fine — apply_op handles any delivery order — but we don't promise vector-clock semantics.

going deeper

Tree CRDT walkthrough

→

the algorithm with code, worked examples, and the full invariant list.

Markdown dialect + matching

→

how external edits get reconciled with the sidecar.

Storage trait

→

why Storage is a trait and how the ChronDB backend slots in.

Original paper (Kleppmann 2022)

→

"A highly-available move operation for replicated trees." IEEE TPDS 2022.

← back to outl install outl →

and one more thing

Sync that doesn't lose data is the prerequisite for letting an LLM read and write your notes. outl ships an MCP server so Claude Desktop, Cursor, Zed and ChatGPT use the same workspace as a local-first second brain — same correctness guarantees, same files.

sync, done right.

where roam and logseq fail.

sync as a service

files on disk, but the merge is hopeful

the merge destroys structure

what outl does instead.

The op log is the source of truth.

The materialized tree and the .md are projections.

Markdown on disk is clean.

the hard casethey all lose.

what "do it right" actually means.

Strong eventual consistency

Commutativity after reordering

Idempotency

Tree invariant preservation

No silent loss

why notautomerge?

Tree CRDT specifically.

Domain semantics.

Storage control.

what sync looks like over the wire.

honest trade-offs.

One move wins per concurrent pair.

Text-level undo through Yrs is partial.

Conflict surfacing is silent.

No causal delivery enforcement.

sync, done
right.

the hard case
they all lose.

what "do it right"
actually means.

why not
automerge?

what sync
looks like over the wire.

honest
trade-offs.