Datamule Cloud V3 Reconciliation

Sometimes things break. Reconciliation is the name for datamule's lambda worker that notices issues, then fixes them on a daily trigger.

Schedule

Inventories

How it works:

1. Lists all YYYY-MM-DD prefixes in the bucket 2. We process about ~100 prefixes at a time. For each prefix, begins sequential list operations. 3. Uploads the inventory as a .txt.zst file

1993-02-11/999999999704035713.sgml.zst
1993-02-12/999999999705015654.sgml.zst
1993-02-24/999999999705037760.sgml.zst
1993-02-26/999999999705050433.sgml.zst
1993-04-28/999999999705050434.sgml.zst

> Note: I was able to achieve ~4x throughput through switching from python to rust.

Metadata Updater

Reconciliation