The index is the least of the issue with the SQLite implementation. It's calling...

avinassh · on Sept 15, 2022

I was working on a project to insert a billion rows in SQLite under a minute, batching the inserts made it crazy fast compared to individual transactions.

link: https://avi.im/blag/2021/fast-sqlite-inserts/

OGWhales · on Sept 15, 2022

> Also, I've never really dug into what's going on, but the dbm implementation is pretty slow on Windows, at least when I've tried to use it.

Seems like this would be why: https://news.ycombinator.com/item?id=32852333

banana_giraffe · on Sept 15, 2022

"the slow-but-simple implementation in module dbm.dumb will be used"

No kidding.

https://github.com/python/cpython/blob/main/Lib/dbm/dumb.py

It uses a text file to store keys and pointers into a binary file for values. It works .. most of the time .. but yeah, that's not going to win any speed awards.

Steltek · on Sept 15, 2022

For use cases where you want a very simple key-value store, working with single records is probably a good test?

banana_giraffe · on Sept 15, 2022

Maybe?

Sure, mutating data sets might be a useful use case. But, inserting thousands of items at once one at a time in a tight loop, then asking for all of them is testing an unusual use case in my opinion.

My point was that we're comparing apples and oranges. By default, I think, Python's dbm implementation doesn't do any sort of transaction or even sync after every insert, where as SQLite does have a pretty hefty atomic guarantee after each INSERT, so they're quite different actions.

jacobr1 · on Sept 15, 2022

Also you can use WAL