Gmail archive in 70GB mbox file; how to process for FT indexing? [message #1918] |
Fri, 14 March 2025 22:42 |
mma165
Messages: 6 Registered: March 2025
|
Junior Member |
|
|
Hi,
I'm wondering if anyone has tips on how to process a 70GB mbox file (an archive of gmail) for indexing with FT. I thought perhaps conversion to an sqlite db would work but that did not. Should it be split into individual messages via something like
cat mylistserve.mbox | formail -ds sh -c 'cat > msg.$FILENO'
?
I'm trying to avoid this if possible due to the vast number of files that would create. However I don't know what the intermediate approach between a single 70GB mbox and some number of smaller files would be.
-Malcolm
|
|
|