I have aquired several very large files. Specifically, CVSs of 100+ GB.

I want to search for text in these files faster than manually running grep.

To do this, I need to index the files right? Would something like Aleph be good for this? It seems like the right tool…

https://github.com/alephdata/aleph

Any other tools for doing this?

  • TiTeY`@jlai.lu
    link
    fedilink
    English
    arrow-up
    3
    ·
    edit-2
    2 months ago

    If CSV entries are similar, you can try Opensearch or Elasticsearch. It’s great for plain text search (with Lucene)