Performance

Funnel has several parameters that can be changed to tweak every bit of performance out of Funnel for the way that you use it.

The first one is --cacheWork. This should be used if you have enough memory available to fit the work areas in core. Without this option the work data will be written to a file on disk.

--cacheInput is the parameter that will read the entire input file into memory and keep it there until the end of the sort. That allows Funnel to perform a tag-sort without re-reading the input from disk. Again, this can take considerable memory to run.