a3nm's blog

nlsplit -- a small tool to split natural language text in natural chunks

— updated

Another one of those standard small tools which should exist but that I couldn't find... nlsplit is a tool to split natural language text in chunks at reasonable language boundaries. The program takes as argument a maximal size for chunks, reads stdin and produces chunks smaller than the maximal size on stdout.

If you want more info, see the README. You can also retrieve the C source directly.

comments welcome at a3nm<REMOVETHIS>@a3nm.net