split natural language text in chunks at reasonable language boundaries
diff --git a/README b/README @@ -88,3 +88,7 @@ character. However, as long as you are using ASCII whitespace regularly enough, these splits should be favoured and that bad situation should not happen. +nlsplit keeps whitespace at the beginning or at the end of chunks to +avoid losing any information. Depending on your application, you might +prefer to trim it. +