nlsplit

split natural language text in chunks at reasonable language boundaries
git clone https://a3nm.net/git/nlsplit/
Log | Files | Refs | README

commit 4a3fa099c3272691f13f096d60de56fa9357da69
parent 27953c203db72f7ddba2d2718668c41b9504f66e
Author: Antoine Amarilli <a3nm@a3nm.net>
Date:   Mon, 10 Oct 2011 00:19:04 +0200

fix readme

Diffstat:
README | 3++-
1 file changed, 2 insertions(+), 1 deletion(-)

diff --git a/README b/README @@ -26,7 +26,8 @@ SOFTWARE OR THE USE OR OTHER DEALINGS IN THE SOFTWARE. nlsplit is a tool to split natural language text in chunks at reasonable language boundaries. The program takes as argument a maximal size for -chunks, reads stdin and produces chunks smaller than the maximal size. +chunks, reads stdin and produces chunks smaller than the maximal size on +stdout. The general NLP problem of text splitting is AI-hard, and optimizing to keep chunks close to the specified size would be NP-complete. The text