When I parse my UTF-16 or UTF-8 document, it crashes with
ArrayIndexOutOfBoundsException in LineBreakUtils
I tryed to apply the patch attached to the bug page, but I didn't manage to
recompile the whole thing with the latest JDK (installed ant and JDK), the
build fails with compilation errors.
If I convert the input file to ANSI then back to UTF-16 it works, but of
course I loose unicode characters. The strange thing is that I only have
commonly -used characters (like àèù) , not strange japanese-like