Quantcast
Channel: MobileRead Forums - Kindle Formats
Viewing all articles
Browse latest Browse all 862

kindlegen/mobi dictionary inflections limit?

$
0
0
Hey!

I'm trying to create a Kindle dictionary for a language with lots of inflections (~30 inflected forms per headword).

Everything is working fine when I am not using the full data but once I get to around 1.9M inflected forms the inflection index fails to build. (The headword index still builds fine)
I've tested the whole dataset in split up parts with the inflection index building successfully so I believe that my data is well-formed.

Code:

Info(prcgen/inflections):I1012: Compiling inflections
Info(prcgen/inflections):I3001: Number of new <idx:infl> inflection rules:  0103538
Info(prcgen/inflections):I3002: Of which rules used only once or twice:    0093970
Info(prcgen/inflections):I3003: Number of inflection rule groups:          0065714
Info(prcgen):I1019: Building index into record  0000000 Index name:
Info(prcgen):I1025: Unicode index using optimized single-byte utf-16 encoding
Info(prcgen):I1029: The default lookup index is: ""
Info(prcgen):I1020: Building inflexions into record  0000026
Error(index build):E25002: : single entry exceeds record size (max=64k): aborting index build.
Info(prcgen):I1026: Unicode index using utf-8 encoding
Info(prcgen):I1021: Building naming index into record  0000026

(This is with the kindlegen 2.9 Linux build but it also failed with the kindlegen from the up to date Kindle Previewer)

I couldn't find any information on this error or any discussions of people having similar issues with the inflection index build. I'm wondering if there is just a hard cap on the number of inflections possible or if there might be a way to get this to build.

Thank you!

Viewing all articles
Browse latest Browse all 862