I bought "the girl on the dock" PDF and want to convert it to Mobi format.
There's a few issues with that. Every page has an image background for a line at the top and bottom, some pages have a separator graphic as part of the background and some pages are just an illustration with a caption.
I want to keep the illustrations and lose the rest.
I tried running it through Calibre and it looks like every single line in the PDF is a paragraph. The output has everything double-spaced and broken sentences.
Then there's the page numbers to get rid of and the author's name and book title alternating at the top of every page. (That at the page tops has always annoyed me, even with dead tree books. I'm not likely to forget what book I'm reading or who wrote it while I'm reading it.)
Finally, no table of contents. That should be the easiest thing to do. There's only 5 chapters. Might not even bother with adding one.
Is there an easy way to delete the first and last lines of every page (to remove the page numbers and the author name and book title) then remove all paragraph marks except where there's indents or a line begins with a " mark, which are standalone lines of dialog? Also need to delete all carriage returns except at the ends of each paragraph so the text can flow with different screen or font sizes.
The PDF could be a case study in "How to format a PDF in order to make it as difficult as possible to convert to another format." I suppose it'd work decently on a large tablet or reader but not on a 4.3" Android phone screen.
There's a few issues with that. Every page has an image background for a line at the top and bottom, some pages have a separator graphic as part of the background and some pages are just an illustration with a caption.
I want to keep the illustrations and lose the rest.
I tried running it through Calibre and it looks like every single line in the PDF is a paragraph. The output has everything double-spaced and broken sentences.
Then there's the page numbers to get rid of and the author's name and book title alternating at the top of every page. (That at the page tops has always annoyed me, even with dead tree books. I'm not likely to forget what book I'm reading or who wrote it while I'm reading it.)
Finally, no table of contents. That should be the easiest thing to do. There's only 5 chapters. Might not even bother with adding one.
Is there an easy way to delete the first and last lines of every page (to remove the page numbers and the author name and book title) then remove all paragraph marks except where there's indents or a line begins with a " mark, which are standalone lines of dialog? Also need to delete all carriage returns except at the ends of each paragraph so the text can flow with different screen or font sizes.
The PDF could be a case study in "How to format a PDF in order to make it as difficult as possible to convert to another format." I suppose it'd work decently on a large tablet or reader but not on a 4.3" Android phone screen.