Skip to content

Whats the best way to process only selected pages from PDF #1283

Answered by JorjMcKie
AnweshGangula asked this question in Q&A
Discussion options

You must be logged in to vote

If you save to a new file, then obviously all page must be written. This is the same for all tools.

There are a few things you can do:

  1. Do not write to a new file, instead use incremental save to update the existing file. Obviously this means the old pages' contents are still physcally there, however no longer accessible. This is the fastest alternative.
  2. Play with the garbage collection level: garbage=4 is very, very time consuming, because it checks if there is any duplicate binary object content. That probably makes no sense at all in your case. Try garbage=3 or even garbage=1.
  3. You probably also need no page cleaning for every page: omit clean=True.

Replies: 1 comment 4 replies

Comment options

You must be logged in to vote
4 replies
@AnweshGangula
Comment options

@AnweshGangula
Comment options

@JorjMcKie
Comment options

@JorjMcKie
Comment options

Answer selected by AnweshGangula
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Category
Q&A
Labels
None yet
2 participants