Buy me a coffee
PDF Guide

Split a PDF to Meet Upload Limits (While Keeping Pages in the Right Order)

How to split large PDFs into correctly named parts, preserve page order, and avoid common portal validation issues.

How to split large PDFs into correctly named parts, preserve page order, and avoid common portal validation issues. This guide focuses on real-world constraints—upload validators, email gateways, LMS portals, and mobile workflows—so you can reduce size without accidental quality loss. You will also learn why “just re-saving” often makes files bigger, how to choose a safe target size, and how to verify the final output before you send it.

Compression done well is not a mystery feature; it is basic file engineering. The goal is to keep what matters (readable text, sharp lines, correct page order, valid forms) while reducing what does not (excess resolution, duplicated assets, embedded previews, unused resources and unnecessary metadata). If you control those layers, you can consistently pass strict limits (500 KB, 2 MB, 10 MB) without last-minute panic.

This article is intentionally both technical and practical. Where relevant, it explains the “why” (resolution, encoding overhead, vector vs raster) and then gives “do this” steps you can execute quickly using browser-based tools.

Quick outcome

If you are in a hurry, follow this minimal workflow: (1) remove unnecessary pages and duplicates, (2) compress images first, (3) export a clean copy, and (4) run one final optimisation pass. In most cases, you will achieve a 40–80% reduction without visible quality loss. If the limit is extremely low, use a combination of moderate compression and splitting rather than forcing aggressive settings that create blur or destroy signatures.

Minimum checklist:
  • Confirm the platform limit (MB/KB) and accepted formats.
  • Remove unnecessary pages, duplicates, and heavy media first.
  • Optimise images (resolution + quality) before final export.
  • Export once to a clean file, then run one optimisation pass.
  • Preview at 100% zoom and run a test upload or send.

Why files become “too large” (and why compression fails)

The largest contributor is usually embedded raster imagery: photos, scanned pages, screenshots, or exported charts. A single modern phone photo can be several megabytes as JPG; the same image as PNG can be significantly larger. When you place these assets into a PDF or Office file, you are effectively packaging those megabytes into a container. The container itself adds overhead (fonts, object tables, previews), so the final file becomes even bigger.

Compression “fails” when the approach targets the wrong layer. Repeated exports can add incremental-save layers and inflate size even if the visible content is unchanged. Some editors embed multiple versions of the same images. Others store thumbnails for every slide or page. Converting between formats (PDF ⇄ image ⇄ PDF) can also destroy vector text and turn crisp letters into pixels. Once text becomes pixels, you need higher resolution to keep it readable, which increases file size again.

Finally, many platforms enforce rules beyond file size: encryption, unsupported colour spaces, strict PDF validators, malformed forms, or non-standard encodings. In those scenarios, “make it smaller” is not enough—you must also produce a standards-compliant file that the receiving system can parse.

Choose a realistic target size

Pick a target based on the delivery channel. Email gateways often reject attachments in the 20–25 MB range, but your practical limit is lower because base64 encoding adds roughly 30–35% overhead in transit. Portals and LMS systems vary widely: some accept 10–20 MB, others enforce 2 MB, and some government forms require 500 KB–1 MB. If your upload fails, note whether the message is a hard limit (“max size exceeded”) or a validator issue (“file cannot be processed”). That distinction tells you whether to compress further or to re-export cleanly.

For text-heavy PDFs, a well-optimised file often lands around 100–300 KB per page without readability loss. For scanned pages (each page is a photo), expect higher. If the limit is extremely low, plan to reduce scan DPI (often 150–200 DPI is acceptable for screen reading) and use grayscale when colour is not required. Always test: different portals render and validate differently.

Step-by-step workflow

1) Prepare the source (before you compress)

Start with source hygiene. Remove blank pages, duplicates, and unnecessary appendices. Crop screenshots to the relevant area. For scanned content, straighten the page, remove background noise, and ensure text contrast is good. Clean input compresses better because encoders can represent consistent edges more efficiently. If you have access to the original digital file (for example a Word document or a digital PDF), use it instead of a scan whenever possible.

2) Use the right tool for the job

For PDFs, preserve vector text and compress images selectively. If you need to split a document by size, do that after moderate compression so each part remains readable. If you need to merge documents, merge first and then optimise once, rather than optimising each file repeatedly (which can multiply artifacts and sometimes increases size).

Compress PDF tools

Open the relevant tool and apply a controlled optimisation pass.

Open

Split PDF

Open the relevant tool and apply a controlled optimisation pass.

Open

Merge PDF

Open the relevant tool and apply a controlled optimisation pass.

Open

Reorder pages

Open the relevant tool and apply a controlled optimisation pass.

Open

Delete pages

Open the relevant tool and apply a controlled optimisation pass.

Open

3) Apply compression in one controlled pass

Avoid “trial and error” exports across multiple apps. Each export can re-encode images and bloat the file. Choose one path: compress heavy assets first, export a clean file, and verify. If you must iterate, iterate in small steps (for example reducing JPG quality from 85 → 75) and stop as soon as the file passes the limit with acceptable readability. This approach keeps you in control and reduces the risk of irreversible quality loss.

4) Verify like a professional

Do not rely on file size alone. Open the file at 100% zoom and inspect small text, signatures, stamps, line art, and QR codes. Scroll quickly to check for rendering glitches. If you are submitting to a strict portal, ensure the file is not password-protected and does not contain interactive features the portal rejects. When possible, run a final “standard export” to remove exotic features and improve compatibility.

Platform workflows: mobile, Windows, and macOS

Mobile (iPhone / Android)

On mobile, the two biggest mistakes are (1) uploading a high-resolution scan directly and (2) using a messaging app that re-encodes the file unpredictably. If you must work from a phone, prioritise a browser-based compression tool and keep your workflow simple: open the PDF, compress once, and download the optimised version. If your “PDF” is actually a set of photos, convert them into a PDF at A4 size and then compress; that often creates a smaller, more structured document than attaching raw images.

When photographing documents, use good light, keep the camera perpendicular, and avoid shadows. Many scanning apps default to very high resolution; reduce the scan setting if your target is an upload portal, not archival storage. If you need searchability, use OCR, but do it after the image quality is acceptable—OCR accuracy depends heavily on clean input.

Windows

On Windows, the common source of bloat is Office files exported to PDF with “high quality print” settings, plus embedded images that are far larger than needed. Export with standard settings, then run a single optimisation pass. If you are dealing with scanned PDFs, consider a grayscale conversion and a controlled DPI. Avoid repeatedly printing to PDF through multiple virtual printers because those often rasterise pages and inflate size.

macOS

On macOS, Preview is convenient but can add overhead if you re-save repeatedly. If you edit a PDF, try to finish your changes in one session and then create a clean export. If a file becomes larger after editing, it often contains extra resources or incremental updates. A final optimisation pass usually fixes this. For images, use controlled exports rather than dragging and dropping between apps, which can create unnecessary format conversions.

Worked example: reducing a large file without sacrificing quality

Here is a realistic scenario: you have a pdf document that is 18 MB, and the portal limit is 2 MB. The document contains 12 pages, with 6 pages being text and 6 pages being scans or screenshots. If you apply aggressive compression globally, text may become blurry and signatures may fail verification. Instead, apply a layered approach:

  1. Identify the heavy pages. If your PDF viewer shows page thumbnails slowly, those pages likely contain large images. Screenshots pasted at full resolution are common culprits.
  2. Reduce image resolution intentionally. For typical on-screen review, 150–200 DPI is often sufficient. If the page is a photo, downscale it before embedding.
  3. Keep real text as text. Avoid converting digital text to an image layer. Vector text compresses well and stays sharp at zoom.
  4. Export once, then optimise once. Multiple exports can multiply artifacts and sometimes increase size due to duplicate resources.
  5. Verify critical details. Zoom to 100% and check names, ID numbers, signatures, and stamps. If any section is blurred, increase quality slightly for those pages rather than increasing quality across the whole file.

Using this approach, it is common to reduce an 18 MB mixed-content document to 1.5–2.5 MB without visible issues. If the portal requires under 1 MB, splitting into two parts may preserve quality better than forcing an ultra-low quality setting. The key insight is that you are not compressing a single “thing”; you are managing a bundle of elements with different tolerances.

Technical deep dive (clear, but not oversimplified)

Vector vs raster: why text can stay sharp

In PDFs, text and lines are often stored as vector instructions: “draw this glyph here with this font”. That representation is compact and scales cleanly. When you export a document by “printing” it through the wrong pipeline, those instructions can be flattened into raster images. The result is visually similar at first glance, but it becomes blurry when zoomed, and the file size grows because every page is now a photo.

Image compression knobs that actually matter

For raster images, there are three main controls: (1) dimensions (pixel width/height), (2) colour depth (colour vs grayscale), and (3) compression quality. Reducing dimensions is often the most effective because it reduces the raw data before encoding. Grayscale reduces channels. Quality controls trade detail for size; used moderately, it can cut size significantly with minimal visible impact. Used aggressively, it produces artifacts that users describe as “blurry” or “blocky”.

Why email limits feel lower than they should

Email systems encode binary attachments using base64, which increases size in transit. That is why a “25 MB limit” can reject a 20 MB attachment depending on server rules and message headers. If you routinely send large attachments, plan for a smaller working limit and use link-based delivery when appropriate.

Validators and standards

Government and enterprise portals often use strict validators that reject encrypted files, PDFs with unusual features, or documents that contain embedded scripts. When you hit a “cannot process” error, export to a clean standard PDF, remove passwords, and avoid embedded multimedia. If the document includes forms, some portals require a flattened version. Always test a sample upload early, not at the deadline.

Quality control: what to protect and what can be compressed

Different elements tolerate different compression levels. Photographs can often be compressed with a mild-to-moderate JPG setting without noticeable loss at typical viewing sizes. In contrast, screenshots, diagrams, and small text are extremely sensitive: aggressive JPG creates ringing and blur around edges. For those assets, prefer higher quality settings, and use PNG only when transparency or sharp edges are essential and the size budget allows.

For scanned documents, control resolution and colour depth. Colour scans produce larger files because three channels must be encoded; switching to grayscale can reduce size materially. If the document is primarily text, a 150–200 DPI scan is usually adequate for reading on screen. Reserve 300 DPI for documents that must be printed or contain small stamps and microtext. This single decision often determines whether you can meet a strict limit without ruining readability.

Whenever possible, keep text as vector. Vector text scales without blur and compresses efficiently. Converting PDFs to images and back should be a last resort because it turns crisp text into pixels.

Common mistakes that waste hours

  • Over-compressing too early: once you destroy text clarity, you cannot recover it without going back to the source.
  • Using PNG for photos: PNG is excellent for transparency and sharp edges, but it is usually larger for photographs.
  • Multiple exports across apps: repeated re-encoding often increases size and introduces artifacts.
  • Ignoring hidden bloat: metadata, duplicate fonts, embedded previews, and unused resources add weight.
  • No verification step: the file “looks fine” until the portal rejects it or a reviewer cannot read the text.
  • Chasing a round number: targeting exactly 1.00 MB can lead to unnecessary quality loss; target a safe margin below the limit and stop.

Troubleshooting

ProblemLikely causeFix
File is still too bigImages are high-resolution or duplicatedDownscale to minimum effective DPI; remove duplicates; compress in one controlled pass
Text became blurryVector text rasterised or over-compressed imagesExport from the original source; keep text as text; increase quality slightly and retry
Portal rejects the fileEncryption, strict validator, non-standard featuresRemove passwords; re-export to standard PDF; avoid uncommon colour profiles and embedded media
Email bouncesAttachment limit plus base64 overheadTarget a smaller file; send via link; split into parts if permitted
Images look “washed”Colour profile stripped or downsampled too farUse moderate compression; avoid repeated exports; confirm colour settings on export
Search no longer worksText layer removed or flattened to imageUse a workflow that preserves text; consider OCR for scanned documents

Advanced techniques (when you need “as small as possible”)

Optimise images before embedding

The best compression happens before content is wrapped into a PDF or Office file. Resize images to the actual display size. Avoid embedding 4000–6000 pixel originals if the layout shows them at 1200 pixels. This is a frequent cause of “mysterious” file size bloat and is also one of the easiest fixes.

Split strategically instead of destroying quality

If you must hit a very low per-file limit, splitting often preserves quality better than aggressive compression. Keep naming consistent (Part‑1, Part‑2) and preserve page order. If the portal requires a single file, remove optional pages rather than over-compressing essential pages. A clean submission is worth more than an unreadable one.

Privacy and compliance checks

Before submitting official documents, remove hidden data you do not intend to share (author names, software versions, location tags, hidden layers). For sensitive PDFs, use true redaction and verify the underlying text cannot be selected or copied. This is both a privacy best practice and a professional habit for client work.

Glossary (fast definitions)

  • DPI: Dots per inch; practical resolution for scans and print. Higher DPI increases file size.
  • Raster: Pixel-based images (photos, scans). Compression affects sharpness.
  • Vector: Instruction-based graphics (text, lines). Scales cleanly and often compresses well.
  • Base64 overhead: Email encoding that makes attachments larger in transit.
  • Flattening: Converting interactive layers (forms, transparency) into a static output for compatibility.
  • Metadata: Hidden information such as author name, software, timestamps, and sometimes location.
  • Validator: Automated system that checks file structure and compliance, not only size.

Monetisation-ready (and future AdSense-ready) layout

You can monetise outside AdSense while keeping the site suitable for AdSense later by focusing on value-first content, clear disclosures, and non-intrusive placements. Practical options include affiliate links (software, storage, productivity), sponsorship blocks, and service CTAs. Keep the page fast, avoid misleading clicks, and keep promotions clearly separated from your tool buttons.

Example placement:
Sponsor / affiliate placeholder (integrate later).

Disclosure: If you add affiliate links, label them clearly and use rel="sponsored".

FAQ

What file size should I aim for?

Aim for the smallest size that passes the platform check while remaining readable. For most portals, 1–10 MB is typical; for email, 5–20 MB is common; and for forms with strict limits, 500 KB–2 MB may be required.

Will compression make my document blurry?

It can, if the tool downscales images too aggressively or applies heavy JPEG compression. Use moderate settings first, keep text as vector when possible, and preview at 100% zoom before submitting.

Why does my PDF get bigger after I edit it?

Many editors embed new resources (fonts, duplicated images, incremental save layers). A final optimisation pass and removing unused objects often restores a smaller file.

Is it better to split or compress?

Compress first if quality can be preserved. Split when the target limit is extremely low, when a portal rejects certain content types, or when you need separate uploads for different document categories.

Do online tools keep my files?

Policies vary. For sensitive files, avoid uploading personal documents to untrusted services, remove metadata, and consider local workflows when possible.

What’s the difference between DPI and quality?

DPI is resolution for print and scan; quality is the compression strength. A 300 DPI scan can still look poor if quality is too low. Conversely, 150–200 DPI at a reasonable quality can remain very readable.

Can I compress without losing text search?

Yes. If the PDF contains real text (not just images), keep it as text and only compress embedded images. If it is scanned, consider OCR to add searchable text while optimising the images.

What should I do if the portal still rejects my file?

Re-check allowed formats, remove passwords, flatten forms if required, ensure page count limits are met, and try exporting to PDF/A or standard PDF if the portal has strict validators.

If you need a workflow tailored to a specific portal or device, confirm the exact size limit and whether the rejection is size-only or format validation.

Conclusion

Compression is a repeatable workflow: clean the source, compress the right layer, export once, and verify. If you keep text as text and treat images intentionally, you can meet strict limits without sacrificing clarity or credibility.

Cookie Consent Banner Buy me a coffee