Skip to content

Why is my PDF so large, and how do you shrink it?

6 min read June 13, 2026
pdfcompress pdffile sizescanning

A text PDF is tiny, but a scanned one stores a full photo of every page. Here is what really bloats a PDF and how to bring it back down.

Why is my PDF so large, and how do you shrink it? — Hivly

A PDF full of text should be small. A few pages of writing is barely any data at all. So when a short document balloons to twenty or thirty megabytes and an upload form bounces it, something other than the words is taking up the room. Almost always, that something is images.

TL;DR: Text in a PDF is tiny. Images are not. A scanned PDF stores a full photo of every page, and high DPI plus full color multiplies that fast. Shrinking a PDF mostly means downsampling and re-compressing those images.

Why is my PDF so large in the first place?

The size of a PDF comes from what it stores, not how many pages it has. Real text is stored as characters and takes almost no space. Images are stored as pixels, and pixels add up fast. A document that looks like a few pages of writing can be huge if those pages are actually photographs of writing rather than text.

This is the core misunderstanding behind most oversized PDFs. People assume a short document is a light one. But a PDF is a container, and it holds whatever you put in: text, vector drawings, embedded fonts, and images. The text and vectors are cheap. The images are where the weight lives, and a single high-resolution page image can outweigh a whole book’s worth of actual characters.

So the first thing to check is what your pages really are. Can you select and copy the text with your cursor? Then it is text, and the file should be small. If your cursor refuses to grab anything, you are looking at a picture, and that picture is your size problem.

Why do scanned PDFs get so big?

A scanned PDF is the usual culprit, because a scanner does not read text. It takes a photo. Every page becomes a full-resolution image, and the PDF stores all of those images back to back. Ten scanned pages means ten photographs bundled into one file, with none of the efficiency a text document would have.

That alone makes scans heavier than typed documents, but two settings on the scanner make it dramatically worse. The first is resolution, measured in DPI, which stands for dots per inch. The second is color. Both multiply the pixel count, and the pixel count is the file size.

Here is the trap. Scanner defaults are often set high “to be safe,” so people scan everyday paperwork at 600 DPI in full color when none of that detail matters. The file ends up many times larger than it needs to be for something that will only ever be read on a screen or printed on a basic office printer.

How DPI multiplies the size

DPI counts how many dots the scanner records per inch of paper, and the cost grows with the square. Doubling the DPI does not double the file. It roughly quadruples it, because you are doubling the dots in both directions at once. A page scanned at 600 DPI carries about four times the data of the same page at 300 DPI.

For most documents, that extra detail is wasted. A 300 DPI scan is the standard for print and reads cleanly. A 150 DPI scan is fine for anything meant to live on a screen. Scanning a tax form or a signed contract at 600 DPI gives you a giant file and no benefit a human reader will ever see.

Why color scans weigh more than grayscale

A full-color scan stores three color values for every pixel, one each for red, green, and blue. A grayscale scan stores a single brightness value per pixel, so it carries roughly a third of the data for the same page. For black-text-on-white-paper documents, color records nothing useful and triples the weight.

If your scan is a printed letter, a receipt, or a form filled in with a pen, grayscale captures everything that matters. Reserve color for pages where color actually carries meaning, such as a highlighted document, a photo, or a chart that depends on its hues to make sense.

What else bloats a PDF besides scans?

Even PDFs that were never scanned can get heavy, and the reasons are similar: assets that are bigger than they need to be. Embedded fonts add some weight, because the file carries the full typeface so it renders the same everywhere. Uncompressed or lightly compressed images inside an otherwise normal document add a lot more.

The sneakiest source is the “Save as PDF” or “Export to PDF” button in some apps. Those exporters are built for fidelity, not size, so they happily embed full-resolution images, redundant copies of fonts, and extras like page thumbnails or document history. The result looks identical to a lean PDF on screen but can be several times larger on disk.

So a PDF you made yourself from a slide deck, a design tool, or a word processor can be just as bloated as a scan, even though every page is “real.” The cause is the same underneath. Somewhere in the file, images are stored at a higher resolution or lower compression than the document actually needs.

How do you actually shrink a PDF?

Shrinking a PDF means going after the images, since that is where the weight is. The most effective move is downsampling: reducing the images inside the file to a sensible DPI. Dropping a 600 DPI scan to 150 DPI for screen reading, or 300 DPI for print, can cut the file by most of its size while leaving it perfectly readable.

The second move is re-compressing the images, usually as JPG, which throws away detail your eye barely notices in exchange for a much smaller file. (If you want the longer story on image compression, our guide on picking the right image format covers the tradeoffs.) On top of that, converting color scans to grayscale and stripping out extras like embedded page thumbnails trims more.

You can do all of this in your browser with the free PDF compressor at pdf.hivly.net. It downsamples and re-compresses the images inside the file, and because it runs in your browser, nothing is uploaded; the document never leaves your device. That matters when the PDF is a signed contract or a scan of an ID, exactly the kind of file you should not be handing to a random website.

The tradeoff is real and worth stating plainly. More compression means a smaller file and softer images. For a document that will only be read on a screen, you can compress hard and never notice. For something destined for a printer, or a page with fine detail that must stay legible, keep the DPI higher and accept a larger file. Match the setting to the job rather than chasing the smallest possible number.

Why does the size limit only matter at upload time?

Most people never think about PDF size until a form rejects the file. Government portals, job application systems, and document upload boxes set hard caps, often a few megabytes, to keep their storage and bandwidth in check. Your file sat happily on your disk for months, then hit a wall the moment you tried to send it.

This is why “why is my PDF so large” is almost always asked in a hurry, with a deadline and a stubborn upload box. The fix is the same regardless of the form: get the images inside the file down to a sensible size. Re-scan in grayscale at a lower DPI if you still have the paper, or compress the existing file if you do not.

One honest note before you start clicking. If a file is already mostly text, compression will barely move it, because there were no heavy images to shrink. And if you over-compress a scan to squeeze under a cap, the result can turn fuzzy or hard to read. Aim for the lowest DPI that keeps the document legible, check the result, and stop there.

Try the pdf toolsMerge, split, compress, protect, unlock, sign and convert PDFs to and from images.

Frequently asked questions

Why is my PDF so large when it is only a few pages?
Page count rarely drives size. Images do. A scanned PDF stores a full photo of every page, so a three-page scan at high DPI in full color can dwarf a hundred-page text document. Check whether your pages are real text or pictures of text.
What DPI should I use to keep a PDF small?
For files meant to be read on a screen, 150 DPI is usually plenty. For documents you expect to be printed, 300 DPI is the common standard. Anything above 300 mostly adds weight without adding detail a reader will ever notice.
Does compressing a PDF reduce its quality?
It can, depending on what you compress. Downsampling images to a sensible DPI and re-encoding them trades a little sharpness for a much smaller file. Text and vector content stay crisp because they are not images. Pick a DPI that matches how the file will be used.
Why does a form reject my PDF for being too large?
Upload forms set hard size caps, often a few megabytes, to control storage and bandwidth. A color scan at high DPI blows past that limit easily. Shrinking the images inside, or scanning in grayscale at a lower DPI, usually brings the file under the cap.

Keep reading

Building something bigger?

Hivly is made by CodingEagles, a software studio that ships production web apps. If you have a real project, get in touch.

See what CodingEagles does →