How To Organize Legal PDFs Before Review
Legal PDF review becomes painful when the documents are disorganized before the review even starts.
The problem usually begins quietly. Files arrive from clients, opposing parties, internal teams, email attachments, scans, shared drives, or export folders. Some files have useful names. Others are called things like scan001.pdf.
Before long, the team is not reviewing the documents. The team is doing detective work before the real work can begin.
Start by preserving the originals
The first rule is simple:
Do not change the original files.
Keep an untouched incoming folder. If you need to rename, sort, convert, or process files, do it on a working copy.
This matters because you may later need to show what was received, when it was received, and how it moved through the review. If the original set has been renamed, rearranged, or overwritten, that becomes harder than it needed to be.
Create a simple folder structure
Do not start with a heroic folder system that nobody will maintain.
A practical starting point:
01_originals/
02_working_copy/
03_index/
04_review_notes/
05_exports/
The exact names are less important than the rule. Originals stay untouched. Everything else happens in a controlled working area.
Create a document index
A document index is often more useful than a clever folder structure because it lets you see the whole set in one place.
At minimum, track:
- File name
- Folder path
- Document type
- Date, if known
- Source
- Short description
- Review status
- Notes
This can start as a spreadsheet. Later, it can become a database or a small internal tool. At the beginning, the important thing is that someone can open one file and understand what exists.
Standardize names carefully
Renaming can help, especially when the original file names are useless. Just do not rename recklessly.
If you rename files, keep a mapping:
- Original file name
- New file name
- Reason for rename
- Date renamed
That mapping protects you when someone needs to trace a reviewed file back to the original received document.
Identify obvious categories
Before deep review, group obvious document types:
- Contracts
- Invoices
- Correspondence
- Corporate documents
- IDs and certificates
- Court documents
- Property documents
- Unknown or needs review
Do not spend the first day trying to classify every file perfectly. The first pass should make the review easier, not become the review.
Look for useful metadata
Useful metadata may include:
- Dates
- Parties
- Reference numbers
- Page counts
- Document type
- Existing Bates stamps
- Signatures
- Keywords
Some of this can be detected manually. Some can be assisted by software.
Where automation helps
PDF organization is a good candidate for automation because so much of it is mechanical.
Software can help:
- List all files in a folder.
- Generate an index.
- Detect page counts.
- Extract text.
- Flag keywords.
- Identify duplicate names.
- Create review copies.
- Export structured data.
This is one of the reasons I am building PDF Peppersoup, a tool for turning messy PDF folders into review-ready document sets.
Keep humans in the loop
Automation should support review, not pretend that every judgment can be delegated.
For legal PDFs especially, human review still matters. The goal is to spend less time preparing the pile and more time understanding what is inside it.
A simple pre-review checklist
Before review begins, ask:
- Are originals preserved?
- Is there a working copy?
- Is there a basic index?
- Are file paths recorded?
- Are document categories clear enough?
- Are unknown files flagged?
- Is there a record of any renaming?
- Are exports stored separately?
If the answer to those questions is yes, the review will usually be calmer.
Before you start reading
Good document review starts before anyone reads the first PDF.
It starts with structure.
If your team handles this kind of work often, it may be worth turning the process into a repeatable workflow or a small custom tool. That is one of the areas covered by my document workflow automation services, and it is also the reason PDF Peppersoup exists.