Desk Survival

Clean Messy Excel Data With AI — Without Learning a Single Formula

The Juno Desk · 4 min read · Verified guide

Duplicates with three spellings, dates in four formats, one cell doing the job of three columns. An LLM fixes in minutes what used to eat your Friday afternoon. Here is the exact workflow.

Excel sheets are the digital equivalent of a junk drawer: everything is thrown in, nothing is labeled, and someone else's typos are now your problem. Traditional fixes mean mastering VBA scripting or executing hundreds of repetitive keystrokes. Large language models treat data as meaning, not strings — which is why they can see that three differently-spelled companies are the same company.

Ed. note If you genuinely enjoy fixing date formats by hand, please stop reading and seek help.

Why Excel's own tools fail you

"Remove Duplicates" and "Text to Columns" match characters, not intent. They are blind to the fact that "Google", "Google, Inc." and "Google LLC" are one company, or that 12/05/2026 and May 12, 2026 are one date. AI models process language contextually — exactly the skill this mess requires.

Excel keeps "James Miller" and "Jim Miller" as two loyal customers. Your quarterly report is now fiction.

Step 1 — Normalize the formatting

Copy a representative sample of rows, define the exact output you want, and let the model do the tedium.

20–50 rows
The right sample size to paste into an LLM. Paste thousands and it will hit token limits — or start inventing your customers.
You are an expert data analyst. Standardize the raw Excel data below: 1. All dates to YYYY-MM-DD. Ambiguous dates: output "Check Manual". 2. All phone numbers to E.164 format (+1234567890). 3. All names to Title Case. Output a markdown table: Raw Input | Cleaned Value | Status. [paste your rows here]

Validate the output, paste it back, and for large datasets ask the model to generate a reusable formula or script that applies the same pattern to the rest.

Step 2 — Kill the fuzzy duplicates

Ask the model for a two-column mapping table — "Messy Name" to "Master Name" — then apply it with one XLOOKUP. No judgment calls, no squinting at 400 rows.

I have client company names with inconsistent spellings and suffixes. Group names that refer to the same entity. Output a markdown mapping table: "Original Name" | "Standardized Master Name". Rules: strip LLC/Corp/Inc variations; group obvious abbreviations; use the most complete official name as Master. [paste your list here]
✓ Works on day one

Get the full prompt pack.

Every prompt in this guide plus 13 more office situations — copy, paste, done.

Next in Desk Survival: Write emails that don't sound like a robot wrote them.