The backlog problem is a math problem. Treat all 5,000 SKUs equally and you'll still be enriching in 2027. Tier by revenue, set different completeness targets, and clear the backlog in 90 days with 2 people.
The backlog problem is a math problem
A large electrical distributor sat on 800,000 unclassified products. With manual enrichment, they made roughly 1% annual progress. At that rate, clearing the backlog becomes a multi-decade project.
Your backlog is smaller but the math is the same. If you have 5,000 SKUs at 40% completeness and you can enrich 50 SKUs per week manually, you'll be done in two years. By then, you'll have added 3,000 more SKUs and your PIM shows 55% completeness instead of 40%. The backlog grows faster than you can enrich.
The issue isn't effort. It's strategy. Most distributors treat all SKUs equally and try to reach 95% completeness across the board. This is why backlogs never clear.
The 80/20 rule applies to your catalog
Revenue concentration follows the Pareto principle. In most electrical distributor catalogs, roughly 10-20% of SKUs drive 80-90% of revenue. Pull your own sales data and you'll see the same pattern:
| Tier | SKU count | % of catalog | Revenue contribution | Page views/month |
|---|---|---|---|---|
| Tier 1 | 500 | 10% | ~75-80% | 15,000+ |
| Tier 2 | 1,500 | 30% | ~15-20% | 5,000-15,000 |
| Tier 3 | 3,000 | 60% | ~5% | Under 5,000 |
Your top 500 SKUs drive three-quarters of your business. They deserve 95% completeness with hero images, 5-8 key specs, and full technical documentation. The next 1,500 SKUs need 70% completeness: image, short description, 3 key specs. The remaining 3,000 SKUs can sit at 40% until someone actually searches for them.
You clear the backlog by accepting that not all SKUs deserve the same effort.
Pull 12 months of revenue by SKU from your ERP. Export page view counts from Google Analytics or your ecommerce platform. Join them in a spreadsheet with SKU as the key.
Sort descending by annual revenue. The top 500 SKUs become Tier 1. Flag any SKU with revenue over $10,000/year as automatic Tier 1 regardless of rank.
For SKUs not in Tier 1, filter for page views over 50/month. These are your Tier 2 SKUs. They have search demand even if revenue is low, which often means a data or pricing problem.
Remaining SKUs get baseline enrichment only until traffic justifies promotion. Set a rule: if page views exceed 5 in a month, move to the enrichment queue.
Revenue over $10K/year: Tier 1, regardless of traffic. Revenue under $10K but page views over 50/month: Tier 2, likely a data or pricing issue. Zero sales but added in last 90 days: Hold in Tier 2 for one quarter, then reassess. Low revenue and low traffic: Tier 3, enrich on-demand only.
Field prioritization: what actually drives conversion
Not all product data fields have equal impact. Focus on fields buyers actually use to make decisions: image, voltage, amperage, wire gauge, enclosure type. Adding a fifth or sixth secondary spec field yields diminishing returns.
High-impact fields for Tier 1: product image (hero angle), short description (2-3 sentences), voltage, amperage, poles or wire gauge, enclosure type or mounting style, price and availability.
For Tier 2, cut to the essentials: image, 2-sentence description, and 3 key specs only. For Tier 3, baseline is manufacturer name, part number, and price. Enrich when traffic justifies it.
Before: Tier 1 circuit breaker at 40% completeness
- Part number: CTL3P240A
- Manufacturer: Eaton
- Price: $142.50
- Description: 3-pole circuit breaker
After: same SKU at 95% completeness
- Part number: CTL3P240A
- Manufacturer: Eaton
- Price: $142.50
- Description: Industrial molded case circuit breaker with thermal-magnetic trip, 240V AC rated
- Image: hero shot showing front panel and terminals
- Voltage: 240 V AC
- Amperage: 30 A
- Poles: 3
- Enclosure: NEMA 1
- Trip curve: Standard thermal-magnetic
Weeks 2-4: Enrich Tier 1 (top 500 SKUs)
Manual enrichment from manufacturer datasheets takes 10 minutes per SKU. You're looking up voltage, amperage, wire gauge, or enclosure type in a 40-page PDF and copying it into your PIM. With 2 people working 10 hours per week on enrichment, you can complete 50 SKUs per week. That's 200 SKUs in 4 weeks.
Template-based workflows drop this to 2 minutes per SKU. You pre-map common attribute combinations for contactors, circuit breakers, wire, and conduit. Most Tier 1 SKUs fit one of 15-20 templates. At 2 minutes per SKU, 2 people complete 500 SKUs in 3 weeks.
Automated extraction from manufacturer source documents gets throughput under 30 seconds per SKU, but only makes sense at 10,000+ SKUs.
Weeks 5-8: Enrich Tier 2 (next 1,500 SKUs)
Lower your target completeness to 70%. Skip the extended specs and marketing copy. Focus on image, short description, and 3 key attributes. At 5 minutes per SKU with templates, 2 people complete 125 SKUs per week. You finish all 1,500 Tier 2 SKUs in 12 weeks, but the 90-day plan allocates 4 weeks to this tier because Tier 1 completeness matters more.
Weeks 9-12: Triage Tier 3 (remaining 3,000 SKUs)
For the long tail, enrich on-demand when someone searches or views the product. Set up a workflow flag in your PIM: if page views exceed 5 in any 30-day period, promote to the enrichment queue. Otherwise, leave at baseline.
You'll opportunistically enrich 200-300 Tier 3 SKUs in the final 4 weeks. The rest sit at 40% completeness until traffic justifies the effort.
- Week 1: SKU ranking complete, Tier 1/2/3 assignments in PIM
- Week 2: 50 Tier 1 SKUs enriched to 95% completeness
- Week 3: 100 Tier 1 SKUs enriched (cumulative)
- Week 4: 150 Tier 1 SKUs enriched
- Week 5: All 500 Tier 1 SKUs complete, start Tier 2
- Week 8: 500 Tier 2 SKUs enriched to 70% completeness
- Week 10: 1,000 Tier 2 SKUs enriched (cumulative)
- Week 12: On-demand workflow live, 200-300 Tier 3 enriched
When to consider automation
If your backlog exceeds 10,000 SKUs, manual enrichment won't clear it even with tier-based triage. At that scale, automated extraction from manufacturer datasheets becomes cost-effective. We've cleared 47,000-SKU catalogs in 6 weeks with automated classification and enrichment from source documents. [See how it works]
