← Back to Blog

The Hidden Cost of Uncatalogued Collections

Somewhere in your store, there are boxes that nobody has described. Shelves of archival material that are logged in a transfer register but never properly catalogued. Collections that staff know exist but cannot confidently direct a researcher to, because the arrangement has never been documented.

This is not a niche problem. Estimates from surveys across the United Kingdom consistently suggest that between 30 and 50 per cent of archive and special collections holdings remain uncatalogued to any meaningful standard. The National Archives has long acknowledged the backlog problem in both local government record offices and national institutions alike. At the same time, funding applications increasingly demand that organisations demonstrate access, discovery, and measurable public benefit.

The hidden costs of an uncatalogued collection are real, accumulating, and mostly invisible until they are not.


What “uncatalogued” actually means in practice

The word covers a wide range. At one end, you have undocumented material with no entry anywhere — not in a management system, not in an accession register, not in any online catalogue. At the other end, you have partially described holdings: a box-level record exists, but there is no item-level description, no index, no subject headings, and nothing made available for research through a public interface.

Both situations carry costs. The distinction matters because organisations often underestimate how much uncatalogued material they are actually holding. A rough accession record is not a catalogue entry. A legacy database export that has never been cleaned or published is not accessible. An arrangement that exists only in the memory of a long-serving member of staff is not a guide that members of the public can use.


The funding cost

Grant bodies in the United Kingdom — from the National Lottery Heritage Fund to research councils — have become increasingly sophisticated about what they fund. Access and impact are now central to almost every application. Funding applications that cannot demonstrate existing discoverability, or that cannot show a clear pathway to making collections accessible, are at a disadvantage before the assessment process begins.

More practically: if your institution cannot demonstrate to funders that previous investment in collections has been catalogued, described, and made available for research, the case for further acquisition funding weakens. The survey question “what percentage of your holdings are catalogued?” has teeth now in a way it did not a decade ago.

There is also the hidden cost of time spent on unsuccessful applications. A specialist collections manager or archivist writing funding bids for projects that fail partly due to poor access metrics is time that is not being spent on the collection itself.


The researcher cost

Researchers — whether from a university library, a postgraduate programme, or members of the public investigating local history — encounter uncatalogued material in one of two ways. Either they do not find it at all, because it does not appear in online catalogues or national register searches. Or they discover it exists, submit a request, and wait while a member of staff manually checks whether the material is available, in what format, and under what access conditions.

The second scenario is particularly costly. A special collections reading room that spends staff time on enquiries that a well-maintained catalogue would answer automatically is not failing through incompetence — it is failing through underinvestment in the catalogue itself. Research materials that are not described cannot easily be cited, cannot easily be requested through standard document supply channels, and cannot easily be made available for collaborative projects or digital publication.

There is also a copyright dimension that often goes unexamined. Institutions considering whether to make digitised holdings available online, free of charge or otherwise, frequently find that uncatalogued material creates uncertainty about copyright status. Without a clear record — provenance, creator, date, transfer conditions — it is difficult to offer guidance or grant permission for reuse.


The operational cost

Ask any archivist working with a large uncatalogued backlog how much of their week is spent answering the same questions. The estimate is usually surprising. Where is the [X collection]? Is there anything on [subject]? Can I see the [specific record]?

These are questions a well-maintained catalogue and a clear online presence would answer without staff involvement. Instead, they become consultation work — drawing a librarian or archivist away from arrangement, description, preservation purposes, or acquisitions.

The operational drag compounds over time. When staff who hold institutional knowledge of uncatalogued holdings leave, that knowledge does not transfer automatically. The next person cannot answer the researcher enquiry. The collection becomes effectively unavailable, even though it is physically present. In this way, the cost of not cataloguing is partly a contingent liability: it grows in proportion to staff turnover and the passage of time.


The access and reputation cost

Archives, libraries, and museums in the United Kingdom exist in a competitive attention environment. Researchers have more options than they did twenty years ago. Born-digital alternatives, digitised collections available through electronic databases, and improved online catalogues at well-resourced institutions mean that a collection which is difficult to discover simply gets overlooked.

A museum or archive that cannot describe its holdings clearly in an accessible, searchable format does not appear in the places researchers now look first. It does not feature in discovery layers. It does not appear when a librarian at a university library runs a subject search. The collection exists. It is simply invisible.

This is a reputational cost as much as an operational one. When a special collections reading room cannot answer basic questions about its own holdings, the institution’s credibility with researchers, with funders, and with the public suffers in ways that are hard to quantify but easy to feel.


Why the backlog grows instead of shrinking

Cataloguing backlogs accumulate for structural reasons, not through negligence. Acquisition happens faster than description. Digital formats — born-digital records, digitised documents, electronic transfers — arrive faster and in greater volume than analogue material, but do not automatically come with structured metadata. Staff resource is finite and prioritised towards access and public-facing work.

The result is that the backlog grows as a proportion of total holdings even when cataloguing output is steady. Institutions that have invested heavily in digitisation often find that the digital collection is as poorly described as the physical one — sometimes more so, because digitisation projects do not always include a cataloguing phase.


Recognising the costs: a diagnostic checklist

The costs of an uncatalogued backlog are easier to address once they are made visible. The following checklist is designed to help you assess where your organisation sits. If you answer “yes” to three or more of these questions, the backlog is actively costing you — not just sitting quietly in storage.

  • Researcher delay. Do staff regularly spend time searching for material that a well-maintained catalogue would surface immediately? Are enquiries taking longer than they should because the answer depends on institutional memory rather than a searchable record?
  • Hidden collections. Are there collections in your store that do not appear in any public catalogue, online finding aid, or national register? Would a researcher searching for material on a topic covered by your holdings fail to find you?
  • Staff dependency. Is knowledge of certain collections held by one or two individuals? If those staff members left, would the ability to locate and interpret that material leave with them?
  • Grant weakness. Have funding applications been weakened — or failed — because you could not demonstrate adequate cataloguing, access metrics, or collection management standards?
  • Reputational exposure. Have researchers, partner institutions, or community groups expressed frustration about the discoverability or accessibility of your collections?
  • Missed public benefit. Are there collections with clear educational, community, or research value that remain effectively invisible because they have never been described?

This is not a scoring exercise. It is a way to move the conversation from “we have a backlog” to “the backlog is costing us these specific things” — which is a much stronger basis for securing the resources to address it.


How to start reversing the cost

The starting point is a realistic survey of what you actually hold and what proportion is described to what standard. Many institutions find this survey itself revelatory. Holdings that were assumed to be “basically catalogued” turn out to have significant gaps at item level. Material that was considered undocumented turns out to have partial records that can be built upon.

From there, the practical path involves prioritising by research demand, preservation risk, and funding opportunity. High-demand collections with active enquiries and clear funding prospects should move first. Material at physical risk — deteriorating formats, unstable media — should be described even if only to a basic standard, to enable preservation decision-making.

AI-assisted cataloguing tools have substantially changed what is achievable per member of staff per week. Rather than replacing archivist expertise, they handle the repetitive elements of description — suggesting titles, dates, subject headings, and summaries from the content itself — while the archivist reviews, corrects, and contextualises. For institutions with large backlogs and limited staff, this is not a luxury. It is how the gap gets closed in a reasonable timeframe.

For a detailed breakdown of how The Archiver compares to spreadsheets, manual cataloguing, and traditional CMS platforms, see our comparison page.

The Archiver’s features are built around this model: AI drafts the catalogue record, the archivist approves it. Collections that would take years to describe manually can be brought to a basic catalogued standard in weeks.


The cost of waiting

For directors and heads of collections, the uncatalogued backlog is often framed as a resourcing problem to be solved when budget allows. The more accurate frame is that the backlog is already costing money, every month, in missed funding, staff time, and research opportunities foregone.

The question is not whether you can afford to catalogue. It is whether you can afford not to. The return on investment case for cataloguing is straightforward once the hidden costs are made visible: more successful funding applications, lower staff burden on routine enquiries, greater researcher engagement, and a stronger case for institutional relevance.


Where to start

If you are not sure where your uncatalogued material sits or how to prioritise it, a useful first step is to work through the complete guide to archival cataloguing, which sets out a practical workflow from initial survey through to published catalogue records.

For institutions ready to move faster, request early access to The Archiver to see what AI-assisted cataloguing actually produces on your own collection before making any decisions about wider adoption.

The backlog is not inevitable. The costs are not fixed. But they do not reduce on their own.

Try The Archiver on your collection

Request early access and see what AI cataloguing can do for your collection.

Request early access

Updates from The Archiver