Skip to main content
The Daily Rio de Janeiro

All of Rio de Janeiro, every day

News

How Rio's Digital Archives Ended Up Full of the Same Image Twice — and What the City Is Doing About It

A decade of fragmented digitisation projects left municipal databases riddled with duplicate imagery, and cleaning up the mess has become one of the city's quieter bureaucratic battles.

Share

By Rio de Janeiro News Desk · Published 4 July 2026, 3:57 PM

4 min read

Updated 4 h ago· 5 July 2026, 12:13 AM

How we reported this

This article was generated by AI from the linked public sources. The Daily Rio de Janeiro is independently owned and covers Rio de Janeiro news free from advertiser or sponsor influence. Read our editorial standards →

How Rio's Digital Archives Ended Up Full of the Same Image Twice — and What the City Is Doing About It
Photo: Photo by Kaique Rocha on Pexels

Rio de Janeiro's municipal image archives contain tens of thousands of duplicate photographs. That is the working estimate circulating inside the Secretaria Municipal de Fazenda e Planejamento as technicians work through a backlog that accumulated over roughly twelve years of disconnected digitisation efforts across different city departments. The problem has a name — duplicate image replacement — and it has quietly become a priority for the prefeitura's information technology directorate heading into the second half of 2026.

The issue matters now because Rio is mid-way through a broader smart-city overhaul tied to preparations for the 2027 Pan American Games. Redundant files slow database queries, bloat storage costs, and — more practically — cause confusion when urban planners and journalists request photographic records of specific neighbourhoods under redevelopment, from Porto Maravilha on the waterfront to the hillside communities around Santa Teresa. When two images carry different metadata but show the same damaged footpath on Rua do Catete, neither can be trusted as a reliable timestamp.

How the Duplication Problem Built Up Over Time

The roots go back to roughly 2014, when at least four separate city agencies — including the Instituto Pereira Passos, which manages urban data for the city, and the Empresa Municipal de Informática (Iplanrio) — began independent campaigns to digitise physical photo libraries. Each agency used its own file-naming conventions, its own metadata schemas, and its own cloud storage contracts. Nobody coordinated. A photograph of flooding in Jacarepaguá taken by a city surveyor in 2016 might exist under three different filenames across two different servers, each with a slightly different crop or compression level.

The problem compounded after 2020, when the COVID-19 pandemic accelerated remote digitisation work. Contractors working from home uploaded files without the quality-control checks that an in-person archive supervisor would have applied. According to a municipal audit report completed in March 2025 and available through the Controladoria Geral do Município, Iplanrio identified more than 47,000 image files flagged as probable duplicates across its urban documentation database alone. The audit put the unnecessary storage cost at approximately R$380,000 per year across the affected systems.

The Instituto Pereira Passos has faced the same challenge on the geographic data side. Its GeoRio mapping portal, which serves as a public reference for everything from flood-risk mapping in Realengo to zoning boundaries in Barra da Tijuca, drew on photo documentation that had been ingested from multiple legacy sources. Duplicate images embedded in map layers created version-control headaches for engineers working on drainage infrastructure along the Avenida Brasil corridor.

What a Fix Actually Looks Like

Duplicate image replacement is less dramatic than it sounds. The technical process involves running perceptual hashing algorithms across stored image files — software that can identify images that are visually near-identical even when they carry different filenames or have been slightly re-encoded. Files flagged by the algorithm are reviewed, a canonical version is selected based on resolution and metadata completeness, and the redundant copies are either deleted or archived to cold storage.

Iplanrio began piloting this process across a subset of its Zona Norte neighbourhood documentation files in February 2026. The pilot covered roughly 8,000 images linked to urban inspection records in Méier, Tijuca, and Vila Isabel. Results from that pilot are expected to inform a city-wide rollout scheduled for the third quarter of this year, according to the March 2025 audit's recommended timeline.

For residents and civil society groups that rely on public image records — particularly organisations in Complexo do Alemão and Manguinhos that use photographic evidence in housing-rights disputes — the cleanup has practical stakes. Duplicate or mislabelled images in official records have in the past been cited in disputes where the city's own documentation contradicted itself. A cleaner, deduplicated archive makes those records harder to dismiss and easier to navigate. The prefeitura has not yet announced a public-facing portal for the cleaned archive, but the Controladoria's audit recommended one be operational before the end of 2026.

You might also like

Editorial picks

How did this story land?

Spread the word

Share

Have your say

Loading comments…

Sources

About this article

Published by The Daily Rio de Janeiro

Covering news in Rio de Janeiro. This article was generated by AI from the linked sources and was not reviewed by a human editor before publishing. See our editorial standards.

Spread the word

Share

See something wrong? Suggest a correction.

Daily brief

Enjoyed this? Wake up to Rio de Janeiro news every morning.

Free, in your inbox before 7am. Weekdays.

By subscribing you agree to receive emails from The Daily Rio de Janeiro and accept our Privacy Policy. Unsubscribe anytime.

The Daily Network