Rio de Janeiro's municipal image archives contain tens of thousands of duplicate photographs. That is the working estimate circulating inside the Secretaria Municipal de Fazenda e Planejamento as technicians work through a backlog that accumulated over roughly twelve years of disconnected digitisation efforts across different city departments. The problem has a name — duplicate image replacement — and it has quietly become a priority for the prefeitura's information technology directorate heading into the second half of 2026.
The issue matters now because Rio is mid-way through a broader smart-city overhaul tied to preparations for the 2027 Pan American Games. Redundant files slow database queries, bloat storage costs, and — more practically — cause confusion when urban planners and journalists request photographic records of specific neighbourhoods under redevelopment, from Porto Maravilha on the waterfront to the hillside communities around Santa Teresa. When two images carry different metadata but show the same damaged footpath on Rua do Catete, neither can be trusted as a reliable timestamp.
How the Duplication Problem Built Up Over Time
The roots go back to roughly 2014, when at least four separate city agencies — including the Instituto Pereira Passos, which manages urban data for the city, and the Empresa Municipal de Informática (Iplanrio) — began independent campaigns to digitise physical photo libraries. Each agency used its own file-naming conventions, its own metadata schemas, and its own cloud storage contracts. Nobody coordinated. A photograph of flooding in Jacarepaguá taken by a city surveyor in 2016 might exist under three different filenames across two different servers, each with a slightly different crop or compression level.
The problem compounded after 2020, when the COVID-19 pandemic accelerated remote digitisation work. Contractors working from home uploaded files without the quality-control checks that an in-person archive supervisor would have applied. According to a municipal audit report completed in March 2025 and available through the Controladoria Geral do Município, Iplanrio identified more than 47,000 image files flagged as probable duplicates across its urban documentation database alone. The audit put the unnecessary storage cost at approximately R$380,000 per year across the affected systems.
The Instituto Pereira Passos has faced the same challenge on the geographic data side. Its GeoRio mapping portal, which serves as a public reference for everything from flood-risk mapping in Realengo to zoning boundaries in Barra da Tijuca, drew on photo documentation that had been ingested from multiple legacy sources. Duplicate images embedded in map layers created version-control headaches for engineers working on drainage infrastructure along the Avenida Brasil corridor.
What a Fix Actually Looks Like
Duplicate image replacement is less dramatic than it sounds. The technical process involves running perceptual hashing algorithms across stored image files — software that can identify images that are visually near-identical even when they carry different filenames or have been slightly re-encoded. Files flagged by the algorithm are reviewed, a canonical version is selected based on resolution and metadata completeness, and the redundant copies are either deleted or archived to cold storage.
Iplanrio began piloting this process across a subset of its Zona Norte neighbourhood documentation files in February 2026. The pilot covered roughly 8,000 images linked to urban inspection records in Méier, Tijuca, and Vila Isabel. Results from that pilot are expected to inform a city-wide rollout scheduled for the third quarter of this year, according to the March 2025 audit's recommended timeline.
For residents and civil society groups that rely on public image records — particularly organisations in Complexo do Alemão and Manguinhos that use photographic evidence in housing-rights disputes — the cleanup has practical stakes. Duplicate or mislabelled images in official records have in the past been cited in disputes where the city's own documentation contradicted itself. A cleaner, deduplicated archive makes those records harder to dismiss and easier to navigate. The prefeitura has not yet announced a public-facing portal for the cleaned archive, but the Controladoria's audit recommended one be operational before the end of 2026.