Rio de Janeiro's municipal digital archive contains hundreds of thousands of duplicate images — the same photographs of favela urbanisation projects, public works and heritage sites stored multiple times across disconnected servers — and city officials, archivists and urban data specialists are now publicly disagreeing over how to fix the problem before it gets worse.
The debate has sharpened because the Prefeitura do Rio is mid-way through a broader digitisation push linked to the city's Plano Estratégico 2021–2024, a program that committed to migrating public records to centralised cloud infrastructure. With that plan's successor document already in draft, institutions that depend on visual records — from the Instituto Municipal de Urbanismo Pereira Passos to the Secretaria Municipal de Urbanismo — are under pressure to demonstrate their data is clean, searchable and non-redundant before new funding cycles open in August 2026.
What the Experts Are Saying
The core technical problem is straightforward. When city departments photograph the same construction site, public square or heritage building at different stages, images are often uploaded without consistent metadata tagging, creating what archivists call phantom duplication — files that are not identical byte-for-byte but represent the same subject, making automated deduplication software ineffective. Urban data specialists familiar with Rio's infrastructure have described the situation as a governance failure as much as a technical one, arguing that no single office holds clear authority over image classification standards.
Staff at the Instituto Pereira Passos, based in the Cidade Nova neighbourhood, have been among the most vocal internally about the need for a unified tagging protocol. The institute maintains cartographic and photographic records stretching back decades, including aerial surveys of the Zona Norte and detailed documentation of the Porto Maravilha urban regeneration zone. Without consistent duplicate-image management, searches within those archives produce redundant results that slow down planning decisions, particularly when environmental impact assessments reference visual evidence of how a site looked before development began.
Transparency advocates at Transparência Brasil have separately raised a different concern: that duplicated or disorganised image records complicate the public's ability to verify what city work actually looked like at a given date, undermining the Lei de Acesso à Informação, Brazil's freedom-of-information law enacted in 2011. When a document request yields thirty near-identical photos of the same stretch of Avenida Brasil without timestamps or project codes, the practical value of the disclosure drops sharply.
The Practical and Political Stakes
There is money involved. Municipal contracts for digital asset management are typically tendered through the city's procurement portal, and a previous contract for server storage and archive maintenance — covering the period through December 2025 — cost the Prefeitura approximately R$4.2 million. A new procurement round is expected before the end of the third quarter of 2026, and how the city specifies deduplication requirements in that tender will determine whether the underlying problem is addressed or simply migrated to a new server at equal cost.
The Museu da Imagem e do Som, located on Praça Pio X in the Centro district, operates its own photographic collection under a separate institutional framework and has piloted a hash-based duplicate detection system on a subset of roughly 80,000 digital items since early 2025. Officials there have not publicly released findings, but the pilot is being watched closely by other municipal departments as a possible model.
Urban planning professionals working on projects in Jacarepaguá and the Barra da Tijuca corridor have noted that the duplicate-image problem is not merely bureaucratic. When environmental licensing requires photographic baselines, redundant files without clear provenance create legal exposure if a project is challenged in court and the city cannot produce a clean, authenticated image record.
For residents and community organisations tracking infrastructure promises in areas like Complexo do Alemão or Maré, the practical advice from legal observers is the same: submit information requests through e.Rio, the city's digital services portal, and specifically ask for geotagged image files with project identification codes attached. That specificity forces clerks to locate structured records rather than bulk-exporting whatever the search function returns — which, right now, often means duplicates.