Rio de Janeiro's municipal government began a structured clean-up of its official digital image archive on Monday, July 1, after an internal audit identified more than 14,000 duplicate photographs stored across the Prefeitura do Rio's document management system — redundant files that had been consuming server capacity and slowing access to public records for city employees and journalists alike.
The problem is not new. But it became urgent after the Secretaria Municipal de Fazenda flagged rising IT infrastructure costs in a June budget review, noting that duplicated and orphaned media files were contributing to ballooning cloud storage expenses. The clean-up, which city contractors are expected to complete by July 31, is part of a broader digitisation push that has been underway since 2024 under the municipal transparency programme known as Rio Dados Abertos.
Where the Duplicate Files Came From
The duplicates accumulated across several years and multiple departments. The bulk originated in two areas: the urban planning secretariat's photographic records of construction permits in the Zona Norte, and image files tied to social welfare casework documentation processed at the Centro de Referência de Assistência Social units scattered across neighbourhoods including Madureira, Bangu, and Campo Grande. When departments migrated from legacy systems to the current cloud platform — a process that ran in phases between 2021 and 2023 — batch uploads were run without deduplication checks, effectively importing the same images two or three times.
Staff at the Arquivo Geral da Cidade, located on Avenida Gomes Freire in the Centro district, flagged the scale of the redundancy in a formal memo in March. The institution, which holds official municipal records, does not directly manage the Prefeitura's live digital systems but has an advisory role over archival standards. The memo recommended that the Empresa Municipal de Informática — IPLANRIO — conduct a full audit before the city's next scheduled infrastructure contract renewal in August.
IPLANRIO has been coordinating with a São Paulo-based technology contractor, though the specific firm has not been publicly named in any document reviewed by this newspaper. Work began at the Cidade Nova administrative complex this week, with technicians running automated hash-comparison scripts to identify exact-copy files before flagging near-duplicates for manual review.
What It Costs and Why It Matters to Residents
Server and cloud storage costs tied to the Prefeitura's document systems reached R$4.2 million in the 2025 municipal budget, according to figures published in the official Diário Oficial do Município in February 2026. City IT managers estimate that removing confirmed duplicate files could reduce active storage load by roughly 18 percent, though that figure has not been formally published and should be treated as a working projection rather than a confirmed outcome.
The practical effect on residents is indirect but real. Public requests for construction permit photographs — commonly filed by residents in areas like Tijuca, Santa Cruz, and the Baixada de Jacarepaguá who are challenging nearby developments — have at times returned multiple copies of the same image, creating confusion in administrative appeals. The deduplication process should make those searches faster and cleaner once complete.
The Rio Dados Abertos portal, accessible at dados.rio, currently hosts more than 900 publicly available datasets. Metadata quality has been a recurring criticism from civil society groups that track municipal transparency, and the image database clean-up is one of several remedial steps the city committed to in a transparency agreement signed with the Ministério Público do Estado do Rio de Janeiro in late 2025.
Residents or organisations that regularly access permit records, social programme documentation, or official city photography through the portal are advised to refresh any saved links after July 31, when the cleaned database is expected to go live. The Prefeitura has not announced a formal public communication campaign about the change, but IPLANRIO's service desk — reachable through the central 1746 city services line — is reportedly fielding inquiries from registered users of the document system ahead of the cutover date.