Publication | Open Access
Data Quality Barriers for Transparency in Public Procurement
24
Citations
17
References
2022
Year
EngineeringBusiness IntelligenceInformation ForensicsData InfrastructureData EcosystemData ScienceManagementData IntegrationData Quality BarriersData GovernanceData ManagementOpen DataPublic PolicyMultiple Data SilosKnowledge DiscoveryData PrivacyGovernment TransparencyInformation ManagementPublic ProcurementMachine-learning-based Procurement AnalyticsResponsible Data ManagementAnomaly Detection TechniquesGovernment ProcurementBig Data
Governments need to be accountable and transparent for their public spending decisions in order to prevent losses through fraud and corruption as well as to build healthy and sustainable economies. Open data act as a major instrument in this respect by enabling public administrations, service providers, data journalists, transparency activists, and regular citizens to identify fraud or uncompetitive markets through connecting related, heterogeneous, and originally unconnected data sources. To this end, in this article, we present our experience in the case of Slovenia, where we successfully applied a number of anomaly detection techniques over a set of open disparate data sets integrated into a Knowledge Graph, including procurement, company, and spending data, through a linked data-based platform called TheyBuyForYou. We then report a set of guidelines for publishing high quality procurement data for better procurement analytics, since our experience has shown us that there are significant shortcomings in the quality of data being published. This article contributes to enhanced policy making by guiding public administrations at local, regional, and national levels on how to improve the way they publish and use procurement-related data; developing technologies and solutions that buyers in the public and private sectors can use and adapt to become more transparent, make markets more competitive, and reduce waste and fraud; and providing a Knowledge Graph, which is a data resource that is designed to facilitate integration across multiple data silos by showing how it adds context and domain knowledge to machine-learning-based procurement analytics.
| Year | Citations | |
|---|---|---|
Page 1
Page 1