Back to procuR Project
This page provides access to standardized methodologies, processing pipelines, and variable templates for transforming raw public procurement data into analytically usable formats. These resources support reproducible research and cross-country comparability.
Data Processing Approaches and Templates
| Source | Comments |
|---|---|
| GTI's Public Procurement Data Processing | Very comprehensive guideline on scraping, matching, processing and cleaning pipeline for public procurement data; pipeline for indicators calculation is not included. |
| DIGIWHIST | Template of variables for analysis (especially for integrity indicators), updated version is here. |
| ProAct methodology report | Logic of data processing, applicable to ProAct countries particularly. |
| Open Contracting DS [1, 2] | Works well for data publication, not very convenient for analysis. For analysis we need flattened CSV tables with unified units of observation (e.g., lot) and selected sets of variables with thorough filtering, which OCDS does not do. |
WB PP Reproducibility Packages
| Country | Link | Notes |
|---|---|---|
| CN | Public procurement data: Auction results | — |
| BR | MiDES: disaggregated and harmonized dataset | — |