Data Processing Approaches and Templates

Methodological guidance and variable templates for procurement data transformation

Back to procuR Project

This page provides access to standardized methodologies, processing pipelines, and variable templates for transforming raw public procurement data into analytically usable formats. These resources support reproducible research and cross-country comparability.

Data Processing Approaches and Templates

Source Comments
GTI's Public Procurement Data Processing Very comprehensive guideline on scraping, matching, processing and cleaning pipeline for public procurement data; pipeline for indicators calculation is not included.
DIGIWHIST Template of variables for analysis (especially for integrity indicators), updated version is here.
ProAct methodology report Logic of data processing, applicable to ProAct countries particularly.
Open Contracting DS [1, 2] Works well for data publication, not very convenient for analysis. For analysis we need flattened CSV tables with unified units of observation (e.g., lot) and selected sets of variables with thorough filtering, which OCDS does not do.

WB PP Reproducibility Packages