Seven weighted criteria designed by procurement professionals for procurement decision-makers. Not generic software scores — criteria calibrated to the real demands of a CPO evaluating a £1m platform investment.
Each criterion is scored out of 10, then multiplied by its weight. Final scores out of 10 represent weighted aggregate performance across procurement-relevant dimensions.
Procurement Fit is our most heavily weighted criterion because a tool that is not purpose-built for procurement workflows will underperform regardless of its general capabilities. We assess whether the product's AI models were trained on procurement data, whether its terminology matches procurement practice (categories, commodity codes, contracts, POs, GRNs), and whether it integrates natively into P2P processes rather than being adapted from a generic workflow or document tool.
We evaluate the depth and breadth of procurement-relevant features. Breadth matters because procurement teams operate across multiple sub-processes; depth matters because a feature that exists in name but delivers 60% accuracy or requires significant manual correction provides limited value. We specifically test or verify: spend classification accuracy against UNSPSC/eCl@ss, three-way invoice matching rates, contract data extraction precision, supplier risk signal quality, and sourcing event automation completeness.
Hidden pricing is a procurement problem in itself. We score vendors on how clearly they communicate pricing, whether starting prices are publicly available, and whether the total cost of ownership (implementation, connectors, training, per-transaction charges) is disclosed upfront. Enterprise-only pricing is not penalised if it is clearly explained and a range is provided. "Contact sales" with no indication of scale is penalised.
Most procurement teams operate within a broader ERP landscape. An AI tool that requires significant middleware or custom development to connect to SAP, Oracle, or Workday creates integration debt that negates much of the claimed efficiency gain. We assess whether integrations are native/certified, what data flows bidirectionally, and how synchronisation is handled. We specifically check the depth of integration with the six most common enterprise ERP and procurement platforms.
Adoption drives value. A tool that procurement analysts find difficult to use or that requires extensive training before delivering productivity gains scores poorly here, regardless of feature depth. We assess UX through demo testing, user feedback from procurement professionals, and proxy indicators including NPS data, implementation timelines reported by customers, and G2 / Gartner Peer Insights review patterns specifically from procurement roles.
Procurement platforms are mission-critical. Invoice processing failures, sourcing event outages, and contract search downtime have direct financial consequences. Support scoring covers the accessibility and competence of technical support, the quality of the customer success function, and the availability of procurement-domain expertise within the vendor's support organisation — not just generic software support.
Browse our reviews and use the comparison tool to evaluate tools side-by-side against the criteria that matter most to your procurement stack.