Methodology Brief
Pre-specifying equivalence margins for dietary assessment non-inferiority claims
A methodology brief
Background
Non-inferiority and equivalence claims are increasingly common in dietary assessment: a new image-based tool may be claimed to be “as accurate as” 24-hour recall, or a simplified diary method may be claimed to be “non-inferior” to a weighed-food record. Such claims are meaningful only if the equivalence margin - the maximum accuracy gap that would still count as “not worse” - is specified in advance and justified on clinical or operational grounds. When margins are selected after seeing the data, or chosen to be conveniently large, non-inferiority inference loses its interpretive content.
The Initiative’s convention is that equivalence margins are pre-specified in the protocol, justified, and reported in every non-inferiority paper drawing on Initiative data.
The Method
Margin specification format. Equivalence margins are specified on the scale of the primary accuracy outcome. For MAPE-based comparisons, the margin $\delta$ is a number of percentage points (for example, “the candidate method is non-inferior if the upper 95% CI on $\text{MAPE}{\text{candidate}} - \text{MAPE}{\text{reference}}$ is below +3 percentage points”). For mean bias, $\delta$ is in the outcome’s units (kcal, g).
Justification. Every margin must be justified on one of three grounds, documented in the protocol:
- Clinical relevance. The margin corresponds to a difference plausibly capable of changing a clinical recommendation or an epidemiologic exposure classification. For example, an energy-estimation margin narrower than the typical within-person day-to-day variation in free-living intake.
- Operational relevance. The margin corresponds to a difference that would change a decision about tool deployment (for example, adoption by a clinic or a public-health programme).
- Historical precedent. The margin matches prior literature on the reference method’s own measurement error.
A pure “that’s what powered” justification is not acceptable; the margin drives the sample size, not the reverse.
Pre-specification vehicle. Margins are recorded in the public protocol (or preregistration) with a date stamp prior to data analysis. Any change to the margin after data collection begins must be disclosed and justified, and triggers a sensitivity analysis with the original margin.
One-sided vs. two-sided. Non-inferiority uses one-sided 97.5% CIs (equivalent to a two-sided 95%). Equivalence (bi-directional) uses two one-sided tests (TOST) with matched margins $\pm \delta$.
Analysis populations. The primary analysis is per-protocol for non-inferiority, with an intention-to-treat sensitivity analysis, reversing the usual convention for superiority trials, because per-protocol is conservative for non-inferiority.
Worked example
A protocol pre-specifies that a candidate image-based tool is non-inferior to 24-hour recall in estimating per-occasion energy if the upper bound of the 95% one-sided CI on the difference in MAPE is below +4 percentage points. The margin is justified on clinical-relevance grounds: a within-person day-to-day CV of about 25% implies a typical day-to-day energy variation well in excess of 4 percentage points of MAPE at the individual level.
Suppose the study enrols $n = 150$ and observes:
| Metric | Candidate | 24-h recall | Difference | 95% one-sided upper CI |
|---|---|---|---|---|
| MAPE | 13.1% | 15.2% | -2.1 pp | +1.4 pp |
| Mean bias (kcal) | -24 | +38 | -62 | - |
The upper CI bound on the MAPE difference (+1.4 pp) is below the pre-specified margin of +4 pp. The non-inferiority claim is supported at the pre-specified margin. The Initiative publication would report the margin, the justification, the upper CI bound, and a sensitivity analysis around the margin choice (for example, at +3 and +5 pp).
Common pitfalls
- Selecting a large margin to ensure the study is powered, without a substantive justification. This inverts the logic of non-inferiority.
- Changing the margin after inspecting the data. This is a form of hypothesising after results are known (HARKing) and invalidates inference.
- Confusing “no statistically significant difference” with non-inferiority. The two are distinct; absence of a significant difference does not establish non-inferiority without a margin.
- Using a two-sided 95% CI for a one-sided non-inferiority claim and thereby reporting a stricter test than claimed.
- Applying the intention-to-treat analysis as primary for non-inferiority. ITT tends to attenuate group differences, which is anti-conservative in this setting.
- Reporting non-inferiority at the overall level while showing adverse stratum-level differences in the secondary analyses, without acknowledging the conflict.
Recommended reporting
- The pre-specified margin, in the units of the primary outcome.
- The justification category (clinical, operational, historical) and the supporting reasoning.
- The date the margin was pre-specified and where it was recorded.
- The primary analysis population (per-protocol) and the ITT sensitivity analysis.
- The one-sided 95% CI (or equivalent) on the candidate-minus-reference difference.
- A sensitivity table at alternative margins around the pre-specified value.
- Any stratum-level analyses and whether the non-inferiority conclusion holds within each.
References
- Okafor N, Henriksen D. Equivalence and non-inferiority inference in dietary assessment: a critical review. Stat Med. 2025;44(4):1010-1024.
- Reinholt P. Margin justification in nutrition non-inferiority studies: a proposal. Nutrients. 2023;15(21):4620.
- Henriksen D. Per-protocol versus intention-to-treat analysis in non-inferiority trials: a primer for nutrition researchers. Am J Clin Nutr. 2024;120(2):330-338.
- Park S-H, Morales F. Within-person day-to-day variability in energy intake and its implication for equivalence margins. Br J Nutr. 2022;128(6):950-960.
- Tanaka M. Two one-sided tests (TOST) for equivalence in dietary method comparison. Public Health Nutr. 2023;26(2):350-359.
- Weiss R, Okafor N. Pre-registration of equivalence margins: practice and adherence. J Nutr. 2025;155(5):1212-1220.
- Linde J. When non-inferiority claims exceed the evidence: three case studies. JMIR mHealth Uhealth. 2024;12(3):e52014.
- Okafor N. A minimal checklist for non-inferiority reporting in nutrition technology validation. Nutrients. 2026;18(2):401.
Keywords
equivalence margin; non-inferiority; pre-specification; study design; dietary assessment; validation; hypothesis testing
License
This piece is distributed under a Creative Commons Attribution 4.0 International License (CC BY 4.0).