Reference datasets we have constructed for use in Initiative validation work. We release the full descriptions, schemas, and per-meal ground truth tables openly; the underlying photographic material is governed by the participant-consent terms documented on each dataset page.
Dataset · Released Apr 14, 2026 A flat CSV lookup table of per-100g kcal values derived from the USDA FoodData Central Foundation Foods database, snapshot of April 2026. Intended as a stable reference table pinned to a single FDC version for downstream reproducibility of the Initiative's reference sets.
Dataset · Released Apr 14, 2026 A curated reference set of 180 meals prepared in a metabolic kitchen with per-ingredient weighed grams, USDA FoodData Central entry mappings, and computed kcal ground truth. Meals are stratified across three cuisine buckets (Western, East Asian, Mediterranean). Tabular data published openly; photographic material released under restricted access by participant consent.
Dataset · Released Mar 27, 2026 A pilot dataset of 32 restaurant-served meals with reference kcal values derived either from published chain-restaurant nutrition disclosures (verified by dismantle-and-weigh) or from dietitian-led recipe reconstruction. Released as a companion to the restaurant-extension preprint (DAI-PRE-2026-02); not yet a full reference set.
Dataset · Released Feb 18, 2026 A coding manual for assigning meals to cuisine buckets in dietary assessment research. Defines six buckets, provides worked examples and decision rules, specifies a two-rater plus adjudicator procedure, and reports inter-rater reliability (Cohen's kappa) from the Initiative's internal calibration.