RecipeScape: An Interactive Tool for Analyzing Cooking Instructions at Scale

Abstract

For cooking professionals and culinary students, understanding cooking instructions is an essential yet demanding task. Common tasks include categorizing different approaches to cooking a dish and identifying usage patterns of particular ingredients or cooking methods, all of which require extensive browsing and comparison. However, no existing system provides support for such in-depth and at-scale analysis. We present RecipeScape, an interactive system for browsing and analyzing the hundreds of recipes of a single dish available online. We also introduce a computational pipeline that extracts cooking processes from recipe text and calculates a procedural similarity between them. To evaluate how RecipeScape supports culinary analysis at scale, we conducted a user study with cooking professionals and culinary students with 500 recipes for two different dishes. Results show that RecipeScape clusters recipes into distinct approaches, and captures notable usage patterns of ingredients and cooking actions.

Minsuk Chang

KAIST

Leonore Guillain

EPFL

Hyeungshik Jung

KAIST

Vivian Hare

Stanford University

Juho Kim

KAIST

Maneesh Agrawala

Stanford University

Computational Pipeline

Computational Pipeline that uses Part-of-Speech tagger and human annotation to convert recipe text into a tree representation, and calculates pairwise distance to visualize the similarities.

Data Gathering: In the data gathering step, we crawl all search results for a queried dish, like chocolate chip cookie and tomato pasta, from recipe websites that use the schema.org’s Recipe scheme.
Parsing: We use off the shelf POS tagger and human annotation to parse tokens of the crawled recipes. More detail is provided in the section below on annotation interface.
Similarity Comparison: In order to obtain similarities between the recipes, we use a tree edit distance, a commonly used technique for comparing tree structures. However, to incorporate the semantic difference between individual cooking actions and ingredients in capturing the structural difference, we dynamically adjust the weights associated with the relabel operations. These weights are calculated using the cosine similarities of words from a pre-trained word embedding model.
Distance Matrix: This similarity information is stored in a pairwise distance matrix, where each element is the tree edit distance between the corresponding recipes. The distance matrix is then converted into x,y coordinates using the Gram matrix.
Hierarchical Clustering: We used hierarchical clustering to group recipes with procedural similarities.

Annotated Recipe Dataset

Please cite our work if you use our annotated data!

recipes from "Chocolate Chip Cookie" query

recipes from "Tomato Pasta" query

Source Code

Dashboard

Pipeline

CHI 2018 Paper (Camera-Ready)

ACM DL Link

Link to paper

Poster for HCI @ KAIST 2018 Winter Workshop

CHI 2018 Slides

Slides

Bibtex

@inproceedings{chang2018recipescape,
  title={RecipeScape: An Interactive Tool for Analyzing Cooking Instructions at Scale},
  author={Chang, Minsuk and Guillain, L{\'e}onore V and Jung, Hyeungshik and Hare, Vivian M and Kim, Juho and Agrawala, Maneesh},
  booktitle={Proceedings of the 2018 CHI Conference on Human Factors in Computing Systems},
  pages={451},
  year={2018},
  organization={ACM}
}

CHI 2017 LBW Paper

While our most recent paper is getting ready for publication, you can access our previous exploration here - Link to LBW Paper. The interface and the computational details have changed, but the overall approach is similar.

ACM DL Link

Poster

Slides for HCI@KAIST Workshop - 2017.02.02

Bibtex

@inproceedings{chang2017recipescape,
  title={Recipescape: Mining and analyzing diverse processes in cooking recipes},
  author={Chang, Minsuk and Hare, Vivian M and Kim, Juho and Agrawala, Maneesh},
  booktitle={Proceedings of the 2017 CHI Conference Extended Abstracts on Human Factors in Computing Systems},
  pages={1524--1531},
  year={2017},
  organization={ACM}
}

RecipeScape: An Interactive Tool for Analyzing Cooking Instructions at Scale

Abstract

Live Interface Dashboard

Computational Pipeline

Recipe Annotation Interface

Annotated Recipe Dataset

Source Code

CHI 2018 Paper (Camera-Ready)

CHI 2018 Slides

Bibtex

CHI 2017 LBW Paper

Bibtex