1 of 46

Pizzanalysis

Insights from scraped menus

2 of 46

We scraped Just Eat

3 of 46

Why?

4 of 46

I needed a project for a .pizza domain

5 of 46

How?

6 of 46

7 of 46

POST /android.php?

var0=effb75066e9800f8960cce6a4c13f947&

var1=getrestaurants&

var4=55.6760968&

var5=12.5683371

8 of 46

var0 = MD5(SECRET+LAT+LNG+...)

9 of 46

SECRET=4ndr01d

10 of 46

272,498 products

11 of 46

name

description

price

category

restaurant_id

Spice pizza [Alm.]

Med kebab, salat og creme fraiche dressing

83.00

Pizzaer

ON33N5PN

12 of 46

Let’s explore the data!

13 of 46

14 of 46

15 of 46

16 of 46

17 of 46

18 of 46

“with kebab, salad and creme fraiche dressing“

19 of 46

[

“kebab”,

“salad”, “creme_fraiche_dressing”]

20 of 46

pizza2vec

21 of 46

“Ingredients used together are closer together in space”

pineapple

pesto

parmesan

22 of 46

What is the closest ingredient to pineapple?

23 of 46

peas

24 of 46

Which ingredients are most similar to pesto?

25 of 46

mascarpone, mozzarella, olive oil

26 of 46

mascarpone, mozzarella, olive oil

(fancy pizzas)

27 of 46

Ingredient clusters

  • Salad pizzas
  • Pepperoni pizzas
  • Italian pizzas

28 of 46

29 of 46

What value does an ingredient add?

30 of 46

“Lets predict prices using ingredients”

31 of 46

32 of 46

33 of 46

Margarita index

34 of 46

Margerita index

35 of 46

Margharita index

36 of 46

Margherita index

37 of 46

Margh?[ae]rita index

38 of 46

Margherita index

  • Law of small numbers (small municipalities have the min and max)
  • Slices are cheap in big cities. Whole pizzas aren’t necessarily.

39 of 46

40 of 46

Further research

  • What is the opposite of a pineapple pizza?
  • Make the margherita index an interactive map
  • Track inflation
  • Find hyper-local purchasing power
  • Predict the stock market

41 of 46

Magenta special:

42 of 46

Hvad tror I en stavefejl koster?

43 of 46

Hvor meget billigere er en

“Pizza med killing”

end en

“Pizza med kylling”?

44 of 46

Ca. 2 kr. pr. stavefejl

45 of 46

46 of 46

github.com/volesen/just-scrape