FridgeBot — GPT-4o shopping list automation

saubury

Simon Aubury

Posted on May 21, 2024

FridgeBot — GPT-4o shopping list automation

FridgeBot — GPT-4o shopping list automation

Monitoring the contents of my fridge and automatically adding grocery items to my shopping list with the new GPT-4o vision API

OpenAI recently announced their new generative AI model GPT-4o — the “o” stands for “omni,” referring to the model’s ability to use mixed modalities including text, speech and video. I wanted to give GPT-4o a real challenge — helping me keep on top of the shopping list by automatically monitoring the contents of my fridge and adding grocery items when I had run out of something.

Processing steps

The fridge door light is used as a signal for when to start and stop looking for changes in the fridge. I’m assuming that anything taken from my fridge has been removed when the door is open and the door light illuminated.

The code does the following

  • Open the camera feed — and use the OpenCV library for real-time computer vision processing

  • Every 500ms we take an image from the video feed and convert the image to grey-scale

  • Calculate mean brightness of the grey-scale image — and use the average value as an indicator for if the fridge door is open

  • If the last image was “dark” and the current image is “light”, we assume the fridge door has just been opened. We wait 500ms for the camera white balance to settle, and save this as image-1 to represent the initial state.

  • We continue to take photos every 500ms, and save the image temporarily as we don’t know when the door is going to close

  • When the average brightness of the latest image drops significantly, we know the fridge door has just been closed. We discard this image (as it is dark), and use the last image taken and save this as image-2 to represent the final state.

Now we have the before and after images, use image-1 and image-2 as inputs to the OpenAI

  • We encode both images as base-64 to transform the binary image data into a sequence of printable characters

  • We use the OpenAI GPT-4o vision API along with prompt ‘What item is missing in the second image?’

  • The response is decoded — and is assumed to be a single word describing what item was in image-1 and is not present in image-2

Knowing the item, we can add to our shopping list

Hardware

The final form of this project runs on a RaspberryPi with a live video processing stream within my fridge. In reality this was a little impractical as both power and ethernet cables needed to be routed past the fridge seal.

To demonstrate the steps without the need for specific hardware (or a fridge) you can run this project on almost any machine. The setup steps below use a demonstration video file.

Overview

FridgeBot — fridge monitoring with GPT-4oFridgeBot — fridge monitoring with GPT-4o

Setup virtual python environment

Create a virtual python environment to keep dependencies separate. The venv module is the preferred way to create and manage virtual environments.

python3 -m venv .venv
Enter fullscreen mode Exit fullscreen mode

Before you can start installing or using packages in your virtual environment you’ll need to activate it.

source .venv/bin/activate
pip install --upgrade pip
pip install -r requirements.txt
Enter fullscreen mode Exit fullscreen mode

API setup

Now it’s time to set up the local API secrets.

cp -i config_secrets_example.py config_secrets.py
Enter fullscreen mode Exit fullscreen mode

Edit config_secrets.py with OpenAPI tokens along with the Todoist API secret tokens

FridgeBot — video process only

To run FridgeBot with example video file without calling OpenAI run the following

python fridgebot.py --video media/pikelets.mov
Enter fullscreen mode Exit fullscreen mode

This should generate two files representing the first lit image and the last lit image within the video media/pikelets.1.jpg and media/pikelets.2.jpg

FridgeBot — video process and OpenAI

To run FridgeBot with example video file, calling OpenAI run the following

python fridgebot.py --video media/pikelets.mov --openai
Enter fullscreen mode Exit fullscreen mode

This should generate two files, and query the OpenAI API to identify the item removed — Pikelets

FridgeBot — video process, OpenAI and Todoist

To run FridgeBot with example video file, calling OpenAI and adding item to Todoist

python fridgebot.py --video media/pikelets.mov --openai --todoist
Enter fullscreen mode Exit fullscreen mode

This should generate two files, and query the OpenAI API and add Pikelets to the Todoist shopping list

FridgeBot Code

FridgeBot code — https://github.com/saubury/fridgebot_openai

💖 💪 🙅 🚩
saubury
Simon Aubury

Posted on May 21, 2024

Join Our Newsletter. No Spam, Only the good stuff.

Sign up to receive the latest update from our blog.

Related