IRS: The Intelligent Receipt Splitter

Tired of overpaying just because splitting the bill equally was easier? Now, pay only your fair share with this app! Snap a photo of your receipt, specify who had what, and let the app handle the rest. Handle tips, taxes, and if you're feeling a little charitable, credit card cashback as well!

Know Python? Then read on to run it locally.

Don't know enough Python? Watch this space.

Screenshots

Setup environment

Prerequisites

You will need ollama installed and running with a model of your choice available.

The default is tulu3:8b, while this is easily configurable you don't really need a more powerful model. If you have the compute, then use gemma2:27b for higher accuracy. If you would like to choose, run pytest and choose the ones that pass enough of the extraction tests.

Python virtual environment

Use uv from the folks over at astral.sh. After cloning the repository, do:

uv sync --extra dev
source .venv/bin/activate # On MacOS/Linux, venv\Scripts\activate on Windows
export PYTHONPATH=$(pwd)

If you don't have uv or don't care for it, then create a virtual environment and install the package itself. Below is an example using pyvenv.

python3 -m venv venv # Python 3.6+
source .venv/bin/activate # On MacOS/Linux, venv\Scripts\activate on Windows
pip install .
export PYTHONPATH=$(pwd)

Start the Gradio app

Note: the long-term intent is to move away from Gradio.

If everything's setup properly in the virtual environment, run:

python src/app/gradio_ui.py

By default, it should run at 0.0.0.0:7860.

Accessing on a mobile device

If you want to run this on a mobile device while processing on your computer, you will need to identify the IP address of your machine and be connected to the same network the machine is on. Then, in your mobile browser, navigate to:

http://<machine-ip-address>:7860/

Other stuff

How it works

Creating an LLM app that is actually useful is rather hard - LLMs are notoriously overconfident when wrong. Not to mention, a major provider is likely to store the conversations you have with it. This is an attempt to work around these limitations by:

Performing Optical Character Recognition (OCR) on images to extract text. OCR frameworks aren't generative models, so they are far less likely to go wrong. This avoids the unreliability of LLMs.
Using LLMs as an Intelligent Document Processing (IDP) layer to extract relevant fields from the OCR'd receipt text. LLMs are great for this - getting the right context around text the same way a human would. This is also done entirely locally, by using ollama and downloaded weights (the default is a 7B model, which can run on machines with as little as 16GB RAM).
Incorporating a Human-in-the-Loop workflow to verify uncertain data extractions. A human - you, will be provided a UI to correct any unreliable data the LLM may have extracted in Step (2), and configure exactly how you want to split the receipt. Of course, there isn't much of a formal loop here, but the concept still stands.

Hope you find this useful!

Thanks

This web-app uses the following components aside from Gradio:

surya for OCR capabilities.
ollama for running LLMs locally.
All LLMs providers that open sourced their weights.

FAQ

Did you know that the abbreviation IRS conflicts with the Internal Revenue Service?

Yes.

What are the limitations?

Receipts need to be horizontally aligned to be read correctly by OCR.
The smaller the LLM, the more mistakes it will make. Since the LLM makes only one pass at reading the receipt data in favor of speed, there are no internal feedback loops like Chain-of-Thought or Prover-Verifier implemented.
You're a bit out of luck if you don't know how to use a bit of Python. This is being worked on, but progress will be slow. The easiest way out would be to convert both the OCR and the LLM calls to using paid or free APIs, but it becomes harder to ensure that everything stays on-device.

Name		Name	Last commit message	Last commit date
Latest commit History 19 Commits
assets		assets
src		src
tests		tests
.gitignore		.gitignore
.python-version		.python-version
LICENSE		LICENSE
README.md		README.md
pyproject.toml		pyproject.toml
uv.lock		uv.lock

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

IRS: The Intelligent Receipt Splitter

Screenshots

Setup environment

Prerequisites

Python virtual environment

Start the Gradio app

Accessing on a mobile device

Other stuff

How it works

Thanks

FAQ

About

Releases

Packages

Languages

License

avimallu/IntelligentReceiptSplitter

Folders and files

Latest commit

History

Repository files navigation

IRS: The Intelligent Receipt Splitter

Screenshots

Setup environment

Prerequisites

Python virtual environment

Start the Gradio app

Accessing on a mobile device

Other stuff

How it works

Thanks

FAQ

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages