an abstract photo of a curved building with a blue sky in the background

llm-box

Run LLMs locally with a clean web UI

Overview

Automatic License Plate Recognition + gate control on Raspberry Pi — captures license plates via camera and triggers a GPIO relay to open a gate.

Details

This project demonstrates how to containerize and serve a Large Language Model (LLM) using FastAPI, Docker, and pre-trained transformer models, with a lightweight browser-based chat interface.

Built as part of the Introduction to Artificial Intelligence course at Southeast European University.

Serves a pre-trained transformer model (microsoft/phi-1_5) for text generation.
FastAPI-based backend with /chat endpoint.
Minimal browser UI for real-time chatting with the LLM.
Fully containerized with Docker (multi-stage build).
Benchmarking for latency, memory usage, and container performance.
Early CI/CD groundwork laid for automated builds.

Resources & Links

Docker Image

GitHub Repository

DockerHub

GitHub