an abstract photo of a curved building with a blue sky in the background

llm-box

Run LLMs locally with a clean web UI

Overview

Automatic License Plate Recognition + gate control on Raspberry Pi — captures license plates via camera and triggers a GPIO relay to open a gate.

Details

This project demonstrates how to containerize and serve a Large Language Model (LLM) using FastAPI, Docker, and pre-trained transformer models, with a lightweight browser-based chat interface.

Built as part of the Introduction to Artificial Intelligence course at Southeast European University.

  • Serves a pre-trained transformer model (microsoft/phi-1_5) for text generation.

  • FastAPI-based backend with /chat endpoint.

  • Minimal browser UI for real-time chatting with the LLM.

  • Fully containerized with Docker (multi-stage build).

  • Benchmarking for latency, memory usage, and container performance.

  • Early CI/CD groundwork laid for automated builds.

Resources & Links

Docker Image

GitHub Repository