|

|

Project Overview

EN | DE

Behind the Scenes

Take a closer look at the development process, the technology stack, and the key features of our project, explained with both text and images.

Ana-Maria Frank

  • Monitoring
  • Admin page design
  • Prompt engineering
  • Image-to-text generation
auto_awesome AI Model

Gemini 2.5 Flash

This large-scale multimodal model forms the core of the image-to-text pipeline. Integrated through Vertex AI, it processes individual images as well as bulk uploads, producing precise alt texts across multiple languages. Carefully engineered prompts and evaluation logic ensure consistent quality aligned with accessibility standards.

Gemini logo
analytics Monitoring

Langfuse

Langfuse provides detailed observability for all AI-driven processes. It tracks token usage, model latency, error rates, and operational costs, enabling informed optimisation decisions during development and deployment. This ensures long-term stability and transparent performance metrics.

Langfuse logo
animation Animations

Lottie animations

Lottie animations enhance the usability and aesthetics of the admin interface. Their lightweight, vector-based structure enables smooth transitions and clear visual guidance without compromising performance, contributing to a refined and engaging user experience.

Lottie logo
cloud_upload File upload

R-Clone

R-Clone supports structured file management within the bulk-processing pipeline. It synchronises datasets, prepares image source collections, and ensures reliable handling of large media volumes before they are passed into the generation service.

RClone logo

rocket_launch Deployment

Google Cloud Platform

Google Cloud Platform hosts the complete application stack, from compute resources to serverless components. Infrastructure design, automated deployments, and operational monitoring ensure a resilient environment capable of handling production workloads. The cloud setup supports scalability, security, and efficient long-term operation.

Google Cloud Platform logo
forward_to_inbox API

Fast API

FastAPI powers the system’s central service layer and exposes the REST endpoints used by the CMS, admin interface, and automation components. It manages communication across the database, batch processor, and LLM pipeline, providing a stable and high-performance integration framework.

FastAPI logo
task_alt Testing

Bruno testing

Bruno enables systematic verification of endpoint behaviour throughout development. Test collections ensure functional accuracy, validate data flow between services, and support regression testing—resulting in a reliable and maintainable API architecture.

Bruno testing logo
fact_check Load testing

Locust load testing

Locust simulates high-volume traffic to assess system behaviour under real-world load. These tests identify performance limits, detect bottlenecks, and verify that the alt-text generation pipeline remains stable even during peak processing tasks.

Locust logo

Stefan Ebner

  • Testing
  • Deployment
  • Core service
  • User Endpoints
  • Cloud Infrastructure

Leopold Klocker

  • Database
  • Admin Web App
  • Bulk-processing
  • Eventarc-Trigger
storage Database

Firebase Firestore

Firestore serves as the central data repository for image sources, generated alt texts, confidence scores, and multilingual variants. The NoSQL schema is designed for fast access, structured document storage, and seamless integration with the generation and admin services.

Firestore logo
flash_on Trigger

Eventarc trigger

Eventarc triggers initiate automated workflows whenever new CSV files are uploaded to Cloud Storage. The system extracts, validates, and forwards all image URLs for processing, enabling efficient handling of large-scale datasets without manual intervention.

Eventarc logo
web Web-App

React

The React-based admin dashboard provides a clear and functional interface for reviewing, editing, regenerating, and validating alt texts. It integrates AI testing tools, real-time updates, and Google OAuth authentication to ensure usability and secure access for authorised staff.

React logo
brush Design

Bootstrap design

Bootstrap supports a consistent design language across the admin interface. Its responsive grid system and styling components ensure accessible layouts, clean visual structure, and seamless user interaction on all device types.

Bootstrap logo

Meet Our Innovators

Driven by innovation and a passion for technology, our team members bring fresh perspectives and expertise to every project we tackle.

Stefan Ebner

Backend Guru

Latency fears my name

Ana-Maria Frank

LLM Overlord and Team Lead

I speak tokens fluently

Leopold Klocker

Database Captain

SQL is my love-language

Dipl.-Ing. Dr. Daniel Wischounig-Strucl

Supervisor

I turn clarity into confusion

Lukas Gregori

Supervisor Netconomy

The real MVP