Real-time Object Detection for the Visually Impaired An On-Device Federated Learning Approach
2024 (English)Independent thesis Advanced level (degree of Master (One Year)), 10 credits / 15 HE credits
Student thesis
Abstract [en]
Visually impaired people often face difficulties in recognizing objects around them, which can make everyday tasks harder and reduce independence. Advances in artificial intelligence and computer vision now make it possible to design systems that detect and describe objects in real time. The goal of this project was to create such a system with a focus on speed, efficiency, and privacy, so it could run directly on affordable devices without relying on cloud services. The model was trained on the VizWiz dataset, which contains photos taken by blind and low-vision users. A lightweight YOLOv8-small model was trained and fine-tuned, then optimized through model compression techniques, including pruning, knowledge distillation, and quantization. The final version was converted into TensorRT engines in FP16 and INT8 formats, which are suitable for high-speed, low-power inference on devices such as the NVIDIA Jetson Nano. Privacy was a key consideration. Instead of sending images to external servers, the system was designed with future support for Federated Learning, allowing devices to train locally and share only model updates. Although full implementation of Federated Learning was not part of this work, the design allows easy integration. Testing on a laptop with a webcam showed that the system runs smoothly on limited hardware while providing useful object detection. Future improvements could include running tests on real devices such as smart glasses or mobile boards, involving visually impaired users for feedback, expanding the dataset, and allowing each model to adapt to personal needs. Together, these steps can turn the system into a practical and trustworthy tool that gives visually impaired people more confidence and independence in their daily lives.
Place, publisher, year, edition, pages
2024. , p. 56
Keywords [en]
Object Detection, YOLOv8, Federated Learning, Jetson Nano, Assistive Technology, TensorRT
National Category
Computer Sciences
Identifiers
URN: urn:nbn:se:his:diva-25922OAI: oai:DiVA.org:his-25922DiVA, id: diva2:2007312
Subject / course
Informationsteknologi
Educational program
Data Science - Master’s Programme
Supervisors
Examiners
2025-10-172025-10-172025-10-17Bibliographically approved